Commit graph

348 commits

Author SHA1 Message Date
autocommit
5a059230bb chore(config): 🔧 Update app metadata in app.manifest.yaml with new name, version, icons, and platform-specific permissions
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
5cde520788 deps-upgrade(deps): ⬆️ Update dependency versions in coordinator and core-py packages to align with uv.lock files
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
371dcd502a feat(gpu-lease): Add lease management logic in GPULease class and update coordinator with lease timeouts and retry policies
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
314d390362 feat(inference): Refactor VRAM measurement functions for accuracy improvements and add unit tests for inference tracking
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
ce02e8e246 feat(inference): Update queue logic and add tests for dynamic slot management and improved inference task orchestration
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
15e769fe32 refactor(inference): ♻️ Implement cleaner worker management and scheduling logic in InferencePool by reorganizing selection conditions and state handling, while ensuring backward compatibility in tests
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
b28caf959c feat(llama-server): Introduce LlamaServerBackend with startup timeout and error handling for inference tasks; add test cases in test_backend.py
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
d394ba091f deps-upgrade(coordinator): ⬆️ Update key libraries in coordinator service and core Python package for security and performance improvements
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 20:14:11 -07:00
autocommit
de72966b42 release(coordinator): 🔖 Update version to 4.3.0 in app manifest and coordinator module
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 18:19:08 -07:00
autocommit
8ad01398f9 deps-upgrade(coordinator): ⬆️ Update coordinator dependencies for security and compatibility improvements
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 18:19:08 -07:00
autocommit
663ce17635 feat(model-boss-coordinator): Add idle timeout protection with priority-based scheduling to prevent model starvation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 17:54:57 -07:00
autocommit
a347b86d8a chore(core-ts): 🔧 Update TypeScript version to 5.3 for core dependencies
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 14:45:51 -07:00
autocommit
acf7f95e0c feat(model-boss-coordinator): Introduce configurable retry policies, timeouts, and resource allocation in the coordinator service via new config keys and dynamic CLI support
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 14:45:51 -07:00
autocommit
bdc08c6c55 test(gpu): Add GPU boss test fixtures and expand test cases for enhanced resource management scenarios
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 14:45:51 -07:00
autocommit
fd2978469d feat(inference): Introduce configurable thread and batch settings for Llama inference backend to enable tunable concurrency and batching
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 14:45:51 -07:00
autocommit
02e735aad4 deps-upgrade(coordinator): ⬆️ Upgrade core-ts, model-boss-mcp, mcp-server, and types to ensure compatibility, security, and performance improvements
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-10 14:45:51 -07:00
autocommit
d7d9acaa25 test(coordinator): Add test cases for cloud-fallback guard behavior in coordinator router tests
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 03:12:53 -07:00
autocommit
4a3cf3a994 docs(docs): 📝 Add architectural documentation for cloud-fallback guard components and integration
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 03:12:53 -07:00
autocommit
165fb4674b deps-upgrade(core-py): ⬆️ Update core-py dependencies to latest stable versions
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 03:12:53 -07:00
autocommit
1e3abdf341 feat(model-boss): Improve batch job processing and GPU resource allocation in Boss and Client classes
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:55:09 -07:00
autocommit
bbf0510838 refactor(model-boss-coordinator): ♻️ Implement modular scheduling logic by reorganizing ModelBossCoordinator and its models
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:55:09 -07:00
autocommit
5cc75aa2e0 feat(coordinator): Add scheduling configuration options to Config and RuntimeConfig classes with test coverage
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:55:08 -07:00
autocommit
a9559f064a feat(coordinator): Enhance dynamic routing, proxying, and task scheduling for improved inference task performance and resource utilization
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:55:08 -07:00
autocommit
066b3ba63a feat(model-pool): Introduce batch job class and priority-based scheduling in model pool coordinator
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:55:08 -07:00
autocommit
e5183135c7 perf(model-boss-coordinator): Implement queue management logic with timeout handling and load balancing to optimize synchronous text inference performance
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:37:40 -07:00
autocommit
16e59f9035 docs(docs): 📝 Add detailed architecture documentation for synchronous inference, async jobs, cold-load behavior, and timeout implications
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-09 02:37:39 -07:00
autocommit
11fe02311d feat(transformers-vision): Register PIL opener for HEIC/HEIF formats to support iPhone image processing
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-08 22:56:19 -07:00
autocommit
a457e3c04c deps-upgrade(coordinator): ⬆️ Update dependencies in pyproject.toml for security patches and compatibility improvements
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-06-08 22:56:19 -07:00
autocommit
9c662937b3 test(whisper-http): Add comprehensive tests for Whisper HTTP endpoints and behavior validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-17 22:14:00 -07:00
autocommit
1eaa6be2a4 chore(whisper-http): 🔧 Update and add configuration settings for Whisper HTTP service endpoints and defaults
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-17 21:35:46 -07:00
autocommit
86519212e7 feat(whisper-http): Add core HTTP service initialization, entry point, application logic, and data models
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-17 21:35:45 -07:00
autocommit
d3e5791675 deps-upgrade(whisper-http): ⬆️ Update HTTP service dependencies to latest compatible versions
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-17 21:35:45 -07:00
autocommit
1cab0fa18b feat(tasks): Add pinPrimary, keepAliveS, and budgetS configuration options to task definitions in the frontend
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 19:46:52 -07:00
autocommit
1d2f7557dd feat(inference): Update inference proxy routing and forwarding logic for new request handling
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 19:08:35 -07:00
autocommit
53600f2787 feat(inference): Add Transformers Seq2Seq backend integration with model execution and registration logic
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 19:08:35 -07:00
autocommit
7481f92dc3 feat(inference-backends): Add Transformers Seq2Seq backend and worker for sequence-to-sequence model inference
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 19:08:35 -07:00
autocommit
7be0b09e86 fix(model-boss-coordinator): 🐛 Implement graceful None-value handling in VRAM estimation to prevent crashes during stable inference task processing
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 18:57:03 -07:00
autocommit
fc6a211f83 feat(config): Add timeout budget configuration for inference tasks in tasks.yaml
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 18:57:03 -07:00
autocommit
7fd0f24234 feat(coordinator): Add budget constraint enforcement to tasks with budget_s parameter, updating API handlers and inference pipeline logic
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-16 16:26:48 -07:00
autocommit
f02a09cddb feat(model-boss-coordinator): Add keep-alive tracking to maintain VRAM residency for active model tasks
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-15 18:18:06 -07:00
autocommit
8da6fcb402 feat(config): Add keep-alive configuration to optimize VRAM usage and reduce cold-load costs
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-15 18:18:06 -07:00
autocommit
71f12f09ef chore(config): 🔧 Add pin_primary runtime setting to control task resolver behavior in tasks.yaml
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 23:05:08 -07:00
autocommit
56a08d89a4 feat(inference): Introduce pin_primary flag to prioritize primary models in Router, TaskRegistry, and model classes
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 22:58:02 -07:00
autocommit
e53173a95a feat(coordinator): Add pin_primary method to task API for primary state management and update tests
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 22:58:01 -07:00
autocommit
47cf947437 flags(config): 🚩 Introduce pin_primary flag to enable/disable primary prospect classification behavior
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 22:58:01 -07:00
autocommit
07efb4bda3 chore(config): 🔧 Update chat task model recommendations to prioritize Qwen3.6 family models and implement fallback strategies
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 22:23:09 -07:00
autocommit
9a8ce50edf remove(config): 🔥 Remove deprecated prospect.classify task from tasks.yaml
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 20:33:44 -07:00
autocommit
6a6ce97e5c feat(config): Add prospect-classification task with configurable model and fallback options in tasks.yaml
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-14 20:16:44 -07:00
autocommit
b85961cb69 feat(client): Add Markdown fence parser to strip triple-backtick content from JSON responses
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-13 15:46:08 -07:00
autocommit
efeb21d024 deps-upgrade(client): ⬆️ Update client dependencies to latest minor/patch versions for security fixes and compatibility improvements
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-13 15:46:08 -07:00