autocommit
2bb4e44c71
docs(benchmarks): 📝 Add raw benchmark data for ministral-14b-reasoning model with performance metrics and evaluation results
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-11 10:19:36 -07:00
autocommit
c8da606f6f
docs(model-encyclopedia): 📝 Update benchmark results for ministral-14b-reasoning model in LLM documentation
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-11 10:19:36 -07:00
autocommit
cb0676bb43
docs(model-encyclopedia): 📝 Add ministral-14b-reasoning benchmark data to model encyclopedia docs
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-11 02:29:31 -07:00
autocommit
6725724e27
docs(model-encyclopedia): 📝 Add benchmark results for llm_reasoning performance metrics to model encyclopedia documentation
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-11 02:29:31 -07:00
autocommit
e2628ee650
docs(model-encyclopedia): 📝 Add detailed model encyclopedia entries for Qwen3.6 variants and Mistral-small-3.2-24b with technical specs and usage guidance
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-05-10 23:36:36 -07:00
Claude Code
506f66ee04
perf(inference-specific): ⚡ Refactor inference task pooling and queuing logic to optimize throughput and resource utilization
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 17:50:38 -07:00
Claude Code
82814ed90e
docs(model-encyclopedia): 📝 Improve model encyclopedia documentation with expanded definitions, usage examples, and structured entries
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:09:15 -07:00
Claude Code
ea0ea05b2e
docs(docs): 📝 Update architecture and consumer integration documentation to clarify system design and integration patterns
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-25 22:57:30 -07:00
Lilith
c6eff0fcd9
feat(coordinator): ✨ Add Oracle-specific routing logic with documentation and test coverage
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-08 22:08:20 -07:00
Lilith
76f525fb7f
docs(docs): 📝 Update deprecation warnings, migration paths, and CLI documentation to reflect changes in GPUBOSS_INTEGRATION.md, LLAMA_SEGFAULT_FIX.md, MIGRATION.md, and CLI.md while cleaning TODO.md
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-02-28 20:04:41 -08:00
Lilith
0690309799
docs(cli): 📝 Update CLI documentation and migration guides with clearer instructions, setup steps, and paths for core-py, core-ts, and loaders-py packages
...
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-02-25 15:48:38 -08:00
Lilith
b39ce3a725
feat: model-boss architecture with extracted loaders and InferenceRouter
...
Architecture refactor per the plan:
Packages:
- core-py: Lean model-boss with GPUBoss, RAMBoss, path resolution
- core-ts: TypeScript client (@lilith/model-boss)
- loaders-py: Extracted direct model loaders (optional, for dev/testing)
New in core-py:
- InferenceRouter for service discovery and routing
- LlamaHttpClient, DiffusionHttpClient typed clients
- VRAM estimation utilities
Services use model-boss for:
- VRAM lease coordination (GPUBoss)
- Model path resolution
- Service discovery (InferenceRouter)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-17 12:39:05 -08:00