A security-first lifecycle for every AI model you download — scan it for malware, size-gate it before it touches your disk, bench it under a memory-pressure throttle that physically cannot let it freeze your machine, and auto-offload the weights if it loses. Every model in your local pool is safe, tested, and earns its disk space. Includes a printable setup PDF.
Check your evaluation history for prior verdicts. Verify disk headroom. Confirm no duplicate already installed.
Queries model size before downloading. Blocks models that exceed your machine's RAM.
ollama pull or huggingface-cli download — Gemini, Kimi, Qwen, Llama, Mistral, and others.
6 virus checks run automatically. Non-zero exit = stop.
Speed bench (3-run median tok/s) + 3 quality prompts per task type. RAM-intensive — run when you're not actively using the computer; the throttle watchdog pauses or aborts the bench under memory pressure so it can't freeze your machine.
DEPLOY: wire into router. DECLINE: auto-offload weights. WATCH: re-evaluate in 30 days.
Log the verdict — including bytes reclaimed by offload — so future evaluations skip re-testing the same model.
Reclaim disk on DECLINE / BLOCKED-too-large. The verdict persists, the weights don't — you never pay disk for a model you've already declined.