The Spark and the Spine: Architecture and Why Local
Core stack, multi-LLM router, persona system with LoRA lineage, and how MCP turned reasoning into real work.
A practical, clean-room walkthrough of a local-first, multi-persona AI platform: architecture, image/music pipelines via ComfyUI, MCP “thinking-to-doing” bridges, and the patterns that actually held up in production.
Download the Architecture Field Guide (PDF)Core stack, multi-LLM router, persona system with LoRA lineage, and how MCP turned reasoning into real work.
Why Lesson Creator fell short and how E‑book Creator improved quality with persona iteration and contracts.
Chatterbox voice cloning, “Virtual Me,” and the pragmatic realities of validation vs. automation.
Network/firewall issues, model management, quality gates with Vision LLM, and dynamic LoRA loading.
FlashAttention2 to SDPA, PyTorch/CUDA pinning, xcodec checkpoints, and the 13-second breakthrough.
Incremental dev plans, runtime tests, and the roadmap to SOTA with reflection and test‑first strategies.
Reusable patterns for long-running workflows, artifacts, retries, and sleep cycles.
Personal AI at the edge, specialized personas, and the case for democratized AI capabilities.
Mneme is a production AI system built by Dave Wheeler (aka Weller Davis). It features a multi‑persona architecture with LoRA fine‑tuning, an MCP tool layer for real‑world actions, ComfyUI pipelines for image/music, and a family of autonomous creator modules.