Desktop Live · 2026 Own product

AI Knowledge Hub.

I built Knowledge Hub for clients with sensitive documents who couldn't pipe them through OpenAI. RAG over your local files, streaming chat, an Ollama model manager — 100% on your machine. Now lives inside EnterpriseCore as a first-class module.

RoleSole engineer
StackFastAPI · React · Ollama · SQLite
StatusBeta · ships inside EnterpriseCore
CloudZero — fully local
AI Knowledge Hub local RAG interface

The problem.

Cloud-hosted RAG is a privacy and lock-in problem for any business handling internal documents, client work, or regulated data. Off-the-shelf "chat with your docs" tools push everything to a vendor's servers. I wanted the same UX, but with zero data egress.

What I built.

How I shipped it.

  1. SQLite-vec for vectors. No external vector DB. The embeddings live in the same SQLite file as the rest of the app.
  2. Ollama via subprocess. Inference goes through Ollama's local HTTP server; the desktop app spawns and manages it.
  3. Streaming end-to-end. FastAPI streams tokens via SSE; React renders them as they arrive.
  4. Citations as a primary feature. The retrieval layer threads source metadata through to the response — not bolted on.
Chat with your docs. Your docs never leave.

— the entire pitch in eight words

Like what you read? I can ship this for you.

Send a one-line scope and I'll quote within 24h. Three engagement shapes — fixed-price MVP, embeddable widget, or maintenance retainer.