PPLPPL
... /loading ...
INITIALIZING_
SYS / BOOT
A fully offline AI assistant that reads private documents and answers questions with context — no API keys, no cloud. Built with Ollama, ChromaDB, and FastAPI.
A fully offline, privacy-first AI assistant built on a local RAG (Retrieval-Augmented Generation) pipeline. It ingests private documents — PDFs, web pages, plain text — and answers natural language questions with source context. No API keys, no cloud, no data leaving the machine.
Most AI assistants send your data to cloud APIs. For sensitive documents — contracts, internal specs, personal notes — this is a non-starter. This project proves you can run a capable QA system entirely on local hardware.
Documents (PDF/Web/Text)
→ Parser (pypdf / BeautifulSoup)
→ Embedder (sentence-transformers)
→ Vector Store (ChromaDB)
↓
User Query → Similarity Search → Context Chunks
→ Ollama (local LLM: llama3 / mistral)
→ Answer with cited sources
| Layer | Technology |
|---|---|
| LLM Runtime | Ollama (llama3, mistral) |
| Embeddings | sentence-transformers (all-MiniLM) |
| Vector DB | ChromaDB (persistent local store) |
| API Layer | FastAPI |
| GUI | PyQt6 desktop app, Textual TUI |
| Parsers | pypdf, BeautifulSoup |
ChromaDB's persistent mode writes embeddings to disk, so re-ingesting the same document is idempotent — it deduplicates by content hash. This made incremental document updates trivial.