Chat With Your Documents Locally Using Karpathy's LLM Wiki

Nariman · May 2, 2026, 7:11pm

In this video, we build an agent to chat with our documents without any RAG, but using Andrej Karpathy’s idea of an LLM wiki, completely with local tools. This can be a strong alternative to RAG, where the LLM often has to rediscover knowledge from scratch on every question. The idea here is different. Instead of retrieving from raw documents at query time, the LLM uses an already optimized, searchable knowledge base.

We use Ollama’s gemma4 model as the LLM, LangChain to create our agent and provide it with tools and memory, Streamlit to create a chat UI, and Obsidian to view the generated markdown documents.

You can watch it here: https://youtu.be/4D8FjzJXJd4

Topic	Replies	Views
A production-style RAG system with explicit routing between tool-based agents, vector retrieval, and general LLM inference. Show the Community! streamlit-cloud , llms , build-with-streamlit , search , chatbot , rag	94	December 18, 2025
Build a Powerful RAG Web Scraper with Ollama and LangChain Show the Community! llms , chatbot	526	January 9, 2025
Build a real-time RAG chatbot using Google Drive and Sharepoint Show the Community! llms	1334	March 7, 2024
Fast llm agents with groq Show the Community! llms	784	February 24, 2024
Ollama RAG & Deep Research App Show the Community! llms , research	540	March 25, 2025

Chat With Your Documents Locally Using Karpathy's LLM Wiki

Related topics