Build a Voice RAG with Deepseek, LangChain and Streamlit

In this video, we will build a Voice RAG (Retrieval-Augmented Generation) system using Deepseek, LangChain, and Streamlit to chat with audio files and answer complex questions about them. I will guide you step by step in setting up Ollama’s Deepseek R1 LLM model, transcribing audio using OpenAI’s Whisper, integrating everything with a LangChain-powered RAG, and then showing you how to use a simple Streamlit interface so you can query your audio files in real time. If you’re curious about the Deepseek model or how to build RAGs that even support audio files, this tutorial is for you.

You can watch it here: https://youtu.be/HT4a6A_wXdA

2 Likes

This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.