AI Voice Assistant App with Llama3, RAG, STT and TTS for Real Time Voice Chat with Documents

Author: Sandip's Technology Channel
Published At: 2025-01-30T00:00:00
Length: 20:14

Summary

Description

In this project, a Voice Assistant ("Evi") Application has been built using Streamlit in Python with Langchain, RAG (Retrieval-augmented generation) and Llama3.3 model (Open Source LLM), pyttsx3 (text-to-speech library), speech_recognition (speech-to-text library), Hugging Face Instruct Embeddings and Chroma (open-source vector database), PyPDFLoader etc. After launching the Streamlit App, user only needs to upload one document (pdf). Then they just need to click on "Start Voice Chat" button and our Voice assistant called "Evi" will start chatting with the user real time regarding the document content. User can ask any questions about the content of the document and Evi will answer all of them one after another. GitHub Link: https://github.com/dharsandip/Voice_assistant_llm_rag_tts_stt_app

LinkedIn: https://www.linkedin.com/in/sandip-dhar-40145546/

#voiceassistant, #llama3, #aiapplication, #aivoiceassistant, #rag, #groq, #speechtotext, #python, #streamlitlibrary

Translated At: 2025-04-01T07:10:17Z

Request translate (One translation is about 5 minutes)

Version 3 (stable)

Optimized for a single speaker. Suitable for knowledge sharing or teaching videos.

Recommended Videos