Showing results for whisper Vector Vector
GitHub Repo
https://github.com/PranavGarud/Speech_Diarization
PranavGarud/Speech_Diarization
A Speech Diarization system that processes WAV files, extracts MFCC features, generates X-vectors/i-vectors, clusters speakers using AHC or Spectral Clustering, and transcribes speech using ASR models (Wav2Vec2, Whisper, DeepSpeech) while minimizing Diarization Error Rate (DER).
GitHub Repo
https://github.com/petermartens98/OpenAI-Whisper-Audio-Transcription-And-Summarization-Chatbot
petermartens98/OpenAI-Whisper-Audio-Transcription-And-Summarization-Chatbot
Web app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
GitHub Repo
https://github.com/peterw/JarvisBase
peterw/JarvisBase
Question-answering chatbot using OpenAI's GPT-3.5-turbo model, DeepLake for the vector database, and the Whisper API for voice transcription. The chatbot also uses Eleven Labs to generate audio responses.
GitHub Repo
https://github.com/Mouez-Yazidi/WhisperMesh
Mouez-Yazidi/WhisperMesh
WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and a sophisticated vector database. Leveraging the RAG framework from Haystack, it ensures engaging, data-driven conversations that adapt to your preferred style.
GitHub Repo
https://github.com/Ashish-Abraham/DocWhisperer-Qdrant
Ashish-Abraham/DocWhisperer-Qdrant
A Retrieval-Augmented Generation (RAG) System for PDF Chat using Qdrant Vector Database.
GitHub Repo
https://github.com/sazonovanton/SirChatalot
sazonovanton/SirChatalot
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tools and semantic search in vector DB.
GitHub Repo
https://github.com/Shaunwei/RealChar
Shaunwei/RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
GitHub Repo
https://github.com/BBC-Esq/VectorDB-Plugin
BBC-Esq/VectorDB-Plugin
Program that lets you ask questions about your documents, audio, and video files.
GitHub Repo
https://github.com/Mj23978/OpenServer