speech-recognition

Star

Here are 4,651 public repositories matching this topic...

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jun 11, 2024
Python

dictation-toolbox / dragonfly

Star

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

python speech-recognition

Updated Jun 11, 2024
Python

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 11, 2024
C

Detilisi / Umbrella

Star

A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.

text-to-speech automation sqlite-database mvvm entity-framework clean-architecture speech-recognition cqrs-pattern intent-recognition communitytoolkit maui-app

Updated Jun 11, 2024
C#

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 11, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jun 11, 2024
Python

modelscope / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated Jun 11, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 11, 2024
C++

TheSoftDiamond / Kazushin

Star

Customizable TTS Chat Bot using OpenAI & Google Cloud TTS/ElevenLabs

python text-to-speech twitch ai chatbot tts speech-recognition openai speech-to-text gpt googlecloud gemini-api twitchio elevenlabs

Updated Jun 11, 2024
Python

omarx11 / chatin-v2

Sponsor

Star

Talk to Rawan voice-to-voice using speech recognition or text-to-speech, with elevenlabs technology and chatgpt on the web.

bot website text-to-speech ai nextjs chatbot speech-recognition tailwindcss speach-to-text vercel supabase chatgpt elevenlabs

Updated Jun 11, 2024
JavaScript

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Jun 11, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!