asr

Here are 1,026 public repositories matching this topic...

robmsmt / SpeechLoop

Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?

python speech speech-recognition speech-to-text asr speech-analysis asr-benchmark speechrecognition speech-api asr-model

Updated Oct 5, 2022
Python

BScUniversityCollaborations / automatic-speech-recognition

Star

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

lizunowa / project-asr-metrics

Star

🧑🏻‍🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.

asr asr-benchmark

Updated Jun 8, 2021
Jupyter Notebook

HeyHera / Hera

Star

This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.

python scikit-learn nlu spacy kivy tts asr wake-word-detection sgd-classifier vosk nix-tts

Updated Jul 12, 2022
Python

maximkm / DLA_ASR_HW

Star

ASR pytorch project

transformers pytorch lm beam-search asr asr-model bpe

Updated Oct 16, 2022
Python

SanchezCris / SDR-Automatic-Speech-Recognition

Star

FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.

python speech-recognition sdr automatic-speech-recognition speech-to-text gnuradio asr software-defined-radio wav2vec2

Updated Apr 17, 2023
Python

jevil25 / Lip-Read-ML-Model

Star

This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions

machine-learning tensorflow asr

Updated Aug 8, 2023
Jupyter Notebook

anonymous-demos / Multimodal-All-In-One

Star

Multi-talker Speech Recognition, Separation and Diarization, Everything Streaming All-at-Once

multi-channel multi-modal asr microphone-array speech-se speech-re multi-talker diariz

Updated Jul 16, 2023

kingabzpro / hindiSpeechPro-Automatic-Speech-Recognization

Star

The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.

transformer speech-recognition whisper asr hindi-language wav2vec2