Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
-
Updated
Oct 5, 2022 - Python
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
🧑🏻🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.
This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.
ASR pytorch project
FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.
This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions
Multi-talker Speech Recognition, Separation and Diarization, Everything Streaming All-at-Once
The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.
Download speech datasets (English and non-English) for Automatic Speech Recognition
Metric evaluator for Automatic Speech Recognition using the HATS dataset
Dialect-Pronunciation-Dictionary
194999-Uyghur-Pronunciation-Dictionary
Number Speech Dataset in Mandarin and Dialects
North American English Speech Dataset
1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone
Korean Speech Dataset
Shanghai Dialect Speech Dataset
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."