Streaming Speech-to-Text Web Server. Share transcript to many in realtime.
-
Updated
Dec 26, 2022 - Python
Streaming Speech-to-Text Web Server. Share transcript to many in realtime.
Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo
Hugging Face Audio coursework
Fine-tuning code for making deepspeech robust to adversarial attacks.
Myanmar (Burmese) Language Grapheme to Phoneme Converter
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.
North American English Speech Dataset
Shanghai Dialect Speech Dataset
The dataset of Korean conversational speech
ATC ASR; internship at Asr Gooyesh Company
An Automatic Speech Recognition System for the Kabyle language.
[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
This web app translates Arabic speech or text into Moroccan Sign Language videos, fostering communication between the hearing and Moroccan deaf communities.
Real-time Nigerian church sermons speech transcriber
this repository concedes my project work done in my bachelors
Whisper Transcription Service
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."