Real-time microphone noise suppression on Linux.
-
Updated
Apr 28, 2024 - Go
Real-time microphone noise suppression on Linux.
Automagically synchronize subtitles with video.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Python AI assistant 🧠
A python package to build AI-powered real-time audio applications
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
An audio/acoustic activity detection and audio segmentation tool
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Voice Activity Detection based on Deep Learning & TensorFlow
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Auto transcribe tool based on whisper
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."