AI powered ytp/sentence mixing for audio and video.
-
Updated
Jun 13, 2024 - Svelte
AI powered ytp/sentence mixing for audio and video.
Protocol buffers and other common resources.
Build high-performance AI models with modular building blocks
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python SDK for working with Voicegain Speech-to-Text
Official Python SDK for Deepgram's automated speech recognition APIs.
End-to-End Speech Processing Toolkit
A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.
V.I.S.O.R., my in-development assistant
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!
Generates a continuously morphing dynamic wallpaper from real-time speech input.
A PyTorch-based Speech Toolkit
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
🗣 Discord voice-chat speech recognition
Official repository of my Master's Thesis project: "Developing an AI-Powered Voice Assistant for an iOS Payment App"
Tools for handling speech data in machine learning projects.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."