speech-recognition

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 13, 2024
Python

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 13, 2024
Python

piaseckijulian / Sentinel

Sponsor

Star

🚀AI Voice Chatbot

ai sentinel speech-recognition

Updated Jun 13, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 13, 2024
C++

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jun 13, 2024
Python

voicegain / python-sdk

Star

Python SDK for working with Voicegain Speech-to-Text

python sdk speech-recognition speech-to-text sdk-python

Updated Jun 12, 2024
Python

deepgram / deepgram-python-sdk

Star

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated Jun 12, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jun 12, 2024
Python

Detilisi / Umbrella

Star

A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.

text-to-speech automation sqlite-database mvvm entity-framework clean-architecture speech-recognition cqrs-pattern intent-recognition communitytoolkit maui-app

Updated Jun 12, 2024
C#

Edw590 / VISOR---A-Virtual-Assistant

Star

V.I.S.O.R., my in-development assistant

ai artificial-intelligence assistant voice-recognition speech-recognition personal-assistant jarvis virtual-assistant jarvis-ai ai-assistant jarvis-assistant jarvis-ready llama3 jarvis-llm

Updated Jun 12, 2024
Go

I5UCC / VRCTextboxSTT

Star

A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!