Pictalk is an open-source application designed to assist individuals with speech impediments communicate effectively using pictograms and pictures
-
Updated
Jun 11, 2024 - Vue
Pictalk is an open-source application designed to assist individuals with speech impediments communicate effectively using pictograms and pictures
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
Revamp your morning routine and supercharge productivity with Dispatch. The ultimate Apple Shortcut powered by ChatGPT and ElevenLabs.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript
ChatTTS is a generative speech model for daily dialogue.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MARS5 speech model (TTS) from CAMB.AI
A list of recommended voices for the Web Speech API
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.
Easy Voice Based Accounting Software With Multiple Languages For Lay People & For People With Disabilities - Also Available Live At : https://linuxguist.github.io/voice-acct-local/
Companion application for Elite Dangerous
An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.
Customizable TTS Chat Bot using OpenAI & Google Cloud TTS/ElevenLabs
Talk to Rawan voice-to-voice using speech recognition or text-to-speech, with elevenlabs technology and chatgpt on the web.
A multi-purpose, cat-themed web app created for college students, by college students.
VITS-based Voice Conversion focused on simplicity, quality and performance.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
MSSpeechServer is a REST server based on the Microsoft Speech Platform that provides text-to-speech (TTS) functionality for Windows. This project is designed to run on the Linux x86_64 platform and supports Docker images. It provides two main APIs for reading voice libraries and generating TTS.
Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."