A machine learning project that translates voice from one language to another in real-time while preserving the tone and emotion of the speaker, and outputs the result in MP3 format.
Python3, SpeechRecognition, pyaudio, google-trans-new, gTTS, playsound, deep-translator
-
Clone this project and create virtualenv (recommended) and activate virtualenv.
# Create virtualenv virtualenv -p python3 env # Linux/MacOS source env/bin/activate # Windows env\Scripts\activate
-
Install require dependencies.
pip install -r requirements.txt
-
Run code and speech (have fun).
python main.py
- Google Translate API error 404: This error occurs when the Google Translate API is not available or the request is not valid. To fix this, we will use the
deep_translator
library instead ofgoogle_trans_new
. - Speech recognition does not work for continuous input: The
speech_recognition.Recognizer().listen()
method does not work for continuous input data. To fix this, we will use therecord()
method with a timer.
Update the record()
timer to listen until the speaker stops speaking.