English male text-to-speech model trained on the multi-dataset dataset at 22050 Hz and is available to synthesize the English language.
Model Description
This English male text-to-speech model is trained on the multi-dataset dataset at 22050 Hz and is available to synthesize the English language. The model is based on the tortoise-v2 encoder.
pip install tts
tts --text "Hello, world!" --model_name tts_models/en/multi-dataset/tortoise-v2
Voice Samples
default (M)
English
English is a West Germanic language that originated in England and is now one of the most widely spoken languages in the world. It belongs to the Indo-European language family and is closely related to German and Dutch. English has a diverse vocabulary and is known for its global influence as a lingua franca. It uses the Latin alphabet with modifications, including the addition of letters such as ð and þ in Old English. English features a complex phonetic system with a wide range of vowel and consonant sounds.
Multi-Dataset
The multi-dataset refers to a combination or fusion of multiple speech datasets. It is often created by merging different datasets to increase the diversity and size of the training data for speech-related tasks.
Tortoise V2
TorToiSe is a text-to-speech (TTS) program which can mimic voices given 2-4 examples. It is composed of five separately-trained neural networks that are pipelined together to produce the final output.
Follow AI Models on Google News
An easy & free way to support AI Models is to follow our google news feed! More followers will help us reach a wider audience!
Google News: AI Models