You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paul Pham c219b3a2ac
Remove errant 'wav'
3 years ago
..
long Initial commit of TTS training and Berkshire Meeting 1994 sounds. 3 years ago
original Initial commit of TTS training and Berkshire Meeting 1994 sounds. 3 years ago
wavs Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago
README.md Initial commit of TTS training and Berkshire Meeting 1994 sounds. 3 years ago
config.json Rename converted to wavs for TTS training format. Add script to repeat metadata.csv column for ljspeech format. 3 years ago
example-config.json Rename converted to wavs for TTS training format. Add script to repeat metadata.csv column for ljspeech format. 3 years ago
example-metadata.csv Remove errant 'wav' 3 years ago
modify_config.py Rename converted to wavs for TTS training format. Add script to repeat metadata.csv column for ljspeech format. 3 years ago
modify_metadata.py Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago
noise_cleanup.py Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago
requirements.txt Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago
scale_stats.npy Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago
train.py Updated noise-cleaned wavs and stats file. Gonna try this on an AWS GPU machine. 3 years ago

README.md

Index of Berkshire Annual Meeting Transcripts

From Warren Buffett archive at CNBC.

HOWTO To Create Training Set of GOAT Warren Buffett for Mozilla TTS

Following https://stackoverflow.com/a/66307612

Use the LiveRecorder Firefox plugin to record from Youtube.

Convert from webm to audio-only ogg.

ffmpeg -i berkshire-1994.webm -vn -acodec copy ./berkshire-1994-00:00_10:00.ogg

Split by silence. This automated splitting of files is the killer app for command-line audio tools, imo.

sox ./berkshire-1994-00:00_10:00.ogg berkshire-1994-00:00_10:00-.wav silence 1 0.2 0.5% 1 0.2 0.5% : newfile : restart

Play with trimming (start and duration) then when you narrow down the clip, you can use sox to trim. This is where GUIs like Audacity are still better.

play ./berkshire-1994-00:00_10:00-149.wav trim 0:00 0:02.5
sox ./berkshire-1994-00:00_10:00-149.wav ./berkshire-1994-00:00_10:00-149a.wav trim 0:00 0:02.5

https://stackoverflow.com/questions/9667081/how-do-you-trim-the-audio-files-end-using-sox

https://unix.stackexchange.com/questions/381890/play-audio-file-from-a-certain-time-step-in-terminal