Whisper was trained on 680,000 hours of audio from the web, with big variations between languages
(Please use a modern browser to see the interactive version of this visualization)