= Session =

Beats, Rhymes, and Neural Nets

The Web Speech API has been the undisputed champion of in browser speech recognition since it's release in 2012. Backed by Google API's, it's allowed developers to achieve near realtime recognition, transcription, and text to speech synthesis for more than a decade.

In recent years, a venture backed contender has appeared, threatening to take the crown away from our beloved browser APIs. Whisper promises to be the most accurate, most tolerant, and quickest way to transcribe speech. But does this stand true when its applied to the humble web browser?

In this battle of the ages, Tony sets out to pit Whisper against the Browser in search of the best option in helping write rap lyrics. Following an introduction to the two tech stacks, Tony will iterate through a series of tests to see which one is most suited to the world of Hip-Hop, leading to a rap crescendo you will never forget.

Session Info

Talk
September 18th, 2025
21:00
Base42
35 minutes
web_platform speech_recognition ai