Build your own Alexa with the ESP32 and TensorFlow Lite
We’ve been building towards this project in the previous set of videos. And we’re now ready to build our very own DIY Alexa!
All the code for this project is on GitHub -
What are we building - 1:15
Wake Word Detection - 2:27
Command Recognition - 11:47
Digging into the code - 16:30
What’s life all about Marvin? - 21:52
To detect the wake words we stream audio from either an I2S microphone or from the ADC. The wake word detector looks at a 1-second window of audio. The spectrogram of the audio is calculated and fed into a TensorFlow Lite model.
Once we detect the wake word we stream the audio up to to recognise the user’s intent.
It works surprisingly well for such a small model, there are improvements that could be made with more training data.
I’ll leave the access token for live for as long as I can, but at some point, you will need to generate your own application.
Let me know how you get on in the comments!
Related Videos:
Audio Input
Audio Output
And TensorFlow Lite for machine learning
Components you could use:
MAX98357 -
TinyPico -
INMP441 I2S Microphone:
ICS-43434 I2S Microphone:
ESP32 Dev board:
Analogue Audio Amplifier:
Speakers:
---
Want to help support the channel? I’m accepting coffee on
1 view
334
85
5 days ago 00:08:10 1
AI Agents Will Create MILLIONAIRES in 2025 – Are You Ready
2 weeks ago 00:40:25 1
40MIN Unilateral Leg Strength - Lower Body Workout
2 weeks ago 00:06:07 1
PATRIARKH - WIERSZALIN IV (feat. Eliza Sacharczuk) (Official Video) | Napalm Records