tinyML Asia 2021 Chanwoo Kim: A review of on-device fully neural end-to-end speech recognition...
A review of on-device fully neural end-to-end speech recognition and synthesis algorithms
Chanwoo KIM, Corporative Vice President, Samsung
In this talk, we review various end-to-end automatic speech recognition and speech synthesis algorithms and their optimization techniques for on-device applications. Conventional speech recognition systems comprise a large number of discrete components such as an acoustic model, a language model, a pronunciation model, a text-normalizer, an inverse-text normalizer, a decoder based on a Weighted Finite-State Transducer (WFST), and so on. To obtain sufficiently high speech recognition accuracy with such conventional speech recognition systems, a very large language model (up to 100 GB) is usually needed. Hence, the corresponding WFST size becomes enormous, which prohibits their on-device implementation. Recently, fully neural network end-to-end speech recognition algorithms have been proposed. Examples include speech recognition systems based on Connectionist Tempo
5 views
30
8
3 years ago 00:53:07 6
tinyML Asia 2021 Jingpeng Xiang: Soundplus
3 years ago 00:27:54 1
tinyML Asia 2021 Dongsoo Lee: Extremely low-bit quantization for Transformers
3 years ago 00:23:01 1
tinyML Asia 2021 Haochen Xie: An approach to dynamically integrate heterogenous AI components...
3 years ago 00:26:53 5
tinyML Asia 2021 Yihong Wu: Lightweight visual localization with deep learning
3 years ago 00:13:25 7
tinyML Asia 2021 Video Poster: Plant Growth and LAI Estimation using quantized Embedded Regression..
3 years ago 00:27:16 1
tinyML Asia 2021 Flora Salim: Learning compact representation with less (labelled) data from sensors
3 years ago 00:29:39 2
tinyML Asia 2021 Joshua Chang: Sensor Fusion using Machine Learning: Smart Forehead Temperature...
3 years ago 00:08:10 1
tinyML Asia 2021 Video Poster: AI Enabled Low-Cost Stethoscope
3 years ago 00:16:40 13
tinyML Asia 2021 Zou Yuanhao: TinyML Heat Image Face Recognition on Wio-Terminal
3 years ago 00:09:30 1
tinyML Asia Video Poster Neuton AI: Bringing Big Ideas into Tiny Devices Bottoms-up Approach to...
3 years ago 00:08:08 5
tinyML Asia 2021 Video Poster: Cyberon DSpotter: A phoneme-based local voice recognition solution
3 years ago 00:28:21 2
tinyML Asia 2021 Justin Kao: A lightweight face detection method working with Himax Ultra-Low...
3 years ago 00:21:24 1
tinyML Asia 2021 Anton Kroger: Airborne sound maintenance in remote sites using low power...
3 years ago 00:14:38 2
tinyML Asia 2021 Video Poster: Efficient inference of low-resolution optic flow on low power...