* denotes equal contribution.

CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting

Sichen Jin, Youngmoon Jung, Seungjin Lee, Jaeyoung Roh, Changwoo Han, Hoonyoung Cho

In: Proceedings of INTERSPEECH 2024. [pdf]


A More Accurate Internal Language Model Score Estimation for the Hybrid Autoregressive Transducer

Kyungmin Lee*, Haeri Kim*, Sichen Jin, Jinhwan Park, Youngho Han

In: Proceedings of INTERSPEECH 2023. [pdf]


Conformer-based On-device Streaming Speech Recognition with KD Compression and Two-pass Architecture

Jinhwan Park*, Sichen Jin*, Sungsoo Kim*, Junmo Park*, Dhairya Sandhyana, Changheon Lee, Myoungji Han, Jungin Lee, Seokyeong Han, Changwoo Han, Chanwoo Kim

In: Proceedings of SLT 2022. [pdf]


Streaming on-device end-to-end ASR system for privacy-sensitive voice-typing

Abhinav Garg, Gowtham Vadisetti, Dhananjaya Gowda, Sichen Jin, Aditya Jayasimha, Youngho Han, Jiyeon Kim, Junmo Park, Kwangyoun Kim, Sooyeon Kim, Youngyoon Lee, Kyungbo Min, Chanwoo Kim

In: Proceedings of INTERSPEECH 2020. [pdf]


Attention based on-device streaming speech recognition with large speech corpus

Kwangyoun Kim*, Kyungmin Lee*, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim

In: Proceedings of ASRU 2019. [pdf]


End-to-end training of a large vocabulary end-to-end speech recognition system

Chanwoo Kim, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda

In: Proceedings of ASRU 2019. [pdf]