I’m a speech researcher at Samsung Research, focusing on a broad range of areas within speech recognition and spoken keyword spotting. My work includes:

In addition to my research in speech processing, I am generally interested in the interpretability of neural networks. I believe that understanding the inner mechanisms of current award-winning large models will help us gain controllability and design more computationally efficient models. On that note, I am interested in the following research topics.

  • Speech foundation models: What can we learn from the speech represenations of each layers of the speech foundation models? What is the best way to utilize the phonetic and synthetic information for downstream tasks?
  • Interpretability of large models: How are the emergent abilities such as reasoning presented in neural networks? How can we extract the circuits and make them more explicit?
  • Controllability and Efficiency: How can we apply our understandings about the neural models to make them more controllable and efficient?

Prior to Samsung Research, I received B.S. in CSE from Seoul National University.


News!


:loudspeaker: (09/24) I will be presenting my paper on spoken keyword detection at INTERSPEECH 2024!