Bo-Wen Chen is a skilled machine learning researcher with a focus on digital speech processing. Holding a Master’s degree from the Graduate Institute of Communication Engineering at National Taiwan University, he possesses a strong academic foundation under the guidance of Prof. Hung-Yi Lee. Bo-Wen’s expertise lies in exploring novel topics within speech processing and acoustic modeling using deep learning techniques. One of his notable contributions is the development of a duplex speech chain model that facilitates simultaneous Text-to-Speech and Automatic Speech Recognition.
M.S. in Graduate Institute of Communication Engineering, 2018-2022
National Taiwan University
B.S. in Electrical Engineering, 2014-2018
National Taiwan University
Fairseq, Pytorch, Tensorflow
Kaldi
Text-to-Speech (TTS),
Automatic Speech Recognition (ASR)
See Publications for more details.
Supervised by Prof. Hung-Yi Lee
The paper investigates reversible neural network layers for constructing a duplex speech chain model that utilizes bidirectional supervision signals. It introduces a novel approach to address speech synthesis and recognition challenges while analyzing the impact of bidirectional supervision on performance.
Randomly weighted neural networks have potential in computer vision and demonstrate a positive correlation in performance with fully trained models. This paper confirms the continued positive correlation and explores the value of randomly weighted networks in audio source separation, a challenging generative audio task.