Automatic Multilingual Speech Recognition |
( Volume 7 Issue 5,May 2020 ) OPEN ACCESS |
Author(s): |
Nguyen Tuan Anh , Tran Thi Ngoc Linh , Dang Thi Hien |
Keywords: |
Automatic Speech Recognition (ASR), multi-languages, Vietnamese and Chinese ASR system, LIS-Net model |
Abstract: |
Automatic Speech Recognition (ASR) for multi-languages is currently attracting more and more attention; however, development is still hampered by the need for language experts. End-to-End ASR simplifies their work by directly predicting the output character based on the acoustic input. This study presents the improvement of LIS-Net model for End-to-End Vietnamese and Chinese ASR system. In this study, an efficient yet accurate end-to-end multilingual multi-speaker ASR model has developed, allowing direct conversion of raw speech audio signals into text of multiple languages. This study proposes a new method of coding labels specifically for multiple languages by pagination labels by language. The results of this study are significantly improved compared to that of baseline models. |
Paper Statistics: |
Cite this Article: |
Click here to get all Styles of Citation using DOI of the article. |