
Automatic Multilingual Speech Recognition |
| ( Volume 7 Issue 5,May 2020 ) OPEN ACCESS |
| Author(s): |
Nguyen Tuan Anh , Tran Thi Ngoc Linh , Dang Thi Hien |
| Keywords: |
|
Automatic Speech Recognition (ASR), multi-languages, Vietnamese and Chinese ASR system, LIS-Net model |
| Abstract: |
|
Automatic Speech Recognition (ASR) for multi-languages is currently attracting more and more attention; however, development is still hampered by the need for language experts. End-to-End ASR simplifies their work by directly predicting the output character based on the acoustic input. This study presents the improvement of LIS-Net model for End-to-End Vietnamese and Chinese ASR system. In this study, an efficient yet accurate end-to-end multilingual multi-speaker ASR model has developed, allowing direct conversion of raw speech audio signals into text of multiple languages. This study proposes a new method of coding labels specifically for multiple languages by pagination labels by language. The results of this study are significantly improved compared to that of baseline models. |
| Paper Statistics: |
| Cite this Article: |
| Click here to get all Styles of Citation using DOI of the article. |