T R A C K       P A P E R
ISSN:2394-3661 | Crossref DOI | SJIF: 5.138 | PIF: 3.854

International Journal of Engineering and Applied Sciences

(An ISO 9001:2008 Certified Online and Print Journal)

Automatic Multilingual Speech Recognition

( Volume 7 Issue 5,May 2020 ) OPEN ACCESS
Author(s):

Nguyen Tuan Anh , Tran Thi Ngoc Linh , Dang Thi Hien

Keywords:

Automatic Speech Recognition (ASR), multi-languages, Vietnamese and Chinese ASR system, LIS-Net model

Abstract:

Automatic Speech Recognition (ASR) for multi-languages is currently attracting more and more attention; however, development is still hampered by the need for language experts. End-to-End ASR simplifies their work by directly predicting the output character based on the acoustic input. This study presents the improvement of LIS-Net model for End-to-End Vietnamese and Chinese ASR system. In this study, an efficient yet accurate end-to-end multilingual multi-speaker ASR model has developed, allowing direct conversion of raw speech audio signals into text of multiple languages. This study proposes a new method of coding labels specifically for multiple languages by pagination labels by language. The results of this study are significantly improved compared to that of baseline models.

Paper Statistics:

Total View : 711 | Downloads : 702 | Page No: 41-46 |

Cite this Article:
Click here to get all Styles of Citation using DOI of the article.