Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://dspace.ctu.edu.vn/jspui/handle/123456789/10457
Nhan đề: Development of high-performance and large-scale Vietnamese automatic speech recognition systems
Tác giả: Do, Quoc Truong
Pham, Ngoc Phuong
Tran, Hoang Tung
Luong, Chi Mai
Từ khoá: ASR
Automatic speech recognition
Vietnamese corpora
Vietnamese Speech recognition
Năm xuất bản: 2018
Tùng thư/Số báo cáo: Journal of Computer Science and Cybernetics;Vol.34(04) .- P.335–348
Tóm tắt: Automatic Speech Recognition (ASR) systems convert human speech into corresponding transcription automatically. They have a wide range of application such as controlling robots, call center analytic, voice chatbot. Recent studies on ASR for English have achieved the performance that surpass human ability. The systems were trained on a large amount of training data and performed well under many environments. With regards to Vietnamese, there have been many studies on improving the performance of existing ASR systems, however, many of them are conducted on a small-scaled data, which does not reflect realistic scenarios. Although the corpora used to train the system were carefully design to maintain phonetic balance properties, efforts in collecting them at a large-scale is still limited. Specifically, only a certain accent of Vietnam was evaluated in existing works. In this paper, we first describe our efforts in collecting a large data set that covers all 3 major accents of Vietnam located in the Northern, Center, and Southern regions. Then, we detail our ASR system development procedure utilizing the collected data set and evaluating different model architectures to find the best structure for Vietnamese. In the VLSP 2018 challenge, our system achieved the best performance with 6,5% WER and on our internal test set with more than 10 hours of speech collected real environments, the system also performs well with 11% WER.
Định danh: http://dspace.ctu.edu.vn/jspui/handle/123456789/10457
ISSN: 1813-9663
Bộ sưu tập: Tin học và Điều khiển học (Journal of Computer Science and Cybernetics)

Các tập tin trong tài liệu này:
Tập tin Mô tả Kích thước Định dạng  
_file_5 MBAdobe PDFXem
Your IP: 3.15.18.198


Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.