The NO_TRAIN_NO_GAIN system for O-COCOSDA and VLSP 2022 - A-MSV shared task: ASIAN multilingual speaker verification

Nguyen, Ngoc Dung; Ly, Nhat Nam; Le, Trong Khanh

Please use this identifier to cite or link to this item: https://dspace.ctu.edu.vn/jspui/handle/123456789/117419

Full metadata record

DC Field	Value	Language
dc.contributor.author	Nguyen, Ngoc Dung	-
dc.contributor.author	Ly, Nhat Nam	-
dc.contributor.author	Le, Trong Khanh	-
dc.date.accessioned	2025-06-23T07:26:40Z	-
dc.date.available	2025-06-23T07:26:40Z	-
dc.date.issued	2024	-
dc.identifier.issn	1813-9663	-
dc.identifier.uri	https://dspace.ctu.edu.vn/jspui/handle/123456789/117419	-
dc.description.abstract	This paper proposes a semi-supervised multilingual speaker verification (MSV) system submitted for the 2 tasks, MSV for the Asian language inside the training set (T01) and outside the training set (T02) in O-COCOSDA and VLSP challenge 2022. To solve the problem, our strategy is training a baseline acoustic model with given labeled data (MSV CommonVoice) and fine-tuning the trained acoustic model with both given labeled data and given unlabeled data (MSV Youtube). To achieve the fine-tuning step, the unlabeled data is converted to labeled data by pseudo labeling technique using the clustering method with the embedding vectors extracted from the trained acoustic model. Besides, we also apply test-time augmentation, back-end scoring, and score normalization with the AS-Norm technique to improve the result. When evaluated on the VLSP 2022 challenge's given test set, our best system with baseline ECAPA-TDNN achieves an equal error rate (EER) of 2.296% in T01 and 3.3296% in T02, which ranks second rank in both two tasks.	vi_VN
dc.language.iso	en	vi_VN
dc.relation.ispartofseries	Tạp chí Tin học và Điều khiển học (Journal of Computer Science and Cybernetics);Vol.40, No.01 .- P.67-77	-
dc.subject	Speaker verification	vi_VN
dc.subject	ECAPA-TDNN	vi_VN
dc.subject	GMM	vi_VN
dc.subject	Fine-tuning	vi_VN
dc.subject	Score normalization	vi_VN
dc.title	The NO_TRAIN_NO_GAIN system for O-COCOSDA and VLSP 2022 - A-MSV shared task: ASIAN multilingual speaker verification	vi_VN
dc.type	Article	vi_VN
Appears in Collections:	Tin học và Điều khiển học (Journal of Computer Science and Cybernetics)

Files in This Item:

File	Description	Size	Format
_file_ Restricted Access		860.62 kB	Adobe PDF
Your IP: 216.73.216.255

Show simple item record

LRC Digital repo

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets