Speech command recognition using self-supervised asr in Tamil

Tharmakulasingham, I; Thayasivam, U

UoM IR
→
Research Publications
→
Conference Proceedings
→
UoM Conferences
→
Faculty of Engineering Research Unit (ERU & MERCon)
→
MERCon - 2023
→
View Item

dc.contributor.author	Tharmakulasingham, I
dc.contributor.author	Thayasivam, U
dc.contributor.editor	Abeysooriya, R
dc.contributor.editor	Adikariwattage, V
dc.contributor.editor	Hemachandra, K
dc.date.accessioned	2024-03-21T08:00:06Z
dc.date.available	2024-03-21T08:00:06Z
dc.date.issued	2023-12-09
dc.identifier.citation	I. Tharmakulasingham and U. Thayasivam, "Speech Command Recognition Using Self-Supervised ASR in Tamil," 2023 Moratuwa Engineering Research Conference (MERCon), Moratuwa, Sri Lanka, 2023, pp. 137-142, doi: 10.1109/MERCon60487.2023.10355402.	en_US
dc.identifier.uri	http://dl.lib.uom.lk/handle/123/22361
dc.description.abstract	ASR technology has made significant advancements, approaching performance levels comparable to humans. However, developing ASR models for all languages, especially Low Resource Languages (LRLs), presents challenges due to limited resources. Recent studies on intent classification in LRLs employ transfer learning techniques, leveraging state-of-the-art English ASR models to improve performance in these domains. Additionally, the emerging trend of self-supervised learning has proven advantageous in developing sophisticated ASR models, requiring less labeled data to achieve high performance. In our research, we utilized a self-supervised ASR model to classify intent in an LRL, specifically the Tamil language. We compared two methods for the same ASR framework: a fine-tuning approach and a transfer learning approach with a limited amount of labeled data in Tamil. Our findings indicate that the fine-tuning method outperforms the transfer learning technique. Moreover, the model exhibited a noteworthy increase in accuracy compared to the established phoneme-based speech intent classification methodology in Tamil. This study represents a significant step forward in enhancing speech recognition capabilities for LRLs.	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.relation.uri	https://ieeexplore.ieee.org/document/10355402	en_US
dc.subject	ASR model	en_US
dc.subject	Speech intent classification	en_US
dc.subject	Low- resource-language	en_US
dc.subject	Wav2Vec 2.0	en_US
dc.subject	Tamil	en_US
dc.title	Speech command recognition using self-supervised asr in Tamil	en_US
dc.type	Conference-Full-text	en_US
dc.identifier.faculty	Engineering	en_US
dc.identifier.department	Engineering Research Unit, University of Moratuwa	en_US
dc.identifier.year	2023	en_US
dc.identifier.conference	Moratuwa Engineering Research Conference 2023	en_US
dc.identifier.place	Katubedda	en_US
dc.identifier.pgnos	pp. 137-142	en_US
dc.identifier.proceeding	Proceedings of Moratuwa Engineering Research Conference 2023	en_US
dc.identifier.email	[email protected]	en_US
dc.identifier.email	[email protected]	en_US