dc.contributor.author |
Tharmakulasingham, I |
|
dc.contributor.author |
Thayasivam, U |
|
dc.contributor.editor |
Abeysooriya, R |
|
dc.contributor.editor |
Adikariwattage, V |
|
dc.contributor.editor |
Hemachandra, K |
|
dc.date.accessioned |
2024-03-21T08:00:06Z |
|
dc.date.available |
2024-03-21T08:00:06Z |
|
dc.date.issued |
2023-12-09 |
|
dc.identifier.citation |
I. Tharmakulasingham and U. Thayasivam, "Speech Command Recognition Using Self-Supervised ASR in Tamil," 2023 Moratuwa Engineering Research Conference (MERCon), Moratuwa, Sri Lanka, 2023, pp. 137-142, doi: 10.1109/MERCon60487.2023.10355402. |
en_US |
dc.identifier.uri |
http://dl.lib.uom.lk/handle/123/22361 |
|
dc.description.abstract |
ASR technology has made significant advancements,
approaching performance levels comparable to humans. However,
developing ASR models for all languages, especially Low
Resource Languages (LRLs), presents challenges due to limited
resources. Recent studies on intent classification in LRLs employ
transfer learning techniques, leveraging state-of-the-art English
ASR models to improve performance in these domains. Additionally,
the emerging trend of self-supervised learning has proven
advantageous in developing sophisticated ASR models, requiring
less labeled data to achieve high performance. In our research, we
utilized a self-supervised ASR model to classify intent in an LRL,
specifically the Tamil language. We compared two methods for
the same ASR framework: a fine-tuning approach and a transfer
learning approach with a limited amount of labeled data in Tamil.
Our findings indicate that the fine-tuning method outperforms
the transfer learning technique. Moreover, the model exhibited
a noteworthy increase in accuracy compared to the established
phoneme-based speech intent classification methodology in Tamil.
This study represents a significant step forward in enhancing
speech recognition capabilities for LRLs. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
IEEE |
en_US |
dc.relation.uri |
https://ieeexplore.ieee.org/document/10355402 |
en_US |
dc.subject |
ASR model |
en_US |
dc.subject |
Speech intent classification |
en_US |
dc.subject |
Low- resource-language |
en_US |
dc.subject |
Wav2Vec 2.0 |
en_US |
dc.subject |
Tamil |
en_US |
dc.title |
Speech command recognition using self-supervised asr in Tamil |
en_US |
dc.type |
Conference-Full-text |
en_US |
dc.identifier.faculty |
Engineering |
en_US |
dc.identifier.department |
Engineering Research Unit, University of Moratuwa |
en_US |
dc.identifier.year |
2023 |
en_US |
dc.identifier.conference |
Moratuwa Engineering Research Conference 2023 |
en_US |
dc.identifier.place |
Katubedda |
en_US |
dc.identifier.pgnos |
pp. 137-142 |
en_US |
dc.identifier.proceeding |
Proceedings of Moratuwa Engineering Research Conference 2023 |
en_US |
dc.identifier.email |
[email protected] |
en_US |
dc.identifier.email |
[email protected] |
en_US |