WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages

Tripti Choudhary; Vishal Goyal; Atul Bansal

doi:10.26599/BDMA.2022.9020017

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (5.5 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages

Tripti Choudhary^¹(

), Vishal Goyal^¹, Atul Bansal^²

1Department of Electronics and Communication, GLA University, Mathura 281406, India

2Chandigarh University, Mohali 140413, India

Show Author Information

Abstract

Automatic speech recognition systems are developed for translating the speech signals into the corresponding text representation. This translation is used in a variety of applications like voice enabled commands, assistive devices and bots, etc. There is a significant lack of efficient technology for Indian languages. In this paper, an wavelet transformer for automatic speech recognition (WTASR) of Indian language is proposed. The speech signals suffer from the problem of high and low frequency over different times due to variation in speech of the speaker. Thus, wavelets enable the network to analyze the signal in multiscale. The wavelet decomposition of the signal is fed in the network for generating the text. The transformer network comprises an encoder decoder system for speech translation. The model is trained on Indian language dataset for translation of speech into corresponding text. The proposed method is compared with other state of the art methods. The results show that the proposed WTASR has a low word error rate and can be used for effective speech recognition for Indian language.

Keywords

transformer wavelet automatic speech recognition (ASR)Indian language

References

【1】

Crossref Google Scholar

Big Data Mining and Analytics

Volume 6 Issue 1,
March 2023

Pages 85-91

DOI: 10.26599/BDMA.2022.9020017

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Choudhary T, Goyal V, Bansal A. WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages. Big Data Mining and Analytics, 2023, 6(1): 85-91. https://doi.org/10.26599/BDMA.2022.9020017

1371

Views

105

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 31 May 2022

Revised: 06 June 2022

Accepted: 21 June 2022

Published: 24 November 2022

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).