Please use this identifier to cite or link to this item:
http://ir.lib.seu.ac.lk/handle/123456789/2633
Title: | Design and development of automatic speech recognition system for Tamil language using CMU Sphinx 4 |
Authors: | Kalith, IM. |
Keywords: | Speech recognition CMU Sphinx 4 Tamil language |
Issue Date: | 28-Mar-2012 |
Publisher: | Faculty of Applied Sciences,South Eastern University of Sri Lanka |
Citation: | Empowering regional development through science and technology First Annual Science Research Session -2012 |
Abstract: | This paper presents a design and development of Speech Recognition System for Tamil language. This system is based on CMU Sphinx 4 open source speech recognition (ASR) engine developed by Carnegie Mellon University. This system should be adapted to speaker specific automatic, continuous speech. One of the main components of this system is a core Tamil speech recognition system that can be trained with field specific data. The target domain is the accent spoken by illiterate Tamil-speaker from Eastern area of Sri Lanka. The phonetically rich and balanced sentence text corpus were developed and recorded in conditional environment to set up speaker specific speech corpus. Using this speech corpus the system was trained and tested with speaker specific (testing with same word uttered by same person) and speaker independent data (testing with different word uttered by different person). The system currently gives a satisfactory peak performance of 39.5% Word Error Rate (WER) for speaker specific and unsatisfactory rate for speaker independent data, which is comparable with the best word error rates of most of the recognition systems for continuous speech available for any language. |
URI: | http://ir.lib.seu.ac.lk/handle/123456789/2633 |
ISBN: | 9789556270273 |
Appears in Collections: | ASRS - FAS 2012 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.