英文摘要 |
Tamil language is widely used in southern India, Sri Lanka and Singapore. This paper investigates the design and implementation strategies for a Tamil speech recognition system. It utilizes the speech features of the 149 common Tamil mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Tamil pronunciation rules. These 10 utterances are collected through reading five rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone one, while the second one has falling pitch of tone four. Mel-frequency cepstral coefficients, linear predicted cepstral coefficients, and hidden Markov model are used as the two feature models and the syllable recognition model respectively. The recognized syllable strings are then refined by phonotactical rules to obtain the optimal result. Under the Intel Core 2 Quad 2.5 GHz personal computer and Ubuntu 10.04 operating system environment, a correct phrase recognition rate of 88.74% can be reached for a 3,500 Tamil phrase database. The average computation time for the system is within 1.5 seconds, and the training time for the systems is about one and half hours. |