英文摘要 |
Using speech as the input of the human-machine interface, we need to consider the effect of the pronunciation variations on speech recognition. We usually use the speaker adaptation and speech adaptation technique to solve above problem, but the result of accent speech recognition is not good enough. This paper presents a framework to combine two phone models to reconstruct acoustic model. First we build accented acoustic model and unaccented acoustic model, than we combine the accented acoustic model into the unaccented acoustic model with the state tying and Gradient tree boosting algorithm. We can adjust the unaccented model by this framework, to reconstruct acoustic model and robust the accented speech recognition performance. |