英文摘要 |
In recent years, speech recognition applications have introduced a variety of remote operating systems, such as car voice assistants, smart speakers, etc. In these systems, far-field speech recognition plays a key role. This paper mainly presents our experiments and results on far-field speech recognition on smart speaker devices. We use data augmentation methods, simulated far-field speech and neural network-based acoustic models to reduce the character error rate (CER). In the experimental part, this paper recorded the parallel test corpus of three distances using the smart speaker. The 50cm situation corpus can be reduced from 13.31% to 8.41%, the relative improvement is 36.8%, and the 80cm situation corpus is reduced from 19.20% to 10.89% with relative improvement of 43.2%. |