His main research interests include: speech signal processing, speech recognition, language recognition, speaker recognition/confirmation (voiceprint recognition/confirmation), keyword detection, audio-based information masking (watermarking), target audio retrieval, content-based music retrieval, target voice change, digital home and so on.
The laboratory has undertaken projects such as National 863 and 973, National Network and Information Security Center, National Natural Science Foundation, Knowledge Innovation Project of Chinese Academy of Sciences, and Hundred Talents Program of Chinese Academy of Sciences. It has an international level and domestic leading audio/voice information classification and processing platform, which mainly includes: non-specific large vocabulary continuous speech recognition system, language recognition system, speaker recognition/confirmation system, recognition confidence evaluation and keyword detection system, music retrieval system based on humming, fixed audio detection system, voice tone sandhi system, noise elimination system, audio watermark coding and decoding system, etc.
In terms of industrialization, Zhongke Xinli Voice Lab can provide world-class voice technology products and solutions. R&D products cover server platforms (telecom grade), desktop platforms and embedded platforms (wireless terminal devices, PDA, handheld devices, etc.). ). Telecom-grade speech recognition products developed by Zhongke Xinli have been commercialized by more than 20 provincial telecom operators in China. Desktop speech recognition products have become the bundled software of Intel digital home desktop computers; Embedded platform products have been integrated into the products of many domestic mobile phone manufacturers and PDA manufacturers.