A communication voice transmission method and system based on voice enhancement
By generating multi-turn interaction feature vectors through multi-channel speech acquisition and prediction networks, calculating the criticality and dynamic weight of future speeches, and adaptively adjusting enhancement parameters in real time, the dynamic adjustment problem of speech enhancement systems in multi-person, multi-turn interaction scenarios is solved, improving speech clarity and stability, and enhancing communication quality.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- SHENZHEN YITENGJIE INFORMATION TECHNOLOGY CO LTD
- Filing Date
- 2026-04-10
- Publication Date
- 2026-06-19
AI Technical Summary
Existing voice enhancement systems struggle to dynamically adjust enhancement strategies in multi-person, multi-turn interactive scenarios, resulting in the suppression of key statements or the weakening of background information, which affects the naturalness of the call and the integrity of the information.
By combining multi-channel speech acquisition with historical interaction information and network status, multi-round interaction feature vectors are generated. A prediction network is used to calculate the keyness and dynamic weight of future speeches. Real-time sequence modeling and adaptive adjustment of enhancement parameters are performed to build an adaptive judgment mechanism to improve speech clarity and stability.
In multi-person, multi-turn interaction scenarios, the targeting and stability of voice enhancement have been improved, the distortion and auditory discomfort caused by frequent fluctuations in enhancement parameters have been reduced, and the communication quality has been improved.
Smart Images

Figure CN122245330A_ABST