Audio-vision collaborative lip language recognition method and system
A recognition method and lip language technology, applied in the field of visual speech recognition and lip language recognition, can solve the problems of rarely considered, difficult to cover different situations, and increase the difficulty of lip language recognition, so as to improve feature extraction ability and good classification. performance effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0083] The present invention uses the characteristics of audio-visual synchronization to propose a lip language recognition method for audio-visual collaborative learning. In this method, we have designed three levels of metric learning: visual-visual, audio-audio, and visual-audio. Simultaneous learning of the three metrics not only shortens the training time and stages, but also enables better collaborative learning between visual and audio modalities. With the help of audio information, the visual model of the present invention can extract more distinguishing features, thereby improving the performance of the lip recognition model. The present invention includes following key technical points:
[0084] Key point 1, the present invention proposes an audio-visual collaborative learning mechanism, which uses audio to assist visual model learning, and at the same time designs three-level metric learning methods of audio-audio, video-visual, and audio-visual, so that the model c...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


