Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method and system for dysarthria based on visual facial contour movement

A technology for dysarthria and speech recognition, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as confusion of vowels and consonants, difficulty in meeting speakers, and difficulty in collecting speech with dysarthria, so as to improve accuracy. Effect

Active Publication Date: 2022-05-24
BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because dysarthria speech has many defects such as difficulty in collecting and confusion of vowels and consonants, simply using speech acoustic feature parameters as the basis for speech recognition, the accuracy of dysarthria speech recognition is not high, and it is difficult to meet the needs of speakers Needs in actual communication application scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and system for dysarthria based on visual facial contour movement
  • Speech recognition method and system for dysarthria based on visual facial contour movement
  • Speech recognition method and system for dysarthria based on visual facial contour movement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0055] like figure 1 As shown, in one embodiment of the present invention, a novel speech recognition method for dysarthria based on visual facial contour motion trajectory includes four main steps:

[0056] Step S1, acquiring multi-modal data, including the photographed facial motion video of the dysarthria person speaking and the voice data synchronized with the video.

[0057] The video is the facial movement process of the person with dysarthria when speaking, which is captured by the camera equipment. This step should be as easy to operate as possible to facilitate specific implement...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for speech recognition of dysarthria based on visual facial contour movement. The system includes multi-modal data acquisition, multi-modal fusion feature calculation, multi-modal speech recognition calculation and language model calculation modules; multi-modal The state data acquisition calculation module is used to obtain the facial contour motion video data of the dysarthria and the voice data synchronized with the video; the multi-modal fusion feature calculation module is used to fuse the facial contour motion features and speech acoustic features; multi-modal speech recognition The calculation module is used to obtain the mapping relationship from multimodal features to phoneme characters; the language model calculation module is used to obtain the mapping relationship from phoneme characters to Chinese sentences. The present invention obtains the fused multimodal features by fusing the speech acoustic feature parameters and the pronunciation actions of the dysarthria, and utilizes the fused multimodal features to perform dysarthria speech recognition, thereby effectively improving dysarthria speech recognition Accuracy.

Description

technical field [0001] The invention relates to the technical field of speech recognition for dysarthria, in particular to a method and a system for enhancing speech recognition for dysarthria based on the movement trajectory of visual facial contours, which can be applied to assist speech rehabilitation of dysarthria. Background technique [0002] Existing research shows that the proportion of dysarthria caused by stroke is 30%-40%, of which 15% of dysarthria caused by stroke cannot be completely recovered. The dysarthria seriously affects the speaker's ability to communicate, resulting in a decline in the quality of life, which brings both physical and psychological pain to the dysarthria. Therefore, improving the intelligibility of dysarthria speech and effectively improving the speech recognition ability of dysarthria has important social significance and practical value. [0003] At present, the mainstream practice in the field of dysarthria speech recognition technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/183G10L15/16G10L15/22G10L15/25G10L15/26G10L25/24
CPCG10L15/183G10L15/22G10L15/26G10L15/16G10L15/25G10L25/24
Inventor 钱兆鹏于重重苏小苏
Owner BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products