Speech recognition method and system for dysarthria based on visual facial contour movement

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for dysarthria and speech recognition, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as confusion of vowels and consonants, difficulty in meeting speakers, and difficulty in collecting speech with dysarthria, so as to improve accuracy. Effect

Active Publication Date: 2022-05-24

BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, because dysarthria speech has many defects such as difficulty in collecting and confusion of vowels and consonants, simply using speech acoustic feature parameters as the basis for speech recognition, the accuracy of dysarthria speech recognition is not high, and it is difficult to meet the needs of speakers Needs in actual communication application scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0054] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0055] like figure 1 As shown, in one embodiment of the present invention, a novel speech recognition method for dysarthria based on visual facial contour motion trajectory includes four main steps:

[0056] Step S1, acquiring multi-modal data, including the photographed facial motion video of the dysarthria person speaking and the voice data synchronized with the video.

[0057] The video is the facial movement process of the person with dysarthria when speaking, which is captured by the camera equipment. This step should be as easy to operate as possible to facilitate specific implement...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method and system for speech recognition of dysarthria based on visual facial contour movement. The system includes multi-modal data acquisition, multi-modal fusion feature calculation, multi-modal speech recognition calculation and language model calculation modules; multi-modal The state data acquisition calculation module is used to obtain the facial contour motion video data of the dysarthria and the voice data synchronized with the video; the multi-modal fusion feature calculation module is used to fuse the facial contour motion features and speech acoustic features; multi-modal speech recognition The calculation module is used to obtain the mapping relationship from multimodal features to phoneme characters; the language model calculation module is used to obtain the mapping relationship from phoneme characters to Chinese sentences. The present invention obtains the fused multimodal features by fusing the speech acoustic feature parameters and the pronunciation actions of the dysarthria, and utilizes the fused multimodal features to perform dysarthria speech recognition, thereby effectively improving dysarthria speech recognition Accuracy.

Description

technical field [0001] The invention relates to the technical field of speech recognition for dysarthria, in particular to a method and a system for enhancing speech recognition for dysarthria based on the movement trajectory of visual facial contours, which can be applied to assist speech rehabilitation of dysarthria. Background technique [0002] Existing research shows that the proportion of dysarthria caused by stroke is 30%-40%, of which 15% of dysarthria caused by stroke cannot be completely recovered. The dysarthria seriously affects the speaker's ability to communicate, resulting in a decline in the quality of life, which brings both physical and psychological pain to the dysarthria. Therefore, improving the intelligibility of dysarthria speech and effectively improving the speech recognition ability of dysarthria has important social significance and practical value. [0003] At present, the mainstream practice in the field of dysarthria speech recognition technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/183G10L15/16G10L15/22G10L15/25G10L15/26G10L25/24

CPCG10L15/183G10L15/22G10L15/26G10L15/16G10L15/25G10L25/24

Inventor钱兆鹏于重重苏小苏

OwnerBEIJING TECHNOLOGY AND BUSINESS UNIVERSITY

Speech recognition method and system for dysarthria based on visual facial contour movement

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology