Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A voice input correction processing method, device, electronic device and storage medium

A voice input and processing technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as non-standard pronunciation, voice communication obstacles, and unrecognizable problems of users, so as to reduce voice communication barriers, smoother voice communication, The effect of good voice communication

Active Publication Date: 2022-02-22
ZHEJIANG UNIV +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in actual application scenarios, the user's language growth environment or physical physiological reasons may cause the user's pronunciation to be not very standard.
[0004] For this part of users whose pronunciation is not standard, the use of general-purpose speech recognition may have problems such as ineffective recognition, such as inaccurate recognition, or even failure to recognize, which makes the voice communication of this part of users using speech recognition technology a great obstacle, which seriously affects users. Experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A voice input correction processing method, device, electronic device and storage medium
  • A voice input correction processing method, device, electronic device and storage medium
  • A voice input correction processing method, device, electronic device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, not all of them.

[0057] The speech input correction processing methods provided by the following embodiments of the present application can be applied to any scene that requires speech recognition, for example, it can be applied to speech assistance tools integrated in the operating system of electronic devices, instant messaging applications, etc. Speech-to-text tools, preset voice input tools, preset voice unlock tools and other scenarios can realize voice-to-text conversion, or can cooperate with other speech synthesis tools to realize voice in voice communication s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a voice input correction processing method, device, electronic device and storage medium, and relates to the technical field of voice recognition. The method includes: acquiring the speech to be recognized input by the user; performing feature extraction on the first speech to be recognized to obtain the speech features to be recognized; using a speech correction model corresponding to the user to recognize the features of the speech to be recognized, and obtaining the recognition corresponding to the speech to be recognized Text, the speech correction model is obtained by model training based on the training voice features and the specified text, and the training voice features are obtained by feature extraction according to the training voice of the user reading the specified text, and the specified text is the specified text that satisfies the preset syllable combination conditions; For the updated text of the recognized text; the speech correction model is updated according to the updated text and the features of the speech to be recognized. The present application can reduce the speech communication barriers for users with non-standard pronunciation based on the speech recognition technology, and improve the user experience.

Description

technical field [0001] The present application relates to the technical field of voice recognition, and in particular, to a voice input correction processing method, device, electronic equipment, and storage medium. Background technique [0002] With the development of speech recognition technology, the speech recognition function is used in more and more application scenarios. [0003] Most of the current speech recognition functions are realized by speech recognition models, and the training of the speech recognition models is carried out by using a training library based on standard speech. However, in actual application scenarios, the user's language growth environment or physical physiological reasons may cause the user's pronunciation to be not very standard. [0004] For this part of users whose pronunciation is not standard, the use of general-purpose speech recognition may have problems such as ineffective recognition, such as inaccurate recognition, or even failur...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/07
CPCG10L15/063G10L15/07G10L2015/0631G10L2015/0635
Inventor 胡志鹏杨天格卜佳俊
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products