Supercharge Your Innovation With Domain-Expert AI Agents!

Voice recognition method and device, equipment and storage medium

A speech recognition and speech technology, which is applied in speech recognition, speech analysis, instruments, etc., can solve the problems that the speech cannot be recognized correctly, and the speech can be recognized, so as to improve the accuracy and ensure the effect of fast decoding

Pending Publication Date: 2021-03-12
BEIJING SINOVOICE TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current speech recognition technology cannot recognize the tone shift in the speech without increasing the decoding path, so the speech with the pitch shift pronunciation cannot be correctly recognized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and device, equipment and storage medium
  • Voice recognition method and device, equipment and storage medium
  • Voice recognition method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0069] Speech modulation can refer to the phenomenon that in actual human pronunciation, the pitch of syllables will change when they are uttered continuously, that is, the pitch of some syllables will be changed by the influence of the following pitch. For example, the original pronunciation of "Prime Minister" is "zong3li3". As "zong2li3", this is the tone sandhi within the word. The current speech recognition technology cannot recognize the words in the speech where th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice recognition method, device and equipment and a storage medium, and relates to the technical field of voice recognition. According to the invention, rhythm detection is carried out on a to-be-recognized voice, tone modification is carried out on the phoneme posterior probability according to the rhythm detection result, and decoding path search is carried out according to the tone-modified phoneme posterior probability, so that the voice recognition accuracy is improved. Rhythm prediction is carried out on the to-be-recognized voice to obtain a rhythm structure ofthe to-be-recognized voice; pronunciation prediction is carried out on the to-be-recognized voice according to the acoustic characteristics of the to-be-recognized voice to obtain a plurality of phoneme posterior probabilities of the to-be-recognized voice; according to the rhythm structure, tone changing is carried out on one or more phoneme posterior probabilities in the plurality of phoneme posterior probabilities; and path search is carried out in a finite state converter according to the plurality of phoneme posteriori probabilities after tone change, and decoding is carried out to obtain a corresponding text of the to-be-recognized voice.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device, equipment and storage medium. Background technique [0002] Automatic Speech Recognition (ASR) is a technology that studies how to convert human speech recognition into text. It is widely used in voice dialing, voice navigation, indoor device control, voice document retrieval, simple dictation data entry, etc. in service. [0003] In actual human pronunciation, pitch changes will occur when syllables are uttered continuously, that is, the tones of some syllables will be affected by the tones of the following tones and change. The current speech recognition technology cannot recognize the tone shift in the speech without increasing the decoding path, which leads to the inability to correctly recognize the speech with the pitch shift pronunciation. Contents of the invention [0004] Embodiments of the present application...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/18G10L15/02G10L15/26G10L15/14G10L15/06
CPCG10L15/1807G10L15/02G10L15/26G10L15/142G10L15/144G10L15/063G10L2015/025
Inventor 郑晓明李健武卫东陈明
Owner BEIJING SINOVOICE TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More