Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice identification method and device

A speech recognition and speech data technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of lower recognition rate, poor use effect, and insignificant improvement effect

Active Publication Date: 2013-04-24
HUAWEI DEVICE CO LTD
View PDF3 Cites 45 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, voice assistant software usually works relatively well in quiet environments such as offices, but it does not work well in noisy environments (such as in vehicle environments); the industry generally adopts software noise reduction methods to improve speech recognition rate, but the improvement effect is not obvious, and sometimes even reduces the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice identification method and device
  • Voice identification method and device
  • Voice identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] figure 1 It is a flow chart of a speech recognition method provided by Embodiment 1 of the present invention.

[0044] Such as figure 1 As shown, Embodiment 1 of the present invention provides a method for speech recognition that may specifically include:

[0045] S100, acquiring voice data;

[0046] The user starts the voice recognition software such as the voice assistant on the device, and obtains the voice data input by the user through the microphone. It should be understood that the voice data may not be input by a user, but may also be input by a machine, including any data containing information.

[0047] S101. Acquire a first confidence value according to the voice data. The first confidence value refers to the degree to which a specific individual believes in the truth of a specific proposition. In the embodiment of the present invention, it is the degree to which the device believes in the authenticity of the voice data recognition result. That is, the ...

Embodiment 2

[0065] image 3 It is a flow chart of another implementation manner of a speech recognition method provided by Embodiment 2 of the present invention.

[0066] Embodiment 2 of the present invention is described on the basis of Embodiment 1 of the present invention. Such as image 3 As shown, in step S102 in Embodiment 1, the noise scene specifically includes: noise type; noise magnitude.

[0067] The noise type refers to the noise environment in which the user inputs voice data, that is, it can be understood as whether the user is in a noise environment on the road, in an office, or in a vehicle.

[0068] The noise level represents the level of noise in the noise environment where the user is inputting voice data at that time. Optionally, the noise size includes: signal-to-noise ratio and noise energy level. The signal-to-noise ratio is the ratio of voice data to noise data power, often expressed in decibels. Generally, the higher the signal-to-noise ratio, the smaller the ...

Embodiment 3

[0112] Figure 4 It is a flow chart of another implementation manner of a speech recognition method provided by Embodiment 3 of the present invention.

[0113] This embodiment is described on the basis of Embodiment 1, as Figure 4 As shown, the step S103 method of embodiment 1 specifically includes:

[0114] S1031. According to the corresponding relationship between the noise scene and the pre-stored empirical data of the confidence value adjustment value, acquire the confidence value adjustment value corresponding to the noise scene;

[0115]According to the noise type in the noise scene, the noise size and the empirical data of the confidence value adjustment value obtained through a large number of simulation measurements, the confidence value adjustment value corresponding to the noise scene is obtained. The noise type indicates the type of environment in which the user performs speech recognition, and the noise level indicates the noise level of the type of environment...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a voice identification method which comprises the following steps of: acquiring voice data; acquiring a first confidence level value according to the voice data; acquiring a noise scene according to the voice data; acquiring a second confidence level value corresponding to the noise scene according to the first confidence level value; and if the second confidence level value is greater than or equal to a pre-stored confidence level threshold value, processing the voice data. The invention also discloses a device. According to the noise scene, the method and the device can be used for flexibly adjusting the confidence level value, so that the voice identification rate in the noise environment is greatly improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of speech processing, and in particular, to a speech recognition method and device. Background technique [0002] Users generally use voice assistant software for voice recognition on terminal devices such as mobile phones. The process of voice recognition with software such as voice assistants is that the user starts the voice assistant software to obtain voice data; the voice data is sent to the noise reduction module for noise reduction processing; the voice data after noise reduction processing is sent to the voice recognition engine; the voice recognition engine Return the recognition result to the voice assistant; in order to reduce misjudgment, the voice assistant judges the correctness of the recognition result according to the confidence threshold, and then presents it. [0003] At present, voice assistant software usually works relatively well in quiet environments such as of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L21/02
CPCG10L17/20G10L17/12
Inventor 蒋洪睿王细勇梁俊斌郑伟军周均扬
Owner HUAWEI DEVICE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products