Voice identification method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech data technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of lower recognition rate, poor use effect, and insignificant improvement effect

Active Publication Date: 2013-04-24

HUAWEI DEVICE CO LTD

View PDF3 Cites 45 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] At present, voice assistant software usually works relatively well in quiet environments such as offices, but it does not work well in noisy environments (such as in vehicle environments); the industry generally adopts software noise reduction methods to improve speech recognition rate, but the improvement effect is not obvious, and sometimes even reduces the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0043] figure 1 It is a flow chart of a speech recognition method provided by Embodiment 1 of the present invention.

[0044] Such as figure 1 As shown, Embodiment 1 of the present invention provides a method for speech recognition that may specifically include:

[0045] S100, acquiring voice data;

[0046] The user starts the voice recognition software such as the voice assistant on the device, and obtains the voice data input by the user through the microphone. It should be understood that the voice data may not be input by a user, but may also be input by a machine, including any data containing information.

[0047] S101. Acquire a first confidence value according to the voice data. The first confidence value refers to the degree to which a specific individual believes in the truth of a specific proposition. In the embodiment of the present invention, it is the degree to which the device believes in the authenticity of the voice data recognition result. That is, the ...

Embodiment 2

[0065] image 3 It is a flow chart of another implementation manner of a speech recognition method provided by Embodiment 2 of the present invention.

[0066] Embodiment 2 of the present invention is described on the basis of Embodiment 1 of the present invention. Such as image 3 As shown, in step S102 in Embodiment 1, the noise scene specifically includes: noise type; noise magnitude.

[0067] The noise type refers to the noise environment in which the user inputs voice data, that is, it can be understood as whether the user is in a noise environment on the road, in an office, or in a vehicle.

[0068] The noise level represents the level of noise in the noise environment where the user is inputting voice data at that time. Optionally, the noise size includes: signal-to-noise ratio and noise energy level. The signal-to-noise ratio is the ratio of voice data to noise data power, often expressed in decibels. Generally, the higher the signal-to-noise ratio, the smaller the ...

Embodiment 3

[0112] Figure 4 It is a flow chart of another implementation manner of a speech recognition method provided by Embodiment 3 of the present invention.

[0113] This embodiment is described on the basis of Embodiment 1, as Figure 4 As shown, the step S103 method of embodiment 1 specifically includes:

[0114] S1031. According to the corresponding relationship between the noise scene and the pre-stored empirical data of the confidence value adjustment value, acquire the confidence value adjustment value corresponding to the noise scene;

[0115]According to the noise type in the noise scene, the noise size and the empirical data of the confidence value adjustment value obtained through a large number of simulation measurements, the confidence value adjustment value corresponding to the noise scene is obtained. The noise type indicates the type of environment in which the user performs speech recognition, and the noise level indicates the noise level of the type of environment...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a voice identification method which comprises the following steps of: acquiring voice data; acquiring a first confidence level value according to the voice data; acquiring a noise scene according to the voice data; acquiring a second confidence level value corresponding to the noise scene according to the first confidence level value; and if the second confidence level value is greater than or equal to a pre-stored confidence level threshold value, processing the voice data. The invention also discloses a device. According to the noise scene, the method and the device can be used for flexibly adjusting the confidence level value, so that the voice identification rate in the noise environment is greatly improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of speech processing, and in particular, to a speech recognition method and device. Background technique [0002] Users generally use voice assistant software for voice recognition on terminal devices such as mobile phones. The process of voice recognition with software such as voice assistants is that the user starts the voice assistant software to obtain voice data; the voice data is sent to the noise reduction module for noise reduction processing; the voice data after noise reduction processing is sent to the voice recognition engine; the voice recognition engine Return the recognition result to the voice assistant; in order to reduce misjudgment, the voice assistant judges the correctness of the recognition result according to the confidence threshold, and then presents it. [0003] At present, voice assistant software usually works relatively well in quiet environments such as of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/26G10L21/02

CPCG10L17/20G10L17/12

Inventor 蒋洪睿王细勇梁俊斌郑伟军周均扬

Owner HUAWEI DEVICE CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice identification method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology