Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for voice recognition, electronic device, and computer readable storage medium

A speech recognition and computer technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as poor user experience, false triggering, and high processing pressure on keyword detection devices

Inactive Publication Date: 2018-12-21
MOBVOI INFORMATION TECH CO LTD
View PDF9 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the process of carrying out the invention, the inventor found that as long as the surrounding environment is not in the silent state, the keyword detection is triggered, because there may be some noises in the surrounding environment, such as people screaming, dog barking and other sounds, may be falsely triggered to collect sounds in the surrounding environment, and trigger keyword detection with the syllables in the voice model, resulting in a higher probability of false triggers; moreover, after false triggering, it is necessary to collect the sounds of the surrounding environment in real time Keyword detection leads to high processing pressure on the collection device and keyword detection device, and after keyword detection is falsely triggered, other voice commands may be falsely triggered to execute, resulting in poor user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for voice recognition, electronic device, and computer readable storage medium
  • Method and apparatus for voice recognition, electronic device, and computer readable storage medium
  • Method and apparatus for voice recognition, electronic device, and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] The embodiment of the present invention provides a method for speech recognition, such as figure 1 As shown, the method includes:

[0036] Step S101 , based on the sound in the current environment, determine whether the current environment is in a silent state.

[0037] The embodiment of the present invention may be executed by a terminal device, or may be executed by a server. It is not limited in the embodiment of the present invention.

[0038] For the embodiment of the present invention, the sound in the current environment may be monitored in real time, or at preset intervals, to determine whether the current environment is in a silent state.

[0039] Step S102, when it is determined that the current environment does not belong to the silent state, determine whether the sound in the current environment belongs to preset noise through the first model and / or the second model.

[0040] For the embodiment of the present invention, the first model may be the SPN mode...

Embodiment 2

[0047] The embodiment of the present invention provides another possible implementation manner. On the basis of the first embodiment, the method shown in the second embodiment is also included, wherein,

[0048] In step S102, through the first model and / or the second model, it is determined whether the sound in the current environment belongs to the preset noise, and the step SA (not marked in the figure) is also included before, wherein,

[0049] Step SA, creating and training the first model and / or the second model.

[0050] Wherein, the first model is used to determine whether the sound in the current environment belongs to the noise generated by humans, and the second model is used to determine whether the sound in the current environment belongs to the noise not generated by humans.

[0051] For the embodiment of the present invention, the first model and / or the second model can be created in the existing garbage model; the first model parallel to the garbage model and / or...

Embodiment 3

[0062] Another possible implementation of the embodiment of the present invention further includes the operations shown in the third embodiment on the basis of the first embodiment, wherein,

[0063] Step S101 includes step S1011 (not marked in the figure), step S1012 (not marked in the figure) and step S1013 (not marked in the figure), wherein,

[0064] Step S1011. Determine whether the decibel corresponding to the sound in the current environment is greater than a preset threshold.

[0065] For the embodiment of the present invention, an application scenario corresponding to the current environment is determined, and a corresponding preset threshold is determined based on the application scenario corresponding to the current environment. In the embodiment of the present invention, different application scenarios may correspond to different preset thresholds, or different application scenarios may correspond to the same preset threshold. It is not limited in the embodiment o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a method and apparatus for voice recognition, an electronic device, and a computer readable storage medium. The method comprises: on the basis of sounds in a current environment, whether the current environment is in a mute state is determined; if not, whether the sounds in the current environment belong to preset noises is determined by a first model and / or a second model; and if not, a voice model is triggered to carry out keyword detection. Therefore, the probability of false triggering is reduced and the processing pressures of the collecting deviceand the keyword detecting device are reduced, so that the user experience is improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of speech recognition, and specifically, embodiments of the present invention relate to a speech recognition method, device, electronic equipment, and computer-readable storage medium. Background technique [0002] With the development of information technology, speech recognition technology develops accordingly. Using speech recognition technology to identify whether the sound in the current environment can trigger the execution of operations, for example, using speech recognition technology to recognize that the voice uttered by the user contains keywords that trigger the opening of the terminal device. , when the keyword that triggers the terminal device to be turned on is recognized, the terminal device is controlled to start up. Therefore, how to execute the trigger operation based on voice recognition becomes a key issue. [0003] In the prior art, a voice recognition method for v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/20G10L15/02G10L15/22
CPCG10L15/02G10L15/20G10L15/22G10L2015/027G10L2015/223
Inventor 胡亚光
Owner MOBVOI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products