Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice recognition method and apparatus, and storage medium

A voice recognition and audio data technology, applied in the field of communication, can solve the problems of consuming large battery power, affecting mobility, reducing the standby time of mobile terminals, etc., to reduce system power consumption, improve performance, and prolong standby time.

Active Publication Date: 2017-11-17
TENCENT TECH (SHENZHEN) CO LTD
View PDF8 Cites 47 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For this reason, the prior art proposes to use an external power supply for power supply, or to use a physical button to wake up. However, if an external power supply is used, it will inevitably affect its mobility, and if it is awakened by a physical button, Then voice wake-up cannot be realized; that is to say, in the existing scheme, if it is necessary to maintain its mobility and voice wake-up function, it will inevitably consume a large amount of battery power, which will greatly reduce the standby time of the mobile terminal and affect the performance of the mobile terminal. performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and apparatus, and storage medium
  • Voice recognition method and apparatus, and storage medium
  • Voice recognition method and apparatus, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] In this embodiment, description will be made from the perspective of a speech recognition device, which may specifically be integrated into a device such as a mobile terminal, where the mobile terminal may include a mobile phone, a wearable smart device, a tablet computer, and / or a notebook computer and other equipment.

[0047] This embodiment provides a method for speech recognition, comprising: acquiring audio data, performing fuzzy speech recognition on the audio data through a DSP, and when the fuzzy speech recognition result indicates that there is a wake-up word, the DSP wakes up the CPU in a dormant state, and uses the CPU to perform fuzzy speech recognition on the audio data. The audio data is subjected to semantic analysis, and corresponding operations are performed according to the analysis results.

[0048] Such as Figure 1c As shown, the specific flow of the speech recognition method can be as follows:

[0049] 101. Acquire audio data.

[0050] For exam...

Embodiment 2

[0082] According to the method described in Embodiment 1, an example will be given below for further detailed description.

[0083] In this embodiment, it will be described by taking the speech recognition device integrated in a mobile terminal as an example.

[0084] Such as Figure 2a As shown, a speech recognition method, the specific process can be as follows:

[0085] 201. The mobile terminal collects the audio data through the MIC.

[0086] Wherein, the MIC may be independent from the mobile terminal, or built in the mobile terminal. The audio data may include data converted from various forms of sound, and the category of the sound may not be limited, for example, it may be the sound of speaking, the sound of an animal, the sound of beating an object, and / or music, etc. Wait.

[0087]202. The mobile terminal performs fuzzy speech recognition on the audio data through the DSP. If the fuzzy speech recognition result indicates that there is a wake-up word, perform step...

Embodiment 3

[0124]In order to better implement the above method, an embodiment of the present invention also provides a speech recognition device, which can be integrated into a mobile terminal, such as a mobile phone, a wearable smart device, a tablet computer, and / or a notebook computer, etc. .

[0125] For example, see Figure 3a , the speech recognition device may include an acquisition unit 301, a fuzzy recognition unit 302, a wake-up unit 303 and a processing unit 304, as follows:

[0126] (1) acquisition unit 301;

[0127] The obtaining unit 301 is configured to obtain audio data.

[0128] For example, the acquiring unit 301 may specifically be configured to acquire the audio data through a MIC, such as a built-in MIC module of a mobile terminal.

[0129] (2) fuzzy identification unit 302;

[0130] The fuzzy recognition unit 302 is configured to perform fuzzy speech recognition on the audio data through the DSP.

[0131] Wherein, the mode of fuzzy speech recognition can have m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiments of the present invention disclose a voice recognition method and apparatus, and a storage medium. According to one embodiment, the method comprises the following steps: after audio data is obtained, the audio data can be subjected to fuzzy voice recognition through a DSP, and when it is determined that a wake-up word exists, the DSP wakes up a CPU at a dormant state, the audio data is semantically analyzed through the CPU, and then corresponding operation is performed according to an analysis result. According to the scheme provided by the invention, under the condition that the mobility and the voice wake-up function are retained, the system power consumption can be greatly reduced, the standby time of a mobile terminal is prolonged, and the performance of the mobile terminal is improved.

Description

technical field [0001] The present invention relates to the field of communication technology, in particular to a speech recognition method, device and storage medium. Background technique [0002] With the development of artificial intelligence, intelligent hardware products have also developed rapidly. The so-called smart hardware products refer to hardware devices integrated with artificial intelligence functions, such as smart mobile terminals (referred to as mobile terminals). The core of intelligent hardware products must be inseparable from the interaction with "people", and voice interaction, as a natural and low-learning-cost interaction method, has become the mainstream technology of intelligent hardware products. [0003] In voice interaction, how to perform voice wake-up is an important issue. Taking a mobile terminal as an example, in the prior art, in order to realize rapid voice wake-up, it is generally required that the recording function of the terminal is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04M1/725H04W52/02G10L15/26G06F17/27G06F1/32H04M1/72409H04M1/72448
CPCH04W52/0261G06F1/3215G06F1/3293G10L15/26G06F40/30H04M1/72409H04M1/72448G10L2015/088G10L15/14G10L15/16G10L15/22G10L15/1822G10L21/02G06F1/3231G10L2015/221G10L15/02G10L15/083G10L15/1815G10L2015/223
Inventor 唐惠忠
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products