Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice detection method and voice detection device for portable terminal

A portable terminal and voice detection technology, which is applied in voice analysis, voice recognition, instruments, etc., can solve the problems of undetectable voice, unstable voice energy and recording volume, premature recognition of the starting point of voice recognition, etc., and achieve stability Speech recognition, effect of improving accuracy

Active Publication Date: 2018-05-08
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can be seen that in the speech detection process, due to the low setting of the speech energy threshold, the background sound (noise) is considered as valid data, which causes the starting point of speech recognition to be identified too early; and if the speech energy threshold is set higher, then in such as image 3 In the case shown in the case, the start point of the speech may not be detected
[0008] In addition, when portable terminals such as smartphones and tablet computers use the voice recognition function in a moving state, the received voice energy and the volume of the recording will be unstable, thereby affecting the accuracy of voice data recognition and the accuracy of the user's voice. Detection of start and end points

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice detection method and voice detection device for portable terminal
  • Voice detection method and voice detection device for portable terminal
  • Voice detection method and voice detection device for portable terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] When performing speech recognition in a state of motion, because the distance between the portable terminal and the sound source of the speech is in a state of change, the received speech energy is inconsistent, so the speech energy thresholds used to identify speech endpoints (that is, speech start and speech end) are not applicable at the same time Speech detection in motion and static state.

[0020] The general idea of ​​the present invention is to dynamically set the speech energy threshold for speech recognition by detecting the motion of the portable terminal and according to the change of the detected motion relative to the speech sound source, so that the dynamically set speech energy threshold can be More accurately detect the start point and end point of the user's voice, optimize the detection of the user's voice, and improve the accuracy of the recognition result. On this basis, the volume of the recorded voice data is also adjusted according to the detecte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice detection method and a voice detection device for a portable terminal. The voice detection method includes: detecting the motion of the portable terminal; and setting a voice energy threshold for voice recognition according to the detected change of the motion relative to a voice sound source. By detecting the movement of the portable terminal and according to the change of the movement relative to the sound source of the speech, dynamically set the speech energy threshold for speech recognition; based on the dynamically set speech energy threshold, the start point and end of the user's speech can be recognized more accurately points to improve the accuracy of speech recognition.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice detection method and device for a portable terminal. Background technique [0002] In technologies related to speech detection such as speech recognition, it is necessary to accurately detect the start point and end point of the speech, obtain valid speech data and perform corresponding processing (for example, record and upload the recorded data to the server). [0003] In the prior art, the detection of the start point and the end point of the speech needs to refer to the preset speech energy threshold, and the time for the energy of the detected speech to change from below the speech energy threshold to above the speech energy threshold The point is considered to be the starting point of the user's voice (speech); the energy of the detected voice changes from above the voice energy threshold to below the voice energy threshold and remains unchanged for a peri...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/20G10L21/02
Inventor 刘俊启
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products