Method for detecting movable voice endpoint

An inactive voice and voice technology, applied in the field of voice recognition and detection, can solve the problems of misjudgment of judgment and reduction of the accuracy rate of voice recognition, and achieve the effect of improving the accuracy rate and the accuracy rate of judgment.

Inactive Publication Date: 2008-07-23
WUDI SCI & TECH (XIAN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Whether the current input signal sound frame is an active voice (activevoice, meaning the voice of the conversation in the conversation) paragraph or an inactive voice (inactive voice, meaning the silence or background noise that pauses in the conversatio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting movable voice endpoint
  • Method for detecting movable voice endpoint
  • Method for detecting movable voice endpoint

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Voice activity detection is used to determine whether there is a human voice. In recent years, it has been widely used in communication to save energy consumption. If it is used for speech recognition, it belongs to the pre-processing of speech recognition, which has a great impact on the recognition results. Accurate voice activity detection can reduce the impact of noise and improve the recognition rate. Traditional voice activity detection mostly uses information such as voice energy or zero-crossing rate to judge. The present invention specifically adds a mathematical deduction function of multiple linear regression and other evaluation procedures to the above-mentioned voice activity detection method. The speech is used for endpoint detection to successfully complete the pre-processing of speech recognition.

[0020] Therefore, in order to solve the problem that the accuracy rate of speech recognition is lowered due to the insufficient parameters of speech acquisit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a detecting method of active voice end, which comprises the following steps including 1) receiving continuous voice and obtaining frame from the continuous voice, 2) calculating the energies of the frame obtained in step 1) and obtaining energy threshold value according to the energies, 3) respectively calculating the zero-crossing rate of the frame obtained in step 1) and obtaining the zero-crossing rate threshold value according to the zero-crossing rates, 4) using linear regression deductive method to judge whether each frame is active voice or inactive voice by taking the energies obtained in step 2) and the zero-crossing rates obtained in step 3) as input parameter of the linear regression deductive method and 5) obtaining active voice starting point and active voice end point in the active voices or inactive voices of the step 4) according to the energy threshold value and the zero-crossing rate threshold value. The invention increases the judging accuracy rate of the active voice starting point and active voice end point, and also improves the correctness rate of voice identification.

Description

technical field [0001] The invention relates to a voice recognition detection method, in particular to a detection method for active voice endpoints used to improve the correct rate of active voice recognition. Background technique [0002] After the original voice analog signal is digitized, it can be directly used for identification, but due to the large amount of data, long processing time, and poor efficiency, it is impossible to store all the original voice as a standard voice reference sample, so it must be According to the characteristics of the digitized voice signal, feature acquisition is performed to obtain appropriate feature parameters for comparison and identification. Moreover, obtaining representative feature parameters for the voice signal can reduce the amount of data and increase efficiency. Generally, the flow of Chinese speech recognition for non-specific speakers is shown in Figure 1, which includes the following steps: [0003] Step 1) Voice signal i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/04G10L15/00G10L11/02G10L15/05
Inventor 廖崇伯陈淮琰
Owner WUDI SCI & TECH (XIAN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products