Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for detecting movable voice endpoint

A non-active voice and voice technology, applied in the field of voice recognition and detection, can solve the problems of voice recognition accuracy rate reduction, judgment misjudgment, etc., and achieve the effect of improving judgment accuracy rate and accuracy rate

Inactive Publication Date: 2011-06-15
WUDI SCI & TECH (XIAN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Whether the current input signal sound frame is an active voice (activevoice, meaning the voice of the conversation in the conversation) paragraph or an inactive voice (inactive voice, meaning the silence or background noise that pauses in the conversation) is determined by the currently used judgment method. Misjudgments still occur
If a misjudgment occurs, when the feature parameters are acquired, because the target voice includes active voice and inactive voice, the accuracy of voice recognition will decrease

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting movable voice endpoint
  • Method for detecting movable voice endpoint
  • Method for detecting movable voice endpoint

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Voice activity detection is used to determine whether there is a human voice. In recent years, it has been widely used in communications to save energy consumption. If used in speech recognition, it is the pre-processing of speech recognition, which has a great influence on the recognition result. Accurate voice activity detection can reduce the influence of noise and improve the recognition rate. Traditional voice activity detection mostly uses information such as voice energy or zero-crossing rate to determine. The present invention deliberately adds a mathematical deduction function of multiple linear regression and other judgment processes for the aforementioned voice activity detection method, and it is necessary to identify Endpoint detection for voice recognition to successfully complete the pre-processing of voice recognition.

[0020] Therefore, in order to solve the problem that the accuracy of voice recognition is reduced due to insufficient voice parameters in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a detecting method of active voice end, which comprises the following steps including 1) receiving continuous voice and obtaining frame from the continuous voice, 2) calculating the energies of the frame obtained in step 1) and obtaining energy threshold value according to the energies, 3) respectively calculating the zero-crossing rate of the frame obtained in step 1) and obtaining the zero-crossing rate threshold value according to the zero-crossing rates, 4) using linear regression deductive method to judge whether each frame is active voice or inactive voice by taking the energies obtained in step 2) and the zero-crossing rates obtained in step 3) as input parameter of the linear regression deductive method and 5) obtaining active voice starting point and active voice end point in the active voices or inactive voices of the step 4) according to the energy threshold value and the zero-crossing rate threshold value. The invention increases the judging accuracy rate of the active voice starting point and active voice end point, and also improves the correctness rate of voice identification.

Description

Technical field [0001] The invention relates to a voice recognition and detection method, in particular to a detection method for an active voice endpoint for improving the correct rate of recognizing active voice. Background technique [0002] After the original voice analog signal is digitized, it can be directly used for identification, but due to the large amount of data, the processing time is too long, and the efficiency is not good, it is impossible to store all the original voice as a standard voice reference sample. According to the characteristics of the digital voice signal, feature acquisition is performed to obtain appropriate feature parameters for comparison and identification. Moreover, obtaining representative characteristic parameters of the voice signal can reduce the amount of data and increase the efficiency. Generally, the existing Chinese speech recognition process for non-specific speakers is shown in Figure 1, which includes the following steps: [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/04G10L15/00G10L11/02G10L15/05
Inventor 廖崇伯陈淮琰
Owner WUDI SCI & TECH (XIAN) CO LTD