Method for detecting movable voice endpoint

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An inactive voice and voice technology, applied in the field of voice recognition and detection, can solve the problems of misjudgment of judgment and reduction of the accuracy rate of voice recognition, and achieve the effect of improving the accuracy rate and the accuracy rate of judgment.

Inactive Publication Date: 2008-07-23

WUDI SCI & TECH (XIAN) CO LTD

View PDF0 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] Whether the current input signal sound frame is an active voice (activevoice, meaning the voice of the conversation in the conversation) paragraph or an inactive voice (inactive voice, meaning the silence or background noise that pauses in the conversation) is determined by the currently used judgment method. Misjudgments still occur

If a misjudgment occurs, when the feature parameters are acquired, because the target voice includes active voice and inactive voice, the accuracy of voice recognition will decrease

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] Voice activity detection is used to determine whether there is a human voice. In recent years, it has been widely used in communication to save energy consumption. If it is used for speech recognition, it belongs to the pre-processing of speech recognition, which has a great impact on the recognition results. Accurate voice activity detection can reduce the impact of noise and improve the recognition rate. Traditional voice activity detection mostly uses information such as voice energy or zero-crossing rate to judge. The present invention specifically adds a mathematical deduction function of multiple linear regression and other evaluation procedures to the above-mentioned voice activity detection method. The speech is used for endpoint detection to successfully complete the pre-processing of speech recognition.

[0020] Therefore, in order to solve the problem that the accuracy rate of speech recognition is lowered due to the insufficient parameters of speech acquisit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a detecting method of active voice end, which comprises the following steps including 1) receiving continuous voice and obtaining frame from the continuous voice, 2) calculating the energies of the frame obtained in step 1) and obtaining energy threshold value according to the energies, 3) respectively calculating the zero-crossing rate of the frame obtained in step 1) and obtaining the zero-crossing rate threshold value according to the zero-crossing rates, 4) using linear regression deductive method to judge whether each frame is active voice or inactive voice by taking the energies obtained in step 2) and the zero-crossing rates obtained in step 3) as input parameter of the linear regression deductive method and 5) obtaining active voice starting point and active voice end point in the active voices or inactive voices of the step 4) according to the energy threshold value and the zero-crossing rate threshold value. The invention increases the judging accuracy rate of the active voice starting point and active voice end point, and also improves the correctness rate of voice identification.

Description

technical field [0001] The invention relates to a voice recognition detection method, in particular to a detection method for active voice endpoints used to improve the correct rate of active voice recognition. Background technique [0002] After the original voice analog signal is digitized, it can be directly used for identification, but due to the large amount of data, long processing time, and poor efficiency, it is impossible to store all the original voice as a standard voice reference sample, so it must be According to the characteristics of the digitized voice signal, feature acquisition is performed to obtain appropriate feature parameters for comparison and identification. Moreover, obtaining representative feature parameters for the voice signal can reduce the amount of data and increase efficiency. Generally, the flow of Chinese speech recognition for non-specific speakers is shown in Figure 1, which includes the following steps: [0003] Step 1) Voice signal i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/04G10L15/00G10L11/02G10L15/05

Inventor 廖崇伯陈淮琰

Owner WUDI SCI & TECH (XIAN) CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method for detecting movable voice endpoint

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology