Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition

An endpoint detection, isolated word technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of large noise impact, and achieve the effect of good effect, simple implementation, and increased robustness

Inactive Publication Date: 2013-10-23
ZHENGZHOU SCI TECH INFORMATION RESINST
View PDF5 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This innovation improves upon existing methods that use double thresholds or other techniques like signal processing technology (SPT) to identify words without being influenced from background noises such as music sounds or conversations during conversation. By adding this technique into our algorithms we aim to improve their effectiveness while also reducing interference caused by these sources.

Problems solved by technology

Technological Problem: The technical problem addressed in this patented method for detecting spoken words accurately without affecting other sounds like music during conversations between users who are speaking over different languages.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition
  • Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The specific implementation method of the present invention will be further described below in conjunction with the accompanying drawings.

[0018] Such as figure 1 , figure 2 Shown, the present invention comprises the following steps:

[0019] 1. Voice input

[0020] Input a speech signal containing isolated words to be detected.

[0021] 2. Speech preprocessing and detection threshold parameter selection

[0022] Perform amplitude translation and normalization processing on the speech signal, and then divide the speech signal into frames, and calculate the short-term average energy and short-term average zero-crossing rate of each frame of speech. The high threshold EFVU and low threshold EFVL, the zero-crossing threshold ZCRT, and the upper limit LENU and lower limit LENL of the speech length of isolated words are preset according to experiments and experience.

[0023] 3. Rough estimation of the start and end points of isolated words

[0024] 3-1. Rough estim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a self-adaptive endpoint detection method and a self-adaptive endpoint detection system for isolate word speech recognition. The self-adaptive endpoint detection method for isolate word speech recognition comprises the following steps: a, a voice input step, wherein a voice signal containing an isolate word to be recognized is input; b, a voice preprocessing step, wherein the voice signal is subjected to amplitude translation and normalization and framing processing operation, and short time average energy and a short time average zero-crossing rate of each frame of voice are calculated; c, an isolate word endpoint rough detection step, wherein isolate word endpoints are roughly estimated through utilization of the short time average energy and the short time average zero-crossing rate of each frame of the voice signal and constraint on the shortest length of continuous voice frames before and after the end points, d, a detection threshold self-adaptive adjustment and accurate endpoint detection step, wherein through utilization of constraint on the smallest time duration and the largest time duration of the isolate word, the detection threshold is subjected to dynamic adjustment operation, the voice endpoints are subjected to front and back fine adjustment, and accurate isolate word endpoints are obtained; e, an isolate word endpoint output and isolate word voice recognition step, wherein the accurate isolate word endpoints are output and isolate word recognition is realized by using voice recognizing technologies.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Owner ZHENGZHOU SCI TECH INFORMATION RESINST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products