Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

System and method for detecting language voice frequency

A language phoneme and language technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as performance degradation, and achieve the effects of improved detection performance, good detection stability, and good practicability

Active Publication Date: 2015-06-03
SUZHOU CHIVOX INFORMATION TECH CO LTD
View PDF4 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This technology has good accuracy and versatility, but its performance will drop sharply on short speech, and there are certain limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for detecting language voice frequency
  • System and method for detecting language voice frequency
  • System and method for detecting language voice frequency

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0033] figure 1 It is a schematic structural diagram of a speech audio detection system provided by an embodiment of the present invention. see figure 1 , the system includes: an acoustic feature extraction module, a phoneme recognition module, an acoustic confidence calculation module, a language confidence calculation module, a prosody feature extraction module and a classification discrimination module. in,

[0034] The acoustic feature extraction module is used to extract the acoustic feature of the input speech signal, the acoustic feature at least includes the fundamental frequency feature of the input audio;

[0035] Wherein, the acoustic feature may include: PLP (Perceptual Linear Predictive, perceptual linear prediction) fe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a system and method for detecting language voice frequency, and belongs to the technical field of language signal processing. The system comprises an acoustic feature extraction module, a phoneme identification module, an acoustic confidence calculation module, a language confidence calculation module, a prosodic feature extraction module and a classification discrimination module. Through comprehensive utilization of acoustic confidence, language confidence and prosodic feature information, the detection performance of the system is obviously improved, the system is suitable for the detection of voice frequencies with different lengths and has good detection stability, various non-target language voice frequencies and noise voice frequencies can be processed, the system has good practicability and can be quickly expanded according to the types of non-target languages by providing the acoustic model and the language model of a new language and then re-training a classifier model, so that the system structure has better flexibility and expandability.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a speech audio detection system and method. Background technique [0002] The actual application environment of speech technology is usually very complex, and the audio received by the system may contain many non-target language sounds, such as speech, music, natural noise and artificial noise in different languages. The presence of these audio can seriously affect the usability and user experience of speech technology. Therefore, it is very necessary to efficiently detect and filter these audios by technical means. [0003] Among such technologies, the most typical ones are language recognition technology and noise detection technology. Among them, the language recognition technology uses the phonological information contained in the speech (such as special pronunciation units, different distributions or combinations of pronunciation units, etc.) to determine ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/03G10L15/02G10L15/06
Inventor 王欢良杨嵩代大明袁军峰惠寅华林远东
Owner SUZHOU CHIVOX INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products