Phoneme based voice recognition method and system

A speech recognition and phoneme technology, applied in speech recognition, speech analysis, special data processing applications, etc., can solve the problems of low environmental noise, unrecognizable, and low word recognition rate, and achieve high accuracy, high stability, and recognition efficiency. high effect

Inactive Publication Date: 2007-07-04
潘建强
View PDF2 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The continuous speech recognition system based on this technology has the following disadvantages: 1. The speaker's pronunciation is required to be very standard; 2. The recognition environment is required to be very quiet and the environmental noise is small; Recognition; 4. The recognition rate is related to the topic, that is, it is related to the content of the template library, and words that are not in the template cannot be recognized; 5. The recognition system is required to establish a large-capacity recognition template l

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phoneme based voice recognition method and system
  • Phoneme based voice recognition method and system
  • Phoneme based voice recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] According to the present invention, in a speech signal, a phoneme is a basic unit that can be distinguished by human hearing. According to whether the vocal cords vibrate during pronunciation, phonemes can be divided into voiced and unvoiced. The vocal cords do not vibrate when making unvoiced sounds. In the frequency spectrum of unvoiced sounds, there is no energy-concentrated frequency point in the low frequency area below 400Hz. It can be said that unvoiced sounds have no fundamental frequency, and their waveforms are similar to noise waves. The zero-crossing rate is high. Different from unvoiced sounds, the vocal cords will vibrate when voiced sounds are produced, the volume is higher than that of unvoiced sounds, and the transmission distance is farther. In daily conversations, there are frequency points where energy is concentrated in the low frequency region of 60-400Hz in the spectrum of voiced sounds. The lowest frequency is called the fundamental frequency , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A voice recognizing method and system based on the phoneme includes: A) the analog voice signal is transferred into digital voice signal; B) the short-time zero-crossing ratio is detected, if the short-time zero-crossing ratio is less than the preset value, it is judged to sonant to processed as sonant, if the short-time zero-crossing ratio is higher than the preset value, it is judged to surd to processed as surd; C) the data after pretreatment is spectrum transformed to pick up character; D) the character data is analyzed; E) the phoneme sequence is output according to the analyzed result. The voice recognizing method and system can introduce different process method to surd and sonant, specially the sonant phoneme is modeled based on the single keynote cycle spectrum; it resolves the defect of current voice input recognizing system. It possesses advantages of high recognizing efficiency, high accuracy and high stability.

Description

technical field [0001] The invention relates to the technical field of computer speech recognition, in particular to a phoneme-based speech recognition method and system. Background technique [0002] The fast Fourier transform-FFT of sequences is one of the most important tools for discrete-time signal analysis and processing. If the signal is a sequence of finite length, the frequency spectrum of the sequence can be obtained by directly performing FFT operation on the sequence. For analog signals, when using FFT for spectrum analysis, the signal must first be sampled to make it a discrete signal. According to the sampling theorem, the sampling frequency fs should be greater than twice the highest frequency of the signal. According to the relationship between digital frequency and analog frequency, the analog frequency resolution can be obtained when using N-point FFT for spectrum analysis: [0003] ΔF=fs / N -----------------------------------------(1) [0004] Therefore...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/00G06F17/00G10L25/93
Inventor 潘建强
Owner 潘建强
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products