Phoneme based voice recognition method and system

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and phoneme technology, applied in speech recognition, speech analysis, special data processing applications, etc., can solve the problems of low environmental noise, unrecognizable, and low word recognition rate, and achieve high accuracy, high stability, and recognition efficiency. high effect

Inactive Publication Date: 2007-07-04

潘建强

View PDF2 Cites 31 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The continuous speech recognition system based on this technology has the following disadvantages: 1. The speaker's pronunciation is required to be very standard; 2. The recognition environment is required to be very quiet and the environmental noise is small; Recognition; 4. The recognition rate is related to the topic, that is, it is related to the content of the template library, and words that are not in the template cannot be recognized; 5. The recognition system is required to establish a large-capacity recognition template library. 6. The repeatability is very poor, and the same In a word, if it cannot be recognized correctly, the recognition results will be different multiple times

[0013] The existing continuous speech recognition system adopts mixed recognition of unvoiced and voiced sounds, samples the speech signal with a fixed duration, and extracts speech recognition features. Because it cannot guarantee that each sampling is for a single phoneme, the extracted recognition features have poor stability and poor recognition effect. far from ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0062] According to the present invention, in a speech signal, a phoneme is a basic unit that can be distinguished by human hearing. According to whether the vocal cords vibrate during pronunciation, phonemes can be divided into voiced and unvoiced. The vocal cords do not vibrate when making unvoiced sounds. In the frequency spectrum of unvoiced sounds, there is no energy-concentrated frequency point in the low frequency area below 400Hz. It can be said that unvoiced sounds have no fundamental frequency, and their waveforms are similar to noise waves. The zero-crossing rate is high. Different from unvoiced sounds, the vocal cords will vibrate when voiced sounds are produced, the volume is higher than that of unvoiced sounds, and the transmission distance is farther. In daily conversations, there are frequency points where energy is concentrated in the low frequency region of 60-400Hz in the spectrum of voiced sounds. The lowest frequency is called the fundamental frequency , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A voice recognizing method and system based on the phoneme includes: A) the analog voice signal is transferred into digital voice signal; B) the short-time zero-crossing ratio is detected, if the short-time zero-crossing ratio is less than the preset value, it is judged to sonant to processed as sonant, if the short-time zero-crossing ratio is higher than the preset value, it is judged to surd to processed as surd; C) the data after pretreatment is spectrum transformed to pick up character; D) the character data is analyzed; E) the phoneme sequence is output according to the analyzed result. The voice recognizing method and system can introduce different process method to surd and sonant, specially the sonant phoneme is modeled based on the single keynote cycle spectrum; it resolves the defect of current voice input recognizing system. It possesses advantages of high recognizing efficiency, high accuracy and high stability.

Description

technical field [0001] The invention relates to the technical field of computer speech recognition, in particular to a phoneme-based speech recognition method and system. Background technique [0002] The fast Fourier transform-FFT of sequences is one of the most important tools for discrete-time signal analysis and processing. If the signal is a sequence of finite length, the frequency spectrum of the sequence can be obtained by directly performing FFT operation on the sequence. For analog signals, when using FFT for spectrum analysis, the signal must first be sampled to make it a discrete signal. According to the sampling theorem, the sampling frequency fs should be greater than twice the highest frequency of the signal. According to the relationship between digital frequency and analog frequency, the analog frequency resolution can be obtained when using N-point FFT for spectrum analysis: [0003] ΔF=fs / N -----------------------------------------(1) [0004] Therefore...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/00G06F17/00G10L25/93

Inventor潘建强

Owner潘建强

Phoneme based voice recognition method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology