Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice recognition method facing specific crowd

A speech recognition and crowd technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as complex algorithms, large amount of calculation, and limited use, so as to avoid large amount of calculation, improve effectiveness, and enhance reliability.

Active Publication Date: 2012-04-25
HANGZHOU PINGPONG INTELLIGENT TECH CO LTD
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The current mainstream speech recognition technology is based on the theory of statistical pattern recognition. Due to the complexity of the algorithm and the large amount of calculation, it is generally completed by a PC, which undoubtedly limits its application in portable embedded devices.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method facing specific crowd
  • Voice recognition method facing specific crowd
  • Voice recognition method facing specific crowd

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The process of the speech recognition method of the present invention will be described in detail below by taking the elderly over 60 years old as the target group as an example.

[0038] see figure 1 , a speech recognition method for a specific target group, including:

[0039] S1, sampling the voice signal, converting the voice signal from an analog signal to a digital signal;

[0040] Due to the maturity and convenience of digital signal processing technology, in the field of signal processing, the analog signal is generally converted to digital signal for processing. Speech signal is a one-dimensional analog signal with continuous changes in time and amplitude. First, the speech signal must be sampled and converted to analog / digital to convert it into a discretized digital signal in time and amplitude, and then processed.

[0041] According to the Nyquist sampling law, when the sampling frequency is greater than twice the highest frequency of the signal, the origi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice recognition method facing a specific crowd. The method comprises the following steps of: first, sampling a voice signal and converting the voice signal to a digital signal from an analogue signal; then, pre-weighting, windowing, en-framing and performing front-end processing of endpoint detection on the digital voice signal; later on, performing feature extraction on the voice signal by adopting discrete wavelet transform; and finally, performing voice recognition on the feature-extracted voice signal by adopting a discrete hidden Markov model after training a sample. In the processes of performing the front-end processing and the feature extraction on the voice signal, spectrum features and pronunciation characteristics of different target crowds are fullytaken into consideration and the process of extracting voice information is optimized, so that a processing process and an information extracting process can be simplified; and therefore, recognitionprecision is ensured, simultaneously calculation amount and information storage capacity in the recognition process are greatly reduced, and the voice recognition on an embedded platform is realized.

Description

technical field [0001] The invention relates to the technical field of voice signal processing and recognition, in particular to a voice recognition method for specific target groups. Background technique [0002] Speech recognition is a technology that converts human voice signals into corresponding text or commands through the process of recognition and understanding by machines, that is, to translate voice signals into corresponding written language word by word, or to make correct requests and commands contained in voice. response. With the development of science and the popularization of computers, people have put forward higher requirements for the way of communication with computers. This promotes the development of speech recognition technology and makes it an important research direction in the field of speech processing. Speech recognition has broad application prospects, and its development, maturity and practicality will promote the rapid development of many in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/14G10L15/02
Inventor 熊伟刘伟王飞浪谢伟良陈鑫娜陈鹏熊俱扬熊鑫
Owner HANGZHOU PINGPONG INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products