Voice identification method utilizing segmenting-layering construction method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as large amount of calculation, difficulty in real-time completion, and impact on recognition effect, so as to ensure time-consuming recognition, reduce dependence, and ensure recognition effect Effect

Inactive Publication Date: 2012-12-12

NORTHWESTERN POLYTECHNICAL UNIV

View PDF4 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, these two methods have their own advantages and disadvantages. If the layered construction method is used in general, although the recognition rate is high, the calculation amount is too large and it is difficult to complete it in real time. If the cutting method is used, although the calculation amount is small, the recognition effect depends heavily on In continuous Chinese speech, sometimes it is difficult to accurately judge the boundaries between characters, which will affect the recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment example

[0098] Implementation conditions: the parameters required by the algorithm are shown in Table 3

[0099] Table 3 parameter description

[0100] parameter symbol

Parameter Description

parameter value

f

Sampling frequency

8000

len

frame length

256

inc

frame shift

80

Δ 1

threshold

0.02

α

threshold

0.2

Zcr1

threshold

0.05

Zcr2

threshold

0.15

Zcr3

threshold

0.5

C 0 E1

threshold

0.05

C 0 E2

threshold

0.15

minlen

threshold

15

max silence

threshold

15

maxlen

threshold

35

[0101] In addition, the special frame parameters of each frame of speech are composed of 12-dimensional MFCC parameters and 12-dimensional MFCC difference parameters, a total of 12-dimensional data, and the speech model is a hidden Markov model with 4 st...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice identification method utilizing a segmenting-layering construction method. The method comprises the following steps of: first, acquiring a voice signal and framing the voice signal; then, extracting voice characteristic parameters and calculating normalized complexity energy and normalized zero-crossing rate of each frame of voice; next, segmenting the voice, wherein each segmented voice only has a single word or double words; and finally, identifying each segmented voice respectively. By the method, the dependence of an identification result on the segmenting precision is reduced; when the pronunciations of two words are continuous and are difficult to segment, the two words are identified by adopting a two-layer layering construction method, and the computation amount of the two-layer layering construction method is acceptable and can be finished in real time; and therefore, the identification effect and the identification time consumption are ensured at the same time.

Description

technical field [0001] The invention relates to the field of speech recognition, especially continuous speech recognition technology. Background technique [0002] In the continuous speech recognition technology, two methods are usually adopted, that is, the layered construction method is adopted in the whole, or the speech signal is first cut into isolated words, and then the result is obtained by matching. However, these two methods have their own advantages and disadvantages. If the layered construction method is used in general, although the recognition rate is high, the calculation amount is too large and it is difficult to complete it in real time. If the cutting method is used, although the calculation amount is small, the recognition effect depends heavily on In continuous Chinese speech, sometimes it is difficult to accurately judge the boundaries between characters, which will affect the recognition effect. Contents of the invention [0003] In order to overcome...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/00G10L15/02G10L15/04

Inventor 董月汉

Owner NORTHWESTERN POLYTECHNICAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice identification method utilizing segmenting-layering construction method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment example

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology