Supercharge Your Innovation With Domain-Expert AI Agents!

Voice identification method utilizing segmenting-layering construction method

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as large amount of calculation, difficulty in real-time completion, and impact on recognition effect, so as to ensure time-consuming recognition, reduce dependence, and ensure recognition effect Effect

Inactive Publication Date: 2012-12-12
NORTHWESTERN POLYTECHNICAL UNIV
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these two methods have their own advantages and disadvantages. If the layered construction method is used in general, although the recognition rate is high, the calculation amount is too large and it is difficult to complete it in real time. If the cutting method is used, although the calculation amount is small, the recognition effect depends heavily on In continuous Chinese speech, sometimes it is difficult to accurately judge the boundaries between characters, which will affect the recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice identification method utilizing segmenting-layering construction method
  • Voice identification method utilizing segmenting-layering construction method
  • Voice identification method utilizing segmenting-layering construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment example

[0098] Implementation conditions: the parameters required by the algorithm are shown in Table 3

[0099] Table 3 parameter description

[0100] parameter symbol

Parameter Description

parameter value

f

Sampling frequency

8000

len

frame length

256

inc

frame shift

80

Δ 1

threshold

0.02

α

threshold

0.2

Zcr1

threshold

0.05

Zcr2

threshold

0.15

Zcr3

threshold

0.5

C 0 E1

threshold

0.05

C 0 E2

threshold

0.15

minlen

threshold

15

max silence

threshold

15

maxlen

threshold

35

[0101] In addition, the special frame parameters of each frame of speech are composed of 12-dimensional MFCC parameters and 12-dimensional MFCC difference parameters, a total of 12-dimensional data, and the speech model is a hidden Markov model with 4 st...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice identification method utilizing a segmenting-layering construction method. The method comprises the following steps of: first, acquiring a voice signal and framing the voice signal; then, extracting voice characteristic parameters and calculating normalized complexity energy and normalized zero-crossing rate of each frame of voice; next, segmenting the voice, wherein each segmented voice only has a single word or double words; and finally, identifying each segmented voice respectively. By the method, the dependence of an identification result on the segmenting precision is reduced; when the pronunciations of two words are continuous and are difficult to segment, the two words are identified by adopting a two-layer layering construction method, and the computation amount of the two-layer layering construction method is acceptable and can be finished in real time; and therefore, the identification effect and the identification time consumption are ensured at the same time.

Description

technical field [0001] The invention relates to the field of speech recognition, especially continuous speech recognition technology. Background technique [0002] In the continuous speech recognition technology, two methods are usually adopted, that is, the layered construction method is adopted in the whole, or the speech signal is first cut into isolated words, and then the result is obtained by matching. However, these two methods have their own advantages and disadvantages. If the layered construction method is used in general, although the recognition rate is high, the calculation amount is too large and it is difficult to complete it in real time. If the cutting method is used, although the calculation amount is small, the recognition effect depends heavily on In continuous Chinese speech, sometimes it is difficult to accurately judge the boundaries between characters, which will affect the recognition effect. Contents of the invention [0003] In order to overcome...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/00G10L15/02G10L15/04
Inventor 董月汉
Owner NORTHWESTERN POLYTECHNICAL UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More