Modeling method and modeling device for language identification

A modeling method and language technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem that time series nonlinear information is not effectively used, and achieve the effect of reducing computational complexity, small changes, and simple implementation

Inactive Publication Date: 2010-11-24
TSINGHUA UNIV
View PDF5 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] 3) The subspace projection and subspace compensation techniques of GMM-SVM are both based on linear space, and the nonlinear information implied by the time series is not effectively utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modeling method and modeling device for language identification
  • Modeling method and modeling device for language identification
  • Modeling method and modeling device for language identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0032] To achieve the purpose of the present invention, the embodiment of the present invention discloses a modeling method for language recognition. figure 1 A block diagram of the modeling method is shown. Such as figure 1 As shown, the method includes the following steps:

[0033] S101: Input voice data, preprocess the voice data to obtain feature sequences, and map the feature vectors into supervectors according to the coordinate system selection algorithm and feature vector mapping algorithm, project and compensate the supervector...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a modeling method for language identification, which comprises the following steps of: inputting voice data, preprocessing the voice data to obtain a characteristic sequence, mapping a characteristic vector to form a super vector, performing projection compensation on the super vector, and establishing a training language model through an algorithm of a support vector machine; and adopting the steps to obtain a super vector to be measured of the voice to be measured, performing the projection compensation on the super vector to be measured, grading the super vector to be measured by utilizing the language model, and identifying language types of the voice to be measured. The embodiment of the invention also provides a modeling device for the language identification, which comprises a voice preprocessing module, a characteristic extraction module, a multi-coordinate system origin selection module, a characteristic vector mapping module, a subspace extraction module, a subspace projection compensation module, a training module and an identification module. According to the method and the device which are provided by the embodiment of the invention, information which is invalid to the identification in high-dimension statistics is removed, the correction rate of the language identification is improved, and the computational complexity on an integrated circuit is reduced.

Description

technical field [0001] The present invention relates to speech recognition, pattern recognition and signal processing, in particular, the present invention relates to a modeling method and device for language recognition. Background technique [0002] Language recognition refers to the technology of using machines to distinguish the language type of a given speech. Language recognition technology is the front end of the multilingual processing system, which can be used in voice humanized services, voice security monitoring and other fields. [0003] At present, the most popular system modeling method in the field of language recognition is: extracting spectral layer features from the preprocessed speech, and then using GMM (Gaussian Mixture Models, Gaussian Mixture Model) or SVM (Support Vector Machine, Support Vector Machine) to carry out system modeling. [0004] Commonly used spectral layer features are Mel Frequency Cepstral Coefficients (MFCC), Linear Prediction Cepst...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L21/02
Inventor 何亮张卫强刘加
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products