Embedded Chinese-English mixed voice recognition method and system for non-specific people

A non-specific person, mixed voice technology, applied in voice recognition, voice analysis, instruments, etc., can solve problems such as voice recognition cannot be realized, and achieve the effect of low algorithm pressure, high recognition rate, and small storage space
CN101604522AInactive Publication Date: 2009-12-16北京森博克智能科技有限公司

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
北京森博克智能科技有限公司
Publication Date
2009-12-16
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to an embedded, Chinese-English mixed language supporting, embedded application oriented voice recognition method and system for non-specific people. The invention adopts acoustical model trained by mass voice data, acoustical modeling unit set compatible with Chinese and English pronouncing mode, so as to implement Chinese-English mixed voice recognition for non-specific people. According to the invention, a plurality of background models are adopted, a Gauss mixed model (GMM) parameter is obtained by an average adaptive training executed by the background models, then a vector quantization to the difference between the average of the Gauss mixed model and the average of the background models, and the model parameters are compressed. In the recognition stage, rapid Gauss selection, acoustic score pre-calculation, and a simplified GMM model are used, so that the amount of recognition calculation and storage space of the models are greatly reduced, and the voice recognition method and system is applicable on various kinds of embedded application systems.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of automatic speech recognition, and is a non-specific person-oriented, embedded application environment with limited computing and storage resources, and a speech recognition method and system supporting Chinese and English mixed languages. Background technique

[0002] Speech is the most natural and convenient way for human beings to communicate and obtain information. Intelligent voice interaction technology mainly includes speech recognition technology, speech synthesis technology, voice evaluation technology, etc. Intelligent voice interaction will be a breakthrough change in the human-computer interaction mode after the graphical interaction mode (GUI).

[0003] Speech recognition technology is a technology that allows machines to understand human speech, and automatically converts voice signals into text and related information through machines. It is a very important and critical part of intelligent ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More