Method and system for recognizing speech of agglutinative language

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of speech recognition and agglutinative language, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of weakening the degree of confusion of the acoustic model and the unsatisfactory effect of the three-factor modeling unit, and achieve the effect of improving the effect and reducing the degree of confusion

Active Publication Date: 2013-04-03

INST OF ACOUSTICS CHINESE ACAD OF SCI +1

View PDF4 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

There is still no effective solution to the second challenge. Some researchers have tried to reduce the degree of confusion of the acoustic model by introducing the concept of isotopes under short-term characteristics, but experiments have proved that this method is not effective on the basic single-factor modeling unit. The effect is obvious, but the effect is not ideal on the three-factor modeling unit used in conventional speech recognition systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0017] The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

[0018] The embodiments of the present invention are dedicated to solving the problems encountered in acoustic model modeling in agglutinative language speech recognition. The agglutinative languages include Korean and Uyghur. For the convenience of description, the following uses Korean as an example for description. By adopting a refined speech analysis method to discover differences in phonemes, it can distinguish Korean phonemes that were originally considered to have consistent pronunciation in the speech recognition system, reduce the degree of confusion of the acoustic model, and improve the overall performance of the system.

[0019] In Korean speech recognition systems, the main cause of high confusion in acoustic models is coarticulation. Given that human co-pronunciation usually affects hundreds of millis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention relates to a method and a system for recognizing Korean speech. The method comprises the following steps of extracting speech long-term features; calculating the posterior probability of an extension phoneme set for the long-term features; performing PCA (Principal Component Analysis) dimension reduction processing on the posterior probability to acquire MLP (Multilayer Perception) features based on the long-term features; and performing speech recognition based on a gaussian mixture model-hidden markov model (GMM-HMM) framework on the MLP features, and thus acquiring a recognition result. According to the method and the system, the Korean phoneme set is detailed and classified by means of the advantages of the long-term features in the aspect of depicting collaborative pronunciation, the confusion degree of acoustic models is effectively reduced, and an effect of recognizing the speech is improved.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a method and system for agglutinative speech recognition. Background technique [0002] Adhesive language mainly relies on the change of word endings to express its grammatical relationship, and its typical feature is that word-level units in the language are composed of a large number of morpheme connections. Compared with Chinese, which belongs to the analytical language, the sticky feature brings many new challenges to speech recognition, which greatly affects its performance under the traditional speech recognition framework. Among these new challenges, the more important ones can be summarized as the following two points: The first is that in terms of language model modeling, Korean natural language units such as words and words separated by spaces are not suitable for language model modeling unit; the second is in terms of acoustic model modeling, the severe co-pronunciat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/08

Inventor 颜永红徐及潘接林

Owner INST OF ACOUSTICS CHINESE ACAD OF SCI

Method and system for recognizing speech of agglutinative language

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology