Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep-learning-technology-based automatic accent classification method and apparatus

A deep learning and accent technology, applied in speech analysis, speech recognition, instruments, etc., to achieve the effect of improving performance, good classification performance, and good accent classification effect

Inactive Publication Date: 2016-06-01
INST OF AUTOMATION CHINESE ACAD OF SCI +2
View PDF5 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Model Although discriminative training methods (discriminative training methods) of the GMM model such as the minimum classification error criterion (MCE) have been used to suppress the confusion area and also increase the discriminative ability of the accent model, the performance of the text-independent accent classification method is still low. needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep-learning-technology-based automatic accent classification method and apparatus
  • Deep-learning-technology-based automatic accent classification method and apparatus
  • Deep-learning-technology-based automatic accent classification method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] It should be understood that the following detailed description of the various examples and drawings is not intended to limit the invention to the particular illustrative embodiments; the described illustrative embodiments merely exemplify the various steps of the invention, the scope of which is defined by the appended claims to define.

[0029] The present invention replaces the shallow model used in traditional methods by establishing an automatic accent classification algorithm based on deep learning technology, and learns more robust accents through the stronger differentiation of deep neural network itself and its deep structure in deep learning technology. Sticky high-level features that improve the performance of automatic accent classification algorithms. A step further is that the use of deep learning technology can more effectively utilize features containing speech context information, thereby further improving the classification effect of automatic accent c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deep-learning-technology-based automatic accent classification method and apparatus. The method comprises: mute voice elimination is carried out on all accent voices in a training set and mel-frequency cepstrum coefficient (MFCC) feature extraction is carried out; according to the extracted MFCC feature, deep neural networks of various accent voices are trained to describe acoustic characteristics of various accent voices, wherein the deep neural networks are forward artificial neural networks at least including two hidden layers; probability scores of all voice frames of a to-be-identified voice at all accent classifications in the deep neural networks are calculated and an accent classification tag with the largest probability score is set as a voice identification tag of the voice frame; and the voice classification of each voice frame in the to-be-identified voice is used for carrying out majority voting to obtain a voice classification corresponding to the to-be-identified voice. According to the invention, context information can be utilized effectively and thus a classification effect better than a traditional superficial layer model can be provided.

Description

technical field [0001] The invention relates to a method for de-muting speech with an accent and extracting features, a deep neural network modeling, parameter selection, training and inference method, and accent classification. Background technique [0002] Accents in speech are divided into native accents and foreign accents. Native accent refers to the accent produced by the speaker who is influenced by the dialect in his native language when he pronounces it in his native language. Foreign language accent refers to the accent produced by the speaker who is influenced by the pronunciation of the mother tongue when speaking in a non-native language. In the present invention, we mainly aim at the classification problem of mother tongue accent in Chinese. [0003] Commonly used methods for classifying Chinese accented speech mainly fall into two categories: text-dependent methods and text-independent methods. [0004] The text-related accent classification method refers t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/32G10L25/24G10L15/16
Inventor 刘文举陈明明张邯平高鹏董理科刘晓飞乔利玮王桐
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products