Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech class recognition method, apparatus, computer device, and storage medium

A recognition method and computer program technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as loss of characteristic information, affect emotion, effect of character recognition, affect affect, effect of character classification, etc. Effect

Pending Publication Date: 2019-01-25
CHINA PING AN LIFE INSURANCE CO LTD
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These features are mainly concentrated in the time domain or frequency domain alone. For speech signals with associated changes in the time domain and frequency domain features, the above features often lose part of the feature information, which in turn affects the effect of emotion and personality recognition; and the above acoustic features In the extraction process, it will be affected by some factors that have nothing to do with emotion and personality (such as speech content, speaker, environment, etc.)
These irrelevant factors are included in the extracted acoustic features, which will also greatly affect the effect of emotion and personality classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech class recognition method, apparatus, computer device, and storage medium
  • Speech class recognition method, apparatus, computer device, and storage medium
  • Speech class recognition method, apparatus, computer device, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0034] refer to figure 1 , providing a method for recognizing speech categories in an embodiment of the present application, comprising the following steps:

[0035] Step S1, acquiring the first speech information to be recognized, and converting the first speech information into a first spectrogram;

[0036] In this embodiment, the above-mentioned first spectrogram is a kind of spectrogram (including two-dimensional and three-dimensional), which is a graph representing the change of speech spectrum over time. The above-mentioned first voice information may be the custom...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application relates to the field of speech recognition, and provides a method, apparatus, computer device and storage medium for recognizing speech categories. The method comprises: acquiring first speech information to be recognized and converting the first speech information into a first language spectrum map; inputting the first language spectrum map into a preset speech classification model to obtain a classification result of the first language spectrum map, and taking the classification result as a class of the first speech information; wherein, the speech classification model is trained based on a depth convolution neural network by using a language map of a known emotion class or a character class; The speech recognition method, apparatus, computer device and storage medium provided in the present application are convenient to improve the effect of emotion and character classification in speech information.

Description

technical field [0001] The present application relates to the technical field of speech recognition, and in particular to a speech category recognition method, device, computer equipment and storage medium. Background technique [0002] At present, the focus of speech emotion and personality recognition is mainly on the extraction of acoustic features. In the existing speech emotion recognition technology, the acoustic features used for recognition include prosodic features, voice quality features, spectral correlation features, and fusion features formed by filtering the above features. These features are mainly concentrated in the time domain or frequency domain alone. For speech signals with associated changes in the time domain and frequency domain features, the above features often lose part of the feature information, which in turn affects the effect of emotion and personality recognition; and the above acoustic features In the extraction process, it will be affected ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/08G10L15/22G10L15/16
CPCG10L15/08G10L15/16G10L15/22
Inventor 易苗莫洋
Owner CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products