Voice classification method and device, server and storage medium

A classification method and classifier technology, applied in the field of Internet technology applications, can solve the problems of ignoring the deep information of voice content, rough evaluation, etc., and achieve the effect of fast and effective classification processing.
CN108962231AActive Publication Date: 2018-12-07WUHAN DOUYU NETWORK TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
WUHAN DOUYU NETWORK TECH CO LTD
Publication Date
2018-12-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The embodiment of the invention discloses a voice classification method and device, a server and a storage medium, wherein the voice classification method comprises: acquiring the MFCC feature matrixof a target short voice by using a Mel frequency cepstrum coefficient MFCC algorithm, and converting the MFCC feature matrix into a target image; based on a deep learning model, extracting the targetimage feature of the target image; inputting the target image feature into a pre-trained voice classifier, and outputting the category of the target short voice. The embodiment of the invention solvesthe problem that the existing voice classification method ignores the deep information of the voice content, can only roughly evaluate the voices with a large content difference, and realizes an effect of classifying and processing the voice data quickly and effectively.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The embodiment of the present invention relates to the application field of Internet technology, and in particular to a voice classification method, device, server and storage medium. Background technique

[0002] With the rapid development of the Internet industry and the expansion of voice information, how to quickly and accurately classify voice data in massive information and save computing resources is a difficult point at present.

[0003] The existing speech classification method usually calculates the MFCC features of each frame in the speech data, and then stitches the MFCC features of each frame into the overall features of short speech, trains a classifier and performs feature classification, and then obtains classification labels. However, based on the general speech classification method, the deep information of the speech content is ignored, and only a rough assessment can be made on the speech with a large difference in content. Conten...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More