Audio scene recognition method and device based on long-term and short time feature extraction

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of feature extraction and recognition methods, applied in speech recognition, speech analysis, instruments, etc., to achieve strong robustness, good discrimination, and strong stability

Active Publication Date: 2018-07-20

NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1

View PDF10 Cites 29 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the present invention is to overcome the problem in the prior art that it is difficult to find effective features that can fully characterize different audio scene information, and introduce a more robust and distinguishable feature extraction method, thereby providing a long-short time-based Audio scene recognition method and device based on feature extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0046] An embodiment of the present invention provides an audio scene recognition method based on long-short-time feature extraction, referring to figure 2 shown, including:

[0047] S201. Perform preprocessing on the input audio signal to be identified;

[0048] S202. Perform short-term audio feature extraction on the pre-processed audio signal to be recognized, and then perform long-term audio feature extraction;

[0049] S203...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to an audio scene recognition method and device based on long-term and short time feature extraction. The method comprises: preprocessing an inputted to-be-recognized audio signal; d carrying out short-time audio feature extraction on the pre-processed to-be-recognized audio signal and carrying out long-term audio feature extraction; and carrying out long-term and short-timeaudio feature combination of the to-be-recognized audio signal, inputting the features into a classification model and a fusion model, carrying out classification and identification, and outputting anidentification label of an audio scene. According to the invention, the long-term features of the audio scene are combined based on the conventional short-time feature extraction and complex audio scene information can be represented; the features are inputted into the classification model and the fusion model to carry out classification and identification; and the identification label of the audio scene is outputted. The method and device have advantages of high robustness and good distinguishing performance; the overall characteristics of the scene data can be represented to the great extent; and the recognition efficiency and the stability are high.

Description

technical field [0001] The invention relates to the field of audio scene recognition, in particular to an audio scene recognition method and device based on long-short-time feature extraction. Background technique [0002] With the development of the information society and the popularization of Internet technology, a large amount of digital audio content is flooding our daily life. Facing the rapid expansion of data volume, traditional analysis methods based on manual text annotation and structured prior knowledge are limited by efficiency and stability, and cannot achieve content analysis and information management of audio data, so that the information that is really concerned by people Or valuable knowledge is submerged in massive audio data. At the same time, specific and complex scenes under real sound acquisition conditions also limit people's effective management of digitized audio content and events. The complexity of the audio scene here is mainly reflected in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/08G10L15/14G10L25/24

CPCG10L15/02G10L15/08G10L15/14G10L25/24

Inventor袁庆升白海钏张鹏远包秀国刘洋张翠汪立东杜翠兰时磊张鸿云晓春颜永红崔佳林绅文王钲淇

OwnerNAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT

Audio scene recognition method and device based on long-term and short time feature extraction

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology