Audio scene recognition method and device based on long-term and short time feature extraction

A technology of feature extraction and recognition methods, applied in speech recognition, speech analysis, instruments, etc., to achieve strong robustness, good discrimination, and strong stability

Active Publication Date: 2018-07-20
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1
View PDF10 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to overcome the problem in the prior art that it is difficult to find effective features that can fully characterize different audio scene informa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio scene recognition method and device based on long-term and short time feature extraction
  • Audio scene recognition method and device based on long-term and short time feature extraction
  • Audio scene recognition method and device based on long-term and short time feature extraction

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0045] Hereinafter, exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0046] The embodiment of the present invention provides an audio scene recognition method based on long and short-term feature extraction, refer to figure 2 Shown, including:

[0047] S201: Preprocessing the input audio signal to be recognized;

[0048] S202: Perform short-term audio feature extraction on the pre-processed audio signal to be recognized, and then perform long-term audio feature extraction;

[004...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an audio scene recognition method and device based on long-term and short time feature extraction. The method comprises: preprocessing an inputted to-be-recognized audio signal; d carrying out short-time audio feature extraction on the pre-processed to-be-recognized audio signal and carrying out long-term audio feature extraction; and carrying out long-term and short-timeaudio feature combination of the to-be-recognized audio signal, inputting the features into a classification model and a fusion model, carrying out classification and identification, and outputting anidentification label of an audio scene. According to the invention, the long-term features of the audio scene are combined based on the conventional short-time feature extraction and complex audio scene information can be represented; the features are inputted into the classification model and the fusion model to carry out classification and identification; and the identification label of the audio scene is outputted. The method and device have advantages of high robustness and good distinguishing performance; the overall characteristics of the scene data can be represented to the great extent; and the recognition efficiency and the stability are high.

Description

technical field [0001] The invention relates to the field of audio scene recognition, in particular to an audio scene recognition method and device based on long-short-time feature extraction. Background technique [0002] With the development of the information society and the popularization of Internet technology, a large amount of digital audio content is flooding our daily life. Facing the rapid expansion of data volume, traditional analysis methods based on manual text annotation and structured prior knowledge are limited by efficiency and stability, and cannot achieve content analysis and information management of audio data, so that the information that is really concerned by people Or valuable knowledge is submerged in massive audio data. At the same time, specific and complex scenes under real sound acquisition conditions also limit people's effective management of digitized audio content and events. The complexity of the audio scene here is mainly reflected in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/08G10L15/14G10L25/24
CPCG10L15/02G10L15/08G10L15/14G10L25/24
Inventor 袁庆升白海钏张鹏远包秀国刘洋张翠汪立东杜翠兰时磊张鸿云晓春颜永红崔佳林绅文王钲淇
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products