Audio scene recognition method based on acoustic events

A scene recognition and audio technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of less audio scene recognition, achieve good promotion, improve accuracy, and make reasonable and accurate judgments

Inactive Publication Date: 2013-07-31
SHANDONG NORMAL UNIV
View PDF6 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Audio scene recognition has such a broad application prospect and urgent market demand, but at present,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio scene recognition method based on acoustic events
  • Audio scene recognition method based on acoustic events
  • Audio scene recognition method based on acoustic events

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further described below in conjunction with the accompanying drawings and embodiments. figure 1 The flow chart of the audio scene recognition method based on acoustic events is given. The method is divided into four steps: Step 1: Carry out audio segmentation on the audio stream to be used for audio scene recognition to form audio scene fragments and audio frames; Step 2 : Classify the audio frames contained in each audio scene segment through the acoustic event model to obtain the probability relationship between the audio frame and each acoustic event class; Step 3: For each audio scene segment, synthesize the audio scene segment The information of all the included audio frames obtains the probability relationship between the audio scene segment and each acoustic event class; Step 4: For each audio scene segment, according to the probability relationship between it and each acoustic event class, the audio scene segment contains The main a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an audio scene recognition method based on acoustic events, which comprises the steps as follows: Step I, conducting audio division on an audio stream to be subjected to audio scene recognition, Step II, classifying audio frames included in audio scene segments with an acoustic event model and obtaining a probability relationship among the audio frames and acoustic event classes, Step III, obtaining a probability relationship among the audio scene segments and the acoustic event classes according to the information of all the audio frames included in the audio scene segments, and Step IV, determining main acoustic events included in the audio scene segments and determining semantic scenes of the audio scene segments. The method judges the main acoustic events more reasonably and accurately, so that the recognition accuracy rate of the semantic scenes can be increased, and the method is good in generalization performance, provides good assistance to the video scene recognition, and increases the accuracy rate of the video scene recognition.

Description

technical field [0001] The invention relates to the fields of pattern recognition and multimedia information processing, in particular to an audio scene recognition method based on acoustic events. Background technique [0002] At present, with the rapid development of the information society, multimedia information data has shown explosive growth. How to effectively use these multimedia data to serve people's daily life has become an urgent problem to be solved. Multimedia data includes images, audio and other forms. At present, the research and utilization of images has been very extensive, but the research on audio started relatively late, and there are still many technical problems to be solved urgently. [0003] A continuous audio stream usually contains a series of acoustic events, such as speech, laughter, music, etc., and an audio scene refers to an audio segment composed of several temporally adjacent and semantically related acoustic events. . Compared with acou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/06G10L25/48
Inventor 冷严徐新艳
Owner SHANDONG NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products