Audio segmentation method and device

A technology of audio and audio frames, applied in the field of audio segmentation methods and devices, capable of solving problems such as extracting audio structured information and semantic content without using

Active Publication Date: 2016-09-07
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF9 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the traditional audio segmentation method generally extracts the Mel cepstral coefficient features of the target audio first, and then divides the audio into speech parts and silent parts according to the Mel cepstral coefficient features and the preset mixed Gaussian classification model. The traditional method can realize the basic division of audio. However, the content of the voice part in practical applications is rich and colorful. For example, there are complex and changeable audio signals in the broadcast audio stream. Therefore, only the voice part and the Silent parts, do not take advantage of the structural information and semantic content in the extracted audio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio segmentation method and device
  • Audio segmentation method and device
  • Audio segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0050] In order to solve the problems existing in the prior art, the embodiment of the present invention discloses an audio segmentation method and device, which uses an audio segmentation method that integrates multiple features to segment audio, and distinguishes silent parts, music parts and non-music parts.

[0051] The present invention will be described in detail below through specific examples.

[0052] figure 1 A schematic flow diagram of an audio segmenta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses an audio segmentation method and a device, and the method comprises the steps: target characteristic value of target audio can be extracted according to a preset characteristic extraction algorithm; according to the target characteristic value, the target audio is segmented into a target voice part and a target mute part; the target characteristic value serves as an input parameter of a preset Gaussian model, and a posterior probability for the target audio can be obtained; according to the posterior probability and a preset classification model, the target voice part is segmented, a target music part and a non-target music part are obtained, wherein the preset classification model is a classification model based on multi-characteristic fusion and context association; according to the target mute part, the target music part and the non-target music part generate a segmentation result for the target audio. According to the invention, the audio can be segmented into a mute part, a music part and a non-music part.

Description

technical field [0001] The invention relates to the field of audio processing, in particular to an audio segmentation method and device. Background technique [0002] With the continuous development of Internet technology, multimedia data such as images, videos, and audios has gradually become the main form of information media in the field of Internet information processing. Among them, audio data occupies a very important position. Raw audio data itself is a non-semantic symbol representation and unstructured binary stream, lacking content semantic description and structured organization. Audio segmentation technology is an important means to extract structured information and semantic content in audio, and is the basis for understanding, analyzing and retrieving audio and video content. Essentially, audio classification is a pattern recognition problem, which includes two basic processes: feature extraction and classification. Audio segmentation is to extract different...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14G10L15/02G10L21/0272
Inventor 谭应伟王涛
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products