Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for audio segmentation

A technology of audio and audio frames, applied in the field of audio segmentation methods and devices, capable of solving problems such as extracting audio structured information and semantic content without using

Active Publication Date: 2019-09-17
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the traditional audio segmentation method generally extracts the Mel cepstral coefficient features of the target audio first, and then divides the audio into speech parts and silent parts according to the Mel cepstral coefficient features and the preset mixed Gaussian classification model. The traditional method can realize the basic division of audio. However, the content of the voice part in practical applications is rich and colorful. For example, there are complex and changeable audio signals in the broadcast audio stream. Therefore, only the voice part and the Silent parts, do not take advantage of the structural information and semantic content in the extracted audio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for audio segmentation
  • A method and device for audio segmentation
  • A method and device for audio segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0050] In order to solve the problems existing in the prior art, the embodiments of the present invention disclose an audio segmentation method and apparatus, which perform segmentation processing on audio by a multi-feature audio segmentation method to distinguish silent parts, music parts and non-music parts.

[0051] The present invention will be described in detail below through specific embodiments.

[0052] figure 1 A schematic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an audio segmentation method and a device, and the method comprises the steps: target characteristic value of target audio can be extracted according to a preset characteristic extraction algorithm; according to the target characteristic value, the target audio is segmented into a target voice part and a target mute part; the target characteristic value serves as an input parameter of a preset Gaussian model, and a posterior probability for the target audio can be obtained; according to the posterior probability and a preset classification model, the target voice part is segmented, a target music part and a non-target music part are obtained, wherein the preset classification model is a classification model based on multi-characteristic fusion and context association; according to the target mute part, the target music part and the non-target music part generate a segmentation result for the target audio. According to the invention, the audio can be segmented into a mute part, a music part and a non-music part.

Description

technical field [0001] The present invention relates to the field of audio processing, in particular to an audio segmentation method and device. Background technique [0002] With the continuous development of Internet technology, multimedia data such as images, videos, and audios have gradually become the main forms of information media in the field of Internet information processing. Among them, audio data occupies a very important position. The raw audio data itself is a non-semantic symbolic representation and unstructured binary stream, which lacks content semantic description and structured organization. Audio segmentation technology is an important means of extracting structured information and semantic content in audio, and it is the basis for understanding, analysis and retrieval of audio and video content. Essentially, audio classification is a pattern recognition problem that includes two basic processes: feature extraction and classification. Audio segmentatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/14G10L15/02G10L21/0272
Inventor 谭应伟王涛
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD