Audio segmentation method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of audio and audio frames, applied in the field of audio segmentation methods and devices, capable of solving problems such as extracting audio structured information and semantic content without using

Active Publication Date: 2016-09-07

BEIJING QIYI CENTURY SCI & TECH CO LTD

View PDF9 Cites 10 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] At present, the traditional audio segmentation method generally extracts the Mel cepstral coefficient features of the target audio first, and then divides the audio into speech parts and silent parts according to the Mel cepstral coefficient features and the preset mixed Gaussian classification model. The traditional method can realize the basic division of audio. However, the content of the voice part in practical applications is rich and colorful. For example, there are complex and changeable audio signals in the broadcast audio stream. Therefore, only the voice part and the Silent parts, do not take advantage of the structural information and semantic content in the extracted audio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0049] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0050] In order to solve the problems existing in the prior art, the embodiment of the present invention discloses an audio segmentation method and device, which uses an audio segmentation method that integrates multiple features to segment audio, and distinguishes silent parts, music parts and non-music parts.

[0051] The present invention will be described in detail below through specific examples.

[0052] figure 1 A schematic flow diagram of an audio segmenta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses an audio segmentation method and a device, and the method comprises the steps: target characteristic value of target audio can be extracted according to a preset characteristic extraction algorithm; according to the target characteristic value, the target audio is segmented into a target voice part and a target mute part; the target characteristic value serves as an input parameter of a preset Gaussian model, and a posterior probability for the target audio can be obtained; according to the posterior probability and a preset classification model, the target voice part is segmented, a target music part and a non-target music part are obtained, wherein the preset classification model is a classification model based on multi-characteristic fusion and context association; according to the target mute part, the target music part and the non-target music part generate a segmentation result for the target audio. According to the invention, the audio can be segmented into a mute part, a music part and a non-music part.

Description

technical field [0001] The invention relates to the field of audio processing, in particular to an audio segmentation method and device. Background technique [0002] With the continuous development of Internet technology, multimedia data such as images, videos, and audios has gradually become the main form of information media in the field of Internet information processing. Among them, audio data occupies a very important position. Raw audio data itself is a non-semantic symbol representation and unstructured binary stream, lacking content semantic description and structured organization. Audio segmentation technology is an important means to extract structured information and semantic content in audio, and is the basis for understanding, analyzing and retrieving audio and video content. Essentially, audio classification is a pattern recognition problem, which includes two basic processes: feature extraction and classification. Audio segmentation is to extract different...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/14G10L15/02G10L21/0272

Inventor谭应伟王涛

OwnerBEIJING QIYI CENTURY SCI & TECH CO LTD

Audio segmentation method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology