Audio segmentation method and system

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of audio and audio frames, which is applied in the field of audio segmentation methods and systems, can solve the problems of low accuracy of audio segmentation, and achieve the effect of high accuracy

Active Publication Date: 2019-01-04

GUANGZHOU SHIYUAN ELECTRONICS CO LTD

View PDF6 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] Based on this, it is necessary to provide an audio segmentation method and system for the problem of low audio segmentation accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0020] The technical solution of the present invention will be described below in conjunction with the accompanying drawings.

[0021] Such as figure 1 As shown, the present invention provides a kind of audio segmentation method, may comprise the following steps:

[0022] S1, reading each audio frame of the audio data to be segmented, and performing feature extraction on each audio frame respectively, to obtain the audio signal feature corresponding to each audio frame;

[0023] A piece of audio data to be divided can be obtained first. A piece of audio data can include multiple audio frames, and feature extraction can be performed on each audio frame to obtain the audio signal features corresponding to each audio frame. The audio signal features mentioned here can be existing Some typical audio signal features (such as spectral coefficients, etc.) may also be other types of audio signal features. Before feature extraction, a piece of audio data can be divided into multiple ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to an audio segmentation method and system. The method comprises following steps: reading each audio frame of the audio data to be segmented, respectively extracting features ofeach audio frame to obtain audio signal features corresponding to each audio frame; inputting the audio signal features into a pre-trained audio classifier, respectively calculating probability valuesof audio frames corresponding to the audio signal features belonging to audio classes, and obtaining a target audio category to which the audio frame corresponding to the audio signal feature belongsaccording to the probability values; performing audio-segmentation on the audio data according to a target audio category to which each audio frame belongs. The audio segmentation method and system can segment audio data into small pieces, and the audio segmentation accuracy is high.

Description

technical field [0001] The present invention relates to the technical field of audio signal processing, in particular to an audio segmentation method and system. Background technique [0002] The original audio data is not conducive to user viewing and retrieval. In order to solve this problem, one way is to perform audio segmentation on audio data. Through audio segmentation, the audio can be divided into small fragments, and each fragment represents a different meaning, such as continuous background sound, narrator's voice, cheers of the audience, etc., which can be used to establish an effective retrieval system in the future. [0003] Traditional audio segmentation methods are mostly divided into two types. One is to divide audio features into categories such as SVM (Support Vector Machine, Support Vector Machine) or Gaussian Mixture Model by extracting long-term and short-term features of audio; One is to extract audio features, divide the audio into target audio and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/08G10L15/02

CPCG10L15/02G10L15/04G10L15/063G10L15/08G10L25/54G10L2015/0631

Inventor雷延强

OwnerGUANGZHOU SHIYUAN ELECTRONICS CO LTD

Audio segmentation method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology