Audio segmentation method and system

A technology of audio and audio frames, which is applied in the field of audio segmentation methods and systems, can solve the problems of low accuracy of audio segmentation, and achieve the effect of high accuracy

Active Publication Date: 2019-01-04
GUANGZHOU SHIYUAN ELECTRONICS CO LTD
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Based on this, it is necessary to provide an audio segmentatio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio segmentation method and system
  • Audio segmentation method and system
  • Audio segmentation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The technical solution of the present invention will be described below in conjunction with the accompanying drawings.

[0021] Such as figure 1 As shown, the present invention provides a kind of audio segmentation method, may comprise the following steps:

[0022] S1, reading each audio frame of the audio data to be segmented, and performing feature extraction on each audio frame respectively, to obtain the audio signal feature corresponding to each audio frame;

[0023] A piece of audio data to be divided can be obtained first. A piece of audio data can include multiple audio frames, and feature extraction can be performed on each audio frame to obtain the audio signal features corresponding to each audio frame. The audio signal features mentioned here can be existing Some typical audio signal features (such as spectral coefficients, etc.) may also be other types of audio signal features. Before feature extraction, a piece of audio data can be divided into multiple ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an audio segmentation method and system. The method comprises following steps: reading each audio frame of the audio data to be segmented, respectively extracting features ofeach audio frame to obtain audio signal features corresponding to each audio frame; inputting the audio signal features into a pre-trained audio classifier, respectively calculating probability valuesof audio frames corresponding to the audio signal features belonging to audio classes, and obtaining a target audio category to which the audio frame corresponding to the audio signal feature belongsaccording to the probability values; performing audio-segmentation on the audio data according to a target audio category to which each audio frame belongs. The audio segmentation method and system can segment audio data into small pieces, and the audio segmentation accuracy is high.

Description

technical field [0001] The present invention relates to the technical field of audio signal processing, in particular to an audio segmentation method and system. Background technique [0002] The original audio data is not conducive to user viewing and retrieval. In order to solve this problem, one way is to perform audio segmentation on audio data. Through audio segmentation, the audio can be divided into small fragments, and each fragment represents a different meaning, such as continuous background sound, narrator's voice, cheers of the audience, etc., which can be used to establish an effective retrieval system in the future. [0003] Traditional audio segmentation methods are mostly divided into two types. One is to divide audio features into categories such as SVM (Support Vector Machine, Support Vector Machine) or Gaussian Mixture Model by extracting long-term and short-term features of audio; One is to extract audio features, divide the audio into target audio and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/08G10L15/02
CPCG10L15/02G10L15/04G10L15/063G10L15/08G10L25/54G10L2015/0631
Inventor 雷延强
Owner GUANGZHOU SHIYUAN ELECTRONICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products