Summarizing digital audio data

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of voice data and abstract, applied in the field of data analysis, can solve the problem that the quality of the abstract cannot meet the needs and so on

Inactive Publication Date: 2006-01-11

AGENCY FOR SCI TECH & RES

View PDF1 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Especially when it is necessary to make summaries of various types of music, the quality of the summaries produced by this method cannot meet the needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] figure 1 It is a block diagram of components and / or modules of the system 100 for generating a sound abstract according to an embodiment of the present invention. The system receives a sound file such as music content 12 at a segmenter 114 . The music sequence 12 is segmented into frames and features are extracted from each frame at a feature extractor 116 . The classifier 118 classifies the frames whose features are extracted, such as the pure music sequence 140 or the vocal music sequence 160 , according to the classification parameters provided by the classification parameter generator 120 . When there is no singing voice in the music content, it is defined as pure music, and when there is singing voice, it is defined as vocal music. The sound summary is generated in the music summary device 122 or 124, and the music summary device makes a summary for the sound content specially set up for this category or makes a summary for the sound content classified by the cla...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An embodiment is related to automatic summarization for digital audio raw data (12), more specifically, for identifying pure music and vocal music (40,60) from digital audio data by extracting distinctive features from music frames (73,74,75,76), designing a classifier and determining the classification parameters (20) using adaptive learning / training algorithm (36), and identifying music into pure music or vocal music according to the classifier. For pure music, temporal, spectral and cepstral features are calculated to characterise the musical content, and an adaptive clustering method is used to structure the musical content according to calculated features. The summary (22,24,26,48,52,70,72) is created according to clustered result and domain-based music knowledge (50,150). For vocal music, voice related features are extracted and used to structure the musical content, and similarly, the music summary is created in terms of structured content and heuristic rules related to music genres.

Description

technical field [0001] The present invention relates to data analysis, such as sound data indexing and classification. More specifically, the present invention relates to automatic summarization of digital music raw data for a variety of applications such as content-based music retrieval and web-based online music distribution. Background technique [0002] The rapid development of computer network and multimedia technology has made the scale of digital multimedia datasets grow rapidly. To accommodate development, there is a need to produce concise and informative summaries for large multimedia datasets, which should best capture important elements of the original content in large-scale information organization and processing. So far, many techniques for automatically creating summaries of text, audio and video have been proposed and are constantly being developed. However, making a music summary refers to determining the most popular and prominent main theme part of a pie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F17/00G06F17/30G10H1/00G10L11/00

CPCG10H2240/155G10L25/48G10H1/0008G06F17/30743G10H2210/031G10H2210/046G06F17/30775G10H2210/061G06F16/64G06F16/683

Inventor徐常胜

OwnerAGENCY FOR SCI TECH & RES

Summarizing digital audio data

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology