Automatic audio summary generation method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An automatic generation, audio technology, applied in audio data retrieval, audio data browsing/visualization, character and pattern recognition, etc., can solve the problem of inaccurate audio summary description

Active Publication Date: 2021-05-11

AISPEECH CO LTD

View PDF3 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The inventor found in the process of implementing the application that the generated audio abstract descriptions are often inaccurate, especially the description of sound events and acoustic scenes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0016] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0017] Please refer to figure 1 , which shows a flow chart of an automatic audio summary generation method.

[0018] Such as figure 1 As shown, in step 101, the pre-training sound event detection model, wherein, the sound event detection model includes an audio feature extraction part and an output part;

[0019] In step 102, the audio feature ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an automatic audio abstract generation method and device, and the method comprises the steps: pre-training a sound event detection model which comprises an audio feature extraction part and an output part; enabling the audio encoder to take the audio feature extraction part as an audio abstract automatic generation model; and training the audio abstract automatic generation model in an end-to-end manner. According to the scheme provided by the embodiment of the invention, a better audio encoder is obtained through pre-training and transfer learning on the sound event detection task, so that more accurate audio abstract description is generated, corresponding text description can be generated for any new audio, the audio-text database is automatically established, and practical application of similar audio retrieval engines based on natural languages in unlimited forms can be supported.

Description

technical field [0001] The invention belongs to the technical field of audio summaries, in particular to a method and device for automatically generating audio summaries. Background technique [0002] In related technologies, Automated audio captioning (Automated audio captioning, AAC) aims to generate a summary description of an audio clip. Many concepts are described in audio summarization, ranging from local information such as sound events to global information such as the acoustic scene. Currently, the mainstream method of AAC is an end-to-end encoder-decoder structure, and it is hoped that the encoder can automatically learn all the concepts embedded in the audio. [0003] The automatic audio summary generation task can be based on an input audio, an encoder encodes the audio into a series of vectors, and then a decoder decodes the encoded vectors into natural language summaries. The inventor found in the process of implementing the present application that the gener...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F16/64G06F16/683G06K9/62

CPCG06F16/64G06F16/683G06F18/214

Inventor俞凯吴梦玥徐薛楠丁翰林谢泽宇

OwnerAISPEECH CO LTD

Automatic audio summary generation method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology