Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic audio summary generation method and device

An automatic generation, audio technology, applied in audio data retrieval, audio data browsing/visualization, character and pattern recognition, etc., can solve the problem of inaccurate audio summary description

Active Publication Date: 2021-05-11
AISPEECH CO LTD
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The inventor found in the process of implementing the application that the generated audio abstract descriptions are often inaccurate, especially the description of sound events and acoustic scenes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic audio summary generation method and device
  • Automatic audio summary generation method and device
  • Automatic audio summary generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0017] Please refer to figure 1 , which shows a flow chart of an automatic audio summary generation method.

[0018] Such as figure 1 As shown, in step 101, the pre-training sound event detection model, wherein, the sound event detection model includes an audio feature extraction part and an output part;

[0019] In step 102, the audio feature ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic audio abstract generation method and device, and the method comprises the steps: pre-training a sound event detection model which comprises an audio feature extraction part and an output part; enabling the audio encoder to take the audio feature extraction part as an audio abstract automatic generation model; and training the audio abstract automatic generation model in an end-to-end manner. According to the scheme provided by the embodiment of the invention, a better audio encoder is obtained through pre-training and transfer learning on the sound event detection task, so that more accurate audio abstract description is generated, corresponding text description can be generated for any new audio, the audio-text database is automatically established, and practical application of similar audio retrieval engines based on natural languages in unlimited forms can be supported.

Description

technical field [0001] The invention belongs to the technical field of audio summaries, in particular to a method and device for automatically generating audio summaries. Background technique [0002] In related technologies, Automated audio captioning (Automated audio captioning, AAC) aims to generate a summary description of an audio clip. Many concepts are described in audio summarization, ranging from local information such as sound events to global information such as the acoustic scene. Currently, the mainstream method of AAC is an end-to-end encoder-decoder structure, and it is hoped that the encoder can automatically learn all the concepts embedded in the audio. [0003] The automatic audio summary generation task can be based on an input audio, an encoder encodes the audio into a series of vectors, and then a decoder decodes the encoded vectors into natural language summaries. The inventor found in the process of implementing the present application that the gener...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/64G06F16/683G06K9/62
CPCG06F16/64G06F16/683G06F18/214
Inventor 俞凯吴梦玥徐薛楠丁翰林谢泽宇
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products