Automatic audio summary generation method and device
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- AISPEECH CO LTD
- Publication Date
- 2021-05-11
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention belongs to the technical field of audio summaries, in particular to a method and device for automatically generating audio summaries. Background technique
[0002] In related technologies, Automated audio captioning (Automated audio captioning, AAC) aims to generate a summary description of an audio clip. Many concepts are described in audio summarization, ranging from local information such as sound events to global information such as the acoustic scene. Currently, the mainstream method of AAC is an end-to-end encoder-decoder structure, and it is hoped that the encoder can automatically learn all the concepts embedded in the audio.
[0003] The automatic audio summary generation task can be based on an input audio, an encoder encodes the audio into a series of vectors, and then a decoder decodes the encoded vectors into natural language summaries. The inventor found in the process of implementing the present application that the gener...