Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text-Audio Automatic Summarization Method Based on Improved PageRank Algorithm

An automatic abstract and text technology, applied in computing, speech analysis, speech recognition, etc., can solve problems such as poor readability, poor fluency of transcribed texts, grammatical errors, etc., to avoid incoherence, improve readability, and avoid grammatical errors Effect

Active Publication Date: 2019-07-19
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with the method of obtaining information from the original text alone, the above method transcribes the relevant information in the voice information, and this information will not be completely covered by the original text document, which helps to increase the amount of information obtained. However, automatic speech recognition exists Many errors, resulting in a less fluent transcription, more grammatical errors, and thus less readability
Therefore, while ensuring as much important information as possible, improving the poor readability of the existing text-audio automatic summarization is a technical problem that needs to be solved at present.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text-Audio Automatic Summarization Method Based on Improved PageRank Algorithm
  • Text-Audio Automatic Summarization Method Based on Improved PageRank Algorithm
  • Text-Audio Automatic Summarization Method Based on Improved PageRank Algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The invention provides a text-audio automatic summarization method based on the improved PageRank algorithm. By guiding the sorting, the similarity matrix between the cross-modal sentences is modified, and when the original text sentence and the transcribed text sentence have the same semantics, according to The output of the original text sentences, while ensuring the acquisition of more important information, avoids grammatical errors and unsmooth problems caused by transcription, and improves the readability of text-audio automatic summarization.

[0030] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0031] In an exemplary embodiment of the present invention, a text-audio automatic summarization method based on an improved PageRank algorithm is provided.

[0032] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method of automatically abstracting text-audio based on the improved PageRank algorithm. The method comprises the steps that audios are transcribed into text, the similarities of the transcription text and the original text are computed; the importance of the sentences in the transcripts and the original texts is computed by utilizing the improved PageRank algorithm and based on the similarities; and according to the importance of the sentences, a text summary is generated. By the improved PageRank algorithm, the sorting of texts is guided, the similarity matrix between the cross-modal sentences is modified, so that the connection direction between two sentences with the same semantics coming from different modes is changed from the original bi-direction to the unidirection, so that when the semantics of the original text sentences and the transcribed text sentences are the same, the sentences are outputted in accordance with the original text, while more important information is assuredly obtained, the problems of grammatical errors and incoherencies brought about by the transcriptions are avoided, and the readability of the automatic abstracts of the text-audios is improved.

Description

technical field [0001] The invention belongs to the technical field of natural language processing and speech processing, and relates to a text-audio automatic summarization method based on an improved PageRank algorithm. Background technique [0002] With the advent of the Internet age, multimedia information including text documents, audio and other information can be seen everywhere. Effectively analyze relevant text and audio information, and obtain as much important information as possible, and present it to users in the form of text summaries, which helps users obtain key information only through text summaries, and realizes convenient and efficient Access to information. For a series of text and audio related to an information event, the current common practice is to first transcribe the audio into text through an automatic speech recognition system, and then combine it with the original text document to generate text through the traditional multi-document automatic ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G10L15/26
CPCG06F40/211G06F40/30G10L15/26
Inventor 张家俊李浩然周玉宗成庆
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products