Method and system for extracting sentences

a sentence extraction and sentence technology, applied in the field of document summarization methods and systems, can solve the problems of not ensuring the consistency with the original document and the accuracy of summarizing

Active Publication Date: 2017-03-09
UBERPLE
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0006]A document summarization method may be divided into an extraction method and an abstraction method. The abstraction method may be more effective in summarizing an original document than the extraction method but do not ensure the consistency with the original document and the accuracy of summarization.

Problems solved by technology

The abstraction method may be more effective in summarizing an original document than the extraction method but do not ensure the consistency with the original document and the accuracy of summarization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for extracting sentences
  • Method and system for extracting sentences
  • Method and system for extracting sentences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040]Prior to setting forth the inventive concept in detail, certain terms used herein will be defined.

[0041]A graph is a data structure consisting of a finite set of one or more vertices and a finite set of edges, i.e., pairs of the vertices. The graph should include, but not limited to, at least one vertex.

[0042]Graphs may be divided into undirected graphs and directed graphs. An undirected graph is a graph in which pairs of vertices representing edges are unordered. That is, each edge in the undirected graph has no orientation. A directed graph is a graph in which pairs of vertices representing edges are ordered. That is, each edge in the directed graph has an orientation.

[0043]A complete graph is a graph in which the number of edges connecting n vertices is n(n−1) / 2. That is, all vertices included in the complete graph are connected by edges.

[0044]Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly underst...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Methods and systems for extracting sentences are provided, one of methods comprises, receiving a keyword, parsing a document, and identifying each of a plurality of sentences included in the parsed document, configuring a graph having vertices and edges, wherein each vertex corresponds to each sentence, and each edge has a first weight corresponding to similarity between each pair of the sentences, calculating importance of each sentence by applying a modified PageRank algorithm to the graph, wherein the modified PageRank algorithm is designed to reflect a second weight corresponding to whether the keyword is included in a sentence of each vertex adjacent to a first vertex and extracting important sentences from the document based on the calculated importance.

Description

[0001]This application claims the benefit of Korean Patent Application No. 10-2015-0127556, filed on Sep. 9, 2015, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein by reference in its entirety.BACKGROUND[0002]1. Field[0003]The present inventive concept relates to a document summarization method and system, and more particularly, to a method and system for calculating the importance of each sentence included in a document and extracting important sentences from the document based on the calculated importance of each sentence.[0004]2. Description of the Related Art[0005]Document summarization is to generate a summary text that can represent a document. Document summarization is needed to quickly and accurately obtain necessary information from a flood of information.[0006]A document summarization method may be divided into an extraction method and an abstraction method. The abstraction method may be more effective in summarizing an original do...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F17/27G06F17/30G06F17/22
CPCG06F17/2705G06F17/30011G06F17/2775G06F17/2211G06F40/211G06F16/313G06F16/93
Inventor JEONG, JAEPILKIM, JAE YUN
Owner UBERPLE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products