Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for acquiring document set abstracts and device

An acquisition method and technology for document sets, which are applied in the fields of instrumentation, computing, and electrical digital data processing, can solve the problems of poor acquisition of document set abstracts, and achieve the best acquisition effect.

Inactive Publication Date: 2010-06-23
PEKING UNIV +2
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present invention provides a method and device for acquiring document summaries, which are used to solve the problem that the existing way of acquiring document summaries based on graph models is not effective in acquiring document summaries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for acquiring document set abstracts and device
  • Method for acquiring document set abstracts and device
  • Method for acquiring document set abstracts and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Since the existing methods for acquiring summaries of document sets based on graph models cannot reflect the influence of the importance of the document where the sentence is located on the importance weight value of the sentence, the effect of acquiring summaries of document sets is not good. The embodiment of the present invention solves the above problem by constructing a bipartite graph model including relational information between sentences and documents when establishing the graph model, and provides a better solution for obtaining summaries of document collections.

[0025] The main realization principles, specific implementation modes and corresponding beneficial effects that can be achieved of the technical solutions of the embodiments of the present invention will be described in detail below in conjunction with each accompanying drawing.

[0026] Such as figure 1 As shown, the main implementation principle flow of the embodiment of the present invention is a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for acquiring document set abstracts and a device for improving the acquiring effect of the document set abstracts. The method extracts each sentence included in each document in the document set for forming a sentence set, the importance weighted value of each sentence in the sentence set is determined on the basis of the text similarity between documents in the document set and between the sentences in the sentence set, and the document set abstracts are formed by selecting the sentences in the specified number according to the determined importance weighted values in accordance with the selection sequence from higher importance weighted values to lower importance weighted values.

Description

technical field [0001] The present invention relates to the field of language and word processing and the field of information retrieval technology, in particular to a method and device for obtaining abstracts of document collections. Background technique [0002] With the rapid promotion and application of Internet technology, the acquisition technology of document collection abstract has been widely used in the field of text / website content retrieval. The document set summary acquisition technology refers to: the computer system automatically obtains information reflecting the main points of the document content in a document set containing multiple documents. This technology can provide users with concise and concise content descriptions of document sets, and facilitates users to consult a large number of document contents. For example, the basic realization principle of the news service provided by an Internet portal is to first collect various news information on the n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 万小军杨建武肖建国
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products