Unlock instant, AI-driven research and patent intelligence for your innovation.

Document summary generation method and device

A document abstraction and generation device technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of explosive growth of data scale, exceeding the acceptability, etc., and achieve the effect of low burden and high accuracy

Inactive Publication Date: 2015-12-02
HITACHI CHINA RES & DEV CORP
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although the advancement of database technology has made the collection and storage of information easier and easier, the explosive growth of data scale has far exceeded people's ability to accept it.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document summary generation method and device
  • Document summary generation method and device
  • Document summary generation method and device

Examples

Experimental program
Comparison scheme
Effect test

no. 1 approach 〕

[0019] figure 1 It shows the structural block diagram of the document summary extracting device of the present invention. Such as figure 1 As shown, the document abstract extraction device of this embodiment includes: a document input unit 104 , a corpus database 101 , a data storage unit (DB) 103 , an associated vocabulary processing unit 102 , a document processing unit 105 and a display unit 106 .

[0020] Wherein, the associative vocabulary processing unit 102 is used to analyze and process the corpus in the corpus database 101, and the data obtained after the analysis—the data representing the degree of association between words and words, that is, the associative vocabulary data—is saved to the data storage Unit 103. The processing performed by the associated vocabulary processing unit 102, that is, the acquisition of the associated vocabulary, will be described in detail later. In addition, the processing performed by the associated vocabulary processing unit 102 may...

no. 2 approach 〕

[0050] In the above first embodiment, if image 3 As shown, in step S307, judge whether there are other associated words in the summary word obtained in step S306 according to the associated word list 208, if exist, then these words are deleted in step S308, otherwise just the abstract obtained in step S306 word as the final abstract word (step S309), and then in step S310, extract the sentence that contains the final abstract word as the abstract. In this first embodiment, the above-mentioned associative vocabulary described above used is a bidirectional, reversible associative vocabulary, that is, if a certain word A has a relationship of A→B with word B (the symbol "→ The left side of " represents the word that appears on the left side of the associated word table, and the right side represents the word that appears on the right side of the same entry in the associated word table, see Table 1 for corresponding understanding), then there must be a B→A association, that is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a document abstract generating method and device. The document abstract generating method comprises a step of inputting a document, a step of storing a related word list into a storage part, a step of extracting a title from the document, a step of extracting a first word from the extracted title, a step of extracting a plurality of second words which are related with the first word from the document based on the related word list, a step of detecting whether a third word which is related with the second words, except the first word, exists or not based on the related word list, a step of deleting the second words which are related with the third word from a plurality of second words under the condition that the third word which is related with the second words exists, and a step of extracting a sentence which contains the second words obtained by deleting the second words which are related with the third word from a plurality of second words from the input document to be used as an abstract.

Description

technical field [0001] The invention relates to a method and device for automatically extracting abstracts according to document content. Background technique [0002] The development of information technology has brought about a rapid increase in the ability to collect and store information. The advancement of data management technology has promoted the informatization of business and government affairs, and produced a large amount of data. Especially after the rise of the Internet, the information on the Internet has increased exponentially. To manage these data, large databases are being widely used in business and scientific engineering. [0003] Although the progress of database technology has made the collection and storage of information easier and easier, the explosive growth of data scale has far exceeded people's ability to accept it. Especially in recent years, with the widespread application of databases and computer networks, the amount of data stored in datab...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 刘宏建周泉邓攀小林义行
Owner HITACHI CHINA RES & DEV CORP