Device for drawing document correlation diagram where documents are arranged in time series

A correlative diagram and time sequence technology, applied to computer parts, instruments, calculations, etc., can solve problems such as accumulation of deviations, inability to properly represent the temporal development of the field, and ambiguous branch meanings, and achieve the effect of improving misclassification

Inactive Publication Date: 2007-08-29
INTPROP BANK CORP (JP)
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, in the technology described in the above-mentioned Japanese Unexamined Patent Publication No. 11-53387 (Patent Document 1), deviations will accumulate when sequentially searching from a certain document to similar documents, and then to similar documents, and it may be found soon completely different file
In addition, there may be a case where one file is finally found from multiple routes branched from a certain file, and the meaning of the branch may become unclear.
Therefore, in the technology described in the above-mentioned Japanese Patent Application Laid-Open No. 11-53387 (Patent Document 1), there is a problem that temporal development in each field cannot be appropriately represented.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Device for drawing document correlation diagram where documents are arranged in time series
  • Device for drawing document correlation diagram where documents are arranged in time series
  • Device for drawing document correlation diagram where documents are arranged in time series

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0284]

[0285] In the Codimensional Reduction Method (Codimensional Reduction Method), as in Embodiment 1 (Balanced Cutting Method; BC Method), association rules are used to determine the cutting position of the dendrogram. In embodiment 1, the parameters that can be obtained according to the geometric shape of the dendrogram are used, and the combination height between elements is used as the cutting position, while in this embodiment 2, the index dimension that represents the difference between the document element vectors is used to determine Decide where to cut.

[0286] Since the basic description related to association rule analysis has been done in Embodiment 1, it will be omitted. First, the differences between Embodiment 2 and Embodiment 1 will be described for the parameters used in association rule analysis in Embodiment 2.

[0287]

[0288] When a certain node (node) c is given in the dendrogram, its combination level is represented by an integer i(c...

Embodiment 3

[0324]

[0325] In the Cell Division Method, after cutting the dendrogram at the cutting height α determined by a certain method and extracting the parent cluster, only the file elements belonging to each parent cluster are used in order to divide each parent cluster into sub-clusters , again making a dendrogram of that section. When creating the partial dendrogram, the index dimension for which the deviation value of the document element vector component in the parent cluster is smaller than the value determined by a predetermined method is removed and analyzed.

[0326]

[0327] Fig. 11 is a flowchart illustrating a cluster extraction procedure in Example 3 (cell division method; CD method). This flowchart shows the sequence of the third embodiment in more detail than in FIG. 3 . For the same steps as in Fig. 3, 300 is added to the step number in Fig. 3, and the last two digits are the same step numbers as in Fig. 3, and the description repeated with Fig. 3 is so...

Embodiment 6

[0444]

[0445] In Pole-and-Line Arrangement, for a cluster with a small number of file elements, the arrangement within the cluster is determined based on time data and dendrogram configuration data.

[0446]

[0447] Fig. 30 is a flow chart illustrating an arrangement process within a cluster in Embodiment 6 (rod and fishing arrangement; PLA). In this flow chart, the premise is that clusters are extracted through the processing before step S70 (cluster extraction) in FIG. 3, and the parts of step S80 (reading of configuration conditions) and step S90 (arrangement of elements in clusters) in FIG. 3 are shown in more detail. The sequence of the present embodiment 6 has been completed. For the same steps as in Fig. 3, 600 is added to the step number in Fig. 3, and the last two digits are the same as those in Fig. 3, and the description repeated with Fig. 3 is sometimes omitted.

[0448] FIG. 31 is a diagram showing an example of arranging a tree diagram in the in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A document correlation diagram drawing device includes extracting means (20, 30) for extracting content data and time data of document elements (E) each including one or more documents, dendrogram drawing means (50) for drawing a dendrogram showing a correlation between documents on the basis of the content data of the document elements, clustering means (70) for cutting the dendrogram in accordance with a predetermined rule and extracting clusters, and intra-cluster arranging means (90) for determining an intra-cluster arrangement of the document elements belonging to each cluster on the basis of the time data of the document elements. Accordingly, a dendrogram adequately showing the chronological development in each field can be automatically drawn.

Description

technical field [0001] The present invention relates to a technology for automatically creating a document correlation graph that expresses the relationship between documents and reflects the chronological order of documents, and particularly relates to a device, method, and program for creating such a document correlation graph. Background technique [0002] The number of technical documents and other documents, including patent documents, emerges in an endless stream. In order to present the interrelationships of these documents in a concise and easy-to-understand form, it is preferable to organize their temporal development by related content. Therefore, it is preferable to automatically create a file correlation graph that takes into account both the association of file contents and the chronological arrangement. [0003] Japanese Patent Application Laid-Open No. 11-53387 "Document Association Method and System" (Patent Document 1) discloses a method for associating doc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06K9/6219G06F18/231
Inventor 增山博昭佐藤晴正浅田诚莲子和巳堀田任晃
Owner INTPROP BANK CORP (JP)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products