Unlock instant, AI-driven research and patent intelligence for your innovation.

Device for drawing document correlation diagram where documents are arranged in time series

A correlative graph and time sequence technology, applied to computer components, instruments, calculations, etc., can solve problems such as accumulation of deviations, unclear branch meanings, and inability to properly represent the temporal development of the field, and achieve the effect of improving misclassification

Inactive Publication Date: 2009-02-18
INTPROP BANK CORP (JP)
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, in the technique described in the above-mentioned Japanese Patent Application Laid-Open No. 11-53387 (Patent Document 1), deviations will accumulate when sequentially searching from a certain document to similar documents, and then to similar documents, and it may be found soon completely different file
In addition, there may be a case where one file is finally found from multiple routes branched from a certain file, and the meaning of the branch may become unclear.
Therefore, in the technology described in the above-mentioned Japanese Patent Application Laid-Open No. 11-53387 (Patent Document 1), there is a problem that the temporal development of each field cannot be properly represented.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Device for drawing document correlation diagram where documents are arranged in time series
  • Device for drawing document correlation diagram where documents are arranged in time series
  • Device for drawing document correlation diagram where documents are arranged in time series

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0287]

[0288] In the Codimensional Reduction Method (Codimensional Reduction Method), as in Embodiment 1 (Balanced Cutting Method; BC Method), association rules are used to determine the cutting position of the dendrogram. In embodiment 1, the parameters that can be obtained according to the geometric shape of the dendrogram are used, and the combination height between elements is used as the cutting position, while in this embodiment 2, the index dimension that represents the difference between the document element vectors is used to determine Decide where to cut.

[0289] Since the basic description related to association rule analysis has been done in Embodiment 1, it will be omitted. First, the differences between Embodiment 2 and Embodiment 1 will be described for the parameters used in association rule analysis in Embodiment 2.

[0290]

[0291] When a certain node (node) c is given in the dendrogram, its combination level is represented by an integer i(c...

Embodiment 3

[0327]

[0328] In the Cell Division Method, after cutting the dendrogram at the cutting height α determined by a certain method and extracting the parent cluster, only the file elements belonging to each parent cluster are used in order to divide each parent cluster into sub-clusters , again making a dendrogram of that section. When creating the partial dendrogram, the dimension of the index term whose deviation value of the document element vector component in the parent cluster is smaller than the value determined by a predetermined method is removed and analyzed.

[0329]

[0330] Fig. 11 is a flowchart illustrating a cluster extraction procedure in Example 3 (cell division method; CD method). This flowchart is more image 3 The procedure of the third embodiment is shown in more detail. for with image 3 The same steps are in image 3 Add 300 to the step number, and take the last two digits and image 3 Same step number, sometimes omitted with image 3...

Embodiment 8

[0475]

[0476] Time Slice Analyzes is a method of performing cluster analysis within each time category after classifying a plurality of file elements to be analyzed based on time data. This is different from the sixth and seventh embodiments described above in that analysis is performed based on time data before clusters are extracted based on content data. After classification based on time data and analysis of clusters within each time classification are completed, lines are drawn between elements belonging to clusters before and after time, and the file correlation graph is completed.

[0477]

[0478] Figure 34 is better than figure 2 It is a diagram explaining the configuration and functions of the file-correlation graph creation device in Embodiment 8 (time-sectional analysis; TSA) in more detail. right with figure 2 The same parts are denoted by the same symbols, and explanations are omitted.

[0479] The file correlation diagram making device ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A document correlation diagram drawing device includes extracting means (20, 30) for extracting content data and time data of document elements (E) each including one or more documents, dendrogram drawing means (50) for drawing a dendrogram showing a correlation between documents on the basis of the content data of the document elements, clustering means (70) for cutting the dendrogram in accordance with a predetermined rule and extracting clusters, and intra-cluster arranging means (90) for determining an intra-cluster arrangement of the document elements belonging to each cluster on the basis of the time data of the document elements. Accordingly, a dendrogram adequately showing the chronological development in each field can be automatically drawn.

Description

technical field [0001] The present invention relates to a technology for automatically creating a document correlation graph that expresses the relationship between documents and reflects the chronological order of documents, and particularly relates to a device, method, and program for creating such a document correlation graph. Background technique [0002] The number of technical documents and other documents, including patent documents, emerges in an endless stream. In order to present the interrelationships of these documents in a concise and easy-to-understand manner, it is preferable to sort out the temporal development for each related content. Therefore, it is preferable to automatically create a file correlation graph that takes into account both the association of file contents and the chronological arrangement. [0003] Japanese Patent Laying-Open No. 11-53387 "Document Association Method and System" (Patent Document 1) discloses a method for associating documen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06K9/6219G06F18/231
Inventor 增山博昭佐藤晴正浅田诚莲子和巳堀田任晃
Owner INTPROP BANK CORP (JP)