Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for text analysis using a knowledge topographic map

A knowledge topographic map and text analysis technology, applied in the field of text analysis, can solve the problems of poor performance and complicated implementation methods, and achieve the effect of easy implementation, strong scalability, and simple implementation.

Active Publication Date: 2019-08-20
德稻全球创新网络(北京)有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The present invention mainly solves the technical problems of poor performance and complex implementation means existing in the prior art, and provides a method and system for text analysis using knowledge topographic maps

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for text analysis using a knowledge topographic map
  • A method and system for text analysis using a knowledge topographic map
  • A method and system for text analysis using a knowledge topographic map

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0060] Step 1: Use the text mining method to extract the subject words in the text data, and obtain the subject words list.

[0061] Step 2: Calculation of the relationship strength of the subject terms. Calculate the co-occurrence key matrix of the subject terms, and calculate the relationship strength matrix between the subject terms according to the co-occurrence matrix. The calculation method can use inverted document frequency, information entropy, mutual information, etc., or directly use the number of co-occurrences to measure the relationship strength . Assume that the relationship strength matrix between n subject terms is Corr n×n .

[0062]

[0063] Step 3: Layout of subject words

[0064] In order to draw a knowledge topographic map, it is necessary to determine the position coordinates of the subject words in the plan, and the operation process is as follows:

[0065] a. Set the relationship strength threshold value, the node strength greater than the thres...

Embodiment 2

[0083] Step 1: Use the text word segmentation method to obtain the subject words in the text data set.

[0084] Step 2: Use the co-occurrence frequency of the subject terms as the relationship strength value of the subject terms.

[0085] Step 3: Apply the Fruchterman-Reingold layout algorithm and VOSMaping algorithm to calculate the plane coordinates of the subject words.

[0086] As shown in Figure 1, suppose there are 12 subject words, divided into three groups A, B, and C, and the relationship strength matrix is ​​R 1 , R 2 , R 3 . In the process of laying out the 12 subject words, first, the three groups A, B, and C are regarded as three nodes, and the node distances (dotted lines in the figure) are equal. Use the Fruchterman-Reingoldlayout algorithm to lay out the three nodes, and record the center position of each node. For the nodes inside each group, such as three nodes in group A, four nodes in group B, and five nodes in group C, the VosMapping algorithm is used...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a text analysis method and system, belongs to the field of information processing, and particularly relates to a method and a system for performing text analysis by utilizing a knowledge topographic map. A method for establishing the knowledge topographic map comprises a coordinate mapping step and a graph rendering step, wherein the coordinate mapping step is used for dividing topic words into m groups according to a preset rule, mapping the m groups into a plane to obtain m group coordinates, calculating coordinates of nodes in each group and moving central points in the groups onto the group coordinates; and the graph rendering step is used for establishing a density function reflecting a pixel point color value according to the relationship intensity of the topic words, establishing a color palette and a mapping relationship between the color palette and the density function, and rendering a graph according to the mapping relationship. The knowledge topographic map constructed by utilizing the method is simple, easy to realize and visual; large-scale text data can be quickly browsed; key information in the text data can be mined; and the expansibility is high.

Description

technical field [0001] The invention relates to a method and system for text analysis, belonging to the field of information processing, in particular to a method and system for text analysis using a knowledge topographic map. Background technique [0002] The knowledge topographic map realizes the visualization of text data through the contour map similar to the geographic information system, and distinguishes the amount of data and the relationship between data through the depth of color. It is also called landscape map or theme map in some literatures. Although the names and expressions are not exactly the same, their basic ideas are consistent. [0003] The heat map is a simple transformation form of the knowledge topographic map. It is a computer simulation of the thermal imaging principle in nature. The amount of data is distinguished by the depth of the three colors of red, yellow and blue, and the density of data is distinguished by color blocks. The technical reali...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
CPCG06F40/205
Inventor 刘玉琴李军柳岸王金秋李韦朱东华李维
Owner 德稻全球创新网络(北京)有限公司