Unlock instant, AI-driven research and patent intelligence for your innovation.
A method and system for text analysis using a knowledge topographic map
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A knowledge topographic map and text analysis technology, applied in the field of text analysis, can solve the problems of poor performance and complicated implementation methods, and achieve the effect of easy implementation, strong scalability, and simple implementation.
Active Publication Date: 2019-08-20
德稻全球创新网络(北京)有限公司
View PDF4 Cites 0 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
[0007] The present invention mainly solves the technical problems of poor performance and complex implementation means existing in the prior art, and provides a method and system for text analysis using knowledge topographic maps
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 1
[0060] Step 1: Use the text mining method to extract the subject words in the text data, and obtain the subject words list.
[0061] Step 2: Calculation of the relationship strength of the subject terms. Calculate the co-occurrence key matrix of the subject terms, and calculate the relationship strength matrix between the subject terms according to the co-occurrence matrix. The calculation method can use inverted document frequency, information entropy, mutual information, etc., or directly use the number of co-occurrences to measure the relationship strength . Assume that the relationship strength matrix between n subject terms is Corr n×n .
[0062]
[0063] Step 3: Layout of subject words
[0064] In order to draw a knowledge topographic map, it is necessary to determine the position coordinates of the subject words in the plan, and the operation process is as follows:
[0065] a. Set the relationship strength threshold value, the node strength greater than the thres...
Embodiment 2
[0083] Step 1: Use the text word segmentation method to obtain the subject words in the text data set.
[0084] Step 2: Use the co-occurrence frequency of the subject terms as the relationship strength value of the subject terms.
[0085] Step 3: Apply the Fruchterman-Reingold layout algorithm and VOSMaping algorithm to calculate the plane coordinates of the subject words.
[0086] As shown in Figure 1, suppose there are 12 subject words, divided into three groups A, B, and C, and the relationship strength matrix is R 1 , R 2 , R 3 . In the process of laying out the 12 subject words, first, the three groups A, B, and C are regarded as three nodes, and the node distances (dotted lines in the figure) are equal. Use the Fruchterman-Reingoldlayout algorithm to lay out the three nodes, and record the center position of each node. For the nodes inside each group, such as three nodes in group A, four nodes in group B, and five nodes in group C, the VosMapping algorithm is used...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The invention relates to a text analysis method and system, belongs to the field of information processing, and particularly relates to a method and a system for performing text analysis by utilizing a knowledge topographic map. A method for establishing the knowledge topographic map comprises a coordinate mapping step and a graph rendering step, wherein the coordinate mapping step is used for dividing topic words into m groups according to a preset rule, mapping the m groups into a plane to obtain m group coordinates, calculating coordinates of nodes in each group and moving central points in the groups onto the group coordinates; and the graph rendering step is used for establishing a density function reflecting a pixel point color value according to the relationship intensity of the topic words, establishing a color palette and a mapping relationship between the color palette and the density function, and rendering a graph according to the mapping relationship. The knowledge topographic map constructed by utilizing the method is simple, easy to realize and visual; large-scale text data can be quickly browsed; key information in the text data can be mined; and the expansibility is high.
Description
technical field [0001] The invention relates to a method and system for text analysis, belonging to the field of information processing, in particular to a method and system for text analysis using a knowledge topographic map. Background technique [0002] The knowledge topographic map realizes the visualization of text data through the contour map similar to the geographic information system, and distinguishes the amount of data and the relationship between data through the depth of color. It is also called landscape map or theme map in some literatures. Although the names and expressions are not exactly the same, their basic ideas are consistent. [0003] The heat map is a simple transformation form of the knowledge topographic map. It is a computer simulation of the thermal imaging principle in nature. The amount of data is distinguished by the depth of the three colors of red, yellow and blue, and the density of data is distinguished by color blocks. The technical reali...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.