Knowledge map construction method based on GHSOM algorithm

A knowledge map and construction method technology, which is applied in computing, computer parts, semantic tool creation, etc., can solve the problems that SOM variants are difficult to handle maps, etc., and achieve easy search, accurate classification results, and improved precision and recall. Effect

Inactive Publication Date: 2020-01-31
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Among the existing text clustering algorithms, the SOM algorithm has a significant disadvantage, that is, its architecture must be defined in advance, and dynamically growing SOM variants often produce huge maps that are difficult to handle

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge map construction method based on GHSOM algorithm
  • Knowledge map construction method based on GHSOM algorithm
  • Knowledge map construction method based on GHSOM algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the invention are described in detail below, examples of which are illustrated in the accompanying drawings. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0042] The environment of the present invention is in the Anaconda experimental environment based on python 3.6 version.

[0043] The entire knowledge map construction process is as follows: figure 2 As shown, first, collect the text to be processed, generate a text set, and perform data preprocessing on the text set. The preprocessing content includes:

[0044] (1) Segment the text set with paragraphs as the basic unit to improve the precision and recall of the knowledge map construction results;

[0045] (2) Carry out Chinese participle processing by paragraph, remove stop word according to Chinese stop word table;

[0046] (3) According to the word segmentation re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a knowledge map construction method based on a GHSOM algorithm, the GHSOM is of a multi-layer hierarchical structure, each layer comprises a plurality of independent growth-type SOMs, and a data set is described to a certain degree of detail by the increasing scale. The method comprises the following steps of: when the knowledge map is constructed, performing data preprocessing on a text data set to be classified, generating an initial input vector for checking calculation of a GHSOM algorithm by combining technical means such as Chinese word segmentation, keyword extraction, file vector generation and the like, performing clustering analysis on texts by using the GHSOM algorithm, and finally establishing a knowledge map. The advancement of the method is mainly reflected in shorter calculation time, and richer ordered expression ability is provided. According to the method, the latest data mining technical result is adopted, the improved GHSOM algorithm is applied to the construction of the knowledge map, and the knowledge map in the special field is established by trying to use the method. Results show that the accuracy and recall rate of the professional domain knowledge map constructed by the method are remarkably improved.

Description

technical field [0001] The invention relates to a method for constructing a knowledge map based on a GHSOM algorithm, and belongs to the technical field of data mining. Background technique [0002] With the rapid development of computer technology, especially the continuous application of Internet technology, people's ability to use network information technology to generate and collect data has been greatly improved, and the data has shown a rapid growth trend. How to obtain the required information from massive data has become an urgent research problem. Faced with such a challenge, data mining (Data Mining) technology emerged as the times require, using data mining technology to obtain hidden useful information from these massive data. However, due to the explosive growth of data, how to use data mining technology to quickly and effectively obtain hidden useful information from massive data is an urgent problem to be solved. [0003] The "knowledge map" proposed by Bro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F16/35G06F40/289G06K9/62
CPCG06F16/367G06F16/355G06F18/23213
Inventor 张浩洋周良
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products