Document theme partitioning method based on domain knowledge map community structure

A technique of domain knowledge and document subject, applied in the field of document subject division

Inactive Publication Date: 2013-11-27
TAIYUAN UNIV OF TECH
View PDF7 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In order to solve the subject division problem of various subject documents in existing large-scale network courses, the present invention provid

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document theme partitioning method based on domain knowledge map community structure
  • Document theme partitioning method based on domain knowledge map community structure
  • Document theme partitioning method based on domain knowledge map community structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The domain knowledge map is a complex network that describes the knowledge in a certain field (course or discipline) and the association between these knowledge; the knowledge unit refers to the basic knowledge fragment with complete expression ability in the knowledge map; the domain knowledge map library is a storage The database of knowledge units in the field records the detailed information of knowledge units, such as the name of the knowledge unit, the corresponding text segment of the knowledge unit, the core terms contained in the knowledge unit, and the relationship between the knowledge units. Usually, the knowledge map of a subject is constructed from the document resources of the subject, expressed as a network of knowledge units and their associated relationships; after using complex network community discovery algorithms to divide the domain knowledge map into community structures, each community has a relative separate subject. Therefore, the knowledge un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document theme partitioning method based on a domain knowledge map community structure, and the partitioning problem of document resources related to subject knowledge or document knowledge is mainly solved, so that documents related to a theme can be stored in a close logical place, and learning efficiency is improved. The document theme partitioning method is characterized in that a level community discovery algorithm based on the Fast Geedy algorithm and the GN algorithm is proposed, and a theme structure tree is built; in the process of feature extraction, knowledge units directly serve as feature vectors, and due to the fact that the knowledge units have semantic integrality, compared with a traditional method based on participles, the document theme partitioning method can reflect theme characteristics of the feature vectors better; in the process of calculating feature vector values, the method of combination of degree centrality and knowledge unit file frequency is proposed, wherein the concept of the degree centrality reflects the status of the knowledge units in a knowledge map whole situation. Through the method, accuracy of document theme partitioning is effectively improved, and the method is suitable for the document theme partitioning based on the domain knowledge map community structure in general scenes.

Description

technical field [0001] The invention relates to the division of document topics based on the domain knowledge map community structure, and mainly solves the problem of division of document resources related to subject or domain knowledge, so as to store subject-related documents in similar logical positions and improve storage and access efficiency. Background technique [0002] With the expansion of the network course platform, the scale of the various subject documents of the network course continues to expand. Documents with similar topics are stored in similar logical locations. When learners learn a resource, they can prefetch other resources associated with the topic. , reduce the time overhead of reading files, and improve storage and access efficiency. [0003] For the subject division method of documents, the following three patent documents provide different technical solutions: [0004] 1. Text classification feature selection and weight calculation method based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/21
Inventor 郑庆华董博刘均徐海鹏李冰贺欢马天
Owner TAIYUAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products