Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system for acquiring important knowledge points in field

A technology of knowledge points and fields, applied in the field of digital resource processing, can solve problems such as multi-manpower and material resources, difficult standards, poor objectivity, etc., to reduce workload, save time and labor costs, and improve efficiency and accuracy.

Inactive Publication Date: 2016-04-06
NEW FOUNDER HLDG DEV LLC +2
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] For this reason, the technical problem to be solved by the present invention is that in the prior art, it is necessary to manually determine the important entries in the field, it takes more manpower and material resources, the standard is not easy, and the objectivity is poor. The method of automatic acquisition of important knowledge points in the field of processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for acquiring important knowledge points in field
  • Method and system for acquiring important knowledge points in field
  • Method and system for acquiring important knowledge points in field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] In this embodiment, a method for obtaining important knowledge points in the domain is provided, and the flow chart is as follows figure 1 shown. The knowledge points in the field refer to the words or entries in the field, which reflect the knowledge in the field. The method of obtaining important knowledge points in the field includes the following process:

[0034] S1: Segment the text to obtain the word segmentation result.

[0035] Some digital resources in the field are selected for the text here. In order to make the knowledge points covered by it broad enough, more electronic digital resources in the field are generally selected. For example, in the field of history, you can choose e-books in this field related to the history of five thousand years and the history of dynasties. After selecting the digital resources in the field, extract the text from it, and then segment the words. After the word segmentation, a large number of words are obtained. These words...

Embodiment 2

[0067] This embodiment provides a method for obtaining important knowledge points in the field, and its steps are the same as those in Embodiment 1. This embodiment provides a specific method for calculating the semantic vector of each candidate knowledge point in the above process, the specific process as follows:

[0068] The first step is to determine the number of occurrences of each candidate knowledge point in the candidate document, so that the text of each candidate knowledge point and its occurrence times is obtained. The candidate text is the text obtained after word segmentation from the selected digital resources, and the candidate knowledge point is the word obtained after the word segmentation in the candidate text except common words. This part is the same as that in Embodiment 1, and will not be repeated here.

[0069] The second step is to calculate the binary tree with the minimum weighted path length according to each candidate knowledge point and the number...

Embodiment 3

[0098] Field encyclopedias are an important digital publishing resource. Domain encyclopedias usually organize domain information in the form of entries. The domain encyclopedia needs to contain important entries in the domain. However, building a domain encyclopedia requires a lot of human input. This embodiment provides a method for acquiring important domain knowledge points, which are entries in domain encyclopedias. In this embodiment, the domain e-book text and newspaper text are used to calculate the semantic vector of the candidate entry through the skip-gram model. The semantic similarity of candidates is calculated through the semantic vector, and the semantic similarity matrix of all candidate entries is obtained. The semantic similarity matrix is ​​used to calculate the important entries in the candidate entries, and then the field encyclopedia can be built or the gaps can be checked and filled according to these important entries, which provides an objective an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a method for acquiring important knowledge points in a field. The method comprises: firstly, determining candidate knowledge points in the field; then calculating semantic vectors of the candidate knowledge points; carrying out calculation according to the semantic vector of every knowledge point to obtain a semantic similarity matrix; and calculating important knowledge points in the candidate knowledge points according to the semantic similarity matrix, wherein the knowledge points are the important knowledge points in this field. When a field encyclopedia is built or inspected, vocabulary entries can be established according to the important knowledge points, or the vocabulary entries are inspected whether to be perfect, and non-included important knowledge points are added into the vocabulary entries which need to be built. In the manner, inspection and building of the vocabulary entries of the encyclopedia in the field are completed. Manual workload is greatly reduced, time cost and labor cost are saved, subjectivity of manual inspection and inaccuracy brought by a non-uniform standard are avoided, thereby greatly improving efficiency and accuracy.

Description

technical field [0001] The invention relates to the field of digital resource processing, in particular to a method and system for acquiring important knowledge points in the field. Background technique [0002] Digital publishing resources have become one of the main ways of information provision. People have shifted from paper reading to electronic reading in large numbers. Digital publishing resources include e-books, digital encyclopedias, digital periodicals, digital newspapers, etc. The information provided by digital publishing resources is usually more authoritative and accurate than that of the Internet. Therefore, how to improve people's learning or reading experience according to the characteristics of digital publishing resources has become particularly important. [0003] Encyclopedia (Encyclopedia) is a reference book that introduces all human knowledge or a certain type of knowledge. They are often arranged in the form of dictionaries (with entries as the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 叶茂徐剑波汤帜张杰成洪甲
Owner NEW FOUNDER HLDG DEV LLC