Related knowledge point acquisition method and system

A technology of knowledge points and domain knowledge, which is applied in the field of electronic digital data processing, can solve problems such as poor objectivity, heavy workload, and artificial screening, so as to reduce workload, improve efficiency and accuracy, and save time and labor costs. Effect

Inactive Publication Date: 2016-05-25
PEKING UNIV FOUNDER GRP CO LTD +2
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] For this reason, the technical problem to be solved by the present invention lies in the problems of manual screening, heavy workload, and poor o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Related knowledge point acquisition method and system
  • Related knowledge point acquisition method and system
  • Related knowledge point acquisition method and system

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0032] Example 1:

[0033] In this embodiment, a method for acquiring related knowledge points is provided, by which related knowledge points of all the knowledge points in the field are obtained, and then based on the obtained relevant knowledge points, for the entries in the established domain encyclopedia It has very good guiding value to conduct further improvement by checking for leaks. Knowledge points refer to the basic unit of information transmission. Research on the representation and association of knowledge points plays an important role in improving learning navigation, information recommendation, retrieval, and building thesaurus.

[0034] The method of obtaining the relevant knowledge points, the flowchart is as follows figure 1 As shown, the specific process is as follows:

[0035] First, obtain domain knowledge points and obtain all knowledge points in the field. For example, when building an encyclopedia, you can obtain all the entries in the field that have been ...

Example Embodiment

[0047] Example 2:

[0048] This embodiment provides a method for acquiring related knowledge points. The steps are the same as those in Embodiment 1. In this embodiment, a specific method for calculating the semantic vector of each candidate knowledge point in the above process is provided. The specific process is as follows :

[0049] The first step is to determine the number of occurrences of each candidate knowledge point in the candidate file, so that the text of each candidate knowledge point and its occurrence number is obtained. The candidate text is the text obtained after word segmentation from the selected digital resource, and the candidate knowledge points are the words obtained by subtracting common words from the words obtained after word segmentation in the candidate text. This part is the same as in Embodiment 1, and will not be repeated here.

[0050] The second step is to calculate the binary tree with the smallest weighted path length according to each candidate ...

Example Embodiment

[0065] Example 3:

[0066] The domain encyclopedia is an important digital publishing resource. Domain encyclopedias usually organize domain information in terms of entries. The domain encyclopedia needs to contain important entries in the domain. However, building an encyclopedia in the field requires a lot of manpower investment. In this embodiment, a method for obtaining related knowledge points is provided. Domain knowledge points are also entries in the domain encyclopedia. In this embodiment, the domain e-book text and newspaper text are used to calculate the semantic vector of the candidate term through the skip-gram model. The semantic similarity between the constructed domain entry and the obtained candidate entry is calculated through the semantic vector. Using the semantic similarity of the entries, discover the semantically related and missed entries in other fields to reduce the possibility of missing entries in certain fields. Specific steps are as follows.

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a related knowledge point acquisition method. The method comprises: firstly, acquiring domain knowledge points; then carrying out word segmentation on a text in a domain according to the domain knowledge points; obtaining candidate knowledge points after removing common words; obtaining semantic vectors of the candidate knowledge points; and obtaining candidate knowledge points, related to each domain knowledge point, as target knowledge points by calculating similarity between the domain knowledge points and the candidate knowledge points. Thus, a plurality of target knowledge points related to each domain knowledge point can be obtained. When constructing an encyclopedia directory entry, it may be determined, through searching, whether each domain knowledge point has a related knowledge point, and if not, a related knowledge point needs to be added. In this way, checking and construction of encyclopedia entries are completed, so that a manual workload is significantly reduced; time costs and labor costs are reduced; inaccuracy caused by subjectivity and non-uniform standards of manual checking is avoided; and efficiency and accuracy are greatly improved.

Description

technical field [0001] The invention relates to the field of electrical digital data processing, in particular to a method and system for acquiring relevant knowledge points. Background technique [0002] Digital publishing resources have become one of the main ways of information provision. People have shifted from paper reading to electronic reading in large numbers. Digital publishing resources include e-books, digital encyclopedias, digital periodicals, digital newspapers, etc. The information provided by digital publishing resources is usually more authoritative and accurate than that of the Internet. Therefore, how to improve people's learning or reading experience according to the characteristics of digital publishing resources has become particularly important. [0003] Encyclopedia (Encyclopedia) is a reference book that introduces all human knowledge or a certain type of knowledge. They are often arranged in the form of dictionaries (with entries as the basic u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
Inventor 叶茂徐剑波汤帜杨亮卢菁
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products