Supercharge Your Innovation With Domain-Expert AI Agents!

Method and device for automatic construction of domain vocabulary based on classification system

A classification system and technology in the field, applied in text database clustering/classification, natural language data processing, unstructured text data retrieval, etc., can solve problems such as large amount of calculation, low efficiency, and unguaranteed accuracy, and achieve accurate sex high effect

Active Publication Date: 2021-10-01
BEIJING LANGUAGE AND CULTURE UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But both of these two methods have the disadvantages of low efficiency, and the amount of calculation is large, and the accuracy cannot be guaranteed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for automatic construction of domain vocabulary based on classification system
  • Method and device for automatic construction of domain vocabulary based on classification system
  • Method and device for automatic construction of domain vocabulary based on classification system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the technical problems, technical solutions and advantages to be solved by the present invention clearer, the following will describe in detail with reference to the drawings and specific embodiments.

[0050] An embodiment of the present invention provides a method for automatically constructing a domain vocabulary based on a classification system, and the method can be implemented by an electronic device, which can be a terminal or a server. like figure 1 The shown flow chart of the method for automatically constructing the domain vocabulary based on the classification system, the processing flow of the method may include the following steps:

[0051]S101. Obtaining data, the data includes domain node data in domain hierarchical tree structure and multiple articles corresponding to each domain node;

[0052] S102. Determine the parent node and each child node of the domain node, and obtain multiple associated words corresponding to the domain node; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the technical field of automatic construction of domain vocabulary, in particular to a method and device for automatic construction of domain vocabulary based on a classification system. The method includes: obtaining data; determining the parent node and each child node of a domain node, and obtaining A plurality of related words corresponding to the node; determine the probability that the related words appear in the average thousand words in the domain node and the probability that the related words appear in the average thousand words in the sub-node domain; determine the average probability that the related words appear in each article in the domain node The reciprocal and the reciprocal of the average probability of occurrence in each article in the parent node field; determine the first intermediate score and the second intermediate score of each associated word; determine the total score of the associated word according to the first intermediate score and the second intermediate score; Obtain the preset score threshold, and determine the professional vocabulary according to the associated words whose total scores of all associated words are greater than the preset score threshold. By adopting the invention, the professional vocabulary can be automatically constructed simply and efficiently.

Description

technical field [0001] The present invention relates to the technical field of automatic construction of domain vocabulary, in particular to a method and device for automatic construction of domain vocabulary based on a classification system. Background technique [0002] In recent years, domain vocabulary is an important resource for professional and vocational education, and it is also widely used in industries, such as solving classification problems. There are usually two ways to obtain the vocabulary in the current field: generating a vocabulary through word frequency statistics and expert identification, or constructing a classification model through machine learning, and using the model to classify a large amount of data. However, both of these two methods have the disadvantages of low efficiency, and a large amount of calculation, and the accuracy cannot be guaranteed. Contents of the invention [0003] The embodiment of the present invention provides a method and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/216G06F40/284
CPCG06F16/3346G06F16/35G06F40/216G06F40/284
Inventor 殷晓君
Owner BEIJING LANGUAGE AND CULTURE UNIVERSITY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More