Supercharge Your Innovation With Domain-Expert AI Agents!

Method and device for automatically constructing field word list based on classification system

A classification system and field technology, applied in text database clustering/classification, natural language data processing, unstructured text data retrieval, etc. high sex effect

Active Publication Date: 2021-08-13
BEIJING LANGUAGE AND CULTURE UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But both of these two methods have the disadvantages of low efficiency, and the amount of calculation is large, and the accuracy cannot be guaranteed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for automatically constructing field word list based on classification system
  • Method and device for automatically constructing field word list based on classification system
  • Method and device for automatically constructing field word list based on classification system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the technical problems, technical solutions and advantages to be solved by the present invention clearer, the following will describe in detail with reference to the drawings and specific embodiments.

[0050] An embodiment of the present invention provides a method for automatically constructing a domain vocabulary based on a classification system, and the method can be implemented by an electronic device, which can be a terminal or a server. Such as figure 1 The shown flow chart of the method for automatically constructing the domain vocabulary based on the classification system, the processing flow of the method may include the following steps:

[0051]S101. Obtaining data, the data includes domain node data in domain hierarchical tree structure and multiple articles corresponding to each domain node;

[0052] S102. Determine the parent node and each child node of the domain node, and obtain multiple associated words corresponding to the domain node...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of automatic construction of domain word lists, in particular to a method and device for automatically constructing a domain word list based on a classification system, and the method comprises the steps: obtaining data; determining a father node and each child node of the field node, and obtaining a plurality of associated words corresponding to the field node; determining the occurrence probability of the associated words in the average thousand words in the field nodes and the occurrence probability of the associated words in the average thousand words in the child node fields; determining the reciprocal of the average probability of the associated word appearing in each article in the field node and the reciprocal of the average probability of the associated word appearing in each article in the father node field; determining a first intermediate score and a second intermediate score of each associated word; determining a total score of the associated words according to the first intermediate score and the second intermediate score; and obtaining a preset score threshold, and determining a professional word list according to the associated words of which the total scores are greater than the preset score threshold. By adopting the method and the device, the professional word list can be simply, efficiently and automatically constructed.

Description

technical field [0001] The present invention relates to the technical field of automatic construction of domain vocabulary, in particular to a method and device for automatic construction of domain vocabulary based on a classification system. Background technique [0002] In recent years, domain vocabulary is an important resource for professional and vocational education, and it is also widely used in industries, such as solving classification problems. There are usually two ways to obtain the vocabulary in the current field: generating a vocabulary through word frequency statistics and expert identification, or constructing a classification model through machine learning, and using the model to classify a large amount of data. However, both of these two methods have the disadvantages of low efficiency, and a large amount of calculation, and the accuracy cannot be guaranteed. Contents of the invention [0003] The embodiment of the present invention provides a method and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/35G06F40/216G06F40/284
CPCG06F16/3346G06F16/35G06F40/216G06F40/284
Inventor 殷晓君
Owner BEIJING LANGUAGE AND CULTURE UNIVERSITY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More