Enterprise industry classification method based on domain ontology and system

A technology of domain ontology and classification method, which is applied in the field of enterprise industry classification and system based on domain ontology, which can solve the problems of short text length, difficulty in calculating text correlation, cumbersome work, etc.

Inactive Publication Date: 2021-01-05
ZHEJIANG UNIV OF TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the short text length and sparse features, it is difficult to calculate the correlation between texts. The above-mentioned more commonly used text classification methods are applied to short text classification, and usually cannot obtain a better classification result.
[0002] Therefore, in order to solve the cumbersom...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enterprise industry classification method based on domain ontology and system
  • Enterprise industry classification method based on domain ontology and system
  • Enterprise industry classification method based on domain ontology and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to understand the above-mentioned purpose, features and advantages of the present invention more clearly, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0063] The present invention provides an enterprise industry classification method based on domain ontology. The main innovation of the method is that it comprehensively and effectively utilizes the feature extension method of domain ontology, and realizes short text classification based on the BM25 classification model according to the description information of the main business of the enterprise. Technology for business industry classification purposes.

[0064] In order to realize above-mentioned object of the invention, the present invention provides such as figure 1 The following technical solutions are shown:

[0065] Step 1: Construct a category feature vocabulary through domain ontology, which is used to expand ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an enterprise industry classification method based on a domain ontology. The method comprises the steps that (1) constructing a category feature word bank through the domain ontology, and forming a feature extension set used for extending feature words of a short text; (2) extracting category feature words from annotation information of National Economy Industry Classification Annotation (edition 2017) by using a TF-IDF feature extraction method for representing basic features of each industry category, and forming a feature corpus in a vector form; (3) extracting main business keywords of an enterprise, removing useless words, performing feature extension on the feature keywords by using the feature extension set of the domain ontology, and strengthening feature information of the feature keywords to obtain extended to-be-classified short texts; and (4) performing classification operation on the to-be-classified short texts subjected to feature extension by utilizing a BM25 classification model, and determining the industry category to which the to-be-classified texts belong according to the text similarity. The invention further comprises a system for implementing the enterprise industry classification method based on the domain ontology. The problems that time and labor are wasted and work is tedious in manual classification are solved.

Description

Background technique [0001] Economic industry classification refers to the standard classification of economic activities in the whole society according to the national economic industry classification standards and according to certain principles and classification methods. As my country's economic development has entered a new normal, the economic structure has been continuously optimized, emerging industries such as new energy, new materials, and new medicines have developed rapidly, and new technologies such as artificial intelligence, the Internet of Things, and robots have emerged vigorously, and industry classifications have become more complex. At the same time, since most statistical staff have not participated in professional industry classification work, are not familiar with industry standards and lack practical experience, it is difficult to accurately determine the industry category of an enterprise, which virtually increases the difficulty of industry classificat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F16/335G06F40/194G06F40/216G06F40/289G06F40/30
CPCG06F16/35G06F40/194G06F40/289G06F40/216G06F16/335G06F40/30
Inventor 郑晓辉季白杨
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products