ANN-based feature selection method in database text classification

A feature selection method and text classification technology, which are applied in text database clustering/classification, text database query, neural learning method, etc., can solve the problems of troublesome feature word extraction and cumbersome classification method, so as to improve the classification effect and classification result. accurate effect

Inactive Publication Date: 2019-11-08
厦门美域中央信息科技有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the database field existing in the background technology, it is essential to classify the database text, the classification method in the prior art is relatively cumbersome, and the technical problem of extracting the characteristic words in the text is relatively troublesome, the present invention proposes a method based on The feature selection method in the database text classification of ANN, obtains a plurality of classification models by ANN training in the present invention, and extracts the feature keyword of classification model and the feature item of text to be classified, selects the most suitable classification model by comparison, Classification results are more accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ANN-based feature selection method in database text classification
  • ANN-based feature selection method in database text classification
  • ANN-based feature selection method in database text classification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be understood that these descriptions are exemplary only, and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0032] Such as Figure 1-3 Shown, the feature selection method in a kind of ANN-based database text classification that the present invention proposes comprises the following specific steps:

[0033] S1. Using the text sample set and its text category, obtain a text classification model through ANN artificial neural network training;

[0034] S2. Establish a collection of text classification models, and set characterist...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An ANN-based feature selection method in database text classification is characterized by comprising the following specific steps of using a text sample set and text categories of the text sample set,and obtaining a text classification model through ANN artificial neural network training; establishing a set of text classification models, and setting feature keywords for comparison for different text classification models; obtaining to-be-classified texts, preprocessing the to-be-classified texts, and obtaining a feature item set of the to-be-classified texts; determining the entity attributeof each feature item in the feature item set and the occurrence frequency of each feature item in the to-be-classified text; setting a weight according to the occurrence frequency of the feature item;sorting the feature items according to the degree of association; calculating the similarity; inputting the text to be classified into the text classification model with the highest similarity. According to the method, a plurality of classification models are obtained through ANN training, feature keywords of the classification models and feature items of texts to be classified are extracted, andthe most suitable classification model is selected through comparison.

Description

technical field [0001] The invention relates to the technical field of computer text classification, in particular to a feature selection method in ANN-based database text classification. Background technique [0002] ANN refers to a complex network structure formed by the interconnection of a large number of processing units (neurons), which is a certain abstraction, simplification and simulation of the organizational structure and operating mechanism of the human brain. Artificial Neural Network (ANN for short), which simulates neuron activity with a mathematical model, is an information processing system based on imitating the structure and function of the brain's neural network; artificial neural networks can be divided into multi-layer and single-layer, each One layer contains several neurons, and each neuron is connected by a directed arc with variable weight. The network can process information, simulate The purpose of the relationship between input and output. It d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F17/27G06N3/08
CPCG06F16/353G06F16/3331G06N3/08
Inventor 肖清林
Owner 厦门美域中央信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products