User quality inspection requirements classification method and system based on rule matching and deep learning

A technology of deep learning and classification methods, applied in the field of inspection and detection services and machine learning, it can solve the problems of large impact of classification results, poor classification effect of deep learning models, and sparse text features.

Active Publication Date: 2021-10-08
HUNAN UNIV
View PDF19 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, existing user quality inspection requirements classification methods include: (1) The current industry classification of quality inspection requirements texts mainly relies on manual classification by users. Users need to select specific labels from many industry categories, and the classification efficiency is low and limited. Due to the user's knowledge reserve, misclassification often occurs, which affects the quality of inspection and testing services
(2) Text classification for quality inspection requirements based on machine learning methods, such as support vector machine, K nearest neighbor, naive Bayesian and other algorithms are suitable for small-scale data sets, and there are shortcomings such as cumbersome feature extraction for text and weak model generalization performance ;(3) The text classification method based on the convolutional neural network predicts the text category by setting the parameters of the input layer, convolution layer, pooling layer and output layer. The training speed is fast, but the word order and semantic information of the text cannot be obtained. The effect is average; (4) The text classification method based on the long-term short-term memory network considers the context information of the text, and the classification performance is better, but the classification effect in small-scale data sets is average; (5) fasttext: it is a fast text Classification algorithm, but it does not fully consider the characteristics of short text and many professional vocabulary required by quality inspection. It is necessary to consider the characteristics of the text to design a suitable classification model
[0004] Compared with other Chinese texts such as news and user comments, the quality inspection demand texts published by users have the following characteristics: (1) There are many professional words, such as "protein / radio frequency antenna / xenon lamp / ballast, etc.", the frequency of occurrence of these words It is low, but it has a great impact on the classification results. The commonly used text classification methods do not fully consider these text features, and the classification effect is average.
(2) There are more than 20 classified industries, and the amount of data in each industry is unbalanced
Common deep learning models have been fully trained in large-scale data sets. The amount of data required for quality inspection in various industries is unbalanced (thousands at most and hundreds at least). The classification effect of using deep learning models is relatively low. Difference
(3) The text features are sparse, and the quality inspection requirements are published by different users. The description is relatively colloquial, and the text features are sparse. The text length of the quality inspection requirements is generally between 15 and 150 characters. It consists of fields of "test content" and "test location requirements". Different fields have different classification capabilities, which are not considered in general text classification methods.
[0005] To sum up, based on the classification of quality inspection needs based on user experience, multiple industry categories are preset, such as more than 20 industry categories such as food, agricultural products, environment, medicine, electronic appliances, and biology. Users can classify according to their own knowledge reserves , it is easy to cause misclassification for products with vague industry attributes
Text classification methods based on machine learning, this type of method has a strong dependence on the quality of text feature extraction, and the extraction process is cumbersome, and the generalization ability of the model is weak

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • User quality inspection requirements classification method and system based on rule matching and deep learning
  • User quality inspection requirements classification method and system based on rule matching and deep learning
  • User quality inspection requirements classification method and system based on rule matching and deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] like figure 1 As shown, the user quality inspection requirements classification method based on rule matches and depth learning includes:

[0039] 1) The classification text is matched with the proprietary noun dictionary of each industry, and the industry probability P between the industry labels corresponding to each industry is calculated. i , Industry probability P for the largest value i Reconsible as the largest industry probability value P;

[0040] 2) If the maximum industry probability value P is greater than or equal to the preset threshold G, the industry tag corresponding to the largest industry probability value P is output as a classification result; otherwise, the jump is executed next;

[0041] 3) Treat the classification text to predict the industry label and calculate the industry label and calculate the predicted industry probability value h; for the largest industry probability value P, the predicted industry probability value H, select the industry labe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a user quality inspection demand classification method and system based on rule matching and deep learning. The method of the invention includes matching the text to be classified with the proper noun dictionary of each industry, and calculating the difference between the industry labels corresponding to each industry. If the industry probability value Pi matched between them is selected, the maximum industry probability value P is selected; if the maximum industry probability value P is greater than or equal to the preset threshold G, the industry label corresponding to the maximum industry probability value P is output as the classification result; otherwise, the text to be classified is used The deep learning model predicts the industry label and calculates the predicted industry probability value H; for both the maximum industry probability value P and the predicted industry probability value H, the industry label corresponding to the higher probability is selected as the final classification result output. The invention can realize the automatic and accurate classification of the user's quality inspection demand text according to the industry to which it belongs.

Description

Technical field [0001] The present invention relates to the field of inspection and detection services and the field of machine learning, which relates to a user quality inspection requirements classification method and system based on rule matching and depth learning. Background technique [0002] The domestic third-party inspection and testing website has various industry inspection and testing agencies, providing product quality inspection and testing services, and users can contact the organization by actively contacting or issuing quality inspection requirements. In the face of a large number of professional testing services, users tend to release product detection requirements in the website to let organizations that meet the inspection conditions. In order to realize the exact match of user needs and detection services, it is a key step by classifying quality inspection requirements. The classification of current quality inspection requirements is mainly manually classifie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/216G06F40/242G06F40/289G06F40/30G06N3/04G06K9/62G06N3/08
CPCG06F16/3344G06F16/3346G06F16/35G06F40/216G06F40/242G06F40/289G06F40/30G06N3/08G06N3/047G06N3/048G06N3/044G06F18/2415G06F18/241
Inventor 滕召胜龚冬成杨智君唐求刘涛吴娟林海军朱坤志何民军杨唐胜欧阳博成达
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products