Domain concept extraction method based on Deep Learning

A concept and domain technology, applied in the field of domain concept extraction based on Deep Learning, can solve the problems of poor domain concept extraction and weak learning ability, and achieve high accuracy and good recognition performance.

Inactive Publication Date: 2014-09-10
EAST CHINA NORMAL UNIV
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a deep learning-based domain concept extraction method for the traditional unsupervised method with weak learning ability and poor domain concept extraction effect, and transform the domain concept extraction problem into a binary classification problem. Richer statistical features, using Deep Learning's domain concept extraction algorithm, combining Deep Learning and domain concept extraction tasks, conducting unsupervised pre-training by building a deep belief network, and then performing supervised adjustments with traditional neural network models , compared with the KNN and SVM models, the finally trained deep network model achieved the highest F value on the test data set

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Domain concept extraction method based on Deep Learning
  • Domain concept extraction method based on Deep Learning
  • Domain concept extraction method based on Deep Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0048] The present invention will be further described below by taking materials in the military field as an example in conjunction with the accompanying drawings.

[0049] refer to figure 1 , first extract samples from the training corpus, extract features from the samples, select feature vectors, and obtain the training model—DN model, and the obtained DN model automatically classifies and recognizes the test data. For the classification results, the correct domain concept set can be finally obtained through manual review.

[0050] In this example, if figure 2 As shown, to realize the conversion of the training corpus to the sample space, the present invention selects the above several features to construct the eigenvectors, and Table 1 lists the eigenvalues ​​of the part of the training samples extracted in the military field materials of the present invention.

[0051] Table 1 Characteristics of some training samples in the military field

[0052]

[0053] Model tra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a domain concept extraction method based on Deep Learning. The method includes extracting samples in a training corpus, adopting word frequency, document frequency, inverse document frequency, word length, word frequency variance and domain consensus as feature vectors, training and acquiring a deep network model, which is capable of representing the complex mapping correspondence between the word-type filed concept multi-dimensional feature vectors and class labels, on the basis of the Deep Learning technology, and finally comparing the deep network model established on the basis of the Deep Learning technology, an optimized BP neural network model and mainstream KNN and SVM models in the testing step. According to the tests, the optimal test effect is acquired through the deep network model established on the basis of the Deep Learning technology.

Description

technical field [0001] The present invention relates to domain concepts, automatic extraction of domain concepts, artificial neural network, Deep Learning and deep belief network technical fields, specifically a feature extraction method based on Deep Learning that is suitable for the characteristics of word-type domain concepts. Background technique [0002] Domain concept is a manifestation of domain knowledge. People use domain concept to describe certain objects in the domain and to disseminate domain information. For example, "SMS" and "CRBT" are concepts in the field of mobile communications, while "data structure" and "computer network" are concepts in the field of computers. In a sense, the domain concept is the abstraction of things in the process of human cognition, a form of expression of domain knowledge in texts, and reflects the development and changes of the domain to a certain extent. Domain concepts are usually used more frequently in specific domains and l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/08
CPCG06F16/35G06N3/088
Inventor 吕钊张青
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products