Classifier construction method based on semantic computation, and classifier

A classification method and semantic computing technology, applied in the field of information retrieval and its database structure, can solve the problems of high requirements on labeling accuracy, high labor cost, poor classifier model, etc. fast effect

Active Publication Date: 2018-08-10
GLOBAL TONE COMM TECH
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) Manual labeling of data often requires heavy manual labor and requires high labeling accuracy, which often requires three people to label the same text, resulting in long labeling work cycles, h

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classifier construction method based on semantic computation, and classifier
  • Classifier construction method based on semantic computation, and classifier
  • Classifier construction method based on semantic computation, and classifier

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0036] In order to quickly build a classifier and gradually improve the classification effect during use, the present invention proposes a progressive classifier construction technology; only the user is required to define some heuristic keywords for each classification, and the classification task is automatically completed. It greatly reduces the workload of manual participation and speeds up the construction of classifiers.

[0037] Such as figure 1 As shown, the semantic computing-based classifier construction method provided by the embodiment of the present invention includes the following steps:

[0038] S101: In t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of information retrieval and database structure, discloses a classifier construction method based on semantic computation, and a classifier. A neural network model is used to train word vectors on Wikipedia data to obtain distributed representation of words; vector representation of a classification is obtained through a classification label, and a weighted average method is used to obtain vector representation of a text; and by calculating the semantic relationship between a classification vector and a text vector, the most likely classification towhich the text belongs is obtained. The unsupervised learning phase does not need to data labeling, and only needs a user to define a small number of feature words to complete the creation of the classifier, the online speed is fast, and there is no need to wait for long label data accumulation; the unsupervised learning stage can make full use of the existing limited label data, and guide and improve unsupervised classification by extracting valid feature words.

Description

technical field [0001] The invention belongs to the technical field of information retrieval and database structure thereof, and in particular relates to a semantic calculation-based classifier construction method and a classifier. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: With the continuous deepening of the globalization process and the rapid development of the Internet, text data is showing explosive growth, but the data sources and forms are diverse, which brings great challenges to the management and use of documents. Text classification technology uses machine learning methods to automatically classify and mark text sets according to a certain classification system or standard, so as to realize the classification, archiving and fast query and retrieval of massive data. At present, text classification technology is relatively mature and has been widely used in many fields. The most primitive metho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06F17/30
CPCG06F16/35G06F18/2155G06F18/2411
Inventor 宋俊平程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products