A classifier construction method and classifier based on semantic computing

A classification method and algorithm technology, applied in the field of information retrieval and database structure, can solve the problems of high labeling accuracy, high labor cost, poor classifier model, etc., to improve field operation performance, classification ability, and online speed fast effect

Active Publication Date: 2022-04-01
GLOBAL TONE COMM TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) Manual labeling of data often requires heavy manual labor and requires high labeling accuracy, which often requires three people to label the same text, resulting in long labeling work cycles, high labor costs, and slow production speed;
[0005] (2) In the absence of corpus, the model trained by the classifier is poor, which often cannot meet the effectiveness requirements of the system application

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A classifier construction method and classifier based on semantic computing
  • A classifier construction method and classifier based on semantic computing
  • A classifier construction method and classifier based on semantic computing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0036] In order to quickly build a classifier and gradually improve the classification effect during use, the present invention proposes a progressive classifier construction technology; only the user is required to define some heuristic keywords for each classification, and the classification task is automatically completed. It greatly reduces the workload of manual participation and speeds up the construction of classifiers.

[0037] like figure 1 As shown, the semantic computing-based classifier construction method provided by the embodiment of the present invention includes the following steps:

[0038] S101: In the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of information retrieval and its database structure, and discloses a semantic calculation-based classifier construction method and a classifier, which use a neural network model to train word vectors on Wikipedia data to obtain distributed representations of words; through classification The vector representation of the classification is obtained by the label, and the vector representation of the text is obtained by using the weighted average method; the most likely classification of the text is obtained by calculating the semantic relationship between the classification vector and the text vector. The unsupervised learning stage of the present invention does not need to label data, only needs the user to define a small number of feature words to complete the creation of the classifier, the online speed is fast, and there is no need to wait for the long accumulation of label data; the unsupervised learning stage can make full use of the existing limited labels Data, guide and improve unsupervised classification ability by extracting effective feature words.

Description

technical field [0001] The invention belongs to the technical field of information retrieval and database structure thereof, and in particular relates to a semantic calculation-based classifier construction method and a classifier. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: With the continuous deepening of the globalization process and the rapid development of the Internet, text data is showing explosive growth, but the data sources and forms are diverse, which brings great challenges to the management and use of documents. Text classification technology uses machine learning methods to automatically classify and mark text sets according to a certain classification system or standard, so as to realize the classification, archiving and fast query and retrieval of massive data. At present, text classification technology is relatively mature and has been widely used in many fields. The most primitive metho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06V10/774G06F16/35
CPCG06F16/35G06F18/2155G06F18/2411
Inventor 宋俊平程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products