Supercharge Your Innovation With Domain-Expert AI Agents!

Text classification method and text classification system

A text classification and text technology, applied in the computer field, can solve the problems of high computational complexity, rough recommendation method, inaccurate classification, etc., and achieve the effect of simplifying calculation steps, saving calculation amount, and classifying accurately

Pending Publication Date: 2020-11-17
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development of "big data" technology, more and more fields are beginning to use computers for text matching and classification, and with the rapid growth of text data sets, there are more and more algorithms for text classification, and the computational complexity is also increasing. Higher and higher
[0003] Commonly used text classification methods are based on association rules or keyword-based classification. These methods are based on keywords contained in the target text or according to the association relationship. When a target text contains a certain keyword , it can be better classified. If the target text does not contain pre-selected keywords, the text cannot be classified. Such a classification method causes many target texts to be unable to be matched to the appropriate classification, and the keyword-based The recommendation method is relatively rough, the semantic information is not well considered, and the classification results are not accurate
[0004] Therefore, the inventor believes that the above-mentioned text classification method has great limitations. Classifying the target text based on keywords has the problem of time-consuming and labor-intensive searching and inaccurate classification.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and text classification system
  • Text classification method and text classification system
  • Text classification method and text classification system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The present invention is described below based on examples, but the present invention is not limited to these examples. In the following detailed description of the invention, some specific details are set forth in detail. The present invention can be fully understood by those skilled in the art without the description of these detailed parts. In order to avoid obscuring the essence of the present invention, well-known methods, procedures, and flow charts are not described in detail. Additionally, the drawings are not necessarily drawn to scale.

[0065] figure 1 A flow chart of the text classification method in the embodiment of the present invention is shown, and the specific steps include S101-S103.

[0066] In step S101, the target text is obtained and encoded into a first text vector by using a convolutional network.

[0067] In step S102, the source texts in each text library are divided into multiple categories using a clustering model, and each category is r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text classification method and a text classification system, and the text classification method comprises the steps: obtaining a target text, and employing aconvolution network to encode the target text into a first text vector; dividing a source text in each text library into a plurality of categories by adopting a clustering model, wherein each category is represented by a second text vector corresponding to the clustering center of the category; calculating the distances between the first text vector and the second text vectors respectively, withthe text library where the clustering centers corresponding to the nearest K second text vectors are located serves as a candidate text library of the target text through a sorting model, wherein K isa positive integer. A target text is encoded into the first text vector, distance calculation is carried out on the first text vector and the second text vectors corresponding to the multiple clustering centers in each text library, the K text libraries closest to one another serve as candidate text libraries of the target vector, the target text can be classified rapidly and efficiently, and thetext classification accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of computers, in particular to a text classification method and a text classification system. Background technique [0002] With the development of "big data" technology, more and more fields are beginning to use computers for text matching and classification, and with the rapid growth of text data sets, there are more and more algorithms for text classification, and the computational complexity is also increasing. Higher and higher. [0003] Commonly used text classification methods are based on association rules or keyword-based classification. These methods are based on keywords contained in the target text or according to the association relationship. When a target text contains a certain keyword , it can be better classified. If the target text does not contain pre-selected keywords, the text cannot be classified. Such a classification method causes many target texts to be unable to be matched to the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/33G06N3/04
CPCG06F16/35G06F16/3347G06N3/044G06N3/045
Inventor 孙金辉陈生泰
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More