Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification method and device

A text classification and text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem that users are unwilling to actively select categories or provide labels, and achieve high classification accuracy and low redundancy Effect

Active Publication Date: 2013-09-25
HUAWEI TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, many forums have classification functions, but most of them rely on users to select categories or provide tags when posting posts. The problem with this method is that many users are unwilling to actively select categories or provide tags. The amount of reading, deliberately providing a lot of irrelevant tags
[0007] 2) May be satisfied "by chance"

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device
  • Text classification method and device
  • Text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. It should be understood that the described embodiments are only some of the embodiments of the present invention, not all of them. example. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0022] Embodiments of the present invention provide a text classification method, such as figure 1 shown, including the following steps:

[0023] Step 10: segment the text to be classified into sentences, perform dependency syntactic analysis on each sentence, and extract all dependency pairs as the extracted decision-making unit;

[0024] The decision-making unit acquisition method described in the embodiment of the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention relates to a text classification method in the field of communication. The method comprises the following steps of: phrasing a text to be classified, performing dependence syntax analysis on each sentence and extracting all dependence pairs serving as extracted decision units; searching the types of the extracted decision units from a knowledge base, wherein the knowledge base stores the decision units serving as classification bases and the types and weights of the decision units; accumulating the weight sums of the extracted decision units according to the types; and taking a type with a maximum weight sum as the type of the text to be classified. The embodiment of the invention also provides a corresponding text classification device. The text classification method and the text classification device provided by the embodiment of the invention have the advantages of high classification accuracy, small redundancy rate and capability of effectively resolving conflicts by using a syntactic distance.

Description

technical field [0001] The invention relates to the technical field of text mining, in particular to a text classification method and device. Background technique [0002] Online forums are one of the typical ways of participating in contemporary online life. With the increase in the number of posts, there is an increasing need for a mechanism to classify the published posts, which not only facilitates forum content management, but also greatly facilitates users to choose interesting topic posts. At present, many forums have classification functions, but most of them rely on users to select categories or provide tags when posting posts. The problem with this method is that many users are unwilling to actively select categories or provide tags. The amount of reading, intentionally provides a lot of irrelevant tags. [0003] Based on the above problems, it is necessary to perform text classification on posts in online forums. Text classification (Text Classification, Text Cat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 张翼陈儒王震高立琦刘桂平
Owner HUAWEI TECH CO LTD