Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification method and device

A text classification and text technology, applied in the computer field, can solve the problems of ignoring similarity, large dependence on rules and templates, and unbalanced multi-category data, so as to reduce the consumption of system resources and improve the accuracy and efficiency.

Active Publication Date: 2021-08-24
TENCENT TECH (SHENZHEN) CO LTD
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The template-based text classification method relies heavily on rules and templates, consumes more system resources in the text classification process, and the built rules have low generalization ability and insufficient versatility, resulting in low text classification accuracy.
The text topic classification method based on information retrieval retrieves and classifies by training the classification model, but text classification is a multi-classification problem, and the imbalance of multi-class data makes the prediction accuracy of the classification model low.
The text similarity-based classification method uses the result of sentence similarity calculation directly on the text as the basis for text classification. However, this method ignores the similarity between the contents of the text itself, and does not distinguish the key information of the text. , resulting in lower text classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device
  • Text classification method and device
  • Text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] With the research and progress of artificial intelligence (AI), AI has been researched and applied in many fields. Artificial intelligence technology is a comprehensive subject that involves a wide range of fields, including both hardware-level technology and software-level technology. Artificial intelligence basic technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation / interaction systems, and mechatronics. Artificial intelligence software technology mainly includes several major directions such as computer vision technology, speech processing technology, natural language processing technology, machine learning, and automatic driving.

[0037] Specifically, the embodiments of the present application relate to natural language processing (Nature Language processing, NLP) technology and machine learning technology in AI. Among them, NLP is an imp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a text classification method and device, which relate to natural language processing technology and deep learning technology in the field of artificial intelligence. The method includes: acquiring text to be classified and a preset text library; performing word segmentation and part-of-speech tagging processing on the text to be classified to obtain Multiple word segmentation results and part-of-speech tagging results corresponding to each word-segmentation result; match each word-segmentation result with a preset text library, and determine the target word segmentation result and the first weight information corresponding to the target word segmentation result based on the matching result and the part-of-speech tagging result; Based on the first weight information, determine the second weight information corresponding to other word segmentation results in the multiple word segmentation results except the target word segmentation result; according to the first weight information and the second weight information, obtain the text feature information of the text to be classified; The text classification model performs correlation identification processing on the text feature information, and obtains the text classification result of the text to be classified. The present application can improve the accuracy and efficiency of text classification.

Description

technical field [0001] The present application belongs to the field of computer technology, and in particular relates to a text classification method and device. Background technique [0002] Text (eg, official document) classification is a fundamental task in natural language processing. In related technologies, text classification methods based on templates, text topic classification methods based on information retrieval, and text similarity-based classification methods are usually used to classify texts. [0003] The template-based text classification method relies heavily on rules and templates, consumes more system resources in the text classification process, and the constructed rules have low generalization ability and insufficient versatility, resulting in low text classification accuracy. The text topic classification method based on information retrieval retrieves and classifies by training the classification model, but text classification is a multi-classificati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/117G06F40/216G06F40/258G06F40/289G06F16/33G06F16/35G06N3/02G06N3/08
CPCG06N3/02G06N3/08G06F16/3344G06F16/353G06F40/117G06F40/216G06F40/258G06F40/289
Inventor 刘志煌
Owner TENCENT TECH (SHENZHEN) CO LTD