Text classification feature extraction method, text classification method and device

A technology for classifying features and text classification, which is applied in the field of information processing and can solve problems such as inconvenience in search.

Active Publication Date: 2022-08-09
TENCENT TECH (SHENZHEN) CO LTD
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, with the development of Internet technology, there are more and more texts on the Internet. A large number of texts provide users with convenience, but also bring great inconvenience to users' search.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification feature extraction method, text classification method and device
  • Text classification feature extraction method, text classification method and device
  • Text classification feature extraction method, text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0038] This application relates to text classification technology, which can be applied to figure 1 in the system architecture shown. like figure 1 As shown, the system architecture includes: a terminal 101 , a text classification server 102 , an application server 104 , and the terminal 101 , the text classification server 102 and the application server 104 communicate through the Internet 103 .

[0039] The terminal 101 may be a s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application discloses a text classification feature extraction method, which obtains a feature word set from multiple training texts in a training set, and determines the attribute correlation between each feature word in the feature word set and a certain category and the attribute correlation of each feature word in the feature word set. For the word frequencies in this category, the feature words whose attribute relevancy meets the preset first condition and the feature words whose word frequencies meet the preset second condition are selected from the feature word set as the classification feature words of the corresponding category. The present application also proposes a corresponding text classification method, a text classification feature extraction device, and a text classification device.

Description

technical field [0001] The present application relates to the technical field of information processing, and in particular, to a text classification feature extraction method, device, text classification method and device. Background technique [0002] At present, with the development of Internet technology, there are more and more texts on the Internet, and a large number of texts provide convenience to users, but also bring great inconvenience to users' search. How to effectively manage these texts is a current hot issue. Faced with this problem, text classification has been proposed. Text classification can determine a category for the text according to the pre-defined subject categories, and classify the text according to the category, so as to facilitate users Find. As a key technology for managing massive data, text classification has been widely used. SUMMARY OF THE INVENTION [0003] An example of this application provides a text classification feature extraction...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35
CPCG06F16/35
Inventor 包恒耀苏可饶孟良陈益
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products