Text classification feature extraction method and text classification method and device

A classification feature and text classification technology, applied in the field of information processing, can solve problems such as inconvenient search

Active Publication Date: 2017-06-27
TENCENT TECH (SHENZHEN) CO LTD
View PDF3 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, with the development of Internet technology, there are more and more texts on the Internet. A large number of texts provide users with convenience, but also bring great inconvenience to users' search.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification feature extraction method and text classification method and device
  • Text classification feature extraction method and text classification method and device
  • Text classification feature extraction method and text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0038] This application involves text classification technology, which can be applied to figure 1 In the system architecture shown. Such as figure 1 As shown, the system architecture includes: a terminal 101 , a text classification server 102 , and an application server 104 . The terminal 101 , the text classification server 102 and the application server 104 communicate through the Internet 103 .

[0039] The terminal 101 may be a smart phone,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification feature extraction method. According to the method, a feature word set is acquired from multiple training texts in a training set, the property correlation between each feature word in the feature word set and a certain category and the word frequency of each feature word in the category are determined, and the feature word with the property correlation meeting a first preset condition and the feature word with the word frequency meeting a second preset condition are selected from the feature word set to serve as classification feature words of the corresponding category. The invention furthermore provides a corresponding text classification method, a text classification feature extraction device and a text classification device.

Description

technical field [0001] The present application relates to the technical field of information processing, and in particular to a text classification feature extraction method and device, and a text classification method and device. Background technique [0002] At present, with the development of Internet technology, there are more and more texts on the Internet. While providing convenience to users, a large number of texts also bring great inconvenience to users' search. How to effectively manage these texts is a current hot issue. In the face of this problem, text classification is proposed. Text classification can determine a category for the text according to the pre-defined topic category, and classify the text according to the category, so as to facilitate users. find. As a key technology for managing massive data, text classification has been widely used. Contents of the invention [0003] The application example provides a method for extracting text classification...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 包恒耀苏可饶孟良陈益
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products