Supercharge Your Innovation With Domain-Expert AI Agents!

Text classification method and device

A text classification and text technology, applied in the field of computer networks, can solve the problems of server burden and consumption of large server resources, etc., achieve the effect of low algorithm complexity and ease the consumption of server resources

Active Publication Date: 2016-11-09
TENCENT TECH (SHENZHEN) CO LTD
View PDF12 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since background offline training and offline classification require server support, if online real-time classification is required, it will consume a lot of server resources and cause a certain burden on the server

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device
  • Text classification method and device
  • Text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0039] The text classification method provided in the embodiment of the present invention can be applied to such as figure 1 shown in the application environment. The terminal 102 and the server 104 are connected via a network. A browser and a browser plug-in are run on the terminal 102, multiple pages of the server 104 are accessed through the browser, and texts to be classified are obtained in the pages through the browser plug-in. The terminal 102 obtains the characteristic vocabulary in the text to be classified by traversing the characters or strings of the text to be classified. The termin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a text classification method and device. The method comprises the following steps of: obtaining a to-be-classified text, wherein the to-be-classified text comprises a feature vocabulary; obtaining a classification model and feature weight vectors of a plurality of text categories corresponding to the classification model; calculating a voting score of a text category corresponding to the feature vocabulary according to the feature weight vectors of the plurality of text categories so as to obtain a text category with the highest voting score; and determining the text category with the highest voting score as a text category corresponding to the to-be-classified text. By adopting the method to carry out real-time online classification on the texts, the server resource consumption can be effectively relieved.

Description

technical field [0001] The invention relates to the technical field of computer networks, in particular to a text classification method and device. Background technique [0002] With the development of Internet technology, people can publish information online at any time. For example, comments on purchased products on a shopping website, and posting personal impressions after watching a movie, people can refer to these information for shopping or watching movies. Usually the amount of such information is large and exists in the form of text. If the information is classified, it is convenient for people to quickly understand the relevant content. [0003] In the traditional text classification method, it is necessary to perform word segmentation on the text. By using methods such as naive Bayesian or support vector machine, the classification model is obtained through offline training on big data in the background. In the background, the information released by people is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/36
Inventor 梁锦全
Owner TENCENT TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More