Text information classification method and device

A text information and classification method technology, applied in the field of text information classification methods and devices, can solve the problems of low recall rate, long analysis period, low work efficiency, etc., and achieve the goal of reducing errors, improving accuracy and improving efficiency Effect

Pending Publication Date: 2018-03-06
ZTE CORP
View PDF5 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The text information classification method and its device provided by the embodiments of the present invention solve the problem of m

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text information classification method and device
  • Text information classification method and device
  • Text information classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Example

[0026] The first embodiment:

[0027] In the prior art, the classification of information is usually performed manually, which leads to the problems of low work efficiency and low accuracy. The embodiment of the present invention discloses a text information classification method and system, which is based on a preset The rule extracts the keyword information set from the acquired text information to be classified, and matches the key of the text information to be classified according to the extracted keyword information set and the preset correspondence between the sample keyword information set and the text classification information The text category information corresponding to the word information set is finally classified according to the text category information, thereby realizing the automatic classification operation of the text information, greatly improving the work efficiency, the accuracy of the classification, and so on.

[0028] See figure 1 , figure 1 This is a pro...

Example

[0065] The second embodiment:

[0066] Please refer to figure 2 , figure 2 This embodiment provides a processing flow chart for a user to classify using a text information classification method through a client.

[0067] This embodiment is a text information classification method obtained by combining a client and a specific application scenario, and the processing steps are as follows:

[0068] S201: Obtain sample text information on the client, perform labeling and classification, and establish a correspondence between the sample keyword information set and the text category information.

[0069] In this step, text information is collected in order to create a text category keyword information set, and the collected text information is used as the sample text information for creating the keyword information set. The text information may be historical text previously received by the client The information can also be short messages or chat text messages on certain applications or t...

Example

[0124] The third embodiment:

[0125] Please refer to Image 6 , Image 6 The processing flowchart for classifying a single piece of text information provided by this embodiment is as follows:

[0126] S601, input a single text message, for example, input a single text "tomorrow night in the Presidential Private Room of Jinling Hotel."

[0127] S602: Perform word segmentation on the input text information, and after removing punctuation marks, perform word segmentation on the text: "tomorrow / night / Jinling / restaurant / president / private room / dining / ".

[0128] S603, each piece of text is split into multiple words, and the above piece of text is split into multiple words: "tomorrow", "night", "Jinling", "restaurant", "president", "private room" and "dining".

[0129] S604: Perform keyword extraction on the text information, and perform vectorized analysis using the category keyword information set in the system. The existing keyword information set is represented by 1, and the non-existent ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a text information classification method and device. According to the method, a sample keyword information set of a text category is set in advance, and a corresponding relation between the sample keyword information set and text category information is established, so that a matching foundation is provided for subsequent classification of to-be-classifiedtext information; and during classification processing of the to-be-classified text information, keyword information is extracted from the to-be-classified text information according to preset rules,and the text category information corresponding to the to-be-classified text information is obtained through matching according to the corresponding relation between the sample keyword information setand the text category. Through the information classification mode, it is only needed to perform system automatic matching, classification processing efficiency is greatly improved, the analysis cycle is shortened, a manual allocation error is lowered, and matching accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of text information classification, in particular to a text information classification method and a device thereof. Background technique [0002] With the development of information classification technology, the information processing departments in various enterprises receive or accumulate massive amounts of information every day. In some cases, it is necessary to extract a certain category of information from the information, but due to There is no direct corresponding relationship established, therefore, it is impossible to directly use search engines to retrieve and extract. The existing methods for categorizing information usually use a manual method to analyze item by item, which will cost a lot of manpower. At the same time, with the continuous increase in the amount of interactive information, or the continuous increase in the accumulation of related work every day, at this time, if the information...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/00
Inventor 周晶
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products