Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text classification and device

A text classification and text technology, applied in the computer field, can solve the problems of low accuracy and coverage, long construction time, and low construction efficiency, and achieve the effects of high accuracy, simple operation, and improved coverage

Active Publication Date: 2017-12-05
HUAWEI TECH CO LTD
View PDF12 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Since the keyword library and the matching rule library of the business information base are completely manually constructed, the construction time of the keyword library and the matching rule library is relatively long, and the construction efficiency is low, and the establishment and maintenance of the keyword library and the The labor cost of matching the rule base is relatively large
In addition, since the keyword library and the matching rule library are completely dependent on the experience of technicians, the accuracy and coverage of the above text classification methods are relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification and device
  • Text classification and device
  • Text classification and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0058] Before explaining and describing the embodiments of the present invention in detail, the application scenarios involved in the embodiments of the present invention will be described first.

[0059] The customer service platform is often the most important service window for telecom operators or Internet operators, such as China Mobile 10086 platform, Taobao customer service platform, etc. Taking the mobile 10086 platform as an example, the average daily customer service calls in the first half of 2015 were about 5.5 million. Millions of pieces of customer service data are stored in the mobile 10086 platform every day. Conversation log. Since the customer service data is often stored in the form of recording, in order to facilit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text classification and device, pertaining to the technical field of computers. The method comprises the following steps: determining word vectors corresponding to keywords according to a word vector mode as for each keyword of many keywords incorporated into a keyword library of a business information library; determining potential expansion words of keywords on the basis of the word vectors corresponding to keywords; adding the potential expansion words into the keyword library as well as adding an expansion rule to a matching rule library when receiving the inputted expansion rule for the potential expansion words and detecting addition instructions of the potential expansion words; determining first probability of each pre-set class in multiple pre-set classes comprising to-be-classified texts according to a mode distribution classifier on the basis of the keyword library and the matching rule library; and determining a class to which to-be-classified texts belong from multiple pre-set classes on the basis of the first probability. The text classification and device can reduce labor cost of building a business information library and helps coverage rate and accuracy of text classification.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a text classification method and device. Background technique [0002] With the rapid development of computer technology, massive information resources exist in the form of text. Because massive information resources often contain a lot of real and valuable information, for example, when the information resource is the dialogue data between users and customer service in a certain business, the dialogue data often reflects the business development situation, business problem feedback, etc. Therefore, in order to quickly and effectively discover valuable information from massive information resources, it is necessary to classify the text corresponding to the information resources. [0003] At present, a text classification method is provided, specifically: technicians manually construct the keyword database and matching rule database of the business information base. After the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/353G06F16/374G06F16/355G06F16/00G06N7/01G06N3/08
Inventor 刘炳源张旭
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products