Text classification and device

A text classification and text technology, applied in the computer field, can solve the problems of low accuracy and coverage, long construction time, and low construction efficiency, and achieve the effects of high accuracy, simple operation, and improved coverage

Active Publication Date: 2017-12-05
HUAWEI TECH CO LTD
View PDF12 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Since the keyword library and the matching rule library of the business information base are completely manually constructed, the construction time of the keyword library and the matching rule library is relatively long, and the construction efficiency is low, and the establishment and maintenance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification and device
  • Text classification and device
  • Text classification and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0057] In order to make the objectives, technical solutions and advantages of the present invention clearer, the embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0058] Before explaining the embodiments of the present invention in detail, the application scenarios involved in the embodiments of the present invention will be described first.

[0059] The customer service platform is often the most important service window for telecom operators or Internet operators, such as the mobile 10086 platform and Taobao customer service platform. Take the mobile 10086 platform as an example. The average daily customer service calls in the first half of 2015 were about 5.5 million. There are millions of pieces of customer service data stored on the mobile 10086 platform every day. Conversation record. Since the customer service data is often stored in the form of recordings, in order to facilitate the processing of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification and device, pertaining to the technical field of computers. The method comprises the following steps: determining word vectors corresponding to keywords according to a word vector mode as for each keyword of many keywords incorporated into a keyword library of a business information library; determining potential expansion words of keywords on the basis of the word vectors corresponding to keywords; adding the potential expansion words into the keyword library as well as adding an expansion rule to a matching rule library when receiving the inputted expansion rule for the potential expansion words and detecting addition instructions of the potential expansion words; determining first probability of each pre-set class in multiple pre-set classes comprising to-be-classified texts according to a mode distribution classifier on the basis of the keyword library and the matching rule library; and determining a class to which to-be-classified texts belong from multiple pre-set classes on the basis of the first probability. The text classification and device can reduce labor cost of building a business information library and helps coverage rate and accuracy of text classification.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a text classification method and device. Background technique [0002] With the rapid development of computer technology, massive information resources exist in the form of text. Because massive information resources often contain a lot of real and valuable information, for example, when the information resource is the dialogue data between users and customer service in a certain business, the dialogue data often reflects the business development situation, business problem feedback, etc. Therefore, in order to quickly and effectively discover valuable information from massive information resources, it is necessary to classify the text corresponding to the information resources. [0003] At present, a text classification method is provided, specifically: technicians manually construct the keyword database and matching rule database of the business information base. After the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/353G06F16/374G06F16/355G06F16/00G06N7/01G06N3/08
Inventor 刘炳源张旭
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products