Text classification and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A text classification and text technology, applied in the computer field, can solve the problems of low accuracy and coverage, long construction time, and low construction efficiency, and achieve the effects of high accuracy, simple operation, and improved coverage

Active Publication Date: 2017-12-05

HUAWEI TECH CO LTD

View PDF12 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Since the keyword library and the matching rule library of the business information base are completely manually constructed, the construction time of the keyword library and the matching rule library is relatively long, and the construction efficiency is low, and the establishment and maintenance of the keyword library and the The labor cost of matching the rule base is relatively large

In addition, since the keyword library and the matching rule library are completely dependent on the experience of technicians, the accuracy and coverage of the above text classification methods are relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0057] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0058] Before explaining and describing the embodiments of the present invention in detail, the application scenarios involved in the embodiments of the present invention will be described first.

[0059] The customer service platform is often the most important service window for telecom operators or Internet operators, such as China Mobile 10086 platform, Taobao customer service platform, etc. Taking the mobile 10086 platform as an example, the average daily customer service calls in the first half of 2015 were about 5.5 million. Millions of pieces of customer service data are stored in the mobile 10086 platform every day. Conversation log. Since the customer service data is often stored in the form of recording, in order to facilit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a text classification and device, pertaining to the technical field of computers. The method comprises the following steps: determining word vectors corresponding to keywords according to a word vector mode as for each keyword of many keywords incorporated into a keyword library of a business information library; determining potential expansion words of keywords on the basis of the word vectors corresponding to keywords; adding the potential expansion words into the keyword library as well as adding an expansion rule to a matching rule library when receiving the inputted expansion rule for the potential expansion words and detecting addition instructions of the potential expansion words; determining first probability of each pre-set class in multiple pre-set classes comprising to-be-classified texts according to a mode distribution classifier on the basis of the keyword library and the matching rule library; and determining a class to which to-be-classified texts belong from multiple pre-set classes on the basis of the first probability. The text classification and device can reduce labor cost of building a business information library and helps coverage rate and accuracy of text classification.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a text classification method and device. Background technique [0002] With the rapid development of computer technology, massive information resources exist in the form of text. Because massive information resources often contain a lot of real and valuable information, for example, when the information resource is the dialogue data between users and customer service in a certain business, the dialogue data often reflects the business development situation, business problem feedback, etc. Therefore, in order to quickly and effectively discover valuable information from massive information resources, it is necessary to classify the text corresponding to the information resources. [0003] At present, a text classification method is provided, specifically: technicians manually construct the keyword database and matching rule database of the business information base. After the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30

CPCG06F16/353G06F16/374G06F16/355G06F16/00G06N7/01G06N3/08

Inventor 刘炳源张旭

Owner HUAWEI TECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Text classification and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology