Text classification method and device based on heterogeneous neural network

A neural network and text classification technology, which is applied in the fields of artificial intelligence and natural language processing, can solve problems such as complex business logic, differences, and huge classification systems, and achieve the effect of improving classification accuracy

Pending Publication Date: 2022-02-22
CHINA TELECOM DIGITAL INTELLIGENCE TECH CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the complex business logic, the classification system in this scena

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device based on heterogeneous neural network
  • Text classification method and device based on heterogeneous neural network
  • Text classification method and device based on heterogeneous neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0096] In this embodiment, taking the call center user complaint work order text classification as an example, the implementation method of the call center user complaint classification is as follows:

[0097] In step S0, 100,000 training data sets, 20,000 verification data sets, and 4,000 test data sets to be classified are obtained.

[0098] Step S1, combining the tree structure of the actual classification system in the training data set, constructing an N-layer text classification multi-fork tree corresponding to the tree structure, and according to the structure of the N-layer text classification multi-fork tree, the training data set The training data are written into the classification files corresponding to each level, and the files are named according to the file naming rule R, and the naming rules of the category number and the corresponding Chinese category names are saved as a classification file, which is convenient for reading and selects the corresponding word se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text classification method and device based on a heterogeneous neural network, and the method comprises the steps: S1, constructing an N-layer text classification multi-way tree corresponding to a tree structure in combination with the tree structure of an actual classification system in a training data set, and according to the structure of the N-layer text classification multi-way tree, respectively writing training data in the training data set into classification files corresponding to each level, performing word segmentation on Chinese texts of each classification file, performing feature selection, and storing the selected features into corresponding feature files; S2, constructing a text heterogeneous neural network corresponding to the tree structure; and S3, setting heterogeneous neural network algorithm parameters, carrying out parameter adjustment, iteratively generating and storing each classification and sub-classification model of training data, using a verification data set for carrying out accuracy judgment. The categories and the hierarchical relation between the categories are also added into the heterogeneous neural network, and the problems of vector representation and link data sparsity are learned in a category display manner.

Description

technical field [0001] The present invention relates to the fields of natural language processing and artificial intelligence, in particular to a text classification method and device based on a heterogeneous neural network. Background technique [0002] Text classification is a process in which computers automatically classify natural language according to a certain classification system, and it has applications in many fields. With the increasing number of application scenarios, scenarios with a large classification system are becoming more and more common. Taking the customer service center platform as an example, merchants hope to achieve accurate classification and quick recording of customer visit work orders through automatic text classification, so as to speed up on-site The energy efficiency of operator service response improves the overall service quality; it optimizes management by improving the intelligence of the overall operation data analysis of the hotline. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/289G06K9/62G06N3/04G06N3/08
CPCG06F40/289G06N3/08G06N3/045G06F18/24323
Inventor 王彦君贺同泽郭彩丽李忠张宇峰李金岭杜忠田王欣肖天宇
Owner CHINA TELECOM DIGITAL INTELLIGENCE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products