Unlock instant, AI-driven research and patent intelligence for your innovation.

Single-label text classification method, computing device and computer readable storage medium

A text classification and text technology, applied in the field of machine learning, can solve problems such as lack of robustness, complex structure, and many parameters

Active Publication Date: 2020-12-18
震坤行网络技术(南京)有限公司 +1
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In addition, conventional classification methods usually use a deep network model. Due to the deep network layer, complex structure, and various parameters, the deployment of the model online will cause a certain degree of performance loss. This part of the loss is sensitive to response time systems (such as search , recommendation system, etc.) can not be ignored
[0004] Furthermore, in the case of a small data volume, the single-model classification scheme makes the entire classification algorithm completely dependent on the quality of a single model, which lacks robustness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Single-label text classification method, computing device and computer readable storage medium
  • Single-label text classification method, computing device and computer readable storage medium
  • Single-label text classification method, computing device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Preferred embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although preferred embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0020] In the following description, for the purposes of illustrating various inventive embodiments, certain specific details are set forth in order to provide a thorough understanding of various inventive embodiments. One skilled in the relevant art will recognize, however, that an embodiment may be practiced without one or more of these specific details. In other instances, well-known devices, structures and techniques associated with the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a single-label text classification method, computing equipment and a computer readable storage medium. The method comprises the steps: constructing a sample set based on user behavior data and commodity data, wherein the sample set comprises a plurality of samples, and each sample comprises a first field indicating text and a second field indicating a single label; performing word segmentation on the first field of each sample to obtain at least one feature word, and obtaining a formatted sample of the sample based on the second field and the at least one feature word; training a first neural network model based on a plurality of formatted samples of the plurality of samples; dividing the plurality of formatted samples into a plurality of groups based on the second field; and for each group, determining a second neural network model based on the average word vector of at least one feature word in the group. A large-data-volume training sample set is constructed by utilizing data from multiple sources, and the performance loss of an online system is reduced by utilizing a combined model.

Description

technical field [0001] The present disclosure generally relates to the field of machine learning, and more specifically, relates to a single-label text classification method, a computing device, and a computer-readable storage medium. Background technique [0002] Currently, various machine learning-based methods have been proposed to classify objects such as text and pictures. These classification methods usually require large amounts of data to train neural network models to classify various objects into multiple categories. However, in some cases, such as e-commerce search scenarios, the text to be classified (search term) is usually a single-label text, which often faces problems such as insufficient data volume and excessive data noise. Especially in scenarios such as system cold start, user behavior data is insufficient. Under such conditions, how to construct an effective data set is the cornerstone of producing an effective model and achieving accurate text classif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/33G06N3/04G06N3/08G06Q30/06
CPCG06F16/353G06F16/3334G06N3/08G06Q30/0625G06N3/045
Inventor 陈赵阳郭相林郑学坤
Owner 震坤行网络技术(南京)有限公司