Scene-based text classification model and text classification method and device

A text classification and scene technology, applied in the field of scene-based text classification models, can solve the problems of training models, ignoring task commonality, and complicated development and maintenance.

Active Publication Date: 2020-12-18
ZHIZHESIHAIBEIJINGTECH CO LTD
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

2. The tasks in each scene are similar, but not exactly the same
For example, for unfriendly business, although unfriendly related content is identified in each scenario, the definition of unfriendly in different scenarios is slightly different, the target labels are different, and the data distribution in each scenario is quite different
[0003] The existing technology trains models, goes online, and optimizes each scene separately, but the development and maintenance are relatively complicated, and the commonality of tasks in each scene is ignored. When the number of samples in a single scene is extremely insufficient, it is even more difficult to train an effective model. Make the results of text classification based on the model inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene-based text classification model and text classification method and device
  • Scene-based text classification model and text classification method and device
  • Scene-based text classification model and text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Hereinafter, embodiments of the present invention will be described with reference to the drawings. It should be understood, however, that these descriptions are illustrative only and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0033] The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting of the invention. The words "a", "an" and "the" used herein shall also include the meanings of "plurality" and "multiple", unless the context clearly indicates otherwise. In addition, the terms "comprising", "comprising", etc. used herein indicate the existence of stated features, steps, operations and / or components, but do not exclude the existence or addition of one or more other features, steps, operations or components . ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a scene-based text classification model and a text classification method and device, belongs to the technical field of natural language processing, and aims to realize text classification in multiple scenes and improve the accuracy of a text classification result. The model comprises an input layer used for obtaining a short text and a scene label, and the scene label is used for representing the type of a scene where the short text is located; an embedding layer used for adding a scene code and a position code to the short text to obtain a code representation of the short text in the scene; an encoding layer which comprises a multi-head self-attention layer and is used for extracting text features in the short text according to the encoding representation obtainedby the embedding layer to obtain feature vectors of the short text; a decoding layer which comprises multiple layers of sensors and is used for outputting scores of all service types according to thefeature vectors, obtained by the encoding layer, of the short text; and an output layer which is used for outputting a classification result according to the score of each service type.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, and more specifically, to a scene-based text classification model, text classification method and device. Background technique [0002] At present, in the business of various social platforms (such as Weibo, Zhihu, etc.), there are a large number of similar scenarios and the same task. The tasks in different scenarios are similar but not completely consistent, and there may be differences in data sources, target categories, and discrimination definitions. The characteristics of multi-scene similar tasks are mainly 1. There are many scenes. For example, scenes can contain questions, answers, comments, articles, bullet chats, ideas, etc. 2. The tasks in each scene are similar, but not completely consistent. For example, for unfriendly business, although unfriendly related content is identified in each scenario, the definition of unfriendly in different scenarios is sl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/284G06N3/04G06N3/08
CPCG06F16/35G06F40/284G06N3/08G06N3/045
Inventor 李博徐英杰
Owner ZHIZHESIHAIBEIJINGTECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products