Text labeling method and device, electronic equipment and storage medium

A text and labeling model technology, applied in the field of artificial intelligence, can solve problems such as unable to label text

Pending Publication Date: 2021-09-14
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present application provide a text tagging method and device, electronic equipment, and a storage medium to solve the current technical problem that cross-language text tagging cannot be performed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text labeling method and device, electronic equipment and storage medium
  • Text labeling method and device, electronic equipment and storage medium
  • Text labeling method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Apparently, the described embodiments are only some of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

[0056] The terms "first", "second", "third", etc. (if any) in the description and claims of the present application and the above drawings are used to distinguish similar objects and not necessarily to describe a specific order or sequentially. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the application described herein can be practiced in sequences other than those illustrated or described herein....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of artificial intelligence, in particular to a text labeling method and device, electronic equipment and a storage medium, wherein the method comprises the steps: obtaining a to-be-labeled text and an actual language of the to-be-labeled text; obtaining a trained text labeling model corresponding to the actual language and the target language; carrying out semantic space conversion processing on the actual semantic features of the to-be-labeled text through the trained text labeling model, and acquiring target semantic features of the to-be-labeled text in a target language; performing clustering processing on the target semantic features through a trained text labeling model to obtain a clustering result of the to-be-labeled text; performing classification processing on the clustering result through a trained text labeling model to obtain text type information of the to-be-labeled text; and according to the text type information, performing text type labeling on the to-be-labeled text. According to the method and the device, the text type of any language text can be automatically labeled on the basis of the text in the target language with a large number of labeled samples, so that the cross-language text labeling problem is solved.

Description

technical field [0001] The present application relates to the field of artificial intelligence, in particular to a text labeling method and device, electronic equipment, and a storage medium. Background technique [0002] Text classification is widely used in content-related products, such as news classification, article classification, intent classification and so on. Under normal circumstances, text classification is aimed at texts in a certain language, such as Chinese, English, etc., but when the product needs to expand its business in other languages, it will encounter the problem of insufficient labeled text in the initial stage of the product. [0003] Although from a longer-term perspective, these other language texts can slowly accumulate a certain amount of labeled data through manual operations and other methods, and then perform model training. But in the early days, it was very time-consuming and a waste of manpower to annotate these other language texts only m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/30
CPCG06F16/3344G06F16/35G06F40/30
Inventor 缪畅宇
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products