Language sequence labeling method and device, storage medium and computer equipment

A technology of sequence labeling and storage medium, applied in the field of device storage medium and computer equipment, language sequence labeling method, and can solve the problems of lack of labeling resources, incompleteness, inaccurate labeling, etc.

Active Publication Date: 2020-06-12
ALIBABA GRP HLDG LTD
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a language sequence labeling method, device storage medium and computer equipment to at least solve the techn

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language sequence labeling method and device, storage medium and computer equipment
  • Language sequence labeling method and device, storage medium and computer equipment
  • Language sequence labeling method and device, storage medium and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033]According to an embodiment of the present invention, a method embodiment of a language sequence tagging method is also provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0034] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. figure 1 It shows a hardware structure block diagram of a computer terminal (or mobile device) for realizing the language sequence tagging method. Such as figure 1 As shown, the computer terminal 10 (or mobile device 10) may include one or more (shown by 102a, 102b, ..., 102n in the figure) processor 102 (the processor 102 may ...

Embodiment 2

[0110] According to another aspect of the application, the application also provides another language sequence tagging method, such as Image 6 The language sequence tagging method shown. Image 6 It is a flow chart of a language sequence tagging method according to Embodiment 2 of the present invention, such as Image 6 As shown, the method includes the following steps:

[0111] Step S602, receiving a target language sequence annotation request.

[0112] As an optional embodiment, the subject of the above-mentioned execution steps may be an execution terminal for performing target language sequence tagging, which may be a server, a computer, or other intelligent terminals, and the above-mentioned terminal sending the target language sequence tagging request may be Servers, computers, or other intelligent terminals.

[0113] As an optional embodiment, before receiving the target language sequence labeling request, the target language sequence labeling request may also be pr...

Embodiment 3

[0126] According to an embodiment of the present invention, a language sequence tagging device for implementing the above-mentioned embodiment 1 is also provided, Figure 7 is a schematic diagram of a language sequence tagging device according to Embodiment 3 of the present invention, such as Figure 7 As shown, the device includes: a first generation module 702, a second generation module 704, a conversion module 706, a training module 708 and a labeling module 710, and the device will be described in detail below.

[0127] The first generation module 702 is used to generate cross-language vectors based on the source language vector and the target language vector; the second generation module 704 is connected to the above-mentioned first generation module 702, and is used to generate language correspondences according to the cross-language vectors, where language The corresponding relationship includes the relationship between the source language and the target language; the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a language sequence labeling method and device, a storage medium and computer equipment. The method comprises the steps of generating a cross-language vector based on a sourcelanguage vector and a target language vector; generating a language correspondence relationship according to the cross-language vector, the language correspondence relationship comprising a relationship corresponding to the source language and the target language; converting the source language sequence annotation data into conversion data according to the language corresponding relation; trainingthe source language sequence labeling data and the conversion data to obtain a cross-language sequence labeling model; and performing sequence labeling on the target language based on the cross-language sequence labeling model. According to the method and the device, the technical problem of inaccurate and incomplete annotation caused by lack of annotation resources of the target language in a language sequence annotation method in related technologies is solved.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a language sequence labeling method, device storage medium and computer equipment. Background technique [0002] In some application scenarios, it is necessary to perform sequence annotation on various kinds of languages. For example, the input text (for example, I went to She County, Anhui today) recognizes the entity (for example, She County, Anhui is a place name); another example, the input text (for example, I bought a She inkstone) recognizes the entity (for example, She County Inkstone is a commodity), but instead of inputting "I went to She County, Anhui today, or I bought a She inkstone" in another language (for example, English, Thai, Vietnamese, Arabic, etc.), in this language is In the case of languages ​​without manual labeling data (resource-poor languages, for example, Vietnamese, Thai), it is also impossible to identify the product name "Anhui Shexian is a place na...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/295G06N3/04G06N3/08
CPCG06N3/088G06N3/049G06N3/045
Inventor 黄睿李辰王涛包祖贻李林琳司罗
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products