Corpus labeling method and device

A corpus labeling and labeling technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve problems such as repeated calculation, and achieve the effect of improving the speed of corpus labeling and reducing repeated calculation.

Active Publication Date: 2019-04-26
BEIJING GRIDSUM TECH CO LTD
View PDF12 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a corpus tagging method and device to at least solve the technical problem in the prior art that the tagging result is inserted into the sentence or displayed at the end of the sentence when tagging the corpus repeatedly.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus labeling method and device
  • Corpus labeling method and device
  • Corpus labeling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] According to an embodiment of the present invention, a method embodiment of a corpus tagging method is provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, Although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0022] figure 1 is a corpus tagging method according to an embodiment of the present invention, such as figure 1 As shown, the method includes the following steps:

[0023] Step S102, detecting the text selection operation of the sentence to be marked.

[0024] Specifically, a corpus tagging platform can be built in advance, and the process of detecting the text selection operation of the sentence to be tagged can be realized on the corpus tagging platform and displayed on the page, wherein the text selection ...

Embodiment 2

[0052] According to an embodiment of the present invention, a product embodiment of a corpus tagging device is provided, Figure 4 is a corpus tagging device according to an embodiment of the present invention, such as Figure 4 As shown, the device includes a detection module, a first determination module and a processing module, wherein the detection module is used to detect the text selection operation of the sentence to be marked; the first determination module is used to obtain the selected text after the text selection operation ends, Determine the label corresponding to the selected text; the processing module is used to display the label corresponding to the selected text at a position other than the node of the sentence to be marked, and mark the position information and the selected text of the sentence to be marked with the label corresponding to the selected text in the position to be marked Positional information in sentences is stored in a preset database.

[00...

Embodiment 3

[0066] According to an embodiment of the present invention, a product embodiment of a storage medium is provided, on which a program is stored, and when the program is running, the device where the storage medium is located is controlled to execute the above-mentioned corpus labeling method, or when the program is executed by a processor, the above-mentioned corpus labeling is realized method.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a corpus annotation method and device. The method comprises the following steps: detecting a text selection operation on a sentence to be labeled; After the text selection operation is finished to obtain a selection text, determining a label corresponding to the selection text; And displaying a label corresponding to the selection text at a position outside the node of theto-be-labeled sentence, and storing the position information of the to-be-labeled sentence labeled with the label corresponding to the selection text and the position information of the selection textin the to-be-labeled sentence in a preset database. The technical problem that in the prior art, when corpus annotation is carried out, annotation results are inserted into sentences or displayed atthe tails of the sentences, and the sentence positions are repeatedly calculated during multiple annotations is solved.

Description

technical field [0001] The present invention relates to the field of computer internet, in particular to a corpus tagging method and device. Background technique [0002] In the current era of big data, data is undoubtedly the foundation of all big data, and how to collect data effectively and quickly is the competitive advantage of the big data team. In the process of collecting data, it may be necessary to mark the corpus. [0003] The specific display schemes for corpus annotations in the prior art are mainly aimed at emotional annotations, and mainly include two methods. The first method is to directly store the tagged results in the database and display the tagged results at the end of the sentence. The first method is to directly disrupt the sentence structure, insert the marked result directly into the sentence, and store the inserted result sentence and the starting position of the marked sentence in the database; when marking the sentence, it is often necessary to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/0484G06F17/27
CPCG06F3/0484G06F3/04842G06F40/295
Inventor 杜志娟
Owner BEIJING GRIDSUM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products