Text data annotation method

A text data and text technology, applied in the field of data management, can solve the problems of not being able to label the labeling platform equipment in time, the time required for labeling is prolonged, and the safety performance is not high, so as to shorten the labeling time, reduce the pressure and improve the security.

Pending Publication Date: 2020-12-08
安徽聚戎科技信息咨询有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing text data labeling methods have certain disadvantages when used. First, the existing text data labeling methods usually hand over the text to the relevant labeling platform for direct comparison with all the data, and now the data in the database is getting more and more Large, the time required for labeling is also constantly extending, and the requirements for labeling platform equipment that cannot be marked in time are getting higher and higher. Secondly, all files are aggregated together, which is prone to file leakage and low security performance. Therefore, We propose a text data annotation method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data annotation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0044] A text data labeling method, comprising the following steps:

[0045] (1), text information extraction: determine the text search range, text annotation data and text annotation standards;

[0046] When the text search range is formulated, it is divided according to the subject or type of the text annotation data. The text annotation data is the annotation data that needs to be displayed. The text annotation standard is divided into different levels according to the similarity of the text annotation data. For example, it is divided into 5 levels according to the similarity. , each level is marked with a different color.

[0047] (3) Text information segmentation and numbering: Divide the files within the text search range, directly divide the total file into parts with equal data, and number the divided data in the order of division, and keep the files before dividing The text lookup range of records.

[0048] (3) Text information release: distribute numbered data, te...

Embodiment 1

[0054] Embodiment 1 is to segment the text labeling data, and the labeling platform searches, extracts, and labels the segmented data in the complete text search range; Embodiment 2 is to segment the text search range, and the labeling platform searches the segmented text search range for the text label data. Search, extract, and mark; compared with Example 2, Example 1 has high file security, and Example 2 does not need multiple numbering compared to Example 1, and the integration is more convenient;

[0055] In summary, the present invention is a text data labeling method. The present invention divides the text labeling data or the data within the text search range before labeling, and then distributes it to different labeling platforms for synchronous labeling, and finally labels Summarizing the final results can speed up the text data labeling, shorten the time for text data labeling, and reduce the pressure on the labeling platform; this patent divides the text labeling data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text data annotation method. The method comprises the steps of text information extraction: determining a text search range, text annotation data and a text annotation standard; and text information segmentation and numbering: segmenting the text annotation data, and numbering the segmented data according to a segmentation sequence. According to the text data annotation method, text annotation data or data in a text search range are segmented before annotation, then the segmented text annotation data or data in the text search range are issued to different annotationplatforms for synchronous annotation, and finally annotated results are summarized, so that the text data annotation speed can be increased, the text data annotation time is shortened, and the pressure of the annotation platforms can be reduced. According to the invention, the text annotation data are divided, combined and then issued to different annotation platforms, and the data received by each annotation platform is incomplete, so that the risk of data leakage is reduced, and the security of text data annotation is improved.

Description

technical field [0001] The invention relates to the field of data management, in particular to a text data labeling method. Background technique [0002] With the rapid development of society and the continuous improvement of people's living standards, information has become an important part of various industries. When people design products, they usually check the text they designed, and the relevant content needs to be marked. In order to facilitate people Labeling text data, people have invented some text data labeling methods; [0003] The existing text data annotation methods have certain drawbacks in use. First, the existing text data annotation methods usually hand over the text to the relevant annotation platform to directly compare with all the data, and now the data in the database is becoming more and more Large, the time required for labeling is also constantly extending, and the requirements for labeling platform equipment are getting higher and higher if labe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/117G06F40/189G06F16/33
CPCG06F16/3331G06F40/117G06F40/189
Inventor 江灏汤智曾东
Owner 安徽聚戎科技信息咨询有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products