Django-based text labeling platform

A platform and text technology, applied in the field of Django-based text annotation platforms, can solve the problems of deviation in data format, prone to errors and omissions, and labor-intensive problems.

Active Publication Date: 2021-01-08
BEIJING INST OF COMP TECH & APPL
View PDF10 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Traditional manual labeling is done by distributing data to individual labelers, but there are the following defects: 1. The data format marked by different personnel may be biased, which is not convenient for unified processing; 2. The labeling speed is slow and prone to mislabeling 3. Lack of collaboration between personnel and no cross-validation; 4. Low management efficiency, unable to adjust projects in real time; 5. No pre-labeling function, which consumes more manpower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Django-based text labeling platform
  • Django-based text labeling platform
  • Django-based text labeling platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] In order to make the purpose, content, and advantages of the present invention clearer, the specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0014] This system is based on Django text annotation platform, figure 1 Shown is the algorithm pre-labeling flow chart of the text labeling platform system; figure 2 Shown is a block diagram of the text annotation platform system architecture; image 3 Shown is the flow chart of the user login module; Figure 4 Shown is the permission management diagram; Figure 5 Shown is the block diagram of the algorithm structure.

[0015] Such as figure 1 As shown, the algorithm pre-labeling process includes: upload the packaging algorithm program to the algorithm module, select the algorithm in the project through the project module, and perform pre-labeling model training. The project module transfers the project’s labeled...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Django-based text labeling platform, which comprises an algorithm module, a project module, a user module and a label module, the user module is used for storing user information and performing login verification; the label module adds, deletes and modifies labels of projects, different labels are set for different projects, and the same project label cannot be repeatedand is used for labeling according to label types; and the algorithm module selects an algorithm in a project through the project module according to a packaging algorithm program, pre-annotation model training is carried out, the project module transmits annotated data of the project to the algorithm module, after a pre-annotation model is trained, pre-annotation is carried out through the project module, and unannotated data in the project is transmitted to the pre-annotation model. The algorithm module stores the labeled data into a database, the project module pre-labels all files which donot reach the standard in a project, and the pre-labeled data is checked on a system interface after pre-labeling is completed.

Description

technical field [0001] The invention relates to artificial intelligence natural language technology, in particular to a Django-based text labeling platform. Background technique [0002] In recent years, with the rapid development of the field of artificial intelligence, natural language processing has received widespread attention as an important research direction of artificial intelligence. Natural language processing technology is mainly used to solve problems such as sequence labeling and classification, most of which belong to supervised learning, and need to use labeled data to train corresponding models. The data labeling process requires a lot of manpower to classify data through manual judgment. Traditional manual labeling is done by distributing data to individual labelers, but there are the following defects: 1. The data format marked by different personnel may be biased, which is not convenient for unified processing; 2. The labeling speed is slow, and it is pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/30G06F21/60G06Q10/10G06N3/04G06N3/08
CPCG06F8/315G06F21/604G06Q10/103G06N3/084G06N3/049G06F2221/2141G06N3/045
Inventor 孙科汪兆川任文波
Owner BEIJING INST OF COMP TECH & APPL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products