Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

64results about How to "Improve annotation quality" patented technology

Data annotation management method and device, electronic equipment and readable storage medium

The invention discloses a data annotation management method and device, electronic equipment and a readable storage medium. The method comprises the steps: obtaining a reference annotation data set according to to-be-annotated data corresponding to a to-be-annotated task and historical annotation behavior data corresponding to a target annotator; obtaining a first annotation result of the target annotator for the evaluation annotation data and first reference annotation data distributed in the evaluation annotation data, the evaluation annotation data being a part of the to-be-annotated data and having a correct annotation answer, and the first reference annotation data belonging to a reference annotation data set; If the accuracy corresponding to the first annotation result is greater than or equal to a preset accuracy threshold, determining whether to allow the target annotator to continue to execute the to-be-annotated task or not according to a second annotation result of the target annotator for second reference annotation data already distributed in the to-be-annotated data, the second reference annotation data belonging to a reference annotation data set. According to the embodiment of the invention, the quality and efficiency of data annotation can be improved.
Owner:北京云聚智慧科技有限公司

Semi-automatic word segmentation corpus labeling and training device

The invention discloses a semi-automatic word segmentation corpus labeling and training device, which aims to overcome the defects of the corpora used during the word segmentation corpus labeling and training process. The device of the invention is realized through the following technical schemes of using a text corpus annotation preparation module for managing the to-be-annotated corpora and the segmented word corpora; based on a plurality of word segmentation algorithms, such as the bidirectional maximum matching word segmentation based on an integrated dictionary, CRF, JIEBA, etc., submitting the word segmentation annotation work of the raw corpus to a semi-automatic corpus word segmentation annotation module; creating the segmented word tagging tasks, selecting a labeling applicable algorithm model, carrying out the automatic annotations, on the basis of automatic labeling result fusion, feeding back a training model corpus and a labeling model generated by the text corpus labeling preparation module to the feedback model learning training module; selecting and carrying out model learning training, calling a unified training model interface to generate a core dictionary, updating a word segmentation training model table, establishing a labeling algorithm comprehensive evaluation model to evaluate a model labeling effect, so that a new word segmentation labeling task is completed.
Owner:10TH RES INST OF CETC

Point cloud entity labeling system, method and device and electronic equipment

The invention discloses a point cloud entity labeling system, method and device, a point cloud entity labeling task setting method and device and electronic equipment. The point cloud entity labelingmethod comprises the following steps: displaying a three-dimensional scene graph corresponding to point cloud data to be labeled; determining two-dimensional bounding box information of a target entity in the three-dimensional scene graph; determining point cloud data corresponding to the target entity according to the two-dimensional bounding box; determining entity type information of the targetentity; and marking the point cloud data corresponding to the target entity as the entity type information. With application of the processing mode, labeling personnel only need to frame the target entity through the two-dimensional bounding box at a certain view angle of the three-dimensional scene graph, the system can automatically analyze the point cloud position which the two-dimensional bounding box hopes to frame, and the labeling personnel are automatically helped to locate the point cloud data corresponding to the target entity. Therefore, the point cloud entity labeling efficiency can be effectively improved, and the point cloud entity labeling quality can be effectively improved.
Owner:浙江菜鸟供应链管理有限公司

Model processing method and device, storage medium and electronic equipment

The invention relates to the technical field of artificial intelligence, in particular to a model processing method, a model processing device, a computer readable storage medium and electronic equipment. The model processing method provided by the embodiment of the invention comprises the steps of adding an initial annotation corpus into an initial training set, and performing training by utilizing the initial training set to obtain a language model; obtaining a prediction result output by the language model, and extracting feature information of an error prediction result in the prediction result; when it is judged that the feature information is related to the initial annotation corpus, generating a preset number of simulation annotation corpuses according to the feature information; and adding the simulation annotation corpus into the initial training set, and continuing to train the language model by using the initial training set added with the simulation annotation corpus. According to the model processing method provided by the embodiment of the invention, the data annotation quality can be improved, and the prediction effect of the language model is improved.
Owner:BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1

Power grid image intelligent annotation crowdsourcing platform and working method

The invention relates to a power grid image intelligent annotation crowdsourcing platform and a working method, and belongs to the technical field of power grid image data processing. The working method comprises the following steps: collecting to-be-annotated picture collection, performing initial annotation, performing manual adjustment annotation, performing difference re-annotation and data storage. The power grid image intelligent annotation crowdsourcing platform comprises a to-be-annotated image collection module, an initial annotation module, a manual adjustment annotation module, a difference re-annotation module and a data storage module, and is used for executing the power grid image intelligent annotation crowdsourcing platform working method. According to the invention, the preset model is used to carry out initial annotation on the data; meanwhile, platform crowdsourcing is used for manually adjusting annotation; multi-person cooperation is achieved, the annotation efficiency is improved, the unqualified annotation result is modified according to IOU parameters, the annotation quality is effectively improved, meanwhile, the data classification and arrangement functionmeets the requirement for specific model training for a certain hidden danger, and powerful data support is provided for model training and model precision improvement in the future.
Owner:SHANDONG ZHIYANG ELECTRIC

Data screening method and device, storage medium and electronic equipment

The embodiment of the invention discloses a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: obtaining a sample identification of a datasample marked with a category, and obtaining an identification vector corresponding to the sample identification; then, taking the category number of the labeled categories as a clustering category number to carry out clustering processing on the identification vector; then, for each clustering category, obtaining the similarity between the clustering center identification vector and the non-clustering center identification vector; then, determining a target non-clustering center identification vector of which the similarity with the clustering center identification vector does not reach a preset similarity in each clustering category, and judging the labeled category of the data sample represented by the target non-clustering center identification vector as labeling noise; and finally, filtering out the data sample corresponding to the target non-clustering center identification vector in each clustering category, thereby achieving the purpose of improving the labeling quality of thedata sample, and providing a high-quality data sample for machine learning.
Owner:OPPO CHONGQING INTELLIGENT TECH CO LTD

System and method for revising samples for neural network training

PendingCN112070224ASpeed ​​up the revision processSave time and costNeural learning methodsData setProcessing element
The invention relates to a system and a method for revising samples for neural network training. The system comprises a service terminal and a client front end; the service terminal is set to store samples, distribute the samples to the client front end, receive and store a processing result of the client front end, and generate statistical display according to the processing result; and the client front end is configured to receive the sample, execute revision processing and transmit a processing result to the service terminal. The service terminal comprises a storage module and a statisticsmodule; the client front end comprises an annotation module and an auditing module; the annotation module comprises a preprocessing unit and a fine processing unit; the auditing module can also be setto score the revision quality. According to the invention, a plurality of client front ends can revise samples in the same data set at the same time, so the revising progress of the samples is accelerated, and the time cost is saved; an automatic pretreatment unit is arranged, so that the workload of subsequent fine treatment is reduced; and an auditing and scoring mechanism is set, so that the enthusiasm of the annotator is not struck, and the reliability of the data set sample is improved.
Owner:CHENDU PINGUO TECH CO LTD

Data set establishing method, vehicle and storage medium

The invention relates to the technical field of automatic driving, in particular to a data set establishment method and a vehicle. The method comprises the steps: obtaining a millimeter-wave radar image and a laser radar image; performing space and time calibration on the millimeter-wave radar and the laser radar; constructing a deep neural network for speculating the laser radar image, and generating a target speculation result of the laser radar by using the deep neural network; projecting a target speculation result to the matched millimeter-wave radar image to serve as a pseudo label of the millimeter-wave radar image; generating a radar target confidence coefficient according to the millimeter wave radar local signal of the area where the pseudo label is located; and establishing a millimeter wave radar data set according to the target confidence and the pseudo label. Compared with traditional manual labeling, automatic labeling can be achieved by means of the laser radar, so thatthe labeling efficiency is improved; and target recognition with the high recall rate can be achieved based on the deep neural network of model integration, and a false positive target detection boxis filtered out, so that the labeling quality is improved.
Owner:GUANGZHOU XIAOPENG CONNECTIVITY TECH CO LTD

Data labeling system and method based on intelligent distribution algorithm

PendingCN110188800AScientific and efficient "data + people" processingScientific and efficient "data + people" processing mechanismCharacter and pattern recognitionNeural architecturesComputer moduleData labeling
The invention discloses a data labeling system and method based on an intelligent distribution algorithm, and specifically relates to the field of data processing. The system comprises a data analysismodule, a feature acquisition module and an intelligent distribution module. The output end of the data analysis module is connected with the input end of the feature acquisition module, and the output end of the feature acquisition module is connected with the input end of the intelligent distribution module. The method comprises the following specific processing steps: screening small-scale representative and instructive key data as advanced data by using a data analysis module; carrying out trial labeling, accurate labeling and analysis on the 'leading data' by a labeling person to obtaina 'standard answer', dynamically matching, and then taking the exclusive labeling feature of each labeling person; and using an intelligent distribution module to intelligently distribute the remaining data. According to the method, the manual processing error rate of the data can be reduced by utilizing an intelligent distribution algorithm, and in the manual processing task of the text type data, the manual error rate can be reduced by about 20-30%.
Owner:武汉黑松露科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products