Data set acquisition method and device based on artificial intelligence, equipment and medium

A technology of artificial intelligence and acquisition method, applied in data processing applications, character and pattern recognition, calculation models, etc., can solve problems such as inability to determine data from log data, failure of data labeling, and inability to obtain data sets.

Pending Publication Date: 2020-09-29
CHINA PING AN LIFE INSURANCE CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of training the model, the data needs to be marked. At present, after the log data is obtained, the computer is used to extract the content and review the data. However, when the log

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data set acquisition method and device based on artificial intelligence, equipment and medium
  • Data set acquisition method and device based on artificial intelligence, equipment and medium
  • Data set acquisition method and device based on artificial intelligence, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0050] The artificial intelligence-based data set acquisition method provided by this application can be applied to such as figure 1 shown in the application environment. Wherein, the terminal 102 communicates with the server 104 through the network. The server 104 obtains the initial sample set; uses the initial language model to mark the initial sample set to obtain a model labeling reference index; filters the initial sample set according to the model labeling reference index to obtain a correction set; uses the correction set to continue training the initial language ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data set acquisition method and device based on artificial intelligence, equipment and a medium. The method comprises the steps of obtaining an initial sample set; labelingthe initial sample set by using an initial language model to obtain a model labeling reference index; filtering the initial sample set according to the model annotation reference index to obtain a corrected set; training the initial language model by using the corrected set to obtain a corrected initial language model; when the precision of the corrected initial language model does not reach a preset threshold, expanding the data volume of the corrected set to update the corrected set, continuing to train the initial language model by using the corrected set to obtain a corrected initial language model, and when the precision of the initial language model reaches the preset threshold, obtaining a target language model; and processing to-be-processed service data according to the target language model to obtain a data set. By adopting the method, the data set acquisition efficiency can be improved. In addition, the invention also relates to a block chain technology, and the initial sample set, the corrected set and the data set can be stored in a block chain.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to an artificial intelligence-based data set acquisition method, device, computer equipment and storage medium. Background technique [0002] In the development process of artificial intelligence, the industry generally adopts a data-driven approach, so data quality is the top priority. Data with a large quantity, good quality, and complete coverage can help developers develop models with better effects faster, thereby improving customer satisfaction. [0003] In the process of training the model, the data needs to be marked. At present, after the log data is obtained, the computer is used to extract the content and review the data. However, when the log data is processed, the machine cannot know the correct information. Determine the correct data from a large amount of log data, so that the failure of data labeling makes it impossible to obtain the correc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06F40/30G06N20/00G06Q40/08
CPCG06F40/30G06N20/00G06Q40/08G06F18/22G06F18/214
Inventor 陆林炳刘志慧金培根何斐斐林加新李炫
Owner CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products