Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A method and apparatus for data selection

A technology of data selection and data generation, applied in the field of big data, it can solve the problem that it is difficult for individual users to resist the risk of personal privacy being fully exposed, and achieve the effect of protecting privacy.

Active Publication Date: 2019-02-01
ADVANCED NEW TECH CO LTD
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The overall change brought about by big data makes it difficult for individual users to fight against the risk of personal privacy being fully exposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and apparatus for data selection
  • A method and apparatus for data selection
  • A method and apparatus for data selection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045]In order to enable those skilled in the art to better understand the technical solutions in one or more embodiments of this specification, the following will describe the technical solutions in one or more embodiments of this specification in conjunction with the drawings in one or more embodiments of this specification The technical solutions are clearly and completely described, and obviously, the described embodiments are only some of the embodiments, not all of the embodiments. Based on one or more embodiments in this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0046] In actual business, you may encounter such a scenario: data party A has its own data, and wants to evaluate whether it can improve its own model effect with the help of data party B's data. For example, assume that data party A uses its own data to train a machine learning ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present specification provide a data selection method and apparatus, wherein the method may include training a machine learning model based on input variables and tags in a trainingsample; The training sample also includes non-modular variables; Input the input variables of the test sample into the machine learning model to obtain the predicted value; The test sample also includes a label; According to the label and the predicted value of the test sample, the corresponding residual error of the test sample is obtained. Sending the residuals to at least two second data parties, respectively, so that each second data party regresses the residuals using the possessed second data, respectively, and obtains a regression evaluation index; The regression evaluation indicatorsreturned by the at least two second data parties are received to select a portion of the second data party by comparing the regression evaluation indicators of the at least two second data parties.

Description

technical field [0001] The present disclosure relates to the technical field of big data, in particular to a data selection method and device. Background technique [0002] With the rapid development of Internet technology, the whole society is forced into the era of "big data". Whether people want it or not, our personal data is being collected and used by companies and individuals inadvertently and passively. The networking and transparency of personal data has become an irresistible trend. At the same time, user data is also a dangerous "Pandora's box". Once the data is leaked, the user's privacy will be violated. In recent years, there have been many user privacy leakage incidents, and the protection of citizens' personal privacy data has encountered severe challenges. The overall changes brought about by big data make it difficult for individual users to fight against the risk of personal privacy being fully exposed. In the face of frequent privacy leaks, the issue ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06F21/62
CPCG06F21/6245
Inventor 方文静王力周俊
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products