Federal learning-based data matching method and device

A matching method and federated technology, applied in integrated learning and other directions, can solve problems such as poor data matching effect, and achieve the effect of improving comprehensiveness, enriching feature dimensions, and improving model effect.

Pending Publication Date: 2022-05-13
UNIV OF SCI & TECH OF CHINA +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a data matching method and device based on federated learning, which is used to solve the defect of poor data matching effect in the process of vertical federated learning in the prior art and realize high-quality data matching

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Federal learning-based data matching method and device
  • Federal learning-based data matching method and device
  • Federal learning-based data matching method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0039] During the research and development process, the inventor found that in related technologies, when performing vertical federation multi-party data matching, the following methods are mainly used:

[0040] (1) Partial search, this method refers to the originator of federation data, which selects data providing objects among organizations with existing business contacts. The size of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data matching method and device based on federated learning, and the method comprises the steps: respectively calculating the similarity between a first data label corresponding to a first data set and a plurality of second data labels corresponding to a plurality of second data sets, and generating a plurality of label matching degrees; respectively calculating the similarity between a first data feature set corresponding to the first data set and a plurality of second data feature sets corresponding to the plurality of second data sets, and generating a plurality of data content matching degrees; and based on the label matching degree and the data content matching degree between the first data set and the same second data set, determining a target data set from the plurality of second data sets as federal matching data of the first data set. According to the data matching method based on federated learning, the comprehensiveness, precision and accuracy of a longitudinal federated matching result are remarkably improved, and the subsequent model effect can be improved.

Description

technical field [0001] The present invention relates to the technical field of federated learning, in particular to a data matching method and device based on federated learning. Background technique [0002] When conducting vertical federated learning training, the initiator and data provider must first perform data alignment, and then complete subsequent model training with the participation of the coordinator. Before that, the federated data must be matched. Related matching technologies mainly include matching strategies such as local search, three-party recommendation, and fixed participants. However, the data matched by these methods has problems such as limited data volume, poor data quality, and serious data homogeneity. Contents of the invention [0003] The present invention provides a data matching method and device based on federated learning, which is used to solve the defect of poor data matching effect in the process of longitudinal federated learning in the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/20
CPCG06N20/20
Inventor 徐生束柬
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products