Herein disclosed is a system and method for record linkage that uses machine learning to link records, so that many users can contribute their training data to a shared repository and employ the accumulated training data without any user having to share their actual data. The system includes a record linkage server, which further includes a record linkage repository, a domain classifier, and a domain classification trainer. The record linkage server is connected with a record linkage client, which includes a field comparator and a manual label prompter. Further disclosed is a method for record linkage, describing how two structured data sets can be matched, including searching domains, loading data sets, loading domain, matching fields, iterating record linking for all record pairs, including: selecting record pair, calculating comparison vector, calculating label probabilities, determining label, optionally setting label manually, updating prior probabilities, optionally confirming selected label, and updating training data.