Entity matching method and computer program based on non-principal attribute outlier detection
A technology of outlier detection and non-primary attributes, applied in the Internet field, can solve problems such as weak supervision and large workload, and achieve the effect of improving accuracy and recall.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0049] In order to further understand the invention content, characteristics and effects of the present invention, the following examples are given, and detailed descriptions are as follows in conjunction with the accompanying drawings:
[0050] see figure 1 , an entity matching method based on non-main attribute outlier detection, including the following steps:
[0051] Step 1: Data preprocessing, that is, processing the original data entity and generating the input data set of EM. According to the difference between input data and output data, data preprocessing mainly includes two parts:
[0052] Data extraction: According to the goal of the experiment, find out the common non-primary attributes of different source data, use incremental extraction, and save the extracted data to another table. And use regular expressions or natural language processing technology to remove obviously wrong or meaningless field information.
[0053] Data archiving and cleaning: Use archivin...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


