Algorithm for quickly matching mass data
A mass data and matching operation technology, applied in the field of data matching, can solve problems such as inaccurate results and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0024] refer to figure 1 , an algorithm for fast matching of massive data, including the following steps:
[0025] S1. HubbleDotNet integrates full-text search and relational database, and performs full-text and relational query on the data in the database through SQL statements;
[0026] S2. Based on the TF-IDF algorithm, the position function fp(t,d,q) is added:
[0027] S3. After obtaining accurate data through HubbleDotNet, the system uses the edit distance algorithm and combines its own specific recursive algorithm to perform matching operations on the data.
[0028] The HubbleDotNet component itself is responsible for the inverted index of the full-text data, and stores the index in the directory specified by the table, and the data storage is completed by the relational database associated with Hubble.net.
[0029] The basic scoring algorithm formula of HubbleDotNet is as follows:
[0030]
[0031] FieldRank is the field weight;
[0032] Rank(t,q) is the weight o...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


