Intelligent recommendation method and device for data standard
A recommendation method and data technology, applied in the field of data processing, can solve the problem of high data standard error rate, reduce the number of comparison operations, reduce the workload, and improve the comparison efficiency and accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0067] Please refer to figure 1 and image 3 , a data standard intelligent recommendation method, characterized in that it includes the steps:
[0068] S1. Obtain source table information, and perform word segmentation on the source table information to obtain at least one group of keywords;
[0069] Since the table name of the source table information may be very different from the standard name, extracting keywords from the table name of the source table information for comparison can improve the accuracy;
[0070] S2, match the data standard names related to the keywords in the database according to the keywords, and obtain a plurality of the data standard names and groups of data standard tables corresponding to the data standard names;
[0071] S3, filter the data standard name according to the preset scoring rule, and obtain the data standard table of the preset number of groups, specifically:
[0072] S31, compare the length of the data standard name and the keyword,...
Embodiment 2
[0089] The difference between this embodiment and the first embodiment is that the total similarity is calculated to obtain a more accurate similarity ranking;
[0090] After completing step S43, step S44 is also included:
[0091]S441. Obtain the number of data items in the data standard table; if the first data standard table "merchant transaction records" contains 10 data items, record the number of data items in the first data standard table as 10; The second data standard table "WeChat transaction record" contains 13 data items, then the number of data items recorded in the second data standard table is 13;
[0092] S442. Obtain the average data item similarity of each of the data standard tables according to the number of data items in the data standard table and the total similarity; If the total similarity of transaction records” is 5, the corresponding average data item similarity is 1 / 2; the total similarity of the second data standard table “WeChat transaction reco...
example 3
[0101] The difference between this embodiment and the first or second embodiment is that the data standard table is preprocessed;
[0102] Please refer to Figure 4 , step S02 is included before step S2:
[0103] S021, perform word segmentation on each of the data standard names to obtain word segmentation phrases; if the data standard table is "resident population data", the data standard names are divided into "resident", "population" and "resident" by a word segmentation tool data";
[0104] S022, filter the segmented phrases to obtain related phrases; in the three phrases "resident", "population" and "data", "data" is a popular phrase, which will appear in most data standard names , so it is removed when the phrase is filtered, and the two phrases "population" and "resident" are retained;
[0105] S023, obtain a series of phrases associated with the related phrases; according to the two phrases "population" and "resident", obtain corresponding series of phrases such as ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


