Data extraction method, device, server and storage medium for distributed system
A distributed system and data extraction technology, applied in the field of data extraction of distributed systems, can solve the problems of long processing time, inability to complete a large amount of incremental data extraction, and inability to complete full data extraction, etc., to achieve the effect of reducing time consumption
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0031] figure 1 It is a flow chart of a method for extracting data in a distributed system provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of merging new data and historical data.
[0032] Specifically include the following steps:
[0033] S101. Extract first data and a first data relationship from newly added data, where the first data relationship is a first association relationship between the first data.
[0034] The association relationship in this embodiment is the interdependence and influence relationship of data. Exemplarily, for example, there are three data tables in an existing school: student (student number, name), course (course name, course number), course selection (student number , course number, grade), the "student number" and "course number" in the course selection table must correspond to the student's student number, name, and course name and number in the course. When the student's name is deleted or the cou...
Embodiment 2
[0045] Such as figure 2 As shown, this embodiment provides a data extraction method for a distributed system. On the basis of the above embodiments, specific steps for matching new data and historical data are added, as follows:
[0046] S201. Extract first data and a first data relationship from newly added data, where the first data relationship is a first association relationship between the first data;
[0047]S202. Acquire historical data, where the historical data includes second data and a second data relationship, where the second data relationship is a second association relationship between the second data;
[0048] S2031. Compare the first data with the second data in sequence, and determine whether each of the first data and the second data is repeated;
[0049] S2032. If repeated, delete the first data, and save the second data as the third data;
[0050] S2033. If not repeated, merge the first data and the second data into the third data;
[0051] The third d...
Embodiment 3
[0061] Such as image 3 As shown, this embodiment provides a data extraction method for a distributed system, matching the first data with the second data in the above embodiment, and matching the first data relationship with the second data relationship to generate a matching result It has been refined and realized by drawing a relationship diagram. The specific steps are as follows:
[0062] S301. Extract first data and a first data relationship from newly added data, where the first data relationship is a first association relationship between the first data.
[0063] S302. Acquire historical data, where the historical data includes second data and a second data relationship, where the second data relationship is a second association relationship between the second data.
[0064] S3031. Use the first data as a first node, and use the first data relationship as a first connection line.
[0065] S3032. Compose the first node and the first connecting line into a first relati...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



