A multi-party joint data collection method based on a blockchain
By employing a blockchain-based multi-party collaborative data aggregation method and utilizing asymmetric encryption and role-based access control models, the problem of data leakage across departments, regions, and levels is solved, thereby improving data security and confidentiality and making it suitable for cross-domain data sharing.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- NANJING UNIV OF POSTS & TELECOMM
- Filing Date
- 2023-02-27
- Publication Date
- 2026-06-23
AI Technical Summary
In the aggregation of datasets across departments, regions, and levels, there is a problem of dataset leakage among different stakeholders, resulting in losses for data owners, and data users do not care about the security and confidentiality of the data itself.
A blockchain-based multi-party collaborative data aggregation method is adopted. By setting the operation permissions of alliance member nodes on the blockchain node and using asymmetric encryption technology to protect metadata, combined with the traditional role-based access control model, data security and confidentiality are ensured.
It achieves information security and confidentiality of shared information during the multi-party data collection process, breaks the status quo of "information silos", improves data security, and is suitable for data sharing across departments, regions, and levels.
Smart Images

Figure CN116366294B_ABST
Abstract
Description
Technical Field
[0001] This invention belongs to the field of big data fusion technology, specifically relating to a multi-party joint data collection method based on blockchain. Background Technology
[0002] In the process of advancing the modernization of the national governance system and governance capabilities, a large number of innovative applications across departments, regions, and levels have emerged. At the same time, problems such as "information silos" and "data chimneys" among government data have become increasingly apparent. To ensure the rational allocation of resources, save social costs, and create more wealth through the flow of data, the application of multi-party collaborative data collection methods has become increasingly important. However, in actual multi-party data collection, datasets often belong to different stakeholders, and the leakage of datasets will cause losses to data owners. Furthermore, data users often only care about the "results" derived from data collection, and not the data itself. To address issues such as information security and the confidentiality of shared information, this invention proposes a blockchain-based multi-party collaborative data collection method.
[0003] Blockchain, a novel decentralized protocol, securely stores data that is tamper-proof. At the application level, it can be understood as a distributed database system that uses a blockchain-based data structure to verify and store data, maintaining a continuously growing chain. Cryptographic methods are employed to ensure that data on the chain cannot be tampered with, guaranteeing the security of data transmission and access. Furthermore, blockchain can utilize smart contracts composed of automated script code to program and manipulate data. Therefore, blockchain-based multi-party collaborative data aggregation methods are of significant importance. Summary of the Invention
[0004] Technical problem to be solved: When processing datasets across departments, regions, and levels, the leakage of datasets from different stakeholders can cause losses to data owners. This invention provides a blockchain-based multi-party collaborative data aggregation method, which can effectively solve the problems of data security for information providers and confidentiality of shared information during the multi-party data aggregation process.
[0005] This invention leverages the privacy protection advantages of blockchain. Targeting the structural characteristics of the data layer in the blockchain model, it encrypts metadata information during the multi-party collaborative data aggregation process. Then, it sets operational permissions for alliance member nodes at the blockchain node level and distributes decryption private keys for the aggregated data metadata to authorized members. Alliance members with permissions and private keys can parse the metadata information, write aggregation algorithms on their nodes, package the operation information into blocks, and broadcast them across the network for consensus mechanism verification. After execution states such as Preprepare, Prepare, and Commit, a new block is generated, and the block content is parsed and stored.
[0006] Technical solution: A blockchain-based multi-party collaborative data aggregation method, comprising the following steps:
[0007] S1. Based on the basic data algorithm of the blockchain data layer, obtain the distributed node IP, and map the distributed node IP with the corresponding functional permissions in the block to form the scheduling of data in the node block.
[0008] S2. Map the IP value obtained in step S1 to the on-chain member identity ID, set the permission allocation for the node, and divide the permissions into data management, data modification, smart contract deployment and open contract interface permissions, and bind the identity of the data sender and the on-chain information.
[0009] S3. Extend nodes through consensus algorithm to generate blocks, and encrypt the metadata in the node blocks using a specified public key; construct corresponding extended fields for the function of encrypting metadata, and perform asymmetric encryption on the metadata;
[0010] S4. Distribute the metadata decryption private key according to the permission level of the generated block, and parse the metadata keywords; process the keyword data to form the required aggregated data model; determine the correspondence between the fields in the data source table and the aggregated data model, construct the aggregated table, generate the aggregated statement, query the relevant data storage address, and store it.
[0011] Preferably, in step S2, the mapping relationship between member identity ID and node IP is one-to-many, and the identity of the data sender is bound to the on-chain information through data signature and timestamp.
[0012] Preferably, the metadata in step S3 includes identity attributes and general attributes. The identity attributes include a type field and a configuration field. The type field is used to represent the account name. The configuration field is used to represent the IP address and database address. The general attributes include a name field, an identifier field, and a collection mode field. The name field is used to represent the naming of the called function. The identifier field is used to set a unique identifier for special data to facilitate computer recognition. The collection mode field is used to find the configuration file of the collection method and determine the collection method of the collected data.
[0013] Furthermore, the aggregation methods include interface function aggregation, query data storage address aggregation, and generation aggregation statement aggregation.
[0014] Preferably, the asymmetric encryption in step S3 uses either a public key or a private key for encryption and the other for decryption.
[0015] Preferably, in step S4, the encrypted information is parsed using the private key to obtain metadata keywords; the metadata includes table structure fields, contract interface function names, or database configuration file information; the relevant data storage address can be stored in a certain format, such as a Word table, stored in a database, or downloaded and saved locally.
[0016] Preferably, the blockchain system includes a data layer, a network layer, a consensus layer, an incentive layer, a contract layer, and an application layer. The data layer encapsulates the underlying data blocks and related basic data and algorithms such as data encryption. The network layer encapsulates the P2P peer-to-peer network protocol and network management rules for monitoring, testing, configuring, analyzing, and controlling network resources. The consensus layer encapsulates the consensus algorithm for network nodes. The incentive layer encapsulates the issuance and allocation mechanism for application incentives. The contract layer encapsulates smart contracts containing various script codes, algorithms, and processing logic function codes. The application layer encapsulates a programmable operating system oriented towards data aggregation.
[0017] Beneficial effects: The method of this invention helps to ensure information security and confidentiality of shared information during multi-party data collection, thereby improving data security; it facilitates data collection by blockchain members from multiple nodes while ensuring the security of privacy data, and helps to break down the "information silos" in different fields; it can enhance the utilization of blockchain information, and for subsequent cross-departmental, cross-regional, and cross-level applications, this method can serve as an important supplement to traditional data collection methods. Attached Figure Description
[0018] Figure 1 This is a flowchart of a multi-party collaborative data collection method based on blockchain according to the present invention;
[0019] Figure 2 This is a diagram illustrating the relationship between role access control, nodes, and keys in one embodiment. Detailed Implementation
[0020] The present invention will now be described in detail with reference to the accompanying drawings and specific embodiments: Example 1
[0021] like Figures 1-2 As shown, a blockchain-based multi-party collaborative data aggregation method includes the following steps:
[0022] S1. Based on the basic data algorithm of the blockchain data layer, obtain the distributed node IP, and map the distributed node IP with the corresponding functional permissions in the block to form the scheduling of data in the node block.
[0023] S2. Map the IP value obtained in step S1 to the on-chain member identity ID, set the permission allocation for the nodes, and divide the permissions into data management, data modification, smart contract deployment, and open contract interface permissions. The mapping relationship between member identity ID and node IP is one-to-many, and one ID can set different permissions for multiple nodes. After mapping multiple member identity IDs to functional permissions, the node IP and functional permissions are mapped again according to the block information of the aggregation method. The identity of the data sender is bound to the on-chain information through data signature and timestamp, which can prevent information tampering and ensure information traceability.
[0024] S3. Extend nodes through consensus algorithms to generate blocks, and encrypt metadata within node blocks using a specified public key. For the function of encrypting metadata, construct corresponding extended fields and perform asymmetric encryption on the metadata, i.e., use one of the public and private keys for encryption and the other for decryption. If the data provider's public key is leaked, the data user will not be able to obtain the data provider's data information through the public key. The block consensus process is a decentralized process. The main purpose of block consensus is to generate blocks that are recognized by most nodes according to a certain agreement in the consortium blockchain. Different agreements are different, and the basic principle is to ensure that different nodes produce the same blocks to guarantee data consistency.
[0025] The aforementioned metadata includes identity attributes and general attributes. Identity attributes include a type field and a configuration field. The type field represents the account name; the configuration field represents IP addresses and database addresses, etc. General attributes include a name field, an identifier field, and a collection mode field. The name field represents the name of the called function; the identifier field is used to set a unique identifier for special data to facilitate computer recognition; the collection mode field is used to find the configuration file for the collection method and determine the collection method of the collected data. The collection method will be selected according to business needs, and corresponding extended fields will be constructed for the decrypted information. These methods include interface function collection, query data storage address collection, and generating collection statements collection.
[0026] S4. Distribute the metadata decryption private key according to the permission level of the generated block, parse the metadata keywords, the metadata being table structure fields, contract interface function names, or database configuration file information; process the keyword data to form the required aggregated data model; determine the correspondence between the fields in the data source table and the aggregated data model, construct the aggregated table, generate the aggregated statement, query the relevant data storage address, and store it in a certain format, such as a Word table, storing it in a database, or downloading it for local saving.
[0027] The method of this invention focuses on the distributed database characteristics and privacy protection of blockchain, and combines the traditional role-based access control model with asymmetric encryption as a dual verification of role function operation permissions and access to database resources, establishing a working mechanism for information confidentiality during the multi-party data collection process.
[0028] A blockchain system comprises a data layer, a network layer, a consensus layer, an incentive layer, a contract layer, and an application layer. The data layer encapsulates the underlying data blocks and related basic data and algorithms, including data encryption. The network layer encapsulates the P2P peer-to-peer network protocol and network management rules for monitoring, testing, configuring, analyzing, and controlling network resources. The consensus layer encapsulates the consensus algorithm for network nodes. The incentive layer encapsulates the issuance and allocation mechanisms for application incentives. The contract layer encapsulates smart contracts containing various script codes, algorithms, and processing logic functionalities. The application layer encapsulates a programmable operating system oriented towards data aggregation.
[0029] Based on the underlying data area of the data layer and the consensus algorithm of the consensus layer, blockchain possesses the characteristics of a distributed node consensus mechanism. Each node has a unique IP address, and these distributed nodes connect data blocks in chronological order to form a distributed database, where multiple data owners store their data. In traditional blockchain technology, members verify through a consensus algorithm that they can write or read data from any node. This invention, based on the distributed database characteristics of blockchain, proposes to utilize a traditional role-based access control model to divide permissions among nodes, setting data collection permissions for members on each node, and setting permissions such as managing data, modifying data, deploying smart contracts, and opening contract interfaces. Simultaneously, asymmetric encryption (using public-key encryption) is applied to metadata information within blocks. Based on role permissions, decryption private keys are distributed to members according to the principle of "key, member, permission, node" correspondence, used to parse key metadata information during the collection process, thus solving the data security problem for data owners in traditional data collection processes. Data extractors use their private keys to parse metadata, obtain keywords shared by data providers, process the keyword data to form the required collection data model, determine the correspondence between fields in the data source table and the collection data model, construct a collection table, generate collection statements, query the storage address values of relevant data, and store them locally.
[0030] The above description is merely a preferred embodiment of the present invention and is not intended to limit the present invention. Any modifications, equivalent substitutions, and improvements made within the spirit and principles of the present invention should be included within the protection scope of the present invention.
Claims
1. A blockchain-based multi-party collaborative data aggregation method, characterized in that, Includes the following steps: S1. Based on the basic data algorithm of the blockchain data layer, obtain the distributed node IP, and map the distributed node IP with the corresponding functional permissions in the block to form the scheduling of data in the node block. S2. Map the IP value obtained in step S1 to the on-chain member identity ID, set the permission allocation for the node, and divide the permissions into data management, data modification, smart contract deployment and open contract interface permissions, and bind the identity of the data sender and the on-chain information. S3. Extend nodes through a consensus algorithm to generate blocks, and encrypt the metadata within the node blocks using a specified public key; construct corresponding extended fields for the function of encrypting metadata, and perform asymmetric encryption on the metadata; the metadata includes identity attributes and general attributes, the identity attributes include a type field and a configuration field, the type field is used to represent the account name; the configuration field is used to represent the IP and database address; the general attributes include a name field, an identifier field, and a collection mode field, the name field is used to represent the naming of the called function, the identifier field is used to set a unique identifier for special data to facilitate computer recognition, and the collection mode field is used to look up the configuration file of the collection method and determine the collection method of the collected data; S4. Distribute the metadata decryption private key according to the permission level of the generated block, and parse the metadata keywords; The keyword data is processed to form the required aggregated data model; Determine the correspondence between the fields in the data source table and the aggregated data model, construct the aggregated table, generate the aggregated statement, query the relevant data storage address, and store it.
2. The method for multi-party collaborative data collection based on blockchain according to claim 1, characterized in that: In step S2, the member identity ID and node IP mapping relationship is one-to-many. The identity of the data sender is bound to the on-chain information through data signature and timestamp.
3. The method for multi-party collaborative data collection based on blockchain according to claim 1, characterized in that: The aggregation methods include interface function aggregation, query data storage address aggregation, and generation aggregation statement aggregation.
4. The method for multi-party collaborative data collection based on blockchain according to claim 1, characterized in that: The asymmetric encryption described in step S3 uses either a public key or a private key for encryption and the other for decryption.
5. The blockchain-based multi-party collaborative data aggregation method according to claim 1, characterized in that: In step S4, the private key is used to parse the encrypted information to obtain metadata keywords; Metadata includes table structure fields, contract interface function names, or database configuration file information; related data storage addresses can be stored in a certain format, such as a Word table, stored in a database, or downloaded and saved locally.
6. The blockchain-based multi-party collaborative data aggregation method according to any one of claims 1-5, characterized in that: The blockchain system comprises a data layer, a network layer, a consensus layer, an incentive layer, a contract layer, and an application layer. The data layer encapsulates the underlying data blocks and related basic data encryption data and algorithms. The network layer encapsulates the P2P peer-to-peer network protocol and network management rules for monitoring, testing, configuring, analyzing, and controlling network resources. The consensus layer encapsulates the consensus algorithm for network nodes. The incentive layer encapsulates the issuance and allocation mechanisms for application incentives. The contract layer encapsulates smart contracts containing various script codes, algorithms, and processing logic function codes. The application layer encapsulates a programmable operating system oriented towards data aggregation.