Distributed information security storage method for software platform

By employing differentiated encryption, dual key management, node trust assessment, and multi-technology integration, this technology addresses the issues of single encryption strategies, imperfect key management, and coarse-grained access control found in existing technologies. It achieves high-security distributed information storage and improves data recovery efficiency and compliance.

CN122221284APending Publication Date: 2026-06-16SHANXI ZHONGJIA DIGITAL INTELLIGENCE TECHNOLOGY CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
SHANXI ZHONGJIA DIGITAL INTELLIGENCE TECHNOLOGY CO LTD
Filing Date
2026-03-18
Publication Date
2026-06-16

AI Technical Summary

Technical Problem

Existing distributed information storage methods in software platforms suffer from problems such as single encryption strategies, imperfect key management, lack of trust assessment in node management, coarse-grained access control, and insufficient integration of multiple technologies, making it difficult to meet the data storage requirements for high security levels.

Method used

A distributed information security storage method is constructed by adopting differentiated encryption strategies, a dual-key management system, node trust value evaluation, real-time monitoring and anomaly handling, fine-grained access control, and the integration of multiple technologies, including quantum security technology, zero-knowledge proofs, and improved RS erasure codes.

🎯Benefits of technology

It achieves precise protection for data of different sensitivity levels, improves encryption strength and data recovery efficiency, reduces the risk of key leakage, ensures data integrity and traceability of access behavior, reduces storage resource waste, and adapts to the needs of software platforms of different sizes.

✦ Generated by Eureka AI based on patent content.
Patent Text Reader

Abstract

The application relates to the technical field of data storage and information security, and discloses a distributed information security storage method for a software platform, which is characterized by comprising the following steps: S1, a software platform receives user-uploaded or self-generated information to be stored, performs sensitive level grading and data type classification processing on the information to be stored, and obtains target information after grading and classification. The distributed information security storage method for the software platform adopts a differential double encryption strategy based on a sensitive level, combines AES-256-GCM, SM4, differential privacy and other technologies, provides accurate protection for data of different sensitive levels, avoids the problem of insufficient encryption strength of core sensitive data, reduces the encryption resource consumption of non-sensitive data, and effectively resists quantum computing cracking in combination with quantum random number generation technology and a quantum key distribution network, so that the encryption strength is much higher than that of the prior art.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This invention relates to the field of data storage and information security technology, specifically to a distributed information security storage method for software platforms. Background Technology

[0002] With the rapid development of the digital age, the application scope of software platforms is becoming increasingly widespread, and the amount of data they generate and store is growing exponentially. This includes a large amount of sensitive information, such as user personal information, enterprise core data, and government secrets. The security of this data is directly related to user rights, enterprise interests, and even national information security. Distributed storage technology has become the mainstream method for data storage on software platforms due to its advantages such as strong scalability, large storage capacity, high access efficiency, and good fault tolerance. By distributing data across multiple nodes, it avoids the single point of failure problem of centralized storage and improves the reliability of data storage. However, existing distributed information storage methods in software platforms still have many technical defects, making it difficult to meet the data storage requirements of high security levels. Specifically, the encryption strategies are simplistic, with most using fixed encryption algorithms to uniformly encrypt all data without differentiating protection based on the data's sensitivity level. This results in insufficient encryption strength for core sensitive data, making it vulnerable to cracking, while excessive encryption of non-sensitive data leads to resource waste. The key management system is imperfect, key generation often uses pseudo-random numbers, which are easily cracked by quantum computing. There is a risk of leakage during the key distribution process, and the key update mechanism is inflexible and difficult to deal with new malicious attacks. There are vulnerabilities in node management. The selection of storage nodes often only considers load balancing and does not fully assess the trustworthiness and security of the nodes. This can easily lead to data being allocated to untrusted nodes. At the same time, the response to node anomaly detection is slow and the data recovery efficiency is low. Access control is coarse-grained, making it difficult to achieve fine-grained authorization based on user attributes, and access behavior lacks an effective audit and tracing mechanism, making it difficult to trace the source after data leakage; The data sharding and redundancy strategy is unreasonable. Too high a redundancy rate leads to waste of storage resources, while too low a redundancy rate cannot guarantee data integrity when a node fails. At the same time, the sharding allocation does not take into account the node topology location, which poses a risk of centralized data leakage. Furthermore, while some existing distributed storage methods incorporate encryption and redundancy technologies, they fail to achieve deep integration of multiple technologies. For instance, they do not effectively combine quantum security technology, attribute proxy re-encryption technology, and zero-knowledge proof technology, making it difficult to address new security threats such as quantum computing and collusion attacks. This results in a lack of creativity. For example, the invention patent application with publication number CN117111854A, although it achieves distributed encrypted storage, does not involve quantum security protection and fine-grained access control, and its key management and node trust evaluation mechanisms are incomplete. The invention patent with publication number CN117318942B, although it incorporates quantum security technology, does not achieve differentiated encryption and traceable auditing of access behavior, and the data recovery efficiency and storage resource utilization need to be improved. Therefore, developing a distributed information security storage method for software platforms that possesses high encryption strength, fine-grained access control, efficient node management, and reliable data recovery capabilities, while integrating multiple security technologies and having sufficient creativity, has become an urgent technical problem to be solved. Summary of the Invention

[0003] (a) Technical problems to be solved To address the shortcomings of existing technologies, this invention provides a distributed information security storage method for software platforms, thus resolving the problems mentioned in the background. (II) Technical Solution To achieve the above objectives, the present invention provides the following technical solution: a distributed information security storage method for a software platform, comprising the following steps: S1. The software platform receives information to be stored uploaded by users or generated by itself, performs sensitivity level classification and data type classification on the information to be stored, and obtains target information after classification. S2. Based on the sensitivity level of the target information, a differentiated encryption strategy is used to perform double encryption on the target information to obtain encrypted target information; S3. Adaptive fragmentation processing is performed on the encrypted target information. Based on the importance and size of the fragments, an improved RS erasure code is used to generate redundant codes. The target data redundancy matrix is ​​constructed by combining the encrypted target information fragments and the redundant codes. S4. Obtain real-time operating status data of all storage nodes in the distributed storage cluster, calculate the trust value and load value of each node, and filter out candidate storage nodes that meet the preset security conditions. S5. Based on the topological location of the candidate storage nodes, calculate the secure storage distance corresponding to each encrypted target information fragment, and allocate the encrypted target information fragments and corresponding redundant codes to different candidate storage nodes for storage according to the secure storage distance and node load balancing principle. S6. Establish a dual-key management system, combine quantum random number generation technology to generate encryption and decryption keys, complete the secure distribution of keys through a quantum key distribution network, and regularly perform key update and discard operations; S7. Perform real-time status monitoring on all storage nodes, identify node failures, data tampering or malicious attacks, trigger an anomaly warning when an anomaly is detected, perform rapid data recovery based on redundant coding, and isolate the abnormal node. S8. Introduce a zero-knowledge proof mechanism to audit and track the information access behavior of software platform users throughout the entire process, generate tamper-proof access audit logs, and realize the traceability of access behavior. S9. Combining attribute proxy re-encryption with ABAC and RBAC dual access control models, a re-encryption key is generated based on user attributes and access policies to achieve fine-grained access control and dynamic authorization of encrypted data. S10. When a user of the software platform initiates an information reading request, the system verifies the user's identity and access permissions, obtains the corresponding decryption key through the key management system, retrieves the corresponding encrypted target information fragments and redundant codes from each storage node, completes the fragment reconstruction and decryption process, and returns the original information to be stored to the user. Preferably, the sensitivity level classification in step S1 includes core sensitivity level, general sensitivity level and non-sensitive level, and the data type classification includes text information, binary files, structured data and unstructured data; The hierarchical classification process involves: extracting the content features and attribute information of the information to be stored; automatically determining the sensitivity level and data type of the information to be stored based on a preset sensitive word library and data type recognition algorithm; and adding hierarchical classification tags to each target information. Preferably, the differentiated encryption strategy in step S2 is as follows: core sensitive target information is encrypted using a dual encryption method combining the AES-256-GCM encryption algorithm and differential privacy technology; Generally, sensitive target information is encrypted once using the SM4 encryption algorithm. Non-sensitive target information is encrypted using a lightweight hash encryption algorithm, while differential privacy technology adds adaptive noise to achieve anonymization protection of sensitive information, meeting EU GDPR compliance requirements. Preferably, the adaptive fragmentation process in step S3 specifically involves: automatically adjusting the fragmentation size according to the data type and size of the target information; using fixed-size fragmentation for text information and structured data, and using dynamic-size fragmentation for binary files and unstructured data. The improved RS erasure coding reduces data recovery time to less than 15 minutes and redundancy rate to below 5% by optimizing the polynomial coefficient generation method. Preferably, the parameters for calculating the node trust value in step S4 include the node's historical operational stability, data transmission security, fault recovery efficiency, and data integrity verification results, which are calculated using a weighted summation algorithm. The load value calculation parameters include node CPU utilization, memory utilization, disk storage space utilization, and network bandwidth utilization. Nodes with a trust value lower than a preset threshold or a load value higher than a preset threshold are not included in the candidate storage node range. Preferably, the secure storage distance in step S5 is calculated by a dynamic distance algorithm, which combines the shortest hop count of a node with the node's storage capacity. The calculation formula is: d=mind(B, B), u≠v, i≠k, where d(B, B) is the distance parameter between nodes B and B. Different fragments of the same encrypted target information and their corresponding redundant codes are allocated to different storage nodes with a secure storage distance greater than a preset distance threshold to avoid data loss due to single point of failure. Preferably, the dual-key management system in step S6 includes a key generation module, a key distribution module, a key update module, and a key discard module; Quantum random number generation technology uses a quantum random number generator to generate truly random numbers as key seeds, and quantum key distribution networks use end-to-end encrypted transmission to prevent key transmission from being stolen. The key update cycle is dynamically adjusted according to the sensitivity level of the target information, and the key update cycle for core sensitive information shall not exceed 72 hours. Preferably, the real-time status monitoring in step S7 adopts a distributed node monitoring protocol, which collects node operation data at preset time intervals and verifies the consistency of stored data through a data integrity verification algorithm. When a node failure is detected, the data is reconstructed using the Lagrange interpolation algorithm based on the redundancy coding in the redundancy matrix, and the reconstructed data is allocated to a new candidate storage node. When a malicious attack is detected, the connection between the abnormal node and the distributed storage cluster is immediately cut off, and the intrusion prevention mechanism is activated. Preferably, the fine-grained access control in step S9 specifically involves: the attribute management center generating a user attribute set and access policy; the authorization node generating a re-encryption key based on the user attributes and access policy; and the proxy server, after receiving the access policy update request, using the re-encryption key to re-encrypt the stored ciphertext fragments, ensuring that only users who meet the attribute requirements can decrypt and access the data, thereby achieving dynamic adjustment and rapid revocation of user permissions. Preferably, the user authentication in step S10 adopts a multi-factor authentication method, including account password authentication, quantum-secure UKey authentication and biometric authentication; Access permission verification compares user attributes with access policies. Once verification is successful, the key management system uses a segmented key distribution method to distribute the decryption key to the user, thus preventing the key from being leaked at once. Shard reconstruction integrates the sharded data and redundant coding from each storage node, completes data integrity verification, performs decryption, and returns the original information. (III) Beneficial Effects Compared with existing technologies, this invention provides a distributed information security storage method for software platforms, which has the following beneficial effects: 1. This distributed information security storage method for software platforms adopts a differentiated dual encryption strategy based on sensitivity levels, combined with technologies such as AES-256-GCM, SM4, and differential privacy, to provide precise protection for data of different sensitivity levels. It avoids the problem of insufficient encryption strength for core sensitive data, reduces the encryption resource consumption for non-sensitive data, and effectively resists quantum computing cracking by combining quantum random number generation technology and quantum key distribution network. The encryption strength far exceeds that of existing technologies. 2. This distributed information security storage method for software platforms establishes a dual-key management system to achieve secure key generation, distribution, updating, and disposal. The application of quantum random numbers ensures the unpredictability of keys, and the segmented key distribution and periodic update mechanism further reduces the risk of key leakage, thus solving the defects of imperfect key management in existing technologies. 3. This distributed information security storage method for software platforms introduces a node trust value evaluation mechanism and combines it with load value to screen candidate storage nodes, ensuring the security and reliability of storage nodes; it allocates sharded data based on secure storage distance to avoid data loss caused by failure in the same area; and it has a real-time node monitoring and fast data recovery mechanism, which shortens the data recovery time to less than 15 minutes, reduces the single point of failure rate of nodes to less than 0.05%, and effectively guarantees data integrity. 4. This distributed information security storage method for software platforms combines attribute proxy re-encryption with ABAC and RBAC dual access control models to achieve fine-grained access control and dynamic authorization of encrypted data. The authorization is flexible and low-cost. By introducing a zero-knowledge proof mechanism and blockchain audit logs, the access behavior can be traced and tamper-proof throughout the entire process. The source can be quickly traced after data leakage. At the same time, it meets compliance audit requirements and reduces compliance costs by more than 65%. 5. This distributed information security storage method for software platforms, through the application of adaptive sharding and improved RS erasure coding, reduces the redundancy rate to below 5% while ensuring data integrity, thereby reducing storage resource waste; the distributed storage architecture supports the dynamic addition and removal of storage nodes, and can be flexibly expanded according to the data volume growth needs of the software platform, adapting to software platform applications of different scales. 6. This invention provides a distributed information security storage method for software platforms. It deeply integrates multiple technologies such as differentiated encryption, quantum security technology, attribute proxy re-encryption, zero-knowledge proof, improved RS erasure coding, and node trust assessment. It breaks through the limitations of single-technology applications in existing technologies, solves multiple technical defects in existing distributed storage methods, and forms a complete, efficient, and secure distributed information security storage solution. It has significant creativity and practicality and can be widely used in various software platforms with high security requirements, such as government affairs, finance, and healthcare. Detailed Implementation The technical solutions of the present invention will be clearly and completely described below with reference to the embodiments of the present invention. Obviously, the described embodiments are only some embodiments of the present invention, and not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative effort are within the scope of protection of the present invention. A distributed information security storage method for software platforms includes the following steps: S1. The software platform receives information to be stored uploaded by users or generated by itself, performs sensitivity level classification and data type classification on the information to be stored, and obtains target information after classification. Specifically, the sensitivity level classification includes core sensitive level (such as government secrets, financial transaction data, and medical privacy data), general sensitive level (such as user mobile phone numbers and email addresses) and non-sensitive level (such as public announcements and ordinary documents). The data type classification includes text information, binary files (such as images and videos), structured data (such as database tables) and unstructured data (such as log files). When processing information by hierarchy and classification, the content features and attribute information of the information to be stored are first extracted. Based on the preset sensitive word library and data type recognition algorithm, the sensitivity level and data type of the information to be stored are automatically determined. Hierarchical classification tags are added to each target information to facilitate the adoption of differentiated processing strategies in the future and avoid resource waste and security vulnerabilities. S2. Based on the sensitivity level of the target information, a differentiated encryption strategy is used to perform double encryption on the target information to obtain encrypted target information; The design of differentiated encryption strategies is one of the innovations of this invention. Different encryption methods are used for target information with different sensitivity levels, taking into account both security and resource utilization: core sensitive target information adopts a dual encryption method combining AES-256-GCM encryption algorithm and differential privacy technology. The AES-256-GCM encryption algorithm has high-strength encryption capabilities and can effectively resist traditional cracking attacks. Differential privacy technology adds adaptive noise to achieve anonymization protection of sensitive information, avoids malicious speculation after data leakage, and meets the compliance requirements of EU GDPR and my country's Personal Information Protection Law. Generally, sensitive target information is encrypted once using the SM4 encryption algorithm. The SM4 algorithm is a block cipher algorithm independently developed in my country, which has high security and compatibility and is suitable for the protection of general sensitive data. Non-sensitive target information is encrypted using a lightweight hash encryption algorithm (such as SHA-256), which reduces the resource consumption of encryption and decryption and improves processing efficiency while ensuring basic security. S3. Adaptive fragmentation processing is performed on the encrypted target information. Based on the importance and size of the fragments, an improved RS erasure code is used to generate redundant codes. The target data redundancy matrix is ​​constructed by combining the encrypted target information fragments and the redundant codes. Adaptive sharding processing automatically adjusts the shard size based on the data type and size of the target information, avoiding resource waste or data leakage risks caused by fixed shard sizes: text information and structured data use fixed-size shards (such as 1MB / shard) for easy management and reconstruction; Binary files and unstructured data are dynamically sized for fragmentation, which automatically allocates fragment size (e.g., 10MB-100MB / fragment) based on file size, thus improving fragmentation efficiency. The improved RS erasure code is an optimization of the traditional RS erasure code. By optimizing the polynomial coefficient generation method, it solves the shortcomings of the traditional RS erasure code, such as high computational overhead and slow data recovery. It shortens the data recovery time to less than 15 minutes and reduces the redundancy rate to less than 5%. At the same time, it adjusts the number of redundant codes according to the importance of the fragment, with core sensitive fragments corresponding to more redundant codes, further ensuring data integrity. The construction of the target data redundancy matrix involves splicing encrypted target information fragments and corresponding redundancy codes according to preset rules, which facilitates subsequent encryption and storage allocation and improves data processing efficiency. S4. Obtain real-time operating status data of all storage nodes in the distributed storage cluster, calculate the trust value and load value of each node, and filter out candidate storage nodes that meet the preset security conditions. The calculation of node trust value is one of the innovations of this invention. It breaks through the limitation of existing technologies that only consider load balancing, and comprehensively evaluates the security and reliability of nodes. The calculation parameters include the node's historical operating stability (such as the number of failures and the duration of operation), data transmission security (such as the strength of transmission encryption and whether data leakage has occurred), fault recovery efficiency (such as the historical fault recovery time) and data integrity verification results (such as the integrity verification pass rate of sharded storage). The node trust value is calculated using a weighted summation algorithm, and the weight coefficients are dynamically adjusted according to the security requirements of the software platform. The parameters for calculating node load include node CPU utilization, memory usage, disk storage space utilization, and network bandwidth utilization. After normalization, the load value is obtained. The selection criteria are as follows: the node trust value is higher than the preset threshold (e.g., 0.8) and the load value is lower than the preset threshold (e.g., 70%). At the same time, nodes with historical malicious attack records and severe hardware aging are excluded to ensure the security and stability of candidate storage nodes. S5. Based on the topological location of the candidate storage nodes, calculate the secure storage distance corresponding to each encrypted target information fragment, and allocate the encrypted target information fragments and corresponding redundant codes to different candidate storage nodes for storage according to the secure storage distance and node load balancing principle. The secure storage distance is calculated using a dynamic distance algorithm, which combines the shortest hop count of a node with the node's storage capacity. The calculation formula is: d=mind(B,B),u≠v,i≠k, where d(B,B) is the distance parameter between nodes B and B. The larger the secure storage distance, the farther the physical or logical distance between the two nodes, which can effectively prevent the loss of all data due to a failure in the same area. During storage allocation, different fragments of the same encrypted target information and their corresponding redundant codes are allocated to different storage nodes with a secure storage distance greater than a preset distance threshold (such as 3 hops). At the same time, node load balancing is taken into account to avoid access delays caused by excessive load on some nodes, thus ensuring the security and access efficiency of data storage. S6. Establish a dual-key management system, combine quantum random number generation technology to generate encryption and decryption keys, complete the secure distribution of keys through a quantum key distribution network, and regularly perform key update and discard operations; The dual-key management system includes a key generation module, a key distribution module, a key update module, and a key discard module, which solves the defects of existing key management technology, such as imperfection and vulnerability to cracking: The key generation module uses a quantum random number generator to generate true random numbers as key seeds. Compared with traditional pseudo-random numbers, quantum random numbers are unpredictable and uncopyable, which can effectively resist quantum computing cracking and improve key security. The key distribution module uses a quantum key distribution network (QKD) to complete the end-to-end encrypted transmission of the key, preventing the key from being stolen or tampered with during transmission. The key update module dynamically adjusts the update cycle according to the sensitivity level of the target information. The key update cycle for core sensitive information shall not exceed 72 hours, while the key update cycle for general sensitive and non-sensitive information may be appropriately extended. At the same time, when a potential key leak is detected, an emergency update is immediately triggered. The key disposal module securely destroys expired, leaked, or discarded keys to ensure their uniqueness and security. S7. Perform real-time status monitoring on all storage nodes, identify node failures, data tampering or malicious attacks, trigger an anomaly warning when an anomaly is detected, perform rapid data recovery based on redundant coding, and isolate the abnormal node. Real-time status monitoring adopts a distributed node monitoring protocol, which collects node operation data (such as CPU, memory, and disk status), data storage status (such as fragment integrity and data checksum) and network transmission status (such as transmission rate and connection stability) at preset time intervals (such as 10 seconds). The consistency of stored data is verified by data integrity verification algorithms (such as hash check) to identify data tampering behavior. Malicious attack behaviors (such as DDoS attacks and data theft attacks) can be identified through intrusion detection algorithms (such as abnormal behavior pattern recognition). When a node failure is detected, the data is quickly reconstructed using the Lagrange interpolation algorithm based on the redundancy coding in the target data redundancy matrix. The reconstructed data is then allocated to a new candidate storage node, and the data recovery time is no more than 15 minutes. When data tampering or malicious attacks are detected, an anomaly warning is immediately triggered (such as sending a warning message to the administrator or a platform pop-up warning). At the same time, the connection between the abnormal node and the distributed storage cluster is cut off for isolation to prevent the attack from spreading. After the anomaly is resolved, the node is restored and reconnected. S8. Introduce a zero-knowledge proof mechanism to audit and track the information access behavior of software platform users throughout the entire process, generate tamper-proof access audit logs, and realize the traceability of access behavior. The introduction of zero-knowledge proof mechanism is one of the innovations of this invention, which solves the defects of existing technology that access behavior is difficult to trace and audit is unreliable. It allows the legality of access behavior to be verified without disclosing the original data. The information recorded in the access audit log includes user identity information, access time, access content (such as target information identifier), access operations (such as reading, modifying, and deleting), access IP address, and permission verification results. The audit log is stored using blockchain technology to ensure that the log is tamper-proof and undeletable, which facilitates the tracing and investigation of subsequent data leakage, unauthorized access and other issues, while meeting compliance audit requirements and reducing compliance audit costs to below $0.02 / GB. S9. Combining attribute proxy re-encryption with ABAC and RBAC dual access control models, a re-encryption key is generated based on user attributes and access policies to achieve fine-grained access control and dynamic authorization of encrypted data. This step overcomes the shortcomings of existing technologies in terms of coarse-grained access control and inflexible authorization, and realizes fine-grained control and dynamic authorization of encrypted data: The attribute management center generates user attribute sets (such as user identity, position, and permission level) and access policies (such as core sensitive data can only be accessed by administrators), and the authorization node generates a re-encryption key based on user attributes and access policies; After receiving the access policy update request, the proxy server uses the re-encryption key to re-encrypt the stored ciphertext fragments, ensuring that only users who meet the attribute requirements can decrypt and access them. The combination of ABAC and RBAC dual access control models provides dual protection for the rationality of access permissions. The RBAC model enables coarse-grained authorization based on roles, while the ABAC model enables fine-grained authorization based on user attributes. It also supports dynamic adjustment and rapid revocation of user permissions. When a user's role or permissions change, there is no need to re-encrypt the data; only the re-encryption key needs to be updated, reducing authorization costs and lowering the permission misjudgment rate to below 0.03%. S10. When a user of the software platform initiates an information reading request, the system verifies the user's identity and access permissions, obtains the corresponding decryption key through the key management system, retrieves the corresponding encrypted target information fragments and redundant codes from each storage node, completes the fragment reconstruction and decryption process, and returns the original information to be stored to the user. User authentication employs a multi-factor authentication method, including account password authentication, quantum-secure UKey authentication, and biometric authentication (such as fingerprints and faces). Multiple authentication methods ensure the legitimacy of user identities and prevent identity forgery. Access permission verification verifies whether a user has the necessary access permissions by comparing user attributes with access policies. If the verification fails, access is denied and the violation is recorded. After successful verification, the key management system uses a segmented key distribution method to distribute the decryption key to the user, thus preventing the key from being leaked at once. By integrating the fragmented data and redundant coding of each storage node, fragment reconstruction completes data integrity verification. Based on the sensitivity level of the target information, it uses the corresponding decryption algorithm to perform decryption operations and finally returns the original information to be stored to the user, ensuring that the data obtained by the user is complete and accurate. Example 1: Distributed information security storage method applicable to government software platforms This embodiment is applied to a government affairs software platform. This platform needs to store a large amount of core sensitive data (such as confidential government documents and citizens' privacy data), generally sensitive data (such as basic information of staff), and non-sensitive data (such as government public announcements). The security, reliability, and compliance requirements for data storage are extremely high. The specific implementation steps are as follows: S1. The government affairs software platform receives government documents uploaded by staff, service data submitted by citizens, and log data generated by the platform itself. It extracts the content characteristics and attribute information of each piece of information to be stored. Based on the government affairs sensitive word library (such as words containing confidential, secret, privacy, etc.) and data type recognition algorithm, it automatically determines the sensitivity level and data type: government confidential documents and citizen privacy data are determined to be core sensitive level, staff basic information is determined to be general sensitive level, government public announcements and log data are determined to be non-sensitive level, and graded classification tags are added to each target information. S2. Double encryption using a differentiated encryption strategy: Core sensitive government secrets and citizens' privacy data are encrypted using a combination of AES-256-GCM encryption algorithm and differential privacy technology. The AES-256-GCM algorithm encrypts the data, while the differential privacy technology adds adaptive noise to prevent malicious speculation after the leakage of citizens' privacy data. Basic information of staff members that is generally sensitive is encrypted once using the SM4 encryption algorithm. Non-sensitive government information disclosure announcements and log data are encrypted using the SHA-256 hash encryption algorithm to obtain the encrypted target information; S3. Adaptive fragmentation processing: Text information such as government information disclosure announcements and log data is fragmented into fixed-size fragments of 1MB / fragment; Binary files and structured data, such as confidential government documents and citizen service data, are dynamically sized and fragmented (50MB / fragment). An improved RS erasure code is used to generate three redundant codes for the core sensitive level fragments and one redundant code for the general sensitive level and non-sensitive level fragments. The target data redundancy matrix is ​​constructed by combining the encrypted target information fragments and the redundant codes. S4. Candidate Storage Node Screening: Obtain real-time operating data of 10 storage nodes in the distributed storage cluster, calculate the trust value (weight allocation: historical operating stability 30%, data transmission security 30%, fault recovery efficiency 20%, data integrity verification result 20%) and load value of each node, and screen out 6 nodes with a trust value ≥ 0.85 and a load value ≤ 65% as candidate storage nodes, and exclude nodes with too low a trust value or too high a load. S5. Shard Allocation: Based on the topological location of candidate storage nodes, calculate the secure storage distance corresponding to each encrypted target information shard. The preset secure storage distance threshold is 3 hops. Different shards of the same core sensitive level encrypted target information and their corresponding redundant codes are allocated to different candidate storage nodes with a secure storage distance ≥ 3 hops. Generally, sensitive and non-sensitive data are partitioned and allocated to different candidate storage nodes with a safe storage distance of ≥2 hops to ensure load balancing and data security. S6. Key Management: Generating true random numbers using a quantum random number generator as key seeds to generate encryption and decryption keys; The quantum key distribution network distributes the keys to the staff's quantum-safe UKeys and storage nodes. The keys for core sensitive information are updated every 48 hours, while the keys for general sensitive and non-sensitive information are updated every 7 days. Expired keys are securely discarded. S7. Node monitoring and anomaly handling: Collect the running data of candidate storage nodes every 10 seconds, verify data integrity through hash verification, and detect malicious attacks through abnormal behavior pattern identification; When a node failure is detected, based on redundancy coding, the data is reconstructed within 10 minutes using the Lagrange interpolation algorithm. The reconstructed data is then allocated to new candidate storage nodes, while the failed node is isolated. When data tampering is detected, an alert is immediately sent to the administrator, the abnormal node is isolated, and the cause of the tampering is investigated. S8. Access Audit: Introducing a zero-knowledge proof mechanism to audit staff's information access behavior throughout the entire process, generating audit logs that include staff identity, access time, access content, access operation, and access IP, and using blockchain technology to store the audit logs to ensure immutability and traceability; S9. Access Control and Dynamic Authorization: The attribute management center generates the attribute set (identity, position, permission level) and access policy (e.g., administrators can access all data, while ordinary staff can only access general sensitive and non-sensitive data). Authorized nodes generate re-encryption keys based on attributes and access policies; When staff positions are adjusted, the proxy server uses a new re-encryption key to re-encrypt the ciphertext fragments, enabling dynamic adjustment of permissions without the need to re-encrypt the data. S10. Information Reading: When staff initiate an information reading request, they shall complete account password verification, quantum security UKey verification and fingerprint verification in sequence; After successful verification, the key management system uses a segmented key distribution method to distribute the decryption key to the staff. The corresponding encrypted target information fragments and redundant codes are retrieved from each storage node, fragment reconstruction and decryption are completed, and the original government information is returned to the staff. After this embodiment was applied to the government software platform, the risk of data leakage was reduced to below 0.005%, the data recovery time was shortened to 10 minutes, and the compliance audit cost was reduced by 70%, effectively ensuring the security and reliability of government data and meeting the high security requirements of the government software platform. Example 2: Distributed Information Security Storage Method Applicable to Financial Software Platforms This embodiment is applied to a financial software platform, which mainly stores financial transaction data, user account information, risk control data, etc. It has a high level of sensitivity and extremely high requirements for data encryption strength, access control, and traceability. The implementation steps are basically the same as in Embodiment 1, with the following differences: Sensitivity Level Classification: Financial transaction data, user account passwords, and core risk control data are classified as core sensitive; user mobile phone numbers, email addresses, and transaction record summaries are classified as general sensitive. The information and announcements regarding financial products publicly available on the platform are classified as non-sensitive. Encryption strategy: Core sensitive data is encrypted using a combination of AES-256-GCM encryption algorithm and differential privacy technology, and an additional layer of national cryptographic SM9 algorithm is added to further enhance encryption strength; Generally, sensitive data uses the SM4 encryption algorithm, while non-sensitive data uses the SHA-256 encryption algorithm. Redundant coding: The core sensitive level fragment generates 4 redundant codes to ensure data integrity under extreme conditions; Generally, sensitive-level fragmentation generates 2 redundant codes, while non-sensitive-level fragmentation generates 1 redundant code. Key update: Keys for core sensitive information are updated every 24 hours, and keys for general sensitive information are updated every 3 days. When an abnormal transaction is detected, an emergency key update is triggered immediately. Access verification: User information reading requests use four verification methods: account password, SMS verification code, quantum-safe UKey, and facial recognition. Staff access uses job-related permission verification plus biometric verification to ensure the legitimacy of access. When applied to a financial software platform, this embodiment effectively resists security threats such as quantum computing cracking and malicious attacks. The efficiency of data tampering detection is improved to the millisecond level, the time for retrieving cross-border payment data is shortened from 48 hours to 8 minutes, and compliance costs are reduced by 62%, meeting the high security and high compliance requirements of the financial industry. Although embodiments of the invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made to these embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the appended claims and their equivalents.

Claims

1. A distributed information security storage method for software platforms, characterized in that, Includes the following steps: S1. The software platform receives information to be stored uploaded by users or generated by itself, performs sensitivity level classification and data type classification on the information to be stored, and obtains target information after classification. S2. Based on the sensitivity level of the target information, a differentiated encryption strategy is used to perform double encryption on the target information to obtain encrypted target information; S3. Adaptive fragmentation processing is performed on the encrypted target information. Based on the importance and size of the fragments, an improved RS erasure code is used to generate redundant codes. The target data redundancy matrix is ​​constructed by combining the encrypted target information fragments and the redundant codes. S4. Obtain real-time operating status data of all storage nodes in the distributed storage cluster, calculate the trust value and load value of each node, and filter out candidate storage nodes that meet the preset security conditions. S5. Based on the topological location of the candidate storage nodes, calculate the secure storage distance corresponding to each encrypted target information fragment, and allocate the encrypted target information fragments and corresponding redundant codes to different candidate storage nodes for storage according to the secure storage distance and node load balancing principle. S6. Establish a dual-key management system, combine quantum random number generation technology to generate encryption and decryption keys, complete the secure distribution of keys through a quantum key distribution network, and regularly perform key update and discard operations; S7. Perform real-time status monitoring on all storage nodes, identify node failures, data tampering or malicious attacks, trigger an anomaly warning when an anomaly is detected, perform rapid data recovery based on redundant coding, and isolate the abnormal node. S8. Introduce a zero-knowledge proof mechanism to audit and track the information access behavior of software platform users throughout the entire process, generate tamper-proof access audit logs, and realize the traceability of access behavior. S9. Combining attribute proxy re-encryption with ABAC and RBAC dual access control models, a re-encryption key is generated based on user attributes and access policies to achieve fine-grained access control and dynamic authorization of encrypted data. S10. When a user of the software platform initiates an information reading request, the system verifies the user's identity and access permissions, obtains the corresponding decryption key through the key management system, retrieves the corresponding encrypted target information fragments and redundant codes from each storage node, completes the fragment reconstruction and decryption process, and returns the original information to be stored to the user.

2. The distributed information security storage method for a software platform according to claim 1, characterized in that: The sensitivity level classification in step S1 includes core sensitivity level, general sensitivity level and non-sensitive level, and the data type classification includes text information, binary files, structured data and unstructured data; The hierarchical classification process involves: extracting the content features and attribute information of the information to be stored; automatically determining the sensitivity level and data type of the information to be stored based on a preset sensitive word library and data type recognition algorithm; and adding hierarchical classification tags to each target information.

3. The distributed information security storage method for a software platform according to claim 1, characterized in that: The differentiated encryption strategy mentioned in step S2 is as follows: core sensitive target information adopts a dual encryption method combining the AES-256-GCM encryption algorithm and differential privacy technology; Generally, sensitive target information is encrypted once using the SM4 encryption algorithm. Non-sensitive target information is encrypted using a lightweight hash encryption algorithm, while differential privacy technology adds adaptive noise to achieve anonymization protection of sensitive information, meeting EU GDPR compliance requirements.

4. The distributed information security storage method for a software platform according to claim 1, characterized in that: The adaptive fragmentation process described in step S3 is as follows: the fragmentation size is automatically adjusted according to the data type and size of the target information. Text information and structured data are fragmented with a fixed size, while binary files and unstructured data are fragmented with a dynamic size. The improved RS erasure coding reduces data recovery time to less than 15 minutes and redundancy rate to below 5% by optimizing the polynomial coefficient generation method.

5. The distributed information security storage method for a software platform according to claim 1, characterized in that: The parameters for calculating the node trust value in step S4 include the node's historical operational stability, data transmission security, fault recovery efficiency, and data integrity verification results, which are calculated using a weighted summation algorithm. The load value calculation parameters include node CPU utilization, memory utilization, disk storage space utilization, and network bandwidth utilization. Nodes with a trust value lower than a preset threshold or a load value higher than a preset threshold are not included in the candidate storage node range.

6. The distributed information security storage method for a software platform according to claim 1, characterized in that: The secure storage distance mentioned in step S5 is calculated by a dynamic distance algorithm. The dynamic distance algorithm combines the shortest hop count of a node with the storage capacity of a node. The calculation formula is: d=mind(B, B), u≠v, i≠k, where d(B, B) is the distance parameter between nodes B and B. Different fragments of the same encrypted target information and their corresponding redundant codes are allocated to different storage nodes with a secure storage distance greater than a preset distance threshold to avoid data loss due to single point of failure.

7. The distributed information security storage method for a software platform according to claim 1, characterized in that: The dual-key management system described in step S6 includes a key generation module, a key distribution module, a key update module, and a key discard module; Quantum random number generation technology uses a quantum random number generator to generate truly random numbers as key seeds, and quantum key distribution networks use end-to-end encrypted transmission to prevent key transmission from being stolen. The key update cycle is dynamically adjusted according to the sensitivity level of the target information, and the key update cycle for core sensitive information shall not exceed 72 hours.

8. The distributed information security storage method for a software platform according to claim 1, characterized in that: The real-time status monitoring described in step S7 adopts a distributed node monitoring protocol, which collects node operation data at preset time intervals and verifies the consistency of stored data through a data integrity verification algorithm. When a node failure is detected, the data is reconstructed using the Lagrange interpolation algorithm based on the redundancy coding in the redundancy matrix, and the reconstructed data is allocated to a new candidate storage node. When a malicious attack is detected, the connection between the abnormal node and the distributed storage cluster is immediately cut off, and the intrusion prevention mechanism is activated.

9. The distributed information security storage method for a software platform according to claim 1, characterized in that: The fine-grained access control described in step S9 is as follows: the attribute management center generates a user attribute set and access policy; the authorization node generates a re-encryption key based on the user attributes and access policy; after receiving the access policy update request, the proxy server uses the re-encryption key to re-encrypt the stored ciphertext fragments, ensuring that only users who meet the attribute requirements can decrypt and access the data, thereby realizing the dynamic adjustment and rapid revocation of user permissions.

10. The distributed information security storage method for a software platform according to claim 1, characterized in that: The user authentication method described in step S10 adopts a multi-factor authentication method, including account password authentication, quantum-secure UKey authentication and biometric authentication; Access permission verification compares user attributes with access policies. Once verification is successful, the key management system uses a segmented key distribution method to distribute the decryption key to the user, thus preventing the key from being leaked at once. Shard reconstruction integrates the sharded data and redundant coding from each storage node, completes data integrity verification, performs decryption, and returns the original information.