A method and apparatus for processing duplicate data, an electronic device, and a medium
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- JINAN INSPUR DATA TECH CO LTD
- Filing Date
- 2026-03-25
- Publication Date
- 2026-06-30
AI Technical Summary
In distributed storage systems, duplicate data leads to cross-node reference scenarios, increasing network bandwidth consumption and transmission latency, causing uneven node load, forming performance bottlenecks, and making it difficult to balance deduplication rate and node access.
By receiving write requests, it determines whether the data to be written is duplicated, counts the number of logical data blocks in the candidate physical data blocks, selects the target physical data block for mapping, prioritizes the use of local storage resources, avoids cross-node access, and achieves load balancing.
It effectively reduces network bandwidth consumption and transmission latency caused by cross-node data access, optimizes access distribution, avoids performance bottlenecks caused by excessive reference to a single physical data block, and improves overall system performance and storage efficiency.
Smart Images

Figure CN122308735A_ABST