An optimization method and device for improving global memory access efficiency of a triton compiler
By adding first axis attribute information and integer divisibility attribute values to the Triton compiler, the axis analysis module and memory access merging module were optimized, solving the problem of low global memory access efficiency in the Triton compiler and achieving efficient memory access behavior.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HANGZHOU ADVANCED COMPILATION TECHNOLOGY CO LTD
- Filing Date
- 2026-03-13
- Publication Date
- 2026-06-19
AI Technical Summary
The Triton compiler suffers from inaccurate calculation results and low memory access efficiency in global memory access optimization, especially in axis analysis optimization pass and thread layout calculation, which makes it impossible to effectively merge memory access and vectorized memory access.
Add first axis attribute information to the axis analysis module of the Triton compiler to record the continuity information of tensors in different scenarios, align the memory access granularity in the memory access merging module, add attribute values with integer divisibility as preset values, and generate a unique identifier for each attribute value.
It improves the global memory access efficiency of the Triton compiler, ensures the correctness of the calculation results of the axis analysis module, and realizes the efficient memory access behavior of the operator.
Smart Images

Figure 1 
Figure 2