Image processing method, and apparatus
By transforming image features to the frequency domain and using time steps for adaptive modulation, the DiT model with a U-shaped transformer architecture solves the problem of poor image processing performance in existing technologies, achieving better image enhancement effects and efficiency.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- HUAWEI TECH CO LTD
- Filing Date
- 2025-07-04
- Publication Date
- 2026-06-25
AI Technical Summary
Existing diffusion models have poor output performance in image processing, especially in image super-resolution tasks, and the traditional DiT architecture lacks multi-scale feature extraction capabilities, resulting in unsatisfactory image enhancement effects.
Image features are converted to the frequency domain for modulation. The frequency components are adaptively modulated using time steps. The DiT model with a U-shaped transformer architecture is adopted. The conversion between the spatial and frequency domains is achieved through Fourier transform, and feature modulation is performed in the frequency domain.
It improves the image processing performance, especially in image super-resolution, dehazing, and deblurring tasks, achieving better image enhancement results and higher efficiency.
Smart Images

Figure CN2025107096_25062026_PF_FP_ABST