A non-intrusive software operation agent method based on adaptive image reconstruction
By employing adaptive image reconstruction and semantic-level matching technologies, the problems of response latency and low recognition rate under high resolution and complex software interfaces are solved, enabling a fast, accurate, and non-intrusive software operation agent that adapts to software version updates and natural language commands.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- WUXI CHANGSHENG VISION TECHNOLOGY CO LTD
- Filing Date
- 2026-03-17
- Publication Date
- 2026-06-30
AI Technical Summary
Existing non-intrusive interaction technologies suffer from high response latency and low recognition rates when faced with high-resolution screens and complex professional software interfaces. They also struggle to effectively handle high pixel density and low-contrast interfaces, limiting the application of automated software in these scenarios.
An adaptive image reconstruction method is adopted, including image compression, adaptive binarization, and pseudo-color three-channel data reconstruction. Combined with semantic-level matching technology, the image processing and recognition process is optimized, reducing the amount of computation and improving the recognition accuracy.
Achieving high-precision text recognition within millisecond-level response time reduces hardware costs and improves the accuracy, response speed, and robustness of non-intrusive software operation agents. It can handle complex low-contrast interfaces and adapt to software version updates and natural language commands.
Smart Images

Figure CN122308677A_ABST