An agent evaluation method and device, electronic equipment and storage medium
By using general and domain-specific large models to conduct multi-dimensional reviews of reports generated by intelligent agents, the problem of unreliability of existing evaluation methods in professional fields is solved, and an accurate assessment of the report generation capabilities of intelligent agents is achieved.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- ZHEJIANG LAB
- Filing Date
- 2026-03-18
- Publication Date
- 2026-06-16
AI Technical Summary
Existing intelligent agent evaluation methods cannot accurately reflect an agent's ability to generate research reports in a specific field, leading to unreliable evaluation results.
Cross-review using general large models and domain large models is employed. Literature is reviewed in multiple dimensions through general review dimensions and professional review dimensions, generating a first review report and a second review report. The report score is determined by combining the literature and the review report, thereby evaluating the report generation capability of the intelligent agent.
It provides an effective and reliable evaluation method that can truly reflect the report generation capabilities of intelligent agents in professional fields, thereby improving the accuracy and applicability of the evaluation results.
Smart Images

Figure CN121859944B_ABST