Machine learning device, inference device, machine learning method, recording medium, and method for generating trained model
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- ENEOS HLDG INC
- Filing Date
- 2025-11-07
- Publication Date
- 2026-07-02
AI Technical Summary
Existing reinforcement learning methods using neural networks are inefficient and inaccurate in determining optimal actions due to insufficient consideration of preferable actions during the learning process.
A machine learning device that performs reinforcement learning by acquiring the current state of the environment, determining agent behavior based on a probability distribution, estimating the change in a predetermined evaluation index, calculating a loss, and updating the learning model based on this loss to improve the efficiency and accuracy of learning.
Enhances the efficiency and accuracy of learning by refining the probability distribution to align with favorable actions, thereby improving the performance of the learning model.
Smart Images

Figure JP2025039067_02072026_PF_FP_ABST