Unlock instant, AI-driven research and patent intelligence for your innovation.

Inference device, device control system, and learning device

A learning device and equipment technology, applied in the general control system, control/regulation system, adaptive control, etc., can solve the problems that the efficiency of learning and reasoning cannot be improved, and achieve the effect of improving efficiency

Pending Publication Date: 2022-04-01
MITSUBISHI ELECTRIC CORP
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in the reinforcement learning related to the motion control of the device, since the same feature extractor as the existing feature extractor is used, there is a problem that the efficiency of learning and inference cannot be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Inference device, device control system, and learning device
  • Inference device, device control system, and learning device
  • Inference device, device control system, and learning device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach 1

[0036] figure 1 It is a block diagram showing main parts of the device control system of the first embodiment. figure 2 It is an explanatory diagram showing an example of a robot controlled by the facility control system of the first embodiment. image 3 It is an explanatory diagram showing a main part of a feature quantity extractor and a controller in the plant control system according to the first embodiment. Figure 4A It is an explanatory diagram showing the structure of each layer in the feature quantity extractor in the plant control system according to the first embodiment. Figure 4B It is an explanatory diagram showing another structure of each layer in the feature quantity extractor in the plant control system according to the first embodiment. refer to figure 1 ~ FIG. 4, the equipment control system of Embodiment 1 is demonstrated.

[0037] Such as figure 1 As shown, the environment E includes the control device 1 and the robot 2 . The control device 1 contr...

Embodiment approach 2

[0089] Figure 9 It is a block diagram showing main parts of the reinforcement learning system of Embodiment 2. Figure 10 It is an explanatory diagram showing main parts of a first feature quantity extractor, a second feature quantity extractor, a first controller, and a learner in the reinforcement learning system according to the second embodiment. refer to Figure 9 and Figure 10 A reinforcement learning system according to Embodiment 2 will be described.

[0090] Such as Figure 9 As shown, a loop composed of the environment E, the first feature quantity extractor 41 and the first controller 51 is formed. The environment E outputs a state value (hereinafter referred to as "first state value") s indicating the state in the environment E. t . The first feature quantity extractor 41 accepts the output first state value s t input of. The output of the first feature quantity extractor 41 and the first state value s of the input t The corresponding eigenvector (herein...

Embodiment approach 3

[0146] Figure 14 It is a block diagram showing main parts of the reinforcement learning system of Embodiment 3. refer to Figure 14 A reinforcement learning system according to Embodiment 3 will be described. In addition, in Figure 14 in, right with Figure 9 Blocks that are the same as those shown are assigned the same reference numerals and description thereof will be omitted.

[0147] Such as Figure 14 As shown, the reinforcement learning system 500 according to the third embodiment includes a storage device 81 in addition to the inference device 100 and the learning device 400 . Stored in the storage device 81 is the first state value s t , the corresponding action value a t and the corresponding second state value s t+1 formed group. More specifically, storing multiple sets of values ​​(s t 、a t , s t+1 ). These values ​​(s t 、a t , s t+1 ) is collected using a controller different from the first controller 51 (hereinafter referred to as "second control...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An inference device (100) is provided with: a feature value extractor (3) that receives an input of a state value (st) relating to an environment (E) including a control device (1) and a device (2) controlled by the control device (1), and outputs a feature vector (vt) that corresponds to the state value (st) and is higher than the state value (st); and a controller (4) that receives the input of the feature vector (vt) and outputs a control quantity (At) corresponding to the feature vector (vt).

Description

technical field [0001] The present invention relates to a reasoning device, a device control system and a learning device. Background technique [0002] Conventionally, techniques for applying so-called "reinforcement learning" to image processing and the like have been developed (for example, refer to Patent Document 1). Generally, in reinforcement learning involving image processing and the like, the number of state values ​​obtained from images and the like is large. That is, the dimensionality of a feature vector obtained from an image or the like is large. Therefore, a feature quantity extractor is used from the viewpoint of reducing the dimensionality of a feature vector input to an agent compared to the dimensionality of a feature vector obtained from an image or the like. This is to avoid the reduction of learning efficiency and inference efficiency due to the excessively large dimension of the feature vector input to the agent. In other words, this is to improve ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/02
CPCG06N3/02B25J9/163G05B13/027
Inventor 老木智章
Owner MITSUBISHI ELECTRIC CORP