Web service combination method based on depth reinforcement learning

A technology of reinforcement learning and combined methods, applied in the computer field, can solve problems such as inability to fully perceive environmental information, inability to converge, difficult network training, etc., and achieve the effect of improving adaptability

Active Publication Date: 2017-10-10
SOUTHEAST UNIV
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the actual environment, the agent cannot fully perceive the environmental information. This kind of partial perception problem belongs to the non-Markovian environment. If it is not processed for the reinforcement learning algorithm, this kind of learning will not be able to converge.
(2) The theory of reinforcement learning focuses on small-scale and discrete problems, but in the real service composition problem, the scale of the service composition problem is not to be underestimated and the states are continuous
The recurrent neural network (RNN) is suitable for serialized data and can simulate these data more accurately. It records the activation value at each moment, and enhances the time correlation of the network by adding a self-connected hidden layer at cross-domain time points. , but on the contrary, this also makes it difficult to train the entire network and the phenomenon of gradient disappearance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web service combination method based on depth reinforcement learning
  • Web service combination method based on depth reinforcement learning
  • Web service combination method based on depth reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further clarified below in conjunction with the drawings and specific embodiments.

[0042] The present invention improves the RNN-based improved network LSTM network structure model to improve the process of service composition using reinforcement learning, and constructs an innovative adaptive deep Q-learning and RNN Composition Network (ADQRC) method. )Such as figure 2 Shown. Recurrent neural network is a function that gives the neural network the ability to display and model time by adding self-connected hidden layers that cross-domain time points. In other words, the feedback of the hidden layer not only enters the output terminal, but also enters the hidden layer of the next time step. RNN can connect the previous information with the current task. For example, in the process of a service combination, the status of each service changes, but it is regular, not completely random. For example, in the past performance of a service, the re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a web service combination method based on depth reinforcement learning for overcoming the problems of long time consumption, poor flexibility and non-ideal combination effect of the traditional service combination method in large-scale service scenes. The depth reinforcement learning technology and the heuristic thought are applied to the service combination problem. In addition, by considering the partial observability of the real environment, the service combination process is converted into a partially-observable Markov decision process POMDP, the solution problem of the POMDP is solved by using a recurrent neural network, and the method still expresses high efficiency encountering the challenge of curse of dimensionality. By adoption of the method provided by the invention, the solution speed can be effectively improved, the dynamic service combination environment is automatically adapted on the basis of ensuring the quality of the service combination scheme, and the adaptability and the flexibility of the service combination efficiency is effectively improved in a large-scale dynamic service combination scene.

Description

Technical field [0001] The invention belongs to the computer field, and particularly relates to a Web service composition method based on deep reinforcement learning. Background technique [0002] With the continuous development of network information technology, users' functional requirements for software systems are becoming more and more diversified, complicated and varied. This trend has given birth to a new design or architecture concept for software products: Service-Oriented Architecture (SOA), which requires applications to be an independent collection of interactive services that provide good interfaces. As a new platform for establishing interoperable distributed applications, Web services are network-based, distributed, self-describing, and modular components. They perform specific tasks and follow certain technical specifications. Publish, locate and call on the Internet, thus becoming the most promising technical means to realize the SOA architecture. [0003] Nowada...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24H04L29/08G06N3/08
CPCH04L41/145H04L67/02G06N3/08H04L67/51
Inventor 王红兵顾明珠
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products