A Web Service Composition Method Based on Deep Reinforcement Learning

A technology of reinforcement learning and combined methods, applied in the computer field, can solve problems such as inability to fully perceive environmental information, gradient disappearance, and network training is difficult, and achieve the effect of improving adaptability

Active Publication Date: 2020-05-05
SOUTHEAST UNIV
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the actual environment, the agent cannot fully perceive the environmental information. This kind of partial perception problem belongs to the non-Markovian environment. If it is not processed for the reinforcement learning algorithm, this kind of learning will not be able to converge.
(2) The theory of reinforcement learning focuses on small-scale and discrete problems, but in the real service composition problem, the scale of the service composition problem is not to be underestimated and the states are continuous
The recurrent neural network (RNN) is suitable for serialized data and can simulate these data more accurately. It records the activation value at each moment, and enhances the time correlation of the network by adding a self-connected hidden layer at cross-domain time points. , but on the contrary, this also makes it difficult to train the entire network and the phenomenon of gradient disappearance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Web Service Composition Method Based on Deep Reinforcement Learning
  • A Web Service Composition Method Based on Deep Reinforcement Learning
  • A Web Service Composition Method Based on Deep Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0042] The present invention will be based on the improved network structure model of RNN LSTM, improve the process of service composition using reinforcement learning, and construct an innovative adaptive deep reinforcement learning method (Adaptive Deep Q-learning and RNN Composition Network, ADQRC )like figure 2 shown. Recurrent Neural Networks are a neural network that endows neural networks with the ability to explicitly model time by adding self-connected hidden layers across domain time points. That is, the feedback from the hidden layer, not only goes to the output, but also goes to the hidden layer at the next time step. RNN can connect the previous information with the current task. For example, in the process of service composition, the state of each service changes, but it is regular and not completely random. For example, in th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a web service combination method based on deep reinforcement learning. Aiming at the problems of long time consumption, poor flexibility and unsatisfactory combination results of the traditional service combination method in the face of large-scale service scenarios, the deep reinforcement learning technology and heuristic Ideas applied to the problem of service composition. In addition, considering the characteristics of partial observability of the real environment, the present invention transforms the service composition process into a partially observable Markov decision process (Partially-Observable Markov Decision Process, POMDP), and utilizes a recurrent neural network to solve the problem of POMDP , so that the method can still show high efficiency in the face of the "curse of dimensionality" challenge. The method of the present invention can effectively improve the solution speed, and independently adapt to the dynamic service composition environment on the basis of ensuring the quality of the service composition scheme, effectively improving the efficiency and self-adaptability of the service composition in the large-scale dynamic service composition scene and flexibility.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a Web service combination method based on deep reinforcement learning. Background technique [0002] With the continuous development of network information technology, users' functional requirements for software systems are becoming more and more diverse, complex and changing. This trend has given rise to a new concept of design or architecture of software products: Service-Oriented Architecture (SOA), which requires that applications must be developed as an independent collection of interactive services that can provide a good interface. As a new platform for building interoperable distributed applications, Web services are network-based, distributed, self-describing, and modular components that perform specific tasks and follow certain technical specifications. Publish, locate and invoke on the Internet, thus becoming the most promising technical means to realize the SOA a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/24H04L29/08G06N3/08
CPCH04L41/145H04L67/02G06N3/08H04L67/51
Inventor 王红兵顾明珠
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products