Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

168 results about "Decision networks" patented technology

Deep reinforcement learning method and device based on environment state prediction

The invention discloses a deep reinforcement learning method and device based on environment state prediction. The method comprises the following steps that: establishing a deep reinforcement learningnetwork based on the environment prediction, and selecting a proper strategy decision method according to the characteristics of tasks; initializing network parameters, and establishing a storage area which meets a storage condition as an experience replaying area; according to the output of a strategy decision network, selecting a proper strategy to interact with environment, and continuously storing the interaction information of an interaction process into the experience replaying area; sampling a first sample sequence from the experience replaying area, utilizing a supervised learning method to train an environment prediction part, and repeating a first preset frequency; sampling a second sample sequence from the experience replaying area, fixing the parameter of the environment prediction part to be constant, utilizing a reinforcement learning method to train the strategy decision part, and repeating a second preset frequency; when network convergence meets a preset condition, obtaining a reinforcement learning network. By use of the method, learning efficiency can be effectively improved.
Owner:TSINGHUA UNIV

Power grid multi-section power automatic control method based on distributed multi-agent reinforcement learning

The invention discloses a power grid multi-section power automatic control method based on distributed multi-agent reinforcement learning. The method can achieve the autonomous learning of a proper multi-section power control strategy for a complex power grid through the interaction of multiple agents and a power simulation environment. The method comprises the following steps of firstly, N target sections are selected according to the need of power grid control, and basic elements such as an environment, an intelligent agent, an observation state, an action and a reward function of the reinforcement learning method are constructed; secondly, a multi-section power control task interaction environment is operated, and an initial power flow data set is created; then, a decision network and an estimation network based on a deep neural network are constructed for each agent, an MADDPG (multi-agent deep deterministic strategy gradient) model is constructed, and a distributed method is introduced to train an autonomous learning optimal control strategy; and finally, the trained strategy network is applied to perform automatic section control. The method is advantaged in that a complex power grid multi-section power control problem is solved through the multi-agent reinforcement learning method, the control success rate is high, expert experience is not needed, and meanwhile agent training efficiency is greatly improved by introducing the distributed method.
Owner:ZHEJIANG UNIV

Deep network model constructing method, and facial expression identification method and system

The invention discloses a deep network model constructing method, which comprises the steps of: step S1) establishing a deep network model used for facial expression identification, and initializing parameters of the deep network model, wherein the deep network model comprises a convolutional neural network used for extracting high-level features of pictures, a reconstructed network used for extracting low-level features of the pictures and a joint decision network used for identifying facial expressions; step S2) dividing all training pictures into N groups; step S3) sequentially inputting each group of the pictures into the deep network model, and training parameters in the deep network model based on a gradient descent method; step S4) regarding the parameters of the deep network modelobtained in the step S3) as initial values of model parameters, and re-dividing all the training pictures into N groups, then jumping to the step S3), and carrying out the process repeatedly until allthe trained model parameters no longer change when compared to the initial values of the model parameters. The invention further discloses a facial expression identification method and a facial expression identification system.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI

Heterogeneous wireless network selection method based on utility function

The invention discloses a heterogeneous wireless network selection method based on a utility function. The method includes the following specific steps: (1) acquiring network attribute data and a current user service type, wherein the network attribute data includes available bandwidth, network costs, energy consumption and network load, and the user service type includes voice, data and videos; (2) constructing a utility function for each decision network attribute, and solving a utility value of the corresponding attribute according to the utility function of the network attribute, wherein the relationship between the attribute value and the user satisfaction determines the utility function that is used; (3) constructing a utility matrix according to the solved utility value of the network attribute in each network; (4) constructing a comprehensive weight according to a subjective weight and an objective weight; and (5) calculating the comprehensive utility of networks, and selectingan optimal network for access. According to the scheme of the invention, the user preferences, network states, energy consumption of equipment, network costs and network load are comprehensively considered, and the selected optimal network can achieve an optimal balance between the QoS and the network load.
Owner:SOUTH CHINA UNIV OF TECH

Surface defect detection method based on multi-scale convolution and trilinear global attention

PendingCN112465790AAlleviate the problem of imbalanceAnnotated data is lowImage enhancementImage analysisPattern recognitionActivation function
The invention discloses a surface defect detection method based on multi-scale convolution and trilinear global attention, and the method comprises the steps: carrying out the convolution and poolingof trunk features in an encoding module, and extracting shallow feature maps of an image under different scales; obtaining a deep feature map through up-sampling and convolution operations in a decoding module; fusing the shallow feature maps and the deep feature map together through four times of splicing operation in the middle; converting the shallow feature map into a shallow attention map bya first branch of the trilinear global attention module through linear operation, activating the deep feature map by a second branch through compression to obtain a deep feature weight of the deep feature map, and then weighting the deep feature weight to the shallow feature map; in a decision-making network module, processing an output feature map of the decoding module by using global average pooling and global maximum pooling, outputting the probability that a surface defect image has defects through an activation function, and outputting a grey-scale map of a potential position of the defect through 1*1 convolution operation for visually explaining a neural network.
Owner:TIANJIN UNIV

Adaptive transmission in multi-access asynchronous channels

A hybrid transmission cycle (HTC) unit of bandwidth on a shared transmission medium is defined to include an adaptive, time division multiplexing transmission cycle (ATTC), which is allocated in portions sequentially among all participating network entities, and a residual transmission cycle (RTC), which is allocated in portions, as available, to the first network entity requesting access to the shared medium during each particular portion. The ratio of logical link virtual channels, or D-Channels, to data payload virtual channels, or B-Channels, within the ATTC is adaptive depending on loading conditions. Based on transmission profiles transmitted on the D-Channels during the ATTC, each network entity determines how many B-Channels it will utilize within the current HTC. This calculation may be based on any decision network, such as a decision network modelling the transmission medium as a marketplace and employing microeconomic principles to determine utilization. The ratio of the duration of the ATTC segment to the duration of the RTC segment is also adaptive depending on loading conditions, to prevent unacceptable latency for legacy network entities employing the shared transmission medium. During the RTC, utilization of the shared medium preferably reverts to IEEE 802.3 compliant CSMA/CD transmission, including transmissions by HTC-compliant network entities.
Owner:GLOBAL COMM INVESTMENT LLC

Database query optimization method and system

The invention discloses a database query optimization method. The database query optimization method comprises a connection sequence selector and a self-adaptive decision network, wherein the connection sequence selector is used for selecting an optimal connection sequence in the query plan and comprises a new database query plan coding scheme, and codes are in one-to-one correspondence with the connection sequence; a value network which is used for predicting the execution time of the query plan, is trained by the query plan and the corresponding real execution time, and is used for reward feedback in Monte Carlo tree search; a Monte Carlo tree search method which is used for simulating and generating multiple different connection sequences, evaluating the quality of the connection sequences through a connection sequence value network, and returning a recommended connection sequence after preset exploration times are reached. And the adaptive decision network is used for distinguishing whether the query statement uses the connection sequence selector or not, so that the overall performance of the optimization system is improved. According to the method and the system, the limitation of a traditional query optimizer can be effectively avoided, and the database query efficiency is improved.
Owner:HUAZHONG UNIV OF SCI & TECH +1

Identification method of oil depot target in remote sensing image

The present invention discloses an identification method of an oil depot target in a remote sensing image. The method comprises the steps of firstly calculating the phase spectrum significance of a whole scene, and extracting all interested areas possibly containing the target in the scene according to the phase spectrum significance; in the feature extraction, adopting a local regression nuclear model to calculate the local structural features of the interested areas point by point, and generating a feature descriptor capable of describing the structure of the target; at a target detection stage, carrying out the similarity measurement by the cosine similarity, calculating the similarities of the interested areas and an oil depot sample image, utilizing the positive and negative sample distinguishing ability of the feature descriptor and the features of the similarity surfaces to construct a decision network having the adaptive capability, obtaining the preliminary results of the target detection by the decision network, and removing the redundant preliminary results by a non-maxima suppression algorithm, thereby obtaining a final target detection result. A universal detection method of the oil depot target in the remote sensing image provided by the present invention is good in multi-scale and multi-view angle target identification effect.
Owner:HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products