Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

167 results about "Multi layered perceptron" patented technology

MULTI LAYER PERCEPTRON. Multi Layer perceptron (MLP) is a feedforward neural network with one or more layers between input and output layer. Feedforward means that data flows in one direction from input to output layer (forward).

Machine translation method and system based on generative adversarial neural network

The invention belongs to the technical field of computers, and discloses a machine translation method and system based on a generative adversarial neural network. The method comprises the following steps that: on the basis of an original machine translation generation network, a discrimination network which generates network countermeasure with the original machine translation generation network is imported; a translation used for judging a target language is from a training parallel corpus and is a network machine translation result of the original machine translation generation network; and the discrimination network adopts a multi-layer sensor feedforward neural network model to realize binary classification. The system comprises the discrimination network, a generation network, a mono-lingual corpus and a parallel corpus. While manually annotated bilingual parallel corpus resources are fully utilized, and mono-lingual corpus resources also can be fully utilized to carry out semi-supervised learning; and the mono-lingual corpus resources are very rich and can be easily obtained, and the problem that required training corpora required by the neural network machine translation model are not sufficient is solved.
Owner:GLOBAL TONE COMM TECH

Method for establishing sentiment classification model

The invention provides a sentiment classification method for generating a model deep-convinced-degree network on the basis of the probability of depth study. According to the technical scheme of the method, a plurality of Boltzmann machine layers are stacked, namely, output of this layer is used as input of the next layer. By the adoption of the mode, input information can be expressed in a grading mode, and abstraction can be conducted layer by layer. A multi-layer sensor containing a plurality of hidden layers is the basic study structure of the method. More abstract high layers are formed through combining the characteristics of lower layers and are used for expressing attribute categories or characteristics, so that the distribution type character presentation of data can be discovered. The method belongs to monitoring-free study, and a mainly-used model is the deep-convinced-degree network. The method enables a machine to conduct characteristic abstract better so as to improve the accuracy of sentiment classifications.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Method for determining fatigue state according to electroencephalogram

The invention provides a method for determining fatigue state according to electroencephalogram (EEG) which adopts a plurality of electroencephalographs and connecting electrodes for realizing the real time acquisition of electroencephalogram. The method comprises the following steps: running interface programs of a PC and the electroencephalographs; realizing the synchronous acquisition of data by using a VC++ to compile visual interface program of the electroencephalographs under the Windows platform, and displaying EEG waveforms acquired in real-time; pre-processing the acquired data; carrying out the low-pass filtering at 0Hz to 30Hz to the data by an FIR (Finite Impulse Response) filter, so as to eliminate the power frequency noise and external interference; decomposing the filtered EEG waveforms by the blind-source separation method, so as to acquire each component of the mixed signal comprising electro-oculogram (EOG) and left and right brain EEGs; carrying out the fast Fourier transform (FFT) on the left and right brain EEGs, and converting the time-domain signals to the frequency-domain signals; working out the energy of alpha, beta, theta and delta waves in the EEGs and classifying the BP (back propagation) neural network of the multi-layer perceptron. The invention has the characteristics of directness and rapidness.
Owner:BEIJING UNIV OF TECH

Non-standard character recognition method based on convolution neural network and support vector machine

The invention discloses a non-standard character recognition method based on a convolution neural network and a support vector machine. The method comprises steps of 1, acquiring image signals of non-standard characters to serve as sample data; 2, establishing a convolution neural network and carrying out initialization; 3, passing the trained sample data through the convolution neural network so as to finish forward propagation; 4, carrying out error calculation and gradient calculation on a multi-layered perceptron in the step 3, and if errors are converged, extracting characteristic data and entering the step 6, or else, entering the step 5; 5, using a back propagation algorithm to propagate the errors and the gradients obtained in the step 4 to a network base layer through the convolution neural network layers by layers, judging whether the grid base layer is an input layer, and if yes, entering the step 3, or else, continuing to judge whether the next layer is the input layer until the input layer is determined and entering the step 3; 6, transmitting the characteristic data to a support vector machine for training and establishing a non-standard character recognition training model; and 7, inputting to-be-recognized non-standard character signals into the non-standard character recognition training model for recognition.
Owner:昆山遥矽微电子科技有限公司

Point cloud registration model and method combining attention mechanism and three-dimensional graph convolutional network

The invention relates to a point cloud registration model and method combining an attention mechanism and a three-dimensional graph convolutional network, and the model is a three-branch Siamese architecture, and comprises a Dejector model and a Descriptor model. The Detector model is used for extracting attention features of points and constructing an attention mechanism; the Descriptor model isused for generating an expression of a three-dimensional depth feature to represent the three-dimensional depth feature of the point, and learning and judging the depth feature of the point cloud. Themethod comprises the following steps: carrying out model training, and constructing a loss function to train a model by using a failure align triplet loss, so as to effectively extract attention features and descriptor features from a point cloud; after model training, carrying out the point cloud registration. According to the method, the key points and the three-dimensional depth features of each key point can be automatically extracted, in the three-dimensional graph convolutional network, the multi-layer perceptron MLP is combined with the graph convolutional network GCN, a new point cloud feature extraction module is designed, more point cloud features with identification significance can be extracted, and the accuracy of point cloud registration is improved.
Owner:CAPITAL NORMAL UNIVERSITY

Bayesian Decision Theory foreground extraction method combined with reflected illumination

ActiveCN103164855AIncrease foreground brightnessImage edge information is obviousImage enhancementImage analysisPattern recognitionPoint light
The invention provides a Bayesian Decision Theory foreground extraction method combined with reflected illumination. The Bayesian Decision Theory foreground extraction method comprises the steps of appointing a point light source located on a foreground object by a user, carrying out gray level matching on an image, converting and imitating point light source illumination, strengthening image edge information, obtaining an illumination function according to before-after conversion comparison, filtering waves, reducing noise, dividing the image through a watershed algorithm, calculating a sectional drawing parameter through a Bayes formula, imitating an alpha value function curve through a multi-layer perception device, integrating the illumination function and a color distribution function, and completing extraction of the foreground object. The user is only required to appoint the position of the point light source and not required to preset edge information of a foreground and a background, the requirement for user interaction is reduced, meanwhile, time complexity of the used algorithms is series, and the defects that a common sectional drawing algorithm is large in calculated quantity and low in processing speed are avoided. Due to the facts that the illumination function is introduced and the alpha value is matched by the perception device, an accurate and complete extraction result can be obtained for the foreground object with complicated edges, and particularly for the foreground object similar colors of the edge and the ground.
Owner:SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV

Airfoil flow field rapid prediction method based on deep learning

The invention provides an airfoil flow field rapid prediction method based on deep learning. The method comprises the steps of generating a sample data set; building a deep learning neural network model based on the data set; and applying the built deep neural network to rapid prediction of the airfoil flow field. According to the method, only the grids with obvious airfoil near-field flow parameter changes are intercepted to be used for training and testing the neural network model, and compared with the prior art, data points and time consumption can be reduced as much as possible while flow field feature extraction and flow field parameter prediction precision are guaranteed, and efficiency is improved; compared with the prior art, the multi-layer perceptron neural network model established by the invention can describe a more complex nonlinear relationship, improve the resolution of airfoil flow field features, and contribute to accurate identification of the flow field features. The method aims to construct and train the neural network for a series of airfoils derived from the same reference airfoil, and has high pertinence, so that the same series of airfoil flow fields can be quickly and accurately predicted.
Owner:NORTHWESTERN POLYTECHNICAL UNIV

Method for detecting pulmonary artery blood pressure by using heart sound analysis method of multilayer feedforward network

The invention relates to a method for detecting the pulmonary artery blood pressure by using a heart sound analysis method of a multilayer feedforward network. The method comprises the steps of: extracting a heart sound signal; preprocessing the heart sound signal by fast Fourier transform and normalized average Shannon energy distribution to obtain heart sound signal features; filtering the heart sound signal features by using a principal component analysis method; and performing training learning on the filtered hear sound signal features by using a perceptron neural network or a multilayer perceptron feedforward neural network to obtain an optimal neural network between the heart sound signal features and a pulmonary artery blood pressure value. The heart sound signal features are signal features of a first heart sound and a second heart sound; the heart sound signal features comprise crest frequency, average frequency, crest amplitude, average amplitude, duration and time interface of heart sounds. By adopting the method for detecting the pulmonary artery blood pressure by using the heart sound analysis method of the multilayer feedforward network, provided by the invention, the pulmonary artery blood pressure can be easily and conveniently detected at low cost in a non-intrusive and no risk manner.
Owner:THE HONG KONG POLYTECHNIC UNIV

Video character recognition method based on multi-modal feature fusion deep network

The invention discloses a video character recognition method based on a multi-modal feature fusion deep network. The video character recognition method is a deep learning target recognition multi-modal fusion algorithm provided specially for the target recognition problem of multi-modal character video feature data. According to the video character recognition method, a network structure of the algorithm is composed of a plurality of single-modal multi-layer sensor identification modules and a multi-modal feature fusion module. The video character recognition method comprises the steps of: preprocessing multi-modal data generated by a video, training a plurality of deep networks by using the preprocessed different modal data, on the basis, subjecting features generated by a plurality of sub-networks to weighted fusion, and combining a feature weighted fusion module with models of different modals to achieve a better identification result. By adopting the video character recognition method, a multi-modal feature set weight fusion strategy is used for constructing a video figure target recognizer on a public video figure data set (iQIYI-VID-2019) for multi-modal features generated bythe preprocessed video, multi-model integration is not needed, and the average precision mean value of a single model reaches 89.52%.
Owner:NANJING UNIV +1

Speech emotion recognition method through fusion of feature assessment and multi-layer perceptron

The present invention discloses a speech emotion recognition method through fusion of feature assessment and a multi-layer perceptron. The method comprises the steps: S1, extracting multi-dimensional emotion feature parameters of a training speech set corresponding to various emotion states, and obtaining an original feature set; S2, performing rating ordering of various emotion feature parameters in an original feature set, and obtaining a feature set after ordering; S3, obtaining a plurality of feature subsets with different quantities from the feature set after ordering, using a multi-layer perceptron to perform classification of each feature subset, and selecting an optimal feature subset according to a classification result; and S4, using the multi-layer perceptron to train an emotion classification model for the optimal feature subset, and performing emotion recognition of the speech to be recognized through the classification model obtained through training. The realization method is simple, the speech emotion recognition method through fusion of feature assessment and the multi-layer perceptron can fuse the feature assessment and multi-layer perceptron to realize emotion recognition, and the emotion recognition precision and the efficiency are high.
Owner:HUNAN UNIV

Process data fault classification method based on pseudo label method and weak supervised learning

The invention discloses an industrial process data fault classification method based on a pseudo label method and weak supervised learning; a supervised classification network is composed of a multi-layer sensor, a Batch Normalization layer, a Dropout layer and a Softmax output layer, and the Gaussian mixture model is used for obtaining the inaccurate condition of a pseudo label. The multi-layer sensor can learn feature representation of the data from the labeled data; the BatchNormalization layer is used for accelerating convergence of a multi-layer perceptron model, the Dropout layer is usedfor preventing training overfitting of a multi-layer perceptron, and the Softmax output layer is used for carrying out fault classification according to fault sample features extracted by the multi-layer sensor. According to the invention, modeling can be carried out in a scene that the obtained labeled sample is inaccurate in label and has no label sample; label probability transfer matrix evaluation is carried out on a labeled sample label and a pseudo label predicted for a label-free sample based on a pseudo label method, and the label probability transfer matrix evaluation is used for correcting a loss function of a classification network to complete weak supervised learning, so that the classification precision of the model on the sample is improved.
Owner:ZHEJIANG UNIV

Complex chemical process fault diagnosis method based on deep learning multi-model fusion

InactiveCN111665819ATo overcome the large amount of calculationOvercome the technical problem of being unable to accurately diagnose faults in complex chemical processesProgramme controlElectric testing/monitoringEngineeringTerm memory
The invention discloses a complex chemical process fault diagnosis method based on deep learning multi-model fusion. According to the invention, two neural networks automatically extract fault features from two aspects respectively, then fuse the features and input the features into a multi-layer perceptron (MLP) for further feature compression and extraction, and finally output a diagnosis result. Features are extracted through a convolutional neural network (CNN) and a long-term and short-term memory (LSTM) respectively, and the features finally extracted by the network have space and time characteristics at the same time; and the final diagnosis is carried out by integrating the characteristics of the two aspects, so that the problem of large calculated amount of the existing traditional diagnosis technology is solved, and the technical problem that the fault diagnosis cannot be accurately carried out in the complex chemical process due to the fact that the characteristics extractedby a single network are too one-sided is also solved.
Owner:HANGZHOU DIANZI UNIV

Many-to-many voice conversion method and system based on speaker style feature modeling

The invention discloses a many-to-many voice conversion method and system based on speaker style feature modeling. Firstly, a multi-layer sensor and a style encoder are added to a StarGAN neural network, effective extraction and constraint of speaker style features are realized, the defect of limited speaker information carried by one-hot vectors in a traditional model is overcome, then, an adaptive instance normalization method is adopted to realize full fusion of semantic features and speaker personality features, so that the network can learn more semantic information and speaker personality information, furthermore, a lightweight network module SKNet is introduced into the generator residual error network, so that the network can adaptively adjust the size of a receptive field according to multiple scales of input information, the weight of each feature channel is adjusted through an attention mechanism, the learning ability of frequency spectrum features is enhanced, and the details of the frequency spectrum features are refined.
Owner:NANJING UNIV OF POSTS & TELECOMM

Traditional Chinese medicine syndrome diagnosis method and device based on deep learning and attention mechanism

PendingCN111834012AImprove the accuracy of diagnosis and predictionMedical data miningAlternative medicinesMedical recordNerve network
The invention discloses a traditional Chinese medicine syndrome diagnosis method and device based on deep learning and an attention mechanism. The method comprises the steps of obtaining medical record data with symptoms of a to-be-diagnosed patient; converting the medical record data into vector data; inputting the vector data into a trained traditional Chinese medicine syndrome diagnosis model,and outputting a diagnosis result. The traditional Chinese medicine syndrome diagnosis model comprises an attention mechanism module constructed by using a matrix mapping layer, an activation functiontan h and softmax, and a prediction deep neural network constructed by using a multilayer perceptron and an activation function sigmoid. The method comprises the following steps: firstly, performingone-hot coding mapping on medical record data with symptoms of a patient, and converting the medical record data into vector data; adopting a traditional Chinese medicine syndrome diagnosis model of amultilayer perceptron combining deep learning and supervised learning to diagnose the syndrome of a patient, so that the diagnosis and prediction accuracy of the traditional Chinese medicine syndromeof the patient can be improved.
Owner:INST OF INFORMATION ON TRADITIONAL CHINESE MEDICINE CACMS

Interest point recommendation method and device based on space-time sequence and social embedding ranking

The invention discloses an interest point recommendation method based on a space-time sequence and social embedding ranking, and the method comprises the steps: firstly proposing a mixed depth model based on a horizontal convolution filter, a vertical convolution filter and a multilayer perceptron model, and capturing the user preference and the impact of a space-time sequence mode; secondly, modeling the social relationship of the user by adopting a metric learning method; then, adopting a unified framework based on matrix decomposition to integrate personal interests of the users, sign-in sequence modes and social information of the users; and finally, optimizing a target loss function by adopting a BPR standard, fitting a partial order relationship of the user on the interest point pair, and finally generating an interest point recommendation list. According to the invention, the interest point recommendation method integrating the hybrid depth model and metric learning is constructed, so that the final recommendation is more accurate.
Owner:WUHAN UNIV

Method of predicting gas composition

The method of predicting gas composition in a multistage separator includes solutions to the regression problem of gas composition prediction that are developed using an ensemble of hybrid computational intelligence (CI) models. Three separate homogeneous and one heterogeneous ensemble of hybrid computational intelligence (EHCI) models are developed using a parallel scheme. The homogeneous models have the same types of CI models used as base learners, and the heterogeneous model has of different types of CI models used as base learners. Various popular CI models, including multi-layer perceptron (MLP), support vector regression (SVR) and adaptive neuro-fuzzy inference system (ANFIS), are used as base learners of ensemble models.
Owner:KING FAHD UNIVERSITY OF PETROLEUM AND MINERALS

High-quality face voice driving method based on neural radiation field

The invention provides a high-quality face voice driving method based on a neural radiation field, and the method comprises the following steps: carrying out the feature extraction on video-synchronized voice information through a text-based voice recognition model, and obtaining the extracted voice features; segmenting an initial face speaking video set frame by frame; estimating attitude information of each frame of face by using a pre-trained three-dimensional face reconstruction model; learning a neural radiation field model for a target picture by using a multi-layer perceptron; and taking the voice features as condition information, and generating a picture under the current visual angle and voice condition by using a neural rendering mode. A face speaking model trained based on the neural radiation field has the capability of implicitly representing three-dimensional face displacement, including rigid and non-rigid motion. The neural rendering supports sampling setting of different ray angles and different densities, so that the generated face speaking video has the characteristics of high quality and stability.
Owner:UNIV OF SCI & TECH OF CHINA

Graph convolutional network system and 3D object detection method based on graph convolutional network system

The invention discloses a graph convolutional network system and a 3D object detection method based on the graph convolutional network system. The system comprises a shape semantic extraction module used for modeling geometric positions of points in point cloud features of an image; a multi-layer perceptron which is connected with the shape semantic extraction module and is used for extracting multi-level semantic features by using a multi-layer graph convolutional neural network and filtering the multi-level semantic features by using an attention mechanism; a proposal generator which is connected with the multi-layer perceptron and is used for summarizing the multi-level semantic features and weighting to generate a primary proposal; and a proposal reasoning module which is connected with the proposal generator and is used for predicting a 3D bounding box and a semantic category of the object in the image by utilizing the global semantic features and the primary proposal. According to the method, the detection performance of the whole graph convolutional network system is effectively improved, the precision of 3D object detection is improved, and the interpretability of the deep network is higher.
Owner:广东众聚人工智能科技有限公司

Three-dimensional point cloud processing method, device and equipment

ActiveCN111028327AMitigate missing dataAlleviate Redundancy IssuesNeural architectures3D-image renderingInformation sharingAlgorithm
The invention provides a three-dimensional point cloud processing method, device and equipment. The method comprises the following steps: acquiring point cloud data comprising a plurality of points; inputting the point cloud data into a pre-trained convolutional neural network; the convolutional neural network comprises a convolution module for geometric feature information sharing; for each pointin the point cloud data, obtaining a neighbor point of the point in the Euclidean space based on the convolution module, and determining a neighbor point of the point in the feature value space basedon the neighbor point of the point in the Euclidean space; aggregating the neighbor points of the points in the Euclidean space and the neighbor points of the points in the feature value space to obtain aggregated features; and performing feature learning on the aggregated features by using a multi-layer perceptron, and performing maximum pooling operation on the dimensions of the neighbor pointsto obtain output features. A convolution structure is directly constructed to process three-dimensional point cloud data, so that the problems of data missing and data redundancy are effectively relieved.
Owner:SHENZHEN INST OF ADVANCED TECH

Dynamic linearization system of power amplifier based on mode recognition, and method

The invention discloses a dynamic linearization system of a power amplifier based on mode recognition. The dynamic linearization system comprises a feature vector generation module, a multilayer sensor module, a predistortion coefficient lookup table module, a vector signal module, a transmitting link module, a power amplifier module and a feedback loop module. The invention further discloses a method using the dynamic linearization system. The dynamic linearization system disclosed by the invention adopts a new standard, that is, an amplitude modulation to amplitude modulation (AM / AM) mode, to recognize a working state of the power amplifier, and uses a recognition result of a multilayer sensor neural network on the power amplifier AM / AM to index different predistortion coefficients so asto achieve the dynamic linearization of the power amplifier. Compared with the traditional power lookup tables and power indexing algorithms, the method has the advantages of being able to provide better linearization performance. In addition, the dynamic linearization system disclosed by the invention is also capable of solving the linearization problem under a dynamic condition of the power amplifier caused by many other non-power factor changes such as temperature, frequency, bandwidth, signal type, etc.
Owner:SOUTHEAST UNIV

Optimization method of hybrid bat algorithm and optimization method of multi-layer perceptron

The invention discloses an optimization method of a hybrid bat algorithm and a multilayer perceptron optimization method, and the method comprises the steps: firstly, improving an original reverse learning strategy in the aspect of population reconstruction, and proposing disturbance multi-strategy reverse learning, so as to reconstruct an effective diversified search space; secondly, in the aspect of global exploration and search, by introducing a whale optimization algorithm and improving the whale optimization algorithm, providing a self-adaptive constraint step length whale optimization algorithm so as to make up for the defects of an original bat algorithm in the aspect of global exploration capacity; thirdly, in the aspect of local mining search, providing a bat algorithm based on Cauchy variation and dynamic correction by introducing Cauchy variation and designing a dynamic correction strategy, so that the local search capability of the original bat algorithm is improved; and fourthly, under the synergistic effect of the three strategies, effectively improving the precision and stability of an optimization result by the new algorithm obtained by optimization; finally, applying the new algorithm to the multi-layer perceptron training problem, and obtaining relatively high classification precision.
Owner:WUHAN UNIV

Method for synthesizing virtual viewpoint image based on implicit neural scene representation

PendingCN114666564AReduce data volumeEffective Supervised LearningNeural architecturesNeural learning methodsNeural fieldsData set
The invention discloses a method for synthesizing a virtual viewpoint image by using implicit neural scene representation on the basis of multi-view three-dimensional cross-view loss, and is suitable for the field of computer vision. The method comprises the following steps: acquiring an image data set which needs to generate a virtual viewpoint; preprocessing the training image data set, and performing feature point extraction and matching on the input training image data set based on a feature matching algorithm Sift in the preprocessing stage; processing the obtained training image data and the extracted feature point information, and inputting the processed training image data and extracted feature point information into a multi-layer perceptron network for training; inputting test image data into the trained multi-layer perceptron network, and then obtaining a tested rendered image through volume rendering; and generating a virtual viewpoint image based on the trained multilayer perceptron network. Therefore, the data volume of the neural network during training fitting scene representation is reduced, and centralized sampling is performed in combination with the image depth information, so that the operation speed and performance of the neural scene representation can be improved, and a high-quality virtual viewpoint image is generated.
Owner:NANJING UNIV OF POSTS & TELECOMM

System and method for unsupervised anomaly prediction

ActiveUS11269752B1Improve anomaly prediction ability of MAD-GANAccurately risk of failureFault responseHardware monitoringManufacture execution systemEngineering
Some embodiments are associated with a system and method for deep learning unsupervised anomaly prediction in Internet of Things (IoT) sensor networks or manufacturing execution systems. The system and method use an unsupervised predictive GAN model with multi-layer perceptrons (MLP) as generator and discriminator.
Owner:EUGENIE AI INC

Internet of Things intrusion detection method, system and device and medium

The embodiment of the invention provides an Internet of Things intrusion detection method, system and device and a medium, and belongs to the technical field of data processing, and the method specifically comprises the steps: obtaining a target traffic data packet; extracting feature information in the target traffic data packet; inputting the initial node feature, the routing graph and the adjacent matrix into a graph convolutional neural network to obtain a source node feature corresponding to a source ip address in the target traffic data packet and a target node feature corresponding to a target ip address; splicing the source node features, the data features and the target node features to obtain a target vector; and inputting the target vector into the multi-layer sensor, and outputting an attack type corresponding to the target traffic data packet. Through the scheme of the invention, the flow packet is analyzed and processed, the network node information and the network structure information are spliced, the semantic information of the edge is combined with the node representation output by the graph convolution and provided with the structure information, and the node representation is input into the multi-layer perceptron for intrusion attack detection, so that the detection efficiency, accuracy and security are improved.
Owner:湖南工商大学

Numerically-controlled machine tool thermal error compensation method and system based on workpiece size data

The invention discloses a numerically-controlled machine tool thermal error compensation method and system based on workpiece size data. The method comprises the steps: carrying out the analysis of aprocess capability index (Cpk) based on a machining process, and obtaining a key temperature point from the actual measurement temperature data of cutting work; according to the key temperature pointand workpiece size data machined under the machine tool load condition, a numerical control machine tool thermal error model based on workpiece size detection data is constructed through a multi-layerperceptron neural network (MLP), then obtaining the motion compensation amount of a motion axis corresponding to workpiece size features under the actual machining condition, and achieving thermal error compensation through the external coordinate zero offset function of a numerical control machine tool. According to the method, the thermal deformation of the machine tool and the workpiece causedby the machining process is considered, and the thermal error of the machine tool under the actual machining condition is effectively compensated.
Owner:SHANGHAI JIAO TONG UNIV

Motion recognition method

The invention provides an action recognition method, and the method comprises the following steps: segmenting an action video sample, and calculating a dynamic image of each video clip; inputting the dynamic and static images of each video clip into a feature extractor; extracting a motion feature vector of the dynamic image; constructing a feature center group to obtain histogram expression; inputting the histogram expression into a histogram connection layer to obtain a complete histogram expression of the action video sample; inputting a multi-layer perceptron to form a motion feature quantization network; performing training to converge; inputting the dynamic and static images into a feature extractor, an enhancer and a soft quantizer in the trained motion feature quantization network to obtain histogram expression; inputting the histogram expression into a significant motion feature extractor to obtain a significant motion feature map; inputting the significant motion feature map into a convolutional neural network to form an action classifier; performing training to converge; calculating a dynamic image and a static image of each fragment of the test action video sample, and inputting the dynamic image and the static image into the trained action classifier to realize action recognition.
Owner:SUZHOU UNIV

Deep reinforcement learning method and device based on visual converter

The invention belongs to the technical field of artificial intelligence, and provides a deep reinforcement learning method and device based on a visual converter, and the method comprises the steps: constructing a deep reinforcement learning network structure based on the visual converter, wherein the visual converter comprises a multi-layer perceptron and a conversion encoder, and the conversion encoder comprises a multi-head attention layer and a feedforward network; initializing the weight of the deep reinforcement learning network, and constructing an experience playback pool according to the memory capacity; generating empirical data and putting the empirical data into an empirical playback pool through interaction between a greedy strategy and an operating environment; when the number of samples in the experience playback pool meets a preset value, randomly extracting a batch of training sample images from the training sample images, preprocessing the training sample images, and inputting the preprocessed training sample images into a deep reinforcement learning network for training; and when the deep reinforcement learning network satisfies a convergence condition, obtaining a reinforcement learning model. According to the method and device, the blank of application of the visual converter in the reinforcement learning field can be filled, the interpretability of the reinforcement learning method is improved, and learning training is more effectively carried out.
Owner:INFORMATION SCI RES INST OF CETC

Embryo development potential prediction method and system, equipment and storage medium

ActiveCN113469958AReal-time prediction of euploid probabilityImage enhancementImage analysisAnembryonic gestationEngineering
The invention relates to the technical field of medical artificial intelligence, in particular to an embryo development potential prediction method and system, equipment and a storage medium. The method comprises the steps: inputting an embryo initial image into a blastocyst prediction model, and obtaining an embryo feature vector; inputting the embryo feature vector into a bidirectional long-short-term memory network to obtain embryo development features; based on a cross-modal feature fusion mechanism, obtaining fusion features according to clinical data and the embryonic development features; and inputting the fusion features into a first multi-layer sensor, and predicting to obtain the embryo pregnancy rate. According to the invention, multi-focal-segment embryo videos shot in the early stage are analyzed, and the fusion features with the space-time characteristic is obtained by utilizing a multi-focal-segment selection model and a time transfer model, so that the pregnancy rate of the embryo cultured in vitro is predicted in real time, and the prediction accuracy is improved; meanwhile, by predicting the blastocyst forming probability and the euploid probability, doctors are assisted in early embryo screening, and therefore the labor cost is reduced.
Owner:THE FIRST AFFILIATED HOSPITAL OF SUN YAT SEN UNIV

Pedestrian re-identification three-dimensional data set construction method and device based on neural radiation field

InactiveCN114758081AThe way to obtain data is straightforwardLow costDetails involving processing stepsNeural architecturesPattern recognitionData set
The invention discloses a pedestrian re-identification three-dimensional data set construction method and device based on a neural radiation field. The method comprises the following steps: S1, carrying out image acquisition on a pedestrian to be input through a group of cameras with different visual angles; s2, sampling through camera rays in a scene to generate a three-dimensional space position point set, and converting the observation direction of a camera corresponding to the three-dimensional space position point set into a three-dimensional Cartesian unit vector; s3, inputting the three-dimensional space position point set and the observation direction, converted into the three-dimensional Cartesian unit vector, of the three-dimensional space position point set into a multi-layer sensor, and outputting corresponding density and color; according to the pedestrian re-identification three-dimensional data set construction method and device based on the neural radiation field, a brand-new pedestrian re-identification data set construction method is provided, and a new thought for data set construction is provided. Compared with a traditional data set construction method, the data acquisition mode is more direct and clear through images and spatial positions acquired by multiple devices.
Owner:ZHEJIANG LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products