Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

515results about How to "Improve discrimination ability" patented technology

Human skeleton behavior recognition method and device based on deep reinforcement learning

The invention discloses a human skeleton behavior recognition method and device based on deep reinforcement learning. The method comprises: uniform sampling is carried out on each video segment in a training set to obtain a video with a fixed frame number, thereby training a graphic convolutional neural network; after parameter fixation of the graphic convolutional neural network, an extraction frame network is trained by using the graphic convolutional neural network to obtain a representative frame meeting a preset condition; the graphic convolutional neural network is updated by using the representative frame meeting the preset condition; a target video is obtained and uniform sampling is carried out on the target video, so that a frame obtained by sampling is sent to the extraction frame network to obtain a key frame; and the key frame is sent to the updated graphic convolutional neural network to obtain a final type of the behavior. Therefore, the discriminability of the selectedframe is enhanced; redundant information is removed; the recognition performance is improved; and the calculation amount at the test phase is reduced. Besides, with full utilization of the topologicalrelationship of the human skeletons, the performance of the behavior recognition is improved.
Owner:TSINGHUA UNIV

Semi-supervised mechanical fault diagnosis method based on adaptive migration neural network

The invention discloses a semi-supervised mechanical fault diagnosis method based on an adaptive migration neural network, and the method comprises the steps: firstly obtaining a plurality of source domain fault data sets composed of source domain fault training samples and corresponding tags, and a plurality of target domain fault data sets composed of target domain fault data without tags, wherein the target domain fault data is divided into a target domain fault training sample and target domain fault test data; normalizing the data; constructing an adaptive migration neural network diagnosis model, supervising the training model and constructing a classifier loss function by using the source domain fault data set, constructing a classifier discrimination loss function, and performing adversarial training on the feature extractor and the classifier by using the target domain fault training sample; inputting the target domain fault test data into the trained model, and summing and averaging the two output probability values to obtain a final classification diagnosis result. The method can improve the discrimination capability of the fault data of the target domain, and effectively improves the intelligent fault diagnosis task under the actual variable working condition.
Owner:SOUTH CHINA UNIV OF TECH

A behavior recognition method of depth supervised convolution neural network based on training feature fusion

The invention provides a behavior recognition method of depth supervised convolution neural network based on training feature fusion, belonging to the artificial intelligence computer vision field. This method extracts multi-layer convolution features of target video, designs local evolutionary pooling layer, and maps video convolution features to a vector containing time information by using local evolutionary pooling layer, thus extracts local evolutionary descriptors of target video. The local evolutionary descriptors of target video are extracted by using local evolutionary pooling layer.By using VLAD coding method, multiple local evolutionary descriptors are coded into meta-action based video level representations. Based on the complementarity of the information among the multiple levels of convolution network, the final classification results are obtained by integrating the results of the multiple levels of convolution network. The invention fully utilizes the time information to construct the video level representation, and effectively improves the accuracy of the video behavior recognition. At the same time, the performance of the whole network is improved by integrating the multi-level prediction results to improve the discriminability of the middle layer of the network.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Hyper-spectral image classification method based on recurrent neural network

The invention discloses a hyper-spectral image classification method based on recurrent neural network with the object to solving the problems that in prior art, the input characteristic determination ability is weak and that the extraction of local spatial characteristics is not complete. The method comprises the following steps: 1) extracting the spatial texture characteristics and the sparse representation characteristics of a hyper-spectral image and piling and combining them as the low-level characteristics; 2) extracting from the low-level characteristics the sample local spatial sequence characteristics; 3) according to the local spatial sequence characteristics, creating a recurrent neural network model; and utilizing the training sample local spatial sequence characteristics to train the recurrent neural network model parameters; and 4) inputting the testing sample local spatial sequence characteristics into the well-trained recurrent neural network model; obtaining the highly abstract high-level semantic characteristics and obtaining the classification information of the testing sample. According to the deep learning method of the invention, the correct efficiency for hyper-spectral image classification is increased and the method can be used for vegetation investigation, disaster monitoring, map making and intelligence obtaining.
Owner:XIDIAN UNIV

Target tracking method based on multi-characteristic adaptive fusion and kernelized correlation filtering technology

The invention provides a target tracking method based on multi-characteristic adaptive fusion and kernelized correlation filtering technology. The method comprises steps of according to target position and the dimension of the previous frame tracking, acquiring a candidate region of target motion; extracting histogram characteristics and color characteristics in the gradient direction of the candidate region, fusing the two kinds of characteristics, carrying out Fourier transform so as to obtain a characteristic spectrum and then calculating kernelized correlation; determining the position and the dimension of the target at the current frame, and acquiring a target region; extracting histogram characteristics and color characteristics in the gradient direction of the target region, fusing the two kinds of characteristics, carrying out Fourier transform so as to obtain a characteristic spectrum and then calculating kernelized self-correlation; designing the adaptive target correlation and training a position filter model and a dimension filter model; and using a linear interpolation method to update the characteristic spectrums and the related filters. According to the invention, the discrimination capability of the models is improved; robustness of the target tracking of the target in a complex scene and the appearance change is improved; calculation complexity is reduced; and tracking timeliness is improved.
Owner:NANJING UNIV OF SCI & TECH

Unsupervised cross-domain pedestrian re-identification method and system

The invention discloses an unsupervised cross-domain pedestrian re-identification method and an unsupervised cross-domain pedestrian re-identification system. The method comprises the following steps:constructing a source domain training set and a target domain training set; converting the training images in the source domain training set into a target domain, and generating image data related tothe target domain; training an initial pedestrian re-identification model by utilizing the generated image data; extracting local features of each training image in the target domain training set based on the trained pedestrian re-identification model; performing clustering analysis on training image data in a target domain training set by using the extracted local features; determining an optimal training sample in the target domain training set based on a clustering analysis result; utilizing the generated image data and the determined optimal training sample to retrain the pedestrian re-identification model, and repeating in sequence until an iteration stop condition is reached to obtain a final pedestrian re-identification model; and obtaining to-be-identified image data in the targetdomain, and identifying the to-be-identified image data by using the finally obtained pedestrian re-identification model.
Owner:中科人工智能创新技术研究院(青岛)有限公司

Facial expression identification method based on multi-task convolutional neural network

The invention discloses a facial expression identification method based on multi-task convolutional neural network. The expression identification method comprises the following steps: firstly, designing a multi-task convolutional neural network structure, and sequentially extracting low-level semantic features shared by all expressions and a plurality of single-expression distinguishing characteristics in the network; then adopting multi-task learning and simultaneously learning learning tasks of the plurality of single-expression distinguishing characteristics and multi-expression identification tasks; monitoring the all tasks of the network by using combined loss, and balancing the loss of the network by using the two loss weights; finally, acquiring a final facial expression identification result from a maximum flexible classification layer arranged at the last of a model according to the trained network model. Characteristic extraction and expression classification are put in an end-to-end framework to be learned, the distinguishing characteristics are extracted from input images, and expression identification on the input images are reliably carried out. Experimental analysisshows that the algorithm is excellent in performance, complicated facial expressions can be effectively distinguished, and good identification performance on a plurality of published data sets can beachieved.
Owner:XIAMEN UNIV

Voice generation method and device based on generative adversarial network

The invention discloses a voice generation method based on a generative adversarial network. According to the method, randomly-generated noise data meeting Gaussian distribution is converted into a simulation sample through a generative model; as the simulation sample does not have the language content, when the generative model and a discrimination model are circularly updated, generative capacities required to be learned by the generative model and discrimination capacities required to be learned by the discrimination model are correspondingly increased, and accordingly the generative capacities of the generative model and the discrimination capacities of the discrimination model are improved; when a contrast value between a training sample and the simulation sample is smaller than or equal to a preset threshold value, it is thought that the generative model has the capacity of generating real data; a voice database generated through the generative model has enough reality, and the recognition rate can be increased when the generative model is applied to identity recognition. Correspondingly, the voice generation method, a voice generation device and voice generation equipment based on the generative adversarial network and a computer readable storage medium have the same advantages.
Owner:SPEAKIN TECH CO LTD

Fine-grained image classification method and system, computer equipment and storage medium

The invention discloses a fine-grained image classification method and system, computer equipment and a storage medium. The fine-grained image classification method comprises the steps of: establishing a fine-grained image classification network which is a double-branch network for attention inhibition and attention enhancement and comprises a residual network and an attention layer; acquiring a training set, wherein the training set is composed of a plurality of training images; training the fine-grained image classification network by using the training set, and acquiring a fine-grained image classification model by using a gradient-propelled maximum value and minimum value cross entropy loss function; acquiring a to-be-classifiedimage; and inputting the to-be-classified image into the fine-grained image classification model, so that the to-be-classified image flows in the residual network and does not pass through the attention layer, and a category prediction result is obtained. The fine-grained image classification method and the system is implemented based on weak supervised learning and an attention mechanism, and the fine-grained image classification model obtained throughtraining can realize a good fine-grained image classification effect.
Owner:SOUTH CHINA UNIV OF TECH

CT image super-resolution reconstruction method based on generative adversarial network

The invention belongs to the technical field of computed tomography image processing. According to the specific technical scheme, the CT image super-resolution reconstruction method based on the generative adversarial network comprises the following specific steps: 1, establishing a dense connection relationship among different residual blocks based on a multi-stage dense residual block generatornetwork; 2, adding a bottleneck layer to the front end of each dense residual block; 3, optimizing the global network by adopting the Wasserstein distance loss and the VGG feature matching loss; 4, arranging a multi-path generator based on the sequence from thick to thin; 5, generating an image based on conditional expression generative adversarial learning; 6, reconstructing a CT image super-resolution reconstruction framework of the generative adversarial network based on multiple paths of conditions from coarse to fine; 7, reconstructing a loss function. According to the method, network redundancy is reduced, feature multiplexing among different residual blocks is realized, the maximum information transmission of the network is realized, the feature utilization rate is improved, and thereconstructed image quality is greatly improved.
Owner:TAIYUAN UNIVERSITY OF SCIENCE AND TECHNOLOGY

Intelligent driving local track fault tolerance planning method based on lane lines and GPS following

The invention discloses an intelligent driving local track fault tolerance planning method based on lane lines and GPS following. The method includes steps: firstly initializing a following mode and establishing an intelligent driving vehicle coordinate system; secondly discriminating the recognition states of GPS data and the lane lines according to GPS and lane line information; then calculating a fault tolerance deviation according to the discriminated recognition states, and updating the following mode; and finally performing local path and track planning based on the new following mode. According to the method, the validity of various data is determined, the accuracy of subsequent calculation is improved, the fault tolerance deviation is designed based on various data states, the fault tolerance deviation is dynamically updated in real time, the system complexity is simplified, the practical application is easy, and the robustness of data processing is improved; and real-time state transition of the following mode is performed according to multi-sensing data including the GPS and the lane lines, continuous and smooth control of multiple following states is realized, and the comfort and the stability of intelligent-driving vehicles are improved.
Owner:CENT SOUTH UNIV

Similarity-weight-semi-supervised-dictionary-learning-based human behavior identification method

The invention discloses a similarity-weight-semi-supervised-dictionary-learning-based human behavior identification method. With the method, a problem of low human behavior identification rate of the existing supervision method in the prior art can be solved. The identification method comprises: (1), an inputted data set is divided into test samples and training samples; (2), local feature detection is carried out on all samples and local features with the labeled samples are selected randomly to obtain an initialized dictionary; (3), according to the initialized dictionary, dictionary learning is carried out by using a semi-supervised method; (4), group sparse coding is carried out on all samples by using the learned dictionary to obtain a coding matrix of each sample; (5), vectorization is carried out on the coding matrix of each sample to obtain a final expression; and (6), testing sample classification is carried out by using the final expression of each sample and a sparse representation classification method to complete human behavior identification in the testing samples. Therefore, discrimination of dictionary learning is enhanced; the human behavior identification rate is improved; and the method can be used for target detection in a video.
Owner:XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products