Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

63 results about "Concept drift" patented technology

In predictive analytics and machine learning, the concept drift means that the statistical properties of the target variable, which the model is trying to predict, change over time in unforeseen ways. This causes problems because the predictions become less accurate as time passes.

Data stream anomaly detection system based on empirical features and convolution neural network

The invention discloses a data stream anomaly detection system based on empirical characteristics and convolution neural network. The system includes an empirical feature extraction module, which is used to identify statistical features and header features as features based on artificial experience, which play a more important role in data packet anomaly recognition; a bit stream conversion picture module used to convert the data stream into the form of two-dimensional gray-scale picture, and then through the convolutional neural network perception, the global high-level perception features are extracted; a fusion splicing module used for fusing the above modules as the data stream characteristics and identifying abnormal data streams by using the full connection layer of the neural network; a distillation model module that replaces complex networks in actual deployment; a concept drift fine-tuning module updates the detection model of concept drift; an update experience database module adding new network attacks or hidden attack instructions to the artificial experience database. The invention accurately and efficiently detects abnormal behaviors such as network failure, user misoperation, network attack and the like.
Owner:ARMY ENG UNIV OF PLA

Selective up-sampling combined method for weighted ensemble classification prediction of unbalanced data flows

The invention relates to the technical field of data mining, and discloses a selective up-sampling combined method for weighted ensemble classification prediction of unbalanced data flows. The method comprises the following steps of: screening minority class samples of history data blocks according to a similarity, and selecting samples closest to the current training data block in the aspect of concept; synthesizing the selected samples into new samples in a decision boundary area so as to selectively implement up-sampling; and carrying out weighted ensemble classification on the new sample by adoption of a probability distribution relevancy-based weight distribution strategy. According to the method, the minority class sample information is effectively increased through selecting history data with high similarities and synthesizing new data at the boundary area, so that the decision domain of the minority class is enlarged; and meanwhile, in order to adapt the dynamic data with concept drift and use an ensemble classification thought, the probability distribution relevancy-based weight distribution strategy is designed, so that the overall classification precision is enhanced. Experiment results show that the method is capable of effectively improving the minority class identification rate and the overall classification performance, and has the advantage of better processing the unbalanced data flows.
Owner:NORTHEASTERN UNIV

Self-adaptive trojan communication behavior detection method on basis of dynamic feedback

ActiveCN103532949AEliminate redundancyReduce false positive informationTransmissionRelevant informationSimilarity analysis
The invention discloses a self-adaptive trojan communication behavior detection method on the basis of dynamic feedback, which comprises the steps of processing trojan detection alarm information, constructing a sample set for dynamic feedback learning by utilizing the alarm information, and determining updating opportunity of detection by detecting concept drift of a data stream, wherein the step of processing the trojan detection alarm information comprises the sub-steps of carrying out merging and association processing on the alarm information which is subjected to standard description, then establishing an intrusion track event and storing the intrusion track event into an intrusion event table. According to the invention, aiming at the problem of self-adaption of information stealing trojan detection, the information stealing trojan detection alarm information is analyzed, methods of similarity analysis, clustering analysis and the like are combined, related information of a target IP (Internet Protocol) is acquired additionally by driving detection, the sample set for dynamic feedback learning is constructed by the alarm information, an increment support vector machine algorithm is used as an algorithm for dynamic feedback learning, and the updating opportunity of a detection system is determined by detecting the concept drift of the data stream.
Owner:PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU

Multi-model malicious code detection method based on reliability probability interval

The invention provides a multi-model malicious code detection system based on reliability probability interval. Each machine learning detection model corresponds to a distribution of the underlying data, and various threshold-based detection models can be integrated into the statistical platform, so that the distribution of the semantic code data is detected from the multi-angle view, and the model degradation problem caused by the concept drift is relieved. The detection system changes the prediction mode of 0 or 1 of the existing machine learning detection model, calculates the score based on the existing detection model, carries out statistical analysis, and establishes a isotonic regression function for the score distribution of the sample and the label of the sample. For an unknown sample, according to the score given by the existing detection model, the calculated isotonic regression function is input, the reliability probability interval of a certain label can be given, and theprobability interval can relieve the problem of over-fitting of the fixed threshold to the training data set, the adaptive ability of the detection model to the current dynamic data is improved, and the concept drift phenomenon is found in advance.
Owner:NANKAI UNIV

Big data stream type cluster processing system and method for on-demand clustering

ActiveCN103353883AEfficient use ofSolve problems that are processed quicklySpecial data processing applicationsComputer moduleConcept drift
The invention discloses a big data stream type cluster processing system for on-demand clustering. The system comprises a fast computation module, a data concept drift detection module and a clustering module, wherein an output end of the fast computation module is connected to a first input end of the clustering module through the data concept drift detection module, and the clustering module is connected to the fast computation module. According to the invention, aiming at characteristics of mass, similarity and repetition of the big data, an on-demand clustering model based on data concept drift detection adopts a triggered type clustering processing mode, the accuracy is guaranteed, and on-demand clustering and real-time clustering result services are provided; and secondly, a resource monitoring module and an independent module are provided for clustering processing, the prior traditional clustering algorithms are effectively utilized, expansibility and sensitivity of the system can be enhanced, and quick processing of the data stream in a big data environment is efficiently realized. The big data stream type cluster processing system for on-demand clustering can be widely applied to the field of data processing.
Owner:SOUTH CHINA NORMAL UNIVERSITY

Unbalanced-like network traffic classification method and device and computer equipment

The invention relates to the technical field of network traffic classification, and relates to an unbalanced-like network traffic classification method and device and computer equipment. The method comprises the steps of obtaining to-be-classified network traffic data, and extracting features of network traffic; deleting irrelevant features and redundant features by adopting a feature selection algorithm, and performing dimension reduction on the remaining features so as to select an optimal feature subset; and inputting the optimal feature subset into a weight-based multi-classifier, performing network traffic classification training in an incremental learning mode, optimizing classifier performance, and classifying the network traffic. Aiming at the problem of unbalanced distribution ofnetwork traffic samples, irrelevant features and redundant features are deleted, and the recognition rate of small categories is effectively improved on the premise of ensuring the overall classification accuracy; an incremental learning thought is introduced, the flexibility of model updating training is improved, and the model updating period is shortened; and by utilizing multiple classifiers based on weight, the influence caused by concept drift is reduced.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Concept drift detection method based on classification error rate and consistency prediction

ActiveCN112131575ATimely identification of degradation phenomenaEfficiently assess sustainabilityCharacter and pattern recognitionPlatform integrity maintainanceMisclassification errorComputational model
The invention provides a concept drift detection method based on a classification error rate and consistency prediction, and belongs to the technical field of computer machine learning and informationsecurity. According to the method, mutation type concept drift is detected by calculating a change of the classification error rate of a model, and then the progressive concept drift is detected by calculating a consistency degree of the samples with wrong classification and the samples with correct classification so that the mutation type concept drift and the progressive concept drift can be detected in time, and relatively low calculation overhead is kept. According to the method, detection of mutation type concept drift and progressive concept drift is achieved at low calculation cost, and a model degradation phenomenon is recognized in time. The method is mainly used for concept drift detection, can effectively act on early judgment of a degradation phenomenon of a machine learning classification model, and can be used as a performance monitoring method in various application fields such as automatic analysis and decision in a big data environment.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products