Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

784 results about "Cross-validation" patented technology

Cross-validation, sometimes called rotation estimation or out-of-sample testing, is any of various similar model validation techniques for assessing how the results of a statistical analysis will generalize to an independent data set. It is mainly used in settings where the goal is prediction, and one wants to estimate how accurately a predictive model will perform in practice. In a prediction problem, a model is usually given a dataset of known data on which training is run (training dataset), and a dataset of unknown data (or first seen data) against which the model is tested (called the validation dataset or testing set). The goal of cross-validation is to test the model's ability to predict new data that was not used in estimating it, in order to flag problems like overfitting or selection bias and to give an insight on how the model will generalize to an independent dataset (i.e., an unknown dataset, for instance from a real problem).

Encrypted traffic analysis feature extraction method and system, storage medium and safety equipment

The invention belongs to the technical field of network security and communication, and discloses an encrypted traffic analysis feature extraction method and system, a storage medium and security equipment. The method comprises the steps: collecting original traffic data ; preprocessing the collected original data packet, and filtering traffic data of SSL / TLS encryption communication; performing deep packet analysis on the streaming data to generate a traffic analysis log; performing log aggregation according to the connection quadruple information and the index information in each log to forma flow feature call chain, and performing feature extraction according to the call chain to obtain an initial data set; and determining an optimal supervised learning classification algorithm in thecurrent environment, determining an optimal parameter by using a grid parameter optimization method, and evaluating the feature extraction accuracy by using a ten-fold cross validation method. According to the method, the classification effect is optimal by adopting the random forest algorithm, the obtained accuracy is as high as 99.96%, the result shows that SSL / TLS encryption features used by all malicious families are different, and the classification effect is remarkable.
Owner:XIDIAN UNIV

Data-based qualitative analysis method of influence factors of circuit breaker faults

The invention discloses a data-based qualitative analysis method of influence factors of circuit breaker faults. The method solves prior problems of insufficient utilization of circuit breaker fault data, too subjective positioning of fault causes, insufficient stability of a qualitative analysis model and the like. According to the method, the key influence factors of all types of the circuit breaker faults are found through mining and analyzing the circuit breaker fault data. The method includes the realization steps of defining an original fault data set; cleaning the fault data; transforming the fault data; reducing the fault data; constructing a qualitative analysis model of the influence factors of the faults, and carrying out ten times of ten-fold cross-validation; and obtaining association rules between the faults and attributes, and obtaining the influence factors of all the types of the circuit breaker faults by qualitative analysis. According to the method, a supervised learning algorithm of CMAR is utilized for modeling, and with data amount increasing, the accuracy of the model will be continuously improved. At the same time, the robustness of the qualitative analysis model is guaranteed by the ten times of the ten-fold cross-validation, and the influence factors of the circuit breaker faults can be effectively analyzed.
Owner:无锡启工数据科技有限公司

Treated sewage quality prediction method based on combination of support vector classification and GRU neural network

The invention discloses a treated sewage quality prediction method based on combination of support vector classification and a GRU neural network, and belongs to the technical field of sewage treatment. Missing value processing, abnormal value elimination and data standardization are carried out on the collected sewage historical data, a PCA principal component analysis method is adopted to carryout dimension reduction on the data, and the selected auxiliary variable is used as an input variable of a sewage quality prediction model; a sewage effluent key prediction model is established by adopting a GRU neural network suitable for processing time series data, a support vector machine model is firstly introduced to classify sewage quality data, and then the classified data is respectivelymodeled through the GRU neural network algorithm to predict effluent quality. When the SVM model is trained, a grid search method and a cross validation method are used for optimizing model parameters, the prediction precision of the obtained joint prediction model is more accurate, the model effect is better, the network performance can meet the actual application requirements, and accurate prediction of the effluent quality of the sewage treatment system can be realized.
Owner:HEFEI UNIV

Customs declaration data processing method and device, computer equipment and storage medium

The invention relates to the field of big data processing, in particular to a customs declaration data processing method and device, computer equipment and a storage medium. The method comprises the following steps: receiving customs declaration data sent by a declaration main body, wherein the customs declaration data carries an order identifier, a payment order identifier and a waybill identifier; obtaining encrypted order data, encrypted payment order data and encrypted waybill data from corresponding nodes of the block chain according to the order identifier, the payment order identifier and the waybill identifier; obtaining a directionally authorized decryption key corresponding to the encrypted order data, the encrypted payment bill data and the encrypted waybill data; decrypting thecorresponding data through the obtained decryption key of the directional authorization; performing cross validation on the decrypted order data, payment data and waybill data to obtain a cross validation result; and when the cross validation result shows that the order data, the payment order data and the waybill data are inconsistent, increasing the check rate of customs clearance articles corresponding to the customs clearance data. By adopting the method, the data security can be ensured.
Owner:ONE CONNECT SMART TECH CO LTD SHENZHEN

Abnormity detection method for enterprise industry classification

The invention discloses an abnormity detection method for enterprise industry classification, which comprises the following steps of: firstly, extracting to-be-mined text and non-text information in taxpayer industry information, and carrying out feature processing and coding processing; Secondly, constructing a deep network structure conforming to the industry classification abnormity detection problem, and determining the number of neurons of an input layer and an output layer of the network according to the characteristic dimension of the coded data; Thirdly, on the basis of the constructeddeep network structure, adopting different training strategies to train the industry large-class network and the industry detail network through cross validation; And finally, carrying out abnormitydetection on the industry large class by using dimension reduction characteristics of the industry large class network in combination with an SOS abnormity detection algorithm, and carrying out abnormity detection on industry details according to reconstruction characteristics of the industry detail network. According to the invention, the TADM model is utilized to carry out abnormal detection onthe original data, and macroscopic management work such as national statistics, tax collection and industrial and commercial management can be analyzed more reasonably and accurately.
Owner:XI AN JIAOTONG UNIV

Method for determination of PCTFE content in PBX explosive by near infrared spectrum

The invention discloses a method for determination of the polychlorotrifluoroethylene (PCTFE) content in a polymer-bonded explosive (PBX) by near infrared spectroscopy. The method includes the steps of: preparing and collecting 260 PBX samples, taking 180 samples of them as a calibration set for establishing a calibration model, taking the remaining 80 samples as a validation set for model validation, and acquiring the near infrared spectrum data of all the samples; using a standard method to determine the PCTFE content in the samples; subjecting the spectrum data of the validation set samples in the wave bands of 6102.0cm<-1>-5697.0cm<-1> and 4680.2cm<-1>-4242.9cm<-1> to a first order derivative treatment, correlating the treated spectrum data with the PCTFE content by a partial least squares method, and establishing the calibration model by cross validation; employing the calibration model to predetermine the PCTFE content of the validation set samples, and selecting an optimal model according to a minimum root mean squared error of prediction (RMSEP) of the validation set; and acquiring the near infrared spectrum data of the samples to be determined, and making use of the optimal model to obtain the PCTFE content directly. Being suitable for determination of the PCTFE content in a PBX explosive, the method has the characteristics of convenient operation, and rapid and accurate analysis.
Owner:XIAN MODERN CHEM RES INST

A method for carrying out transformer area user identification based on optimized supervised learning

InactiveCN109816033AReliable Identification ResultsReduce hardware costsCharacter and pattern recognitionTransformerEngineering
The invention relates to the field of data analysis, and discloses a method for carrying out transformer area user identification based on optimized supervised learning. The method comprises the following steps of determining a user with a known station user topological relation and a station area and a phase to which the user belongs, determining a corresponding tag of user data according to thestation area and the phase to which the user belongs, establishing a training set, a verification set and a test set, determining k parameters in a KNN model by adopting a cross verification mode, andcompleting model training; and identifying and classifying the voltage data to be identified by adopting the trained model and the determined k value, thereby realizing the identification of the users in the transformer area to be identified. According to the invention, conversion from unsupervised learning to supervised learning is realized; a training set, a verification set and a test set arereasonably set, and k parameters are determined by adopting a cross verification mode, so that the transformer area and the phase of a user are accurately and effectively identified, the problem of cross-transformer-area user ownership is thoroughly solved, and a foundation is laid for comprehensively guiding work in the fields of operation, maintenance, first-aid repair, technical improvement, planning and the like of a low-voltage transformer area.
Owner:SICHUAN ENERGY INTERNET RES INST TSINGHUA UNIV

Method for solving collaborative filtering recommendation data sparsity based on neural network

The invention provides a method for solving collaborative filtering recommendation data sparsity based on a neural network. The method for solving collaborative filtering recommendation data sparsity based on the neural network adopts generalized regression of neural network (GRNN) and conducts full filling on sparse data by a train network model and score prediction. The method for solving collaborative filtering recommendation data sparsity based on the neural network comprises the following steps: before conducting the GRNN training, conducting screening on input variables of the neural network by adopting mean impact value (MIV), choosing characteristic values having great impact on output as valid input variables; using the valid input variable to construct the input matrix of the GRNN; adopting Kfold cross validation circulation to find out an optimal spread value of the GRNN; using the optimal spread value and the corresponding input matrix and output matrix to conduct GRNN training; using the trained GRNN to conduct score prediction on a sparse score matrix; and replacing non-scored data of the sparse score matrix with predicted score values. The method for solving collaborative filtering recommendation data sparsity based on the neural network can conduct fully filling on sparse recommendation data, solve the data height sparsity problem most outstanding in existing collaborative technology, and enable recommendation result to be accurate.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Microgrid energy storage SOC estimation method and system based on least square support vector machine

The invention discloses a microgrid energy storage SOC estimation method and a system based on a least square support vector machine. Firstly, a microgrid experiment platform comprising an energy storage system is used, and through measurement, voltage, current, temperature and an SOC data sequence of the energy storage system during a charging and discharging process are obtained; according to the measurement data, training set and test set samples for the least square support vector machine are obtained; a cross validation method is then used for selecting the optimal training parameters for the training set of the least square support vector machine, and a training model is further obtained; and the testing set is then used for testing the training model, the result is evaluated, and finally, the training model can be used for SOC estimation determination on the corresponding type of energy storage system in the microgrid. Defects in the traditional SOC calculation method can be overcome, reference is provided for charging and discharging control on the energy storage system, deep charging and discharging of a battery can be prevented, and the service life of the battery and the operation safety of the microgrid can be improved.
Owner:NARI TECH CO LTD +5

Scanning signal feature extraction method based on independent component analysis and recognition method

The invention discloses a scanning signal feature extraction method based on independent component analysis. The method comprises the steps that six-lead scanning eye electric signals are collected and subjected to band-pass filtering treatment, an airspace filter set corresponding to different scanning task backgrounds is built through an ICA method for filtered data, linear protection is carried out, and airspace feature parameters of scanning signals are obtained. The invention further discloses a recognition method of the scanning signal feature extraction method based on independent component analysis. An ICA airspace filter set is built for each experiment sample in an eye movement database, feature parameters are extracted, cross validation is carried out through a support vector machine, and the optimal ICA filter set and SVM model parameters are determined; the optimal ICA airspace filter set is used for filtering, and then the result is fed into an SVM classifier to be recognized. The scanning signal feature extraction method based on independent component analysis and the recognition method have the advantages of being higher in recognition accuracy rate, higher in expansibility, good in application prospect and the like.
Owner:ANHUI UNIVERSITY

Shopping mall building air conditioner cooling load prediction method based on GBDT, storage medium and equipment

PendingCN112001439AFlexible handlingSolve the problem of requiring a large amount of data trainingForecastingCharacter and pattern recognitionSimulationEngineering
The invention discloses a shopping mall building air conditioner cooling load prediction method based on GBDT, a storage medium and equipment, and the method comprises: collecting cooling load data, and carrying out the normalization processing to serve as the cooling load energy consumption prediction; establishing a load prediction model based on a gradient lifting decision tree algorithm; inputting the preprocessed data into a prediction model for training, selecting a grid search-cross validation mode, and optimizing the three hyper-parameters with the maximum influence on the performanceof the GBDT model; establishing a final cold load prediction model by completing parameter optimization of the prediction model, and obtaining a predicted cold load curve according to the parameters and the structure of the prediction model; and evaluating the prediction performance of the prediction model, adopting the prediction error for evaluation, enabling the deviation between the true valueand the prediction value to form the prediction error, and completing mall building air conditioner cooling load prediction. The method has good prediction precision, universality and applicability,and is especially suitable for large public buildings with periodically changing cold loads.
Owner:XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY

Cutter residual life prediction method based on machine learning regression algorithm

The invention relates to the field of machine tool cutter remaining life prediction, and discloses a cutter residual life prediction method based on a machine learning regression algorithm. The cutterresidual life prediction method comprises two parts including model training and online life prediction. The model training comprises the steps of collecting original data of a complete life cycle and establishing a corresponding relation with the actual life of a cutter, preprocessing signals, extracting signal features to form feature vectors, performing cross validation to obtain an optimal cutter life model, and performing hyper-parameter adjustment and optimization. The online life prediction comprises real-time data acquisition, signal preprocessing, signal feature extraction to form afeature vector, input of THE optimal cutter life model based on optimal hyper-parameters and output of the residual life of the cutter. The number of eigenvalues extracted from each channel during model training is large, so that the training precision is high, the residual life of the cutter is accurately predicted, a residual life intelligent prediction model of the cutter is established, different regression models can be intelligently selected according to different working condition environments, and the model is good in generalization performance and high in portability.
Owner:南京凯奥思数据技术有限公司

Protein lysine malonylation site prediction method based on deep learning

The invention discloses a protein lysine malonylation site prediction method based on deep learning, and relates to the technical field of biological information. The method comprises the steps: converting the character information of a protein sequence into a numerical vector by adopting an enhanced amino acid composition, a grouped enhanced amino acid composition, a dipeptide deviation expectedaverage value, a K neighbor score and a BLOSUM62 matrix feature extraction algorithm, performing fusion and obtaining a feature space, wherein the influence of each potential feature on a prediction result is fully considered; performing calculating by using a linear convolutional neural network to obtain malonyl site specificity characteristics; selecting related features and reducing feature dimensions through a maximum pooling layer, classifying malonylation sites and non-malonylation sites in combination with a multi-layer deep neural network, constructing a protein malonylation site prediction model DeepMal, and evaluating prediction performance by using 10-fold cross validation and an independent test data set. The model DeepMal is remarkably improved in evaluation indexes, and further promotion of application of deep learning in protein function prediction is facilitated.
Owner:QINGDAO UNIV OF SCI & TECH

Protein-protein interaction site prediction method based on deep map convolutional network

PendingCN113192559AInteraction site prediction problem solvingImprove the accuracy of interaction predictionNeural architecturesHybridisationBiologyNeutral network
The invention discloses a protein-protein interaction site prediction method based on a deep map convolutional network; the method comprises the following steps: extracting a node feature matrix and an adjacent matrix containing side information according to the sequence and structure information of protein, and forming protein map representation together; carrying out deep map convolution based on an initial residual error and identical mapping; inputting the output of the last layer of image convolution layer of the deep image convolution into a multi-layer perceptron, and completing construction of the deep image convolution neural network; extracting training data to obtain protein map representation, and training the deep map convolutional neural network by adopting a five-fold cross validation method; and extracting to-be-detected data to obtain protein map representation, and inputting the protein map representation into the trained deep map convolutional neural network to realize prediction of protein-protein interaction sites. According to the method, protein space structure information can be fully utilized, and the accuracy of protein-protein interaction site prediction is further improved.
Owner:SUN YAT SEN UNIV

Flow prediction and carrier turn-off method and system based on time sequence

The invention relates to a carrier turn-off method and system based on time sequence prediction. The flow prediction method comprises the following steps: acquiring historical data of a target base station cell, and extracting features based on a time sequence through an associated field; activating a multi-model fusion algorithm, selecting a plurality of models as a stacked first layer accordingto the extracted features to perform multi-fold cross validation, and combining output results; and based on the prediction algorithm model of the designated stacking second layer, retraining an output result after multi-fold cross validation to obtain a target prediction value and a precision index. According to the carrier turn-off method, on the basis of a flow prediction step, a specific cellbase station is enabled to carry out carrier turn-off or turn-off operation at a predicted time node by utilizing prediction data of a certain time period. The system in the scheme of the invention comprises a computing device and a module, and is used for realizing the method. According to the scheme of the invention, an accurate flow prediction result can be obtained, the energy consumption of the base station can be greatly reduced, and the operation cost of a communication system is reduced.
Owner:FUJIAN NANHONG COMM TECH

Real-time prediction method for drilling leakage layer position based on decision tree data mining

The embodiment of the invention provides a real-time prediction method for a drilling leakage layer position based on decision tree data mining. The method comprises the steps that related historicaldrilling data, historical drilling leakage layer position true values and real-time drilling data of a target block are collected, and data preprocessing is conducted; dividing the preprocessed historical drilling data into a training set and a test set, selecting the historical drilling data in the training set as input and a historical drilling leakage layer position true value as a standard, and performing data mining by adopting a decision-making tree algorithm to form a leakage layer position prediction initial decision-making tree; pruning the initial decision tree through the data of the test set, selecting an optimal sub-tree through cross validation and performing precision evaluation, and regenerating the decision tree if the precision requirement is not met; and generating a real-time prediction model of the drilling leakage layer position according to the classification rule determined by the final decision tree, and connecting the model with a drilling data real-time collection platform, thereby realizing an effect of real-time prediction of the drilling leakage layer position.
Owner:SOUTHWEST PETROLEUM UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products