Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

101 results about "Variable screening" patented technology

For variable screening, the variables that induce larger output variances are selected as important variables. To determine important variables, hypothesis testing is used in this paper so that possible errors are contained a user-specified error levelin .

Bearing state noise diagnosis algorithm facing network variable screening and characteristic entropy fusion

The invention discloses a bearing state noise diagnosis algorithm facing network variable screening and characteristic entropy fusion. A sensor collects bearing operation noise signals. The noise signals are segmented according to a time sequence and form a sample set. A time-frequency domain characteristic of a sample is extracted so as to acquire a time-frequency-domain one-dimensional characteristic row vector. An average influence value algorithm is adopted to realize first characteristic variable screening so as to acquire a sensitive characteristic set, and through calculating a characteristic entropy of the sensitive characteristic set, characteristic secondary screening and dimensionality reduction are performed on an average influence value similarity characteristic so as to acquire a final characteristic set. A PSO or GA optimization support vector machine is used to carry out training and establish a fault diagnosis model so as to determine a bearing fault type and output aresult. In the invention, complementarity of a characteristic average influence value and the characteristic entropy based on a network in characteristic selection and characteristic classification isused; and a disadvantage that the characteristic selection and a neural network classification algorithm are mutually isolated in bearing noise diagnosis is overcome so that a time-frequency domain characteristic index well reflects a bearing operation state and a classification network characteristic.
Owner:CHINA UNIV OF MINING & TECH

Organic compound explosive characteristic prediction method based on genetic algorithm

The invention relates to a prediction method of organic compound blast characteristic based on genetic algorithm, and the method is determined by the molecular structure of organic compounds based on various blast characteristics of the organic compounds, and the molecular structure can be described by various parameters which reflect the characteristics of the molecular structure, namely, the organic compound blast characteristic can be expressed by the function of chemical construction parameters. The method firstly calculates structure parameters used for reflecting various molecular structure information according to the molecular structure of the organic compound, thus realizing parametric description of the molecular structure information; secondly, the genetic algorithm is applied to carry out characteristic variables screening, a group of parameters, which are taken as the descriptors of the molecular structure and closely related to the related blast characteristics and contain abundant structure information, are screened from a great amount of calculated structure parameters. Based on the above steps, an appropriate statistical modeling method is adopted to carry out statistical learning for the intrinsic quantitative relation between the selected descriptors and the related blast characteristics, thus obtaining a quantitative function model between the molecular structure and the related blast characteristics. The method has the advantages of simpleness, high prediction accuracy and providing a rapid, novel and accurate prediction method for the organic compound blast characteristics.
Owner:NANJING UNIV OF TECH

Variable selection method for modeling organic pollutant quantitative structure and activity relationship

The invention discloses a variable selection method for modeling an organic pollutant quantitative structure and activity relationship. The method comprises the following steps of: calculating linear models combined with all single variables and different bivariables, and retaining a certain number of optimal models for the single variables and the bivariables; then sequentially taking out a model from the retained bivariable linear models, and combining two of the variables and each of the rest variables to form a tri-variable model until all the retained bivariable models are processed; comparing the quality of the tri-variable models, and retaining a certain number of optimal tri-variable models; and repeating, and stopping calculation until the number of variables forming the models meets the requirement, wherein the quality of the models is based on an end standard represented by q2 or a root-mean-square deviation (RMSEV) which is calculated by leave-one-out cross validation (LOOCV) or leave-multiple-out cross validation (LMOCV). The theory is simple and can be understood easily and programmed easily; and the method is quick and effective, so that the rationality of variable selection and the stability of the forecast capacity of the models are guaranteed.
Owner:GUILIN UNIVERSITY OF TECHNOLOGY

Optical microwave cooperative inversion method and system for urban above-ground biomass

The invention discloses an optical microwave cooperative inversion method and system for an urban above-ground biomass. The method comprises the steps of obtaining an AGB observation value of a samplebiomass by use of a sample area single-plant parameter data set acquired by a ground surface observation test; performing pretreatment to obtain a canopy height value CHM, surface reflectance data and a backscattering coefficient; extracting various LiDAR variables based on the CHM data, and extracting multiple optical characteristic vegetation indices based on the optical surface reflectance data, and simultaneously extracting multiple microwave characteristic variables based on the microwave backscattering coefficient data; extracting a biomass estimation value of an LiDAR data coverage area; selecting a sample for subsequent modeling and verification by use of a stratified random sampling method with the biomass value of the LiDAR data coverage area as a training and verification sample set; screening out an optimal optical and microwave characteristic variable by use of a variable screening method; and constructing an AGB-inverted optical model, a microwave model and an optical and microwave cooperative model, and selecting an optimal model to realize biomass inversion.
Owner:WUHAN UNIV

Large chemical process distributed modeling method based on CCA-PLS

The invention discloses a large chemical process distributed modeling method based on CCA-PLS. All process variables are collected in the large-scale chemical process, output variables are determined according to a process quality index, feature components of the process variables are extracted by the adoption of a canonical correlation analysis method, the largest association coefficients between each output variable and all input variables and corresponding main axis vectors are calculated according to the feature components, independent input variables and interaction input variables of each subsystem are selected according to the absolute values of components of the main axis vectors, and accordingly a large system decomposition is achieved. Modeling is carried out on the subsystems by means of a PLS algorithm after subsystem division, components which enable the covariance between the input variables and the output variables to be the largest are extracted, and a subsystem model is obtained through a regression modeling technology. The large chemical process distributed modeling method has the advantages that only input and output data in the process are used, input variable screening of the subsystems is carried out by means of the canonical correlation analysis principle, the model dimension number is lowered, the model structure is simplified, subsystem modeling is carried out through the PLS modeling algorithm, difficulties in calculation caused by a large number of collinear variables in actual application are eliminated, and modeling precision is high.
Owner:NANJING UNIV OF TECH

Method for detecting content of trace elements in milk powder on basis of laser-induced breakdown spectroscopy

InactiveCN108333171AImprove accuracyOvercome Quantitative Analysis ImpactAnalysis by thermal excitationTime domainHigh density
The invention discloses a method for detecting the content of trace elements in milk powder on the basis of laser-induced breakdown spectroscopy. The method comprises the following steps of making a sample to be tested; building original spectrum data of the sample; testing the content of the trace elements of the sample; amplifying the original spectrum data into a wavelet coefficient carrying more time domain or frequency domain information through high density wavelet transform; performing variable screening by using an improved random leapfrog algorithm; thus screening out variables closely relevant to the tested elements to complete the spectrum pretreatment; combing the correction set spectrum data after the treatment with the measured trace element content; building a correction model by using a partial least squares regression method; obtaining the optimum prediction model of the trace element content in the milk powder. The method has the advantages that the defects of requirement of a great number of samples, long detection pretreatment time, complicated experiments and the like of the exiting method in trace element detection can be avoided. The goal of detecting the content of the trace elements in the milk powder in modes of high speed, great quantity and simple operation is achieved.
Owner:TIANJIN UNIV

Prediction method of aircraft scene taxiing time based on multiple regression analysis

The invention discloses a prediction method of aircraft scene taxiing time based on multiple regression analysis. Various factors affecting the aircraft scene taxiing time are comprehensively analyzed, according to different scene taxiing time data transformation ways and independent variable screening strategies, fundamental factors which affect the scene taxiing time are extracted, aircraft scene taxiing time prediction models based on multiple linear regression analysis and multiple curve regression analysis are established, and prediction errors of the established different models are comparatively analyzed, so as to achieve important significance for enhancing the prediction ability of the aircraft scene taxiing process in an airport flight area. The prediction method of the aircraft scene taxiing time based on the multiple regression analysis disclosed by the invention can accurately and fast predict the taxiing process of any approach and departure aircraft in pre-tactical and tactical levels, and can further provide a data reference basis for normal statistical standard formulation of strategic level flights, so that the problem of lower macro prediction ability in the airport aircraft taxiing process is effectively solved.
Owner:NANJING UNIV OF AERONAUTICS & ASTRONAUTICS

Combination optimization-based near-infrared non-invasive blood glucose detection wavelength variable screening method

The invention provides a combination optimization-based near-infrared non-invasive blood glucose detection wavelength variable screening method. The method comprises the following steps of firstly obtaining transmissivity spectral data of human blood glucose detection by adopting LED near-infrared light sources with different wavelengths; secondly selecting an optimal variable group in groups by using a continuous projection algorithm, a genetic algorithm and a gradual selection algorithm for multiple pieces of wavelength variable data by taking a root-mean-square error as an index in combination with linear regression modeling; thirdly performing scoring on each group of variables through a weighted scoring method; and finally recombining three groups of variables, sorting comprehensive scores in sequence from high to low, and selecting first few variables with highest scores as final auxiliary variables. According to an algorithm, a most effective wavelength combination is extracted through weighted combination optimization for the problem of an over-fitting phenomenon of a model due to excessive wavelength variables in regression modeling; and the variables are accurately selected, so that the model can be greatly simplified, the calculation complexity of the model is lowered, and the prediction capability and the real-timelines and robustness of non-invasive blood glucose detection are improved.
Owner:北京光巨力信息技术有限公司

Method for predicting temperature of furnace core dead stock column of ironmaking blast furnace based on multi-element linear regression algorithm

The invention discloses a method for predicting the temperature of a furnace core dead stock column of an ironmaking blast furnace based on a multi-element linear regression algorithm, and belongs tothe technical field of metallurgy information processing. According to the method, the temperature target value DMTgoal of the furnace core dead stock column is calculated, data is processed, Pearsoncorrelation analysis is carried out on the processed data sample, and condition variables are preliminarily selected according to the result of correlation analysis. Pearson correlation analysis is carried out on each condition variable, and mutually independent condition variables are selected as possible according to the correlation analysis result to establish a model. Condition variable is then screened by a least square method and an AIC-based variable screening criterion, the fitting degree and the regression coefficient of the primary multi-element linear regression equation are checked, and a multi-element linear regression model is obtained. According to the method, the multi-element linear regression algorithm is provided for the first time to predict the temperature of the furnace core dead stock column, the temperature of the furnace core dead stock column in the next five days can be predicted in a high-precision mode, and the early warning function of the temperature of the furnace core dead stock column can be realized.
Owner:ANHUI UNIVERSITY OF TECHNOLOGY

A wavelength selection method based on PCA modeling feedback load weighting

The invention discloses a wavelength selection method based on PCA modeling feedback load weighting. Based on PCA algorithm, according to the spectral detection data of different frequencies, Establishing and optimizing the metrology analysis model, weighting and combining the PCA load vector with modeling coefficient feedback, measuring the information contribution for each wavelength variable, and then choosing the information wavelength set with higher signal-to-noise ratio, can effectively reduce the number of wavelengths involved in modeling and reduce the complexity of the model; the wavelength combinations selected can be combined with various simple statistical algorithms such as linear discrimination or multiple linear regression to complete the qualitative or quantitative analysis. This method can improve the work efficiency of spectral information variable screening, and can be applied to the near infrared, infrared, ultraviolet and other frequency bands of spectral dimension reduction and rapid detection. It provides theoretical basis and technical support for the development and application of small special spectral instrument, and is expected to be widely used in thefield of hyperspectral image analysis.
Owner:GUILIN UNIVERSITY OF TECHNOLOGY

Method and device for predicting deformation of proximity structure caused by shield through random forest fused with SVM

InactiveCN111382472AImprove forecast accuracyPrediction of settlement deformation is accurateGeometric CADKernel methodsStructural deformationFeature set
The invention discloses a method for predicting deformation of a proximity structure caused by a shield through a random forest fused with an SVM. The method comprises the steps of: collecting data corresponding to main factors according to the main factors affecting the deformation and settlement of a building; establishing a random forest model, training the data according to the random forest model, and performing importance measurement to obtain an optimal feature set; and performing dimensionality reduction processing on the optimal feature set, and inputting the optimal feature set intoa support vector machine model for training to obtain a building settlement prediction result. The invention further discloses a device for predicting the deformation of the proximity structure causedby the shield through the random forest fused with the SVM. According to the method, random forest feature selection is utilized, variables having small correlation with a predicted value can be eliminated from excessive influence factors, key variables for modeling are screened out to obtain an optimal variable combination, the dimensionality of the support vector machine training model is reduced, the prediction precision is improved, a prediction result closer to the reality is obtained, and more stable and more accurate prediction of building deformation settlement is realized.
Owner:HUAZHONG UNIV OF SCI & TECH

Agricultural product quality analysis method and analyzer

The invention provides an agricultural product quality analysis method and an analyzer. The agricultural product quality analysis method comprises the following steps: acquiring spectral data and measuring reference value data; wherein the data is divided into a correction set and an external verification set; arranging the spectral data and the reference value data into a data matrix, and sampling the data matrix by using a Monte Carlo sampling method; carrying out key variable selection on a data matrix sampling result by adopting a variable selection algorithm, carrying out statistics on the selected frequency of each variable, and sorting; carrying out statistical stability on the high relative frequency variables, screening out stable key variables and establishing a mathematical model; substituting the acquired spectral data of the to-be-detected agricultural product into the mathematical model, and analyzing the quality of the agricultural product according to an operation result. In order to realize the application of the method, an agricultural product quality analyzer is developed, and a result is predicted and output according to stable key variables. According to the method, Monte Carlo sampling parameters are optimized, the operation cost is reduced to the maximum extent while a stable key variable screening result is obtained, and the working efficiency is improved.
Owner:BEIJING ACADEMY OF AGRICULTURE & FORESTRY SCIENCES

Non-native permanent resident population identification method based on spectral clustering

The invention discloses a non-native permanent resident population identification method based on spectral clustering. The method comprises steps that step (1), the residence data information of handset users is collected; step (2), for the step (1), multiple user residence behavior characteristic variables are constructed; step (3), variable screening for the variables of the step (2) is carried out, and dimensionless processing is further carried out; and step (4), a spectral clustering method is employed to carry out clustering analysis, and a model is established; and the residence data information of the step (1) comprises the residence day number, residence duration and ECI switching data. Statistics of the residence day number, residence duration and the ECI switching data of the users in a present city can be realized according to the collected residence data information, permanent residents and non-permanent residents are different in residence characteristics, so the residence characteristic variables are formed through processing the residence information, the two types of users can be distinguished through employing the clustering algorithm, and the identification result is more scientific.
Owner:NANJING HOWSO TECH

Competitive adaptive reweighting key data extraction method for Raman spectrum analysis of insulating oil

A competitive adaptive reweighting key data extraction method for Raman spectrum analysis of insulating oil is characterized by comprising the following steps: 1, obtaining spectral data by actual measurement; 2, processing all original data, and extracting a PLS subset from actually measured spectral data through a partial least square method; 3, screening out singular samples by using Monte Carlo interaction verification; and 4, screening variables of the Raman spectrum of the insulating oil for multiple times by adopting a CARS method. According to the Raman spectrum key variables screenedby the method, unnecessary wavelength points in the Raman spectrum can be removed, and the colinearity of Raman spectrum data is removed. Due to limit by the Raman signal detection limit of an aging characteristic substance in the oil, characteristic peaks of all aging characteristic substances are difficult to observe by directly detecting the insulating oil, redundant information is removed by utilizing a variable screening means, key variables are obtained, and the relationship between the key variables and the characteristic peaks is analyzed, and thus, effective support can be provided for extraction and discrimination of spectral aging characteristic quantity.
Owner:CHONGQING UNIV +1

Method for measuring whey protein in dairy product on basis of data-driven Raman spectrum

The invention relates to a method for measuring whey protein in a dairy product on the basis of data-driven Raman spectrum. According to the method, the optimal variable combination of a to-be-measured substance is accurately extracted from complicated and fluctuant translation invariant wavelet coefficients with a variable screening method, the variable combination is subjected to spectral reconstruction with the adoption of translation invariant wavelet transform, the best time domain/frequency domain resolution ratio is obtained while spectral interference such as matrixes and the like areeffectively stripped, and following modeling analysis is facilitated. Besides, according to the method, by analyzing reconstructed spectrum data features and inherent law of unknown dairy product samples, separate modeling is only performed on a single unknown dairy product sample, so that a target ideal model is dynamically approached, and the uncertainty of actual dairy product sample products is effectively overcome. Finally, a data-driven model of alpha-lactalbumin and beta-lactoglobulin in the dairy product sample is finally constructed and converted into corresponding whey protein content, nondestructive testing of the whey protein in the dairy product is further realized, and field detection of related dairy products can be conveniently performed.
Owner:陈达

Logistics node layout optimization method and system based on agricultural product cold-chain logistics requirements

The invention discloses a logistics node layout optimization method and system based on agricultural product cold-chain logistics demands. The method comprises steps of carrying out the variable screening of the obtained agricultural product cold-chain logistics basic data in a historical time range of a to-be-predicted region, and screening a plurality of variables; predicting each variable to obtain predicted values of all the variables; obtaining a first agricultural product cold-chain logistics demand prediction value based on all variable prediction values and the multiple linear regression model; obtaining a second agricultural product cold-chain logistics demand prediction value based on all variable prediction values and a pre-trained BP neural network; performing weighted summation on the first agricultural product cold-chain logistics demand prediction value and the second agricultural product cold-chain logistics demand prediction value to obtain a final agricultural productcold-chain logistics demand prediction value; and based on the final agricultural product cold-chain logistics demand prediction value of the to-be-predicted area, the scale of the existing logisticsnode and the distance between the existing logistics node and the to-be-predicted area, obtaining a scale optimization scheme of the logistics node.
Owner:SHANDONG UNIV OF FINANCE & ECONOMICS

Long-term rainfall prediction model construction method, long-term rainfall prediction method and long-term rainfall prediction device

The invention discloses a long-term rainfall prediction model construction method, a long-term rainfall prediction method and a long-term rainfall prediction device. The construction method comprises the following steps: acquiring a sample set; screening explanatory variables in the sample set based on an error discovery rate control method of multi-hypothesis testing and a random forest model to obtain predictive factors influencing precipitation in corresponding months of the next year; and carrying out random forest modeling according to the prediction factors influencing the precipitation in the corresponding month of the next year and the precipitation in the corresponding month of the next year, and training to obtain a long-term precipitation prediction model of the precipitation in the corresponding month. According to the long-term rainfall prediction model construction method, the long-term rainfall prediction method and the long-term rainfall prediction device provided by the invention, variable screening can be optimized from an experience-dependent method to a data-dependent method, the problem of false positive error rate of a random forest method during empirical variable screening is improved, and the accuracy and reliability of model prediction can be effectively improved.
Owner:CHINA THREE GORGES CORPORATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products