Near-infrared spectral variable selection method based on Monte Carlo variable combination cluster

A technology of near-infrared spectroscopy and variable selection, which is applied in the field of near-infrared spectroscopy variable selection based on Monte Carlo variable combination clusters, which can solve the problems of less samples and more variables, the inclusion of non-informative variables and interference variables, and the impact of model prediction performance. , to achieve the effect of improving stability and reliability, reducing dependence and avoiding influence

Active Publication Date: 2017-09-08
CHANGCHUN UNIV OF SCI & TECH
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Although the above methods are widely used in the near-infrared field, in scientific research practice, because the number of collected samples is generally not too large, there will be a situation where there are fewer samples and more variables, and there will be a large number of uninformative variables and variables. Interference variables are sandwiched, so the above variable selection methods are not only difficult to achieve all variable combinations, but also affected by a large number of uninformative variables and interference variables
It is also because the number of samples collected for modeling is not enough to fully express the overall information, so the parameters such as variable importance obtained by the above-mentioned several variable selection methods that only sample the variable space will have great uncertainty The fluctuation of the sample will inevitably have a certain impact on the importance distribution of the variables, which will affect the predictive performance of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Near-infrared spectral variable selection method based on Monte Carlo variable combination cluster
  • Near-infrared spectral variable selection method based on Monte Carlo variable combination cluster
  • Near-infrared spectral variable selection method based on Monte Carlo variable combination cluster

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Embodiment 1: In order to prove the applicability of the present invention, a detailed description will be given in conjunction with examples. However, the present invention can also be applied to spectral data other than the example used here.

[0027] figure 1 Be the flow chart of the near-infrared spectrum variable selection method (MC-VCPA) algorithm based on the Monte Carlo variable combination cluster provided by the invention, as seen, the present invention specifically comprises the following steps:

[0028] (1) The chemical data of 93 wheat and wheat proteins used in this research come from the Beijing Fangfude Research Center of the State Grain Administration, using the near-infrared spectrum of each wheat sample with the MCS611NIR fiber optic spectrometer from Carl Zeiss, Germany. The spectral range 950 ~ 1690nm, each experimental sample collected 3 light, take the average absorbance. Use wavelet packet (WTP) to eliminate the noise signal in the spectrum. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a near-infrared spectral variable selection method based on a Monte Carlo variable combination cluster, and belongs to the field of analytical chemistry and spectroscopy. The method comprises the following steps that firstly, calibration set samples are sampled randomly by the Monte Carlo sampling method; secondly, feature variables of each sample subset are selected by the variable combination cluster analysis method, and the feature variables of all the sample subsets are preserved to obtain a new variable space; then feature variables of the new variable space are further selected by using the variable combination cluster analysis method. The method not only samples the variable space by the binary matrix sampling method, but also realizes the sampling of a sample space by the Monte Carlo sampling method, and avoids the influence of the change of sample sets on the variable selection.

Description

technical field [0001] The invention belongs to the field of analytical chemistry and spectroscopy, in particular to a near-infrared spectrum variable selection method based on Monte Carlo variable combination clusters [0002] technical background [0003] The spectral frequency band of near-infrared is 780nm~2500nm. The source of spectral information comes from the frequency multiplication and combined frequency absorption of hydrogen-containing groups in organic matter. Near-infrared spectral analysis technology can be widely used in the field of qualitative and quantitative analysis of substances. Therefore, this project The technology has been hailed as "a technology with the potential to improve global agricultural analysis capabilities". Since the near-infrared spectrum has hundreds of spectral bands, when the instrument collects these bands, in addition to the information of the sample itself, it also contains a lot of external information, such as noise, sample backg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G01N21/359
CPCG01N21/359
Inventor 宦克为韩雪艳刘小溪赵环石晓光
Owner CHANGCHUN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products