Files in this item



application/pdfXIA-THESIS-2016.pdf (13MB)
(no description provided)PDF


Title:Developing near- and mid-infrared spectroscopy analysis methods for rapid assessment of soil quality in Illinois
Author(s):Xia, Yushu
Department / Program:Natural Res & Env Sci
Discipline:Natural Res & Env Sciences
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):soil quality
near infrared
mid infrared
Partial Least Squares Regression
Random Forest
Monte Carlo feature selection
Abstract:Interest in use of near- (NIR) and mid-infrared (MIR) spectroscopic methods for soil quality assessment has grown rapidly. Ideally robust calibration of models can be developed to rapidly and affordably predict soil quality (SQ) indicators that are more costly to measure. Refinement of data libraries and standardization of data processing steps are needed to improve prospects for reuse of predictive models. To explore the potential for a regional library to predict SQ, this study used 468 topsoil samples collected from Illinois grain farms with loess-derived soils where, fields were managed using conventional-tillage, conservation-tillage or organic practices and so were expected to vary in SQ. Partial Least Squares Regression (PLSR) and Random Forest (RF) algorithms were used to predict SQ indicators, including soil organic carbon (SOC), total N (TN), soil C and N ratio (C: N), soil pH, particulate organic matter (POM), potentially mineralizable nitrogen (PMN), fluorescein diacetate (FDA) hydrolysis, soil nutrient abundance, and productivity proxy Normalized Difference Vegetation Index (NDVI), using the whole NIR or MIR spectra or reduced data sets comprised of spectral features associated with organic functional groups. Monte Carlo feature selection (MCFS) was used as a variable selection tool for PLSR spectra refinement. Overall, NIR models slightly outperformed MIR models and, both NIR and MIR methods were better able to predict SOC, Ca, TN, Mg, and PMN than other SQ indicators. While RF models slightly outperformed PLSR models when estimating a range of SQ indicators in NIR regions that fell within the midrange of the data set; PLSR performed better than RF for most SQ indicators using MIR spectra and had a better estimation on high or low soil property values. Neither NIR nor MIR model performance was improved when spectral ranges primarily associated with organic functional groups were used, but variable selection did significantly improve MIR model performance (p < 0.01). Unfortunately, soil region, not management, explained most differences among samples, suggesting that within IL croplands, spectral features associated with mineralogy overwhelm information about SQ obtained from this technique. Our evaluation suggests that development of robust prediction models should rely not only on careful interpretation of statistical techniques used to select peaks retained, but also careful consideration of their physical meaning.
Issue Date:2016-04-28
Rights Information:Copyright 2016 Yushu Xia
Date Available in IDEALS:2016-07-07
Date Deposited:2016-05

This item appears in the following Collection(s)

Item Statistics