Tutorial for didactic application of chemometrics in free software – Part II: Partial Least Squares (PLS) regression on mid- and near-infrared data to determine adulterant content and physicochemical properties.

Authors

  • Pedro Henrique Pereira da Cunha Department of Chemistry, Center of Exact Sciences, Federal University of Espírito Santo –UFES, Goiabeiras, Vitória –ES, Brazil, Zip Code: 29075-910.
  • Gabriely Silveira Folli Department of Chemistry, Center of Exact Sciences, Federal University of Espírito Santo –UFES, Goiabeiras, Vitória –ES, Brazil, Zip Code: 29075-910.
  • Sara Joaquina Inocencio Dionisio Department of Chemistry, Center of Exact Sciences, Federal University of Espírito Santo –UFES, Goiabeiras, Vitória –ES, Brazil, Zip Code: 29075-910.
  • Amanda Guedes Caldeira Department of Chemistry, Center of Exact Sciences, Federal University of Espírito Santo –UFES, Goiabeiras, Vitória –ES, Brazil, Zip Code: 29075-910.
  • Paulo Roberto Filgueiras Department of Chemistry, Center of Exact Sciences, Federal University of Espírito Santo –UFES, Goiabeiras, Vitória –ES, Brazil, Zip Code: 29075-910.

DOI:

https://doi.org/10.36524/ric.v10i4.2535

Keywords:

Chemometrics, biodiesel, smartphone, photometrix, imagem, PLS, tutorial, Quimiometria; PLS; tutorial; Matlab; Octave., Octave

Abstract

The constant advance in technology, instrumentation, and the increase in data generation capacity has resulted in a new challenge for chemists: how to deal with complex, extensive, and multidimensional data sets. In this scenario, chemometrics emerged as a branch of chemistry specialized in the application of statistical and mathematical techniques for the analysis of multivariate analytical data, resulting in more precise and reliable scientific conclusions. As it is a relatively new area, there is still a lack of educational resources on the subject, and chemometric approaches are mostly confined to postgraduate courses. Therefore, this article provides a tutorial that can be used in various academic instances, with a focus on undergraduate education, to teach partial least squares regression (PLS). PLS is considered one of the most essential and common methods in chemometrics, and stands out for its ability to efficiently handle complex and highly correlated data sets, which is particularly important in regression analysis when there is significant multicollinearity among independent variables. In addition, PLS is known for its ease of use, making it a valuable tool for professionals seeking an effective approach to complex statistical modeling. We detail all the steps to create a complete PLS model, from installing the free GNU Octave software to producing the final figures. We also provide all the algorithms developed for data reading and processing. Finally, we interpret the obtained results so that the discussions and conclusions are easily understood by everyone.

Analyzing olive oil and petroleum using infrared spectroscopy, a computer with Octave achieves multivariate calibration. We also have a PLS tutorial book for assistance.

Published

20-06-2024

Issue

Section

Ciências Exatas e da Terra

How to Cite

Tutorial for didactic application of chemometrics in free software – Part II: Partial Least Squares (PLS) regression on mid- and near-infrared data to determine adulterant content and physicochemical properties. (2024). Revista Ifes Ciência , 10(4), 01-16. https://doi.org/10.36524/ric.v10i4.2535

Similar Articles

1-10 of 17

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)