Skip to main content
SLU publication database (SLUpub)

Abstract

In the analysis of metabolomics data, selecting the appropriate statistical approach is crucial for maximizing model interpre-tation, predictivity and reliability. This study evaluates the effectiveness of Orthogonal Partial Least Squares (OPLS) models,specifically comparing OPLS-DA (assuming sample independence) and OPLS-EP (assuming sample dependency) in datasets ofbacterial samples under different experimental conditions. OPLS-EP consistently demonstrates superior predictive performance,evidenced by higher predictive ability by means of cross-validation (Q2) compared to OPLS-DA, indicating greater model signif-icance. Our findings prove the advantages of the paired statistical approach. This approach ensures that treatment effects areaccurately measured by minimizing inter-sample variation and enhancing signal detection. Previous research in metabolomicshas demonstrated the benefits of this method for biomarker sensitivity, particularly in matched case–control studies. The presentstudy extends this understanding by applying paired statistical approaches to bacterial isolate treatments, offering novel insightsinto their utility. Overall, the findings emphasize the importance of OPLS-EP in enhancing biomarker sensitivity and modelreliability in metabolomics research.

Keywords

cross-validation; metabolomics; OPLS-DA; OPLS-EP; paired statistics; predictive performance (Q2); unpaired statistics

Published in

Journal of Chemometrics
2025, volume: 39, number: 11, article number: e70086

SLU Authors

UKÄ Subject classification

Probability Theory and Statistics

Publication identifier

  • DOI: https://doi.org/10.1002/cem.70086

Permanent link to this page (URI)

https://res.slu.se/id/publ/145079