Ilchenko, Oleksandr
- Umeå Universitet
In the analysis of metabolomics data, selecting the appropriate statistical approach is crucial for maximizing model interpre-tation, predictivity and reliability. This study evaluates the effectiveness of Orthogonal Partial Least Squares (OPLS) models,specifically comparing OPLS-DA (assuming sample independence) and OPLS-EP (assuming sample dependency) in datasets ofbacterial samples under different experimental conditions. OPLS-EP consistently demonstrates superior predictive performance,evidenced by higher predictive ability by means of cross-validation (Q2) compared to OPLS-DA, indicating greater model signif-icance. Our findings prove the advantages of the paired statistical approach. This approach ensures that treatment effects areaccurately measured by minimizing inter-sample variation and enhancing signal detection. Previous research in metabolomicshas demonstrated the benefits of this method for biomarker sensitivity, particularly in matched case–control studies. The presentstudy extends this understanding by applying paired statistical approaches to bacterial isolate treatments, offering novel insightsinto their utility. Overall, the findings emphasize the importance of OPLS-EP in enhancing biomarker sensitivity and modelreliability in metabolomics research.
cross-validation; metabolomics; OPLS-DA; OPLS-EP; paired statistics; predictive performance (Q2); unpaired statistics
Journal of Chemometrics
2025, volym: 39, nummer: 11, artikelnummer: e70086
Sannolikhetsteori och statistik
https://res.slu.se/id/publ/145079