Using machine learning models to predict the effects of seasonal fluxes on Plesiomonas shigelloides population density

dc.contributor.authorEkundayo, CT
dc.contributor.authorOluwatosin, AI.
dc.contributor.authorIgbinosa, EO.
dc.contributor.authorOkoh, AI.
dc.date.accessioned2026-03-31T11:44:43Z
dc.date.available2026-03-31T11:44:43Z
dc.date.issued2022-10-28
dc.description.abstractSeasonal variations (SVs) affect the population density (PD), fate, and fitness of pathogens in environmental water resources and the public health impacts. Therefore, this study is aimed at applying machine learning intelligence (MLI) to predict the impacts of SVs on P. shigelloides population density (PDP) in the aquatic milieu. Physicochemical events (PEs) and PDP from three rivers acquired via standard microbiological and instrumental techniques across seasons were fitted to MLI algorithms (linear regression (LR), multiple linear regression (MR), random forest (RF), gradient boosted machine (GBM), neural network (NN), K-nearest neighbour (KNN), boosted regression tree (BRT), extreme gradient boosting (XGB) regression, support vector regression (SVR), decision tree regression (DTR), M5 pruned regression (M5P), artificial neural network (ANN) regression (with one 10-node hidden layer (ANN10), two 6- and 4-node hidden layers (ANN64), and two 5- and 5-node hidden layers (ANN55)), and elastic net regression (ENR)) to assess the implications of the SVs of PEs on aquatic PDP. The results showed that SVs significantly influenced PDP and PEs in the water (p < 0.0001), exhibiting a site-specific pattern. While MLI algorithms predicted PDP with differing absolute flux magnitudes for the contributing variables, DTR predicted the highest PDP value of 1.707 log unit, followed by XGB (1.637 log unit), but XGB (meansquared-error (MSE) = 0.0025; root-mean-squared-error (RMSE) = 0.0501; R2 =0.998; medium absolute deviation (MAD) = 0.0275) outperformed other models in terms of regression metrics. Temperature and total suspended solids (TSS) ranked first and second as significant factors in predicting PDP in 53.3% (8/15) and 40% (6/15), respectively, of the models, based on the RMSE loss after permutations. Additionally, season ranked third among the 7 models, and turbidity (TBS) ranked fourth at 26.7% (4/15), as the primary significant factor for predicting PDP in the aquatic milieu. The results of this investigation demonstrated that MLI predictive modelling techniques can promisingly be exploited to complement the repetitive laboratory-based monitoring of PDP and other pathogens, especially in low-resource settings, in response to seasonal fluxes and can provide insights into the potential public health risks of emerging pathogens and TSS pollution (e.g., nanoparticles and micro- and nanoplastics) in the aquatic milieu. The model outputs provide low-cost and effective early warning information to assist watershed managers and fish farmers in making appropriate decisions about water resource protection, aquaculture management, and sustainable public health protection.
dc.identifier.issn0269-7491
dc.identifier.urihttp://hdl.handle.net/20.500.11837/3934
dc.language.isoen
dc.publisherElsevier
dc.subjectPathogen
dc.subjectPublic health
dc.subjectMachine intelligence
dc.subjectPrediction
dc.subjectFeature importance
dc.subjectPredictive microbiology
dc.subjectMultiple linear regression
dc.subjectRandom forest
dc.subjectGradient boosted machine
dc.subjectNeural networks
dc.subjectK-nearest neighbours
dc.subjectBoosted regression tree
dc.subjectExtreme gradient boosted regression
dc.subjectSupport vector regression
dc.subjectDecision tree regression
dc.subjectM5 pruned regression
dc.subjectArtificial neural network regression
dc.titleUsing machine learning models to predict the effects of seasonal fluxes on Plesiomonas shigelloides population density
dc.typeArticle
person.identifier.orcidEkundayo, CT 0000-0002-7781-3507
person.identifier.orcidOluwatosin, AI. 0000-0002-6283-8517
person.identifier.orcidIgbinosa, EO. 0000-0001-7441-2145
person.identifier.orcidOkoh, AI. 0000-0002-9770-085X

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
1-s2.0-S0269749122019480-main.pdf
Size:
7.99 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections