Vis enkel innførsel

dc.contributor.authorAli, Zain
dc.contributor.authorHayat, Muhammad Faisal
dc.contributor.authorShaukat, Kamran
dc.contributor.authorAlam, Talha Mahboob
dc.contributor.authorHameed, Ibrahim A.
dc.contributor.authorLuo, Suhuai
dc.contributor.authorBasheer, Shakila
dc.contributor.authorAyadi, Manel
dc.contributor.authorKsibi, Amel
dc.date.accessioned2023-02-09T12:14:41Z
dc.date.available2023-02-09T12:14:41Z
dc.date.created2023-01-05T13:17:26Z
dc.date.issued2022
dc.identifier.citationDiagnostics (Basel). 2022, 12 (12), .en_US
dc.identifier.issn2075-4418
dc.identifier.urihttps://hdl.handle.net/11250/3049691
dc.description.abstractSchistosomiasis is a neglected tropical disease that continues to be a leading cause of illness and mortality around the globe. The causing parasites are affixed to the skin through defiled water and enter the human body. Failure to diagnose Schistosomiasis can result in various medical complications, such as ascites, portal hypertension, esophageal varices, splenomegaly, and growth retardation. Early prediction and identification of risk factors may aid in treating disease before it becomes incurable. We aimed to create a framework by incorporating the most significant features to predict Schistosomiasis using machine learning techniques. A dataset of advanced Schistosomiasis has been employed containing recovery and death cases. A total data of 4316 individuals containing recovery and death cases were included in this research. The dataset contains demographics, socioeconomic, and clinical factors with lab reports. Data preprocessing techniques (missing values imputation, outlier removal, data normalisation, and data transformation) have also been employed for better results. Feature selection techniques, including correlation-based feature selection, Information gain, gain ratio, ReliefF, and OneR, have been utilised to minimise a large number of features. Data resampling algorithms, including Random undersampling, Random oversampling, Cluster Centroid, Near miss, and SMOTE, are applied to address the data imbalance problem. We applied four machine learning algorithms to construct the model: Gradient Boosting, Light Gradient Boosting, Extreme Gradient Boosting and CatBoost. The performance of the proposed framework has been evaluated based on Accuracy, Precision, Recall and F1-Score. The results of our proposed framework stated that the CatBoost model showed the best performance with the highest accuracy of (87.1%) compared with Gradient Boosting (86%), Light Gradient Boosting (86.7%) and Extreme Gradient Boosting (86.9%). Our proposed framework will assist doctors and healthcare professionals in the early diagnosis of Schistosomiasis.en_US
dc.language.isoengen_US
dc.publisherMDPIen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleA Proposed Framework for Early Prediction of Schistosomiasisen_US
dc.title.alternativeA Proposed Framework for Early Prediction of Schistosomiasisen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber0en_US
dc.source.volume12en_US
dc.source.journalDiagnostics (Basel)en_US
dc.source.issue12en_US
dc.identifier.doi10.3390/diagnostics12123138
dc.identifier.cristin2101320
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal