CLASSIFICATION OF SICKLE CELL ANAEMIA SEVERITY USING HAEMATOLOGICAL PARAMETERS: A MACHINE LEARNING APPROACH

[featured_image]
  • Version
  • Download 26
  • File Size 828.33 KB
  • File Count 1
  • Create Date January 12, 2026
  • Last Updated January 12, 2026

CLASSIFICATION OF SICKLE CELL ANAEMIA SEVERITY USING HAEMATOLOGICAL PARAMETERS: A MACHINE LEARNING APPROACH

ABSTRACT

Sickle Cell Anaemia (SCA) is a genetic blood disorder that caused abnormal haemoglobin and severe health complications. In Nigeria, nearly 150,000 infants, 2% of newborns, were diagnosed annually, highlighting its public health impact. SCA severity varied and was linked to complications like vaso-occlusive and other critical case. Traditional severity assessments were subjective, but machine learning (ML) provided an objective alternative. This study developed an ML-based severity classification model using haematological parameters from 364 Complete Blood Count (CBC) patient records. Features included age, sex, RBC, PCV, MCV, MCH, MCHC, RDW, TLC, PLT/mm³, and HGB. ML models including Logistic Regression, Support Vector Machine, Random Forest, and XGBoost were tested with Chi-Square, ANOVA, and all features. Logistic Regression achieved the highest accuracy at 96%, Random Forest reached 95%, XGBoost ranged from 89% to 92%, and Support Vector Machine remained at 90%. ANOVA proved the best feature selection method. Future work should aim at using ensemble learning for enhanced SCA severity prediction.

Keywords: Feature Selection, Machine Learning Models, Model Performance, Severity Classification, Sickle Cell Anaemia.

SHARE