

Comparative Analysis of ML Models for Symptom- Based Disease Prediction
Abstract
This study puts emphasis on early diagnosis in healthcare under the motto "prevention is better than cure." It presents a machine framework for learning to anticipate diseases using patient-reported symptoms. The dataset used comprised 132 symptoms and 41 diseases from 4,921 patient records, and rigorous preprocessing and feature engineering ensured data quality.
We implemented and optimized 6 machine learning models, including SVM, Logistic Regression, Random Forest, Gaussian Naive Bayes, K- Nearest Neighbors, and Decision Tree, achieving 100% classification accuracy and verified with precision, recall, and F1-score.
The proposed system can easily be applied in real-world environments and even in those that have minimal resources due to scalability and computational effectiveness. The future work would extend the dataset, integrate the use of real-time monitoring, and validate among other demographics.
References
Zhou, S.-M., Fernandez-Gutierrez, F., Kennedy, J., Cooksey, R., Atkinson, M., Denaxas, S., Siebert, S., Dixon, W.G., O’Neill, T.W. and Choy, E., "Defining disease phenotypes in primary care Electronic health records by a ML approach: A case study in identifying rheumatoid arthritis", PloS One, Vol. 11, No. 5, (2016), e0154515. https://doi.org/10.1371/journal.pone.0154515
Littell, C.L., "Innovation in medical technology: Reading the indicators", Health Affairs, Vol. 13, No. 3, (1994),
a. 226-235.
b. https://doi.org/10.1377/hlthaff.13.3.226
Rathi, M. and Pareek, V., "Disease prediction tool: An integrated hybrid data mining approach for healthcare", IRACSTInternational Journal of Computer Science and Information Technology & Security (IJCSITS), ISSN, (2016), 2249-9555.
Kelly, C.J. and Young, A.J., "Promoting innovation in healthcare", Future Healthcare Journal, Vol. 4, No. 2, (2017), 121.
https://doi: 10.7861/futurehosp.4-2-121.
Mobeen, A., Shafiq, M., Aziz, M.H. and Mohsin, M.J., "Impact of workflow interruptions on baseline activities of the doctors working in the emergency department", BMJ Open Quality, Vol. 11, No. 3, (2022), e001813. https:// doi: 10.1136/bmjoq-2022-001813.
Ahmed, S., Szabo, S. and Nilsen, K., "Catastrophic healthcare expenditure and impoverishment in tropical deltas: Evidence from the mekong delta region", International Journal for Equity in Health, Vol. 17, No. 1, (2018), 1-13. https://doi: 10.1186/s12939-018-0757-5.
Roberts, M.A. and Abery, B.H., "A person-centered approach to home and community-based services outcome measurement", Frontiers in rehabilitation Sciences, Vol. 4, (2023). doi:10.3389/fresc.2023.1056530
Grampurohit, S. and Sagarnal, C., 2020, June. Disease prediction using machine learning algorithms. In 2020 International Conference for Emerging Technology (INCET) (pp. 1-7). IEEE
Ferjani, M.F., 2020. Disease Prediction Using Machine Learning. Bournemouth, England: Bournemouth University.
Refbacks
- There are currently no refbacks.