Privacy preserving disease risk assessment model using machine learning classifiers
Keywords:
random forest, SMOTE, RSA, encryption, decryption, prognosis, risk assessmentAbstract
In the pre-digital era, medical diagnosis used to consume a lot of time and human resources. But in this digital age, the entire process can be efficiently done with the help of machines. So a new term called prognosis is introduced in this modern era where scientific prediction of the likely development of a disease and its outcome can be done. Machine learning techniques are utilized for prognosis. Though machine learning algorithms solve many problems in the healthcare field, they cannot flourish further without privacy and security assurances as the healthcare field consists of more sensitive data. Our application addresses this issue and provides a fully secured disease risk assessment using an encryption algorithm called RSA. The main objective here is to build a system with the following aspects, Functional Aspect: Disease prediction using machine learning technique decision trees. Non-Functional Aspect: Providing privacy and confidentiality of data using encryption techniques like RSA. Data privacy and confidentiality is provided with the help of the RSA algorithm. RSA is the first successful public key cryptographic algorithm.
Downloads
References
J. Qiu, X. Liang, S. Shetty, and D. Bowden, “Towards secure and smart healthcare in smart cities using blockchain,” in IEEE International Smart Cities Conference. IEEE, 2018, pp. 1–4.
J. S. Lin, C. V. Evans, E. Johnson, N. Redmond, E. L. Coppola, and N. Smith, “Nontraditional risk factors in cardiovascular disease risk assessment: Updated evidence report and systematic review for the us preventive services task force,” Journal of the American Medical Association, vol. 320, no. 3, pp. 281–297, 2018.
A. Abbas, M. Ali, M. U. S. Khan, and S. U. Khan, “Personalized healthcare cloud services for disease risk assessment and wellness management using social media,” Pervasive and Mobile Computing, vol. 28, pp. 81–99, 2016.
S. Perveen, M. Shahbaz, K. Keshavjee, and A. Guergachi, “A systematic machine learning based approach for the diagnosis of non-alcoholic fatty liver disease risk and progression,” Scientific reports, vol. 8, no. 1, pp. 1–12, 2018.
L. Jena, S. Nayak, and R. Swain, “Chronic disease risk (cdr) prediction in biomedical data using machine learning approach,” in Advances in Intelligent Computing and Communication, 2020, pp. 232–239.
C. Xu, N. Wang, L. Zhu, K. Sharif, and C. Zhang, “Achieving searchable and privacy-preserving data sharing for cloud-assisted e-healthcare system,” IEEE Internet of Things Journal, vol. 6, no. 5, pp. 8345–8356, 2019.
W. Tang, J. Ren, K. Deng, and Y. Zhang, “Secure data aggregation of lightweight e-healthcare iot devices with fair incentives,” IEEE Internet of Things Journal, vol. 6, no. 5, pp. 8714–8726, 2019.
L. Yang, Q. Zheng, and X. Fan, “RSPP: A reliable, searchable and privacy-preserving e-healthcare system for cloud-assisted body area networks,” in 2017 IEEE Conference on Computer Communications. IEEE, 2017, pp. 1–9.
R. Bocu and C. Costache, “A homomorphic encryption- based system for securely managing personal health metrics data,” IBM Journal of Research and Development, vol. 62, no. 1, pp. 1:1–1:10, 2018.
M. Martinez-Arroyo and L. E. Sucar, “Learning an optimal naive bayes classifier,” in 18th International Conference on Pattern Recognition (ICPR 2006). IEEE Computer Society, 2006, pp. 1236–1239.
J. Hua, H. Zhu, F. Wang, X. Liu, R. Lu, H. Li, and Y. Zhang, “CINEMA: efficient and privacy-preserving online medical primary diagnosis with skyline query,” IEEE Internet of Things Journal, vol. 6, no. 2, pp. 1450–1461, 2019.
X. Liu, H. Zhu, R. Lu, and H. Li, “Efficient privacy- preserving online medical primary diagnosis scheme on naive bayesian classification,” Peer-to-Peer Networking and Applications, vol. 11, no. 2, pp. 334–347, 2018.
R. Lu, H. Zhu, X. Liu, J. K. Liu, and J. Shao, “Toward efficient and privacy-preserving computing in big data era,” IEEE Network, vol. 28, no. 4, pp. 46–50, 2014.
A. Asuncion and D. Newman, “Uci machine learning repository,” 2007. [Online]. Available: https://doi.org/10.1109/MNET.2014. 6863131
P. Li, J. Li, Z. Huang, C. Gao, W. Chen, and K. Chen, “Privacy preserving outsourced classification in cloud computing,” Cluster Computing, vol. 21, no. 1, pp. 277–286, 2018.
Abhishek Guru and Asha Ambhaikar, “Development of “RSA” Encryption Algorithm for Secure Data Transmission,” in 2020 Research Journal of Computer and Information Technology Sciences, Vol. 8(1), 9-12.
Dindayal Mahto and Dilip Kumar Yadav, “Performance Analysis of RSA and Elliptic Curve Cryptography,” in 2018 International Journal of Network Security, Vol.20, No.4, PP.625-635.
Laiphrakpam Dolendro Singh and Khumanthem Manglem Singh, “Implementation of Text Encryption using Elliptic Curve Cryptography,” in 2015 Procedia Computer Science 54 PP.73-82.
Dindayal Mahto, Dilip Kumar Yadav, “RSA and ECC: A Comparative Analysis” in International Journal of Applied Engineering Research ISSN 0973-4562 Volume 12, Number 19 (2017) pp. 9053-9061.
Mohit D. Singanjude and Prof. R. Dalvi, “Literature Survey: Secure transmitting of data using RSA public key implemented with Vedic method” in 2016 International Journal of Computer Applications Technology and Research, Volume 5–Issue 10, 675- 677.
Neal Koblitz, “Elliptic Curve Cryptosystems”, in 1987 Mathematics of Computation Volume 4x. number 177. PP 203-209.
Nirjharini Mohanty, Soumen Nayak, Monarch Saha,
Vishal Baral and Imlee Rout, “Classification of diabetes disease using machine learning algorithms,” in 2021 ICTACT Journal On Data Science And Machine Learning, Volume: 02.
Padma Bh, D.Chandravathi, P.Prapoorna Roja, “Encoding And Decoding of a Message in the Implementation of Elliptic Curve Cryptography using Koblitz’s Method,” in 2010 (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 05, 1904-1907.
Ximeng Liu, Student Member, IEEE, Rongxing Lu, Member, IEEE, Jianfeng Ma, Le Chen, and Baodong Qin, “A Privacy-Preserving Patient-Centric Clinical Decision Support System on Naive Bayesian Classification,” in 2016 IEEE Journal of Biomedical and Health Informatics, Volume: 20, Issue: 2.
Xue Yang, Rongxing Lu, Senior Member, IEEE, Jun Shao, Xiaohu Tang, Member, IEEE, and Haomiao Yang, Member, IEEE, “An Efficient and Privacy- Preserving Disease Risk Prediction Scheme for E- Healthcare,” in 2019 IEEE Internet Of Things Journal, Vol. 6, No. 2, April 2019.
D. Zhu, H. Zhu, X. Liu, H. Li, F. Wang, H. Li, and D. Feng, “CREDO: efficient and privacy-preserving multi- level medical pre-diagnosis based on ml-kNN,” Information Sciences, vol. 514, pp. 244–262, 2020.
P. Paillier, “Public-key cryptosystems based on composite degree residuosity classes,” in Advances in Cryptology - EUROCRYPT ’99, J. Stern, Ed., vol. 1592. Springer, 1999, pp. 223–238.
M. Shen, X. Tang, L. Zhu, X. Du, and M. Guizani, “Privacy-preserving support vector machine training over blockchain-based encrypted iot data in smart cities,” IEEE Internet of Things Journal, vol. 6, no. 5, pp. 7702–7712, 2019.
F. Wang, H. Zhu, X. Liu, R. Lu, J. Hua, H. Li, and H. Li, “Privacy-preserving collaborative model learning scheme for e-healthcare,” IEEE Access, vol. 7, pp. 166 054–166 065, 2019.
H. Zhu, X. Liu, R. Lu, and H. Li, “Efficient and privacy-preserving online medical prediagnosis framework using nonlinear SVM,”IEEE Journal of Biomedical and Health Informatics, vol. 21, no. 3, pp. 838–850, 2017.
Z. Ma, J. Ma, Y. Miao, and X. Liu, “Privacy-preserving and high-accurate outsourced disease predictor on random forest,” Information Science, vol. 496, pp. 225– 241, 2019.
Evelyn P Whitlock, Jennifer S Lin, Elizabeth Liles, Tracy L Beil, Rongwei Fu, "Screening for Colorectal Cancer: A Targeted, Updated Systematic Review for the U.S. Preventive Services Task Force," in 2008 Annals of internal medicine 149 (9), PP 638-658.
Published
How to Cite
Issue
Section
Copyright (c) 2022 International journal of health sciences

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in the International Journal of Health Sciences (IJHS) are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJHS right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.
Articles published in IJHS can be copied, communicated and shared in their published form for non-commercial purposes provided full attribution is given to the author and the journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
This copyright notice applies to articles published in IJHS volumes 4 onwards. Please read about the copyright notices for previous volumes under Journal History.








