### Comparison of Fuzzy C-Means, Fuzzy Kernel C-Means, and Fuzzy Kernel Robust C-Means to Classify Thalassemia Data

#### Abstract

Among the inherited blood disorders in Southeast Asia, thalassemia is the most prevalent. Thalassemias are pathologies that derive from genetic defects of the globin genes. Thalassemia is also considered a health burden among the worldâ€™s population. Thalassemia cannot be cured, but there is a method to prevent the occurrence of thalassemia by early detection withÂ screening. The aim is to identify the suspected unrecognised diseases in a population that seems healthy and asymptomatic using tests, examinations, or other procedures that can be applied quickly and easily to the target population. Research on thalassemia has been done extensively, such as testing the accuracy of Î²-thalassemia data in Thailand using the Bayesian Network and Multinomial Logistic Regression. In this study, we will compare the performance of the classification of thalassemia data by Fuzzy C-Means, Fuzzy Kernel C-Means, and Fuzzy Kernel Robust C-Means. The author uses thalassemia data from Indonesia, acquired from Harapan Kita Children and Womensâ€™s Hospital,Â Jakarta, that consists of 82 thalassemia samples from the patients of thalassemia and 68 non-thalassemia samples with 11 features. In total, there are 150 data patients used in this paper. The results show the accuracy of the classification. The accuracy of FCM is 100% when training data is 90%, FRCM is 100% when training data is 90%, and FKRCM, which is the modified Fuzzy, 100% when we use the and 80% & 90% training data. This result denote that Fuzzy C-Means, Fuzzy Robust C-Means, and Fuzzy Kernel Robust C-Means perfectly classify thalassemia data from Indonesia.

#### Keywords

#### Full Text:

PDF#### References

The World Health Organization (WHO) website. [Online]. Available : https://www.who.int/genomics/public/geneticdiseases/en/index2.html

M. I. Khan, H. N.Khan, and M. Usman, â€œBeta thalassemia trait; diagnostic importance of haematological indices in detecting beta thalassemia trait patients,â€ The Professional Medical Journal, vol. 25, no.4, pp. 545-550, 2018.

P. L. Greenberg, V. Gordeuk, S. Issaragrisil, N. Siritanaratkul, S. Fucharoen, and R. C. Ribeiro, â€œMajor Hematologic Diseases in the Developing Worldâ€” New Aspects of Diagnosis and Management of Thalassemia, Malarial Anemia, and Acute Leukemia,â€ American Society of Hematology, pp. 479-498, 2001.

M. Peters, H. Heijboer, and P. C. Giordano, â€œDiagnosis and management of thalassaemiaâ€, BMJ, vol. 7.

X. Gu and Y. Zeng, â€œA Review of the Molecular Diagnosis of Thalassemia,â€ Hematology, vol. 7, no. 4, pp. 203â€“209, 2002.

S. R. Amendolia, G. Cossub, M. L. Ganaduc, B. Golosioa, G. L.Masala, and G. M. Mura, â€œA comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening,â€ Chemometrics and Intelligent Laboratory System, vol. 69(1-2), pp. 13-20, 2003.

S. R. Amendolia, A. Brunetti, P. Carta, G. Cossu, M. L. Ganadu, B. Golosio, G. M. Mura, M. G. Pirastru, â€œA Real-Time Classification System of Thalassemic Pathologies Based on Artificial Neural Networks,â€ Medical Decision Making, pp. 18-26, 2002.

P. Paokanta, M. Ceccarelli, and S. Srichairatanakool, â€œThe Effeciency of Data Types for Classification Performance of Machine Learning Techniques for Screening Î²-Thalassemia,â€ IEEE, 2010.

A. S. AlAgha, H. Faris, B. H. Hammo, A. M. AlZoubi, â€œIdentifying Î²-thalassemia carriers using a data mining approach: The case of the Gaza Strip, Palestine,â€ Artificial Intelligence in Medicine, vol. 88, pp. 70-83, 2018.

D. Setsirichok, T. Piroonratana, W. Wongseree, T. Usavanarong, N. Paulkhaolam, C. Kanjanakom, â€¦ , N. Chaiyaratana, â€œClassification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a NaÃ¯ve Bayes classifier and a Multilayer Perceptron for

Thalassaemia screening,â€ Biomedical Signal Processing and Control, vol. 7, No. 2, pp. 202-212, 2012.

P. Paokanta, N. Harnpornchai, and N. Chakpitak, â€œThe Classification Performance of Binomial Logistic Regression Based on Classical and Bayesian Statistics for Screening Î²-Thalassemiaâ€, in The 3rd International Conference on Data Mining and Intelligent Information Technology Applications, 2011, pp. 236-241.

D.A. Puspitasari, Z. Rustam, â€œApplication of SVM-KNN using SVR as Feature Selection on Stock Analysis for Indonesian Stock Exchange,â€ Proceeding of 3rd International Symposium on Current Progress in Mathematics and Sciences, 2017

Z. Rustam, D.F. Vibranti, D. Widya, â€œPredicting The Direction of Indonesian Stock Price Movement using Support Vector Machines and Fuzzy Kernel C-Means,â€ Proceeding of 3rd International Symposium on Current Progress in Mathematics and Sciences, 2017.

Z. Rustam, Fanita, â€œPredicting The Jakarta Composite Index Price using ANFIS and Classifying Prediction Result Based on Relative Error by Fuzzy Kernel C-Means,â€ Proceeding of 3rd International Symposium on Current Progress in Mathematics and Sciences, 2017.

Z. Rustam and A.S. Talita, â€œFuzzy Kernel C-Means Algorithm for Intrusion Detection Systems,â€ Journal of Theoritical and Applied Information Technology, vol. 81, 2015.

J. C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, NewYork, 1981.

Z. Rustam and D. Zahras, â€œComparison between Support Vector Machine and Fuzzy C-Means as Classifier for Intrusion Detection System,â€ in 2nd International Conference on Statistics, Mathematics, Teaching, and Research, 2018, pp. 1-6.

Z. Rustam and A. S. Talita, â€œFuzzy Kernel K-Medoids Algorithm for Multiclass Multidimensional Data Classificationâ€, Journal of Theoretical and Applied Information Technology, vol. 80, Issue 1, 2015.

A. Wulan, V. M. Jannati, Z. Rustam, and A. F. Ahmad, â€œApplication Kernel Modified Fuzzy C-Means for Gliomatosis Cerebri,â€ International Conference on Mathematics, Statistics, and Their Applications, pp. 35â€“38, 2016.

J. Han, M. Kamber, J. Pei, Data mining concepts and techniques. Waltham, Massachusetts: Morgan Kaufmann Publishers, 2012.

N. B. Karayiannis and J. C. Bezdek, â€œAn Integrated Approach to Fuzzy Learning Vector Quantization and Fuzzy C-Means Clusteringâ€, IEEE

Trans. Fuzzy Systems, vol. 5, no. 4, pp. 622-628, 1997.

Z. Rustam and F. Yaurita, â€œInsolvency Prediction in Insurance Companies Using Support Vector Machines and Fuzzy Kernel C-Means,â€ in 2nd International Conference on Statistics, Mathematics, Teaching, and Research, 2018, pp. 1-9.

S. R. Kannan, M. Siva, S. Ramathilagam, and R. Devi, â€œEffective Kernel-Based Fuzzy Clustering Systems in Analyzing Cancer Database,â€ Data Enabled Discovery and Applications, pp. 85â€“92, 2018.

DOI: http://dx.doi.org/10.18517/ijaseit.9.4.9580

### Refbacks

- There are currently no refbacks.

Published by INSIGHT - Indonesian Society for Knowledge and Human Development