Predicting Class Label Using Clustering-Classification Technique: A Comparative Study




Cluster-Classifier, Clustering, Classification, Data Mining


Among different techniques, algorithms and applications of Data Mining, predicting the class label of unlabeled objects(undefined class label) is a crucial term in the field. The most common approaches in this area is the use of classification technique (DT, Bayes, SVM, KNN and others) that represent what is known as supervised learning. However, in many cases no target class labels and the boundaries are available to perform the prediction, so the new approach Clustering-classification technique is used.

The work in this paper presents a survey of the most common researches conducted in this field and discuss their experiments, the algorithms they used, the types of data they utilized, the data sizes used, and the results they discovered.

According to the results, applying the clustering techniques before classification improved classification accuracy and reduced experiment execution time. The Cluster Classifier was proven to be a suitable approach to summarize data by some of the researchers. It achieves a summarization rate of over 50%, which represents a considerable reduction in the size of the test datasets..

The findings of the researches indicated that, in addition to feature selection and feature extraction, data preprocessing (handled missing data and effective outlier detection techniques) enhanced the classifier performance and accuracy while reducing the classification error.



Download data is not yet available.


J. Han, M. Kamber, And Jian Pei, “Data Mining: Concepts And Techniques,” The Morgan Kaufmann Series In Data Management Systems, 2012.

Shrishrimal, P & Deshmukh, Ratnadeep & Waghmare, Dr. Vishal, “Multimedia Data Mining: A Review,” International Conference On Recent Trends And Challenges In Science And Technology, 2014.

Ryan S.J.D. Baker, “Data Mining For Education,” International Encyclopedia Of Education, 2010.

Rabia Saleem, Sania Shaukat, “Denormalization To Enhance Effciency In Data Mining,” International Journal Of Scientific & Engineering Research, Vol. 7, Issue 9, September 2016.

Mirela Danubianu, Et Al, “Model Of A Data Mining System For Personalized Therapy Of Speech Disorders,” Journal Of Applied Computer Science And Mathematics, 2018.

Manish Kumar Aery And Chet Ram, “A Review On Machine Learning: Trends And Future Prospects,” An International Journal Of Engineering Sciences, Vol. 25, November 2017.

Vladimir Nasteski, “An Overview Of The Supervised Machine Learning Methods,” Horizons, Vol. 4, 2017, Pp. 51-62. DOI:

Sankar Rajagopal, ”Customer Data Clustering Using Data Mining Technique,” International Journal Of Database Management Systems

( Ijdms ), Vol.3, No.4, November 2011.

Absalom Ezugwu, Et Al ,”Automatic Clustering Algorithms: A Systematic

Review And Bibliometric Analysis Of Relevant Literature,” Neural Computing And Applications, 2021.

Syed Muhammad Raza Abidi, Et Al, “Popularity Prediction Of Movies: From Statistical Modeling To Machine Learning Techniques,” Springer Science & Business Media, January 2020.

G. Kesavaraj And S. Sukumaran, "A Study On Classification Techniques In Data Mining," In 2013 Fourth International Conference On Computing, Communications And Networking Technologies (ICCCNT), Tiruchengode, India, 2013, Pp. 1-7. Doi:10.1109/ICCCNT.2013.6726842 DOI:

Md Mustafa Md-Muziman-Syah, Et Al, “Machine Learning Cases In Clinical And Biomedical Domains,” International Medical Journal Malaysia, July 2018.

S. S. Khan, S. Ahamed, M. Jannat, S. Shatabda, And D. Md. Farid, “Classification By Clustering (Cbc): An Approach Of Classifying Big Data Based On Similarities,” Springer Nature Singapore, 2020, Pp. 593-605. Https://Doi.Org/10.1007/978-981-13-7564-4_50 . DOI:

Reuben Evans, Bernhard Pfahringer, And Geoffrey Holmes, “Clustering For Classification,” IEEE 7th International Conference On IT In Asia (CITA), 2011. DOI:

Rasoul Kiani, Siamak Mahdavi, And Amin Keshavarzi, “Analysis And Prediction Of Crimes By Clustering And Classification,” (IJARAI) International Journal Of Advanced Research In Artificial Intelligence, Vol. 4, No.8, 2015. DOI:

Asha Gowda Karegowda , M.A. Jayaram, And A.S. Manjunath, “Cascading K-Means Clustering And K-Nearest Neighbor Classifier For Categorization Of Diabetic Patients,” International Journal Of Engineering And Advanced

Technology (IJEAT), Vol. 1, Issue. 3, February 2012.

Yaswanth Kumar Alapati And Korrapati Sindhu, “Combining Clustering With Classification: A Technique To Improve Classification Accuracy,” International Journal Of Computer Science Engineering (IJCSE), Vol. 5, No.06, Nov 2016, Pp. 336-338.

Norsyela Muhammad Noor Mathivanan, Nor Azura Md.Ghani, And Roziah Mohd Janor, “Improving Classification Accuracy Using Clustering Technique,” Bulletin Of Electrical Engineering And Informatics, Vol. 7, No. 3, September 2018, Pp. 465-470 . DOI:

Mundhe, Ms Raksha K., and Ankush Maind. "Automatic labelling and document clustering for forensic analysis." Int J Recent Innovation Trends Comput Commun 2.9 (2014): 2934-41.




How to Cite

alshaibanee, A. faysal, & AlJanabi, K. B. S. (2023). Predicting Class Label Using Clustering-Classification Technique: A Comparative Study. Journal of Kufa for Mathematics and Computer, 10(1), 1–12.

Similar Articles

1 2 3 4 5 6 > >> 

You may also start an advanced similarity search for this article.