<?xml version="1.0" encoding="UTF-8"?>
<article>

    <title>Analysis of an Educational dataset using Classification Algorithm in Conjunction with Wrapper Feature Selection Methods</title>

    <slug>analysis-of-an-educational-dataset-using-classification-algorithm-in-conjunction-with-wrapper-feature-selection-methods</slug>

    
            <parent>
            <title>Volume 5, Issue 2</title>
        </parent>
    
    
            <post_type>
            <title>ARTICLES</title>
        </post_type>
    
    	
	
	<year>2025</year>

    
	<volume>5</volume>
	
    
    <content><![CDATA[<p><span class="fontstyle0">Feature selection plays a critical role in improving the efficiency, accuracy, and interpretability of machine learning models, particularly when dealing with high-dimensional datasets. Among various approaches, wrapper-based feature selection methods are known for their ability to capture feature interactions by directly optimizing model performance. This study presents a comprehensive comparative analysis of six wrapper feature selection techniques—Recursive Feature Elimination (RFE), Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithm (GA), Particle Swarm Optimization (PSO), and Differential Evolution (DE)—in conjunction with five widely used classification algorithms: Decision Tree, K-Nearest Neighbour, Random Forest, Logistic Regression, and Support Vector Machine. Experiments are conducted on an educational dataset comprising 395 student records with 30 attributes obtained from the UCI repository, using different feature subset sizes (all features, top 20, top 15, and top 10). Model performance is evaluated using accuracy, precision, recall, F1-score, and AUC. The results demonstrate that wrapper methods significantly enhance classification performance while reducing dimensionality, with GA and RFE consistently emerging as the most effective techniques across multiple classifiers. DE also shows strong performance, particularly with Logistic Regression and Random Forest, whereas PSO generally underperforms in terms of AUC. Furthermore, reducing the feature set does not adversely affect predictive accuracy and, in several cases, leads to improved generalization. The findings confirm the effectiveness of wrapper methods for educational data mining and provide practical insights for selecting optimal feature–classifier combinations.</span></p>]]></content>

    
            <references><![CDATA[<p><span class="fontstyle0">Asif, R., Merceron, A., &amp; Pathan, M. K. (2017). Predicting student academic performance at degree level: A case study. </span><span class="fontstyle2">International Journal of Intelligent Systems and Applications, 9</span><span class="fontstyle0">(12), 49–61. https://doi.org/10.5815/ijisa.2017.12.06</span></p>
<p> </p>
<p><span class="fontstyle0">Bin Mat, U., Buniyamin, N., Arsad, P. M., &amp; Kassim, R. A. (2014). An overview of using academic analytics to predict and improve students’ achievement: A proposed proactive intelligent intervention. In </span><span class="fontstyle2">Proceedings of the 2013 IEEE 5th International Conference on Engineering Education (ICEED) </span><span class="fontstyle0">(pp. 126–130). IEEE.</span></p>
<p> </p>
<p><span class="fontstyle0">Chandrashekar, G., &amp; Sahin, F. (2014). A survey on feature selection methods. </span><span class="fontstyle2">Computers &amp; Electrical Engineering, 40</span><span class="fontstyle0">(1), 16–28.</span></p>
<p> </p>
<p><span class="fontstyle0">Das, S., &amp; Suganthan, P. N. (2011). Differential evolution: A survey of the state-of-the-art. </span><span class="fontstyle2">IEEE Transactions on Evolutionary Computation, 15</span><span class="fontstyle0">(1), 4–31. https://doi.org/10.1109/ TEVC.2010.2059031</span></p>
<p> </p>
<p><span class="fontstyle0">Dash, M., &amp; Liu, H. (1997). Feature selection for classification. </span><span class="fontstyle2">Intelligent Data Analysis, 1</span><span class="fontstyle0">(3), 131–156.</span></p>
<p> </p>
<p><span class="fontstyle0">Dubois, P. F. (2007). Python: Batteries included. </span><span class="fontstyle2">Computing in Science &amp; Engineering, 9</span><span class="fontstyle0">(3), 7–9. https://doi.org/10.1109/MCSE.2007.58</span></p>
<p> </p>
<p><span class="fontstyle0">Feng, C., &amp; Xu, X. (2019). A hybrid feature selection method based on RFE and Relief for educational data mining. </span><span class="fontstyle2">Computers in Human Behavior, 101</span><span class="fontstyle0">, 228–237. https://doi.org/10.1016/j. chb.2019.07.002</span></p>
<p> </p>
<p><span class="fontstyle0">Goldberg, D. E. (1989). </span><span class="fontstyle2">Genetic algorithms in search, optimization, and machine learning</span><span class="fontstyle0">. Addison-Wesley.</span></p>
<p> </p>
<p><span class="fontstyle0">Guyon, I., &amp; Elisseeff, A. (2003). An introduction to variable and feature selection. </span><span class="fontstyle2">Journal of Machine Learning Research, 3</span><span class="fontstyle0">, 1157–1182.</span></p>
<p> </p>
<p> </p>
<p> </p>
<p><span class="fontstyle4">Mamta Saxena, Netra Pal Singh</span></p>
<p> </p>
<p><span class="fontstyle5">Analysis of an Educational dataset using Classification Algorithm in Conjunction with Wrapper Feature Selection Methods</span></p>
<p> </p>
<p><span class="fontstyle0">Guyon, I., Weston, J., Barnhill, S., &amp; Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. </span><span class="fontstyle2">Machine Learning, 46</span><span class="fontstyle0">(1–3), 389–422. https://doi.org/10.1023/A:1012487302797</span></p>
<p> </p>
<p><span class="fontstyle0">Holland, J. H. (1992). </span><span class="fontstyle2">Adaptation in natural and artificial systems: An introductory analysis with applications to biology, control, and artificial intelligence</span><span class="fontstyle0">. MIT Press.</span></p>
<p> </p>
<p><span class="fontstyle0">Islam, M. M., &amp; Murase, K. (2018). A hybrid wrapper–filter approach for feature selection using mutual information and differential evolution. </span><span class="fontstyle2">Knowledge-Based Systems, 147</span><span class="fontstyle0">, 125–134. https://doi.org/10.1016/j.knosys.2018.01.002</span></p>
<p> </p>
<p><span class="fontstyle0">Jain, A. K., &amp; Zongker, D. (1997). Feature selection: Evaluation, application, and small sample performance. </span><span class="fontstyle2">IEEE Transactions on Pattern Analysis and Machine Intelligence, 19</span><span class="fontstyle0">(2), 153–158. https://doi.org/10.1109/34.574797</span></p>
<p> </p>
<p><span class="fontstyle0">Kennedy, J., &amp; Eberhart, R. (1995). Particle swarm optimization. In </span><span class="fontstyle2">Proceedings of the IEEE International Conference on Neural Networks </span><span class="fontstyle0">(pp. 1942–1948). IEEE.</span></p>
<p> </p>
<p><span class="fontstyle0">Khan, M. J., &amp; Jawaid, A. (2020). Prediction of at-risk students using genetic algorithm and random forest. </span><span class="fontstyle2">Education and Information Technologies, 25</span><span class="fontstyle0">(4), 2979–2997.</span></p>
<p> </p>
<p><span class="fontstyle0">Kohavi, R., &amp; John, G. H. (1997). Wrappers for feature subset selection. </span><span class="fontstyle2">Artificial Intelligence, 97</span><span class="fontstyle0">(1–2), 273–324.</span></p>
<p> </p>
<p><span class="fontstyle0">Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R. P., Tang, J., &amp; Liu, H. (2017). Feature selection: A data perspective. </span><span class="fontstyle2">ACM Computing Surveys, 50</span><span class="fontstyle0">(6), 1–45.</span></p>
<p> </p>
<p><span class="fontstyle0">Liu, H., &amp; Yu, L. (2005). Toward integrating feature selection algorithms for classification and clustering. </span><span class="fontstyle2">IEEE Transactions on Knowledge and Data Engineering, 17</span><span class="fontstyle0">(4), 491–502. https:// doi.org/10.1109/TKDE.2005.66</span></p>
<p> </p>
<p><span class="fontstyle0">Maldonado, S., &amp; Weber, R. (2009). A wrapper method for feature selection using support vector machines. </span><span class="fontstyle2">Information Sciences, 179</span><span class="fontstyle0">(13), 2208–2217.</span></p>
<p> </p>
<p><span class="fontstyle0">Millman, K. J., &amp; Aivazis, M. A. (2011). Python for scientists and engineers. </span><span class="fontstyle2">Computing in Science &amp; Engineering, 13</span><span class="fontstyle0">(2), 9–12. https://doi.org/10.1109/MCSE.2011.36</span></p>
<p> </p>
<p><span class="fontstyle0">Moslehi, F., &amp; Haeri, A. (2020). A novel hybrid wrapper–filter approach based on genetic algorithm and particle swarm optimization for feature subset selection. </span><span class="fontstyle2">Journal of Ambient Intelligence and Humanized Computing, 11</span><span class="fontstyle0">, 1105–1127.</span></p>
<p> </p>
<p><span class="fontstyle0">Patel, D., Saxena, A., &amp; Wang, J. (2024). A machine-learning-based wrapper method for feature selection. </span><span class="fontstyle2">International Journal of Data Warehousing and Mining</span><span class="fontstyle0">. https://doi.org/10.4018/ IJDWM.352041</span></p>
<p> </p>
<p><span class="fontstyle0">Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., … Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. </span><span class="fontstyle2">Journal of Machine Learning Research, 12</span><span class="fontstyle0">, 2825–2830.</span></p>
<p> </p>
<p><span class="fontstyle0">Pudil, P., Novovicová, J., &amp; Kittler, J. (1994). Floating search methods in feature selection. </span><span class="fontstyle2">Pattern Recognition Letters, 15</span><span class="fontstyle0">(11), 1119–1125. https://doi.org/10.1016/0167-8655(94)90127-9</span></p>
<p> </p>
<p><span class="fontstyle0">Saeys, Y., Inza, I., &amp; Larrañaga, P. (2007). A review of feature selection techniques in bioinformatics. </span><span class="fontstyle2">Bioinformatics, 23</span><span class="fontstyle0">(19), 2507–2517.</span></p>
<p> </p>
<p> </p>
<p><span class="fontstyle0">Saxena, M., &amp; Singh, N. P. (2025). Role of Hybrid Feature Selection Algorithms in Foreseeing Stu dent Performance. International Journal of Natural and Technical Sciences, 5(1), 43-75. https://doi.org/10.69648/NVAX8158</span></p>
<p> </p>
<p><span class="fontstyle0">Sharma, M., &amp; Kaur, P. (2020). An efficient differential evolution approach for feature selection in educational data mining. </span><span class="fontstyle2">International Journal of Cognitive Computing in Engineering, 1</span><span class="fontstyle0">, 33–43.</span></p>
<p> </p>
<p><span class="fontstyle0">Siedlecki, W., &amp; Sklansky, J. (1989). A note on genetic algorithms for large-scale feature selection. </span><span class="fontstyle2">Pattern Recognition Letters, 10</span><span class="fontstyle0">(5), 335–347.</span></p>
<p> </p>
<p><span class="fontstyle0">Singh, A. K., &amp; Karthikeyan, S. (2024). A wrapper feature selection method for predicting student dropout in higher education. </span><span class="fontstyle2">SSRN Electronic Journal</span><span class="fontstyle0">. https://doi.org/10.2139/ ssrn.5002077</span></p>
<p> </p>
<p><span class="fontstyle0">Sivanandam, S. N., &amp; Deepa, S. N. (2007). </span><span class="fontstyle2">Introduction to genetic algorithms</span><span class="fontstyle0">. Springer. https://doi. org/10.1007/978-3-540-73190-0</span></p>
<p> </p>
<p><span class="fontstyle0">Storn, R., &amp; Price, K. (1997). Differential evolution – A simple and efficient heuristic for global optimization over continuous spaces. </span><span class="fontstyle2">Journal of Global Optimization, 11</span><span class="fontstyle0">(4), 341–359.</span></p>
<p> </p>
<p><span class="fontstyle0">Tang, J., Alelyani, S., &amp; Liu, H. (2014). Feature selection for classification: A review. In H. Liu &amp; L. Yu (Eds.), </span><span class="fontstyle2">Data classification: Algorithms and applications </span><span class="fontstyle0">(pp. 37–64). CRC Press.</span></p>
<p> </p>
<p><span class="fontstyle0">Tomasevic, N., Gvozdenovic, N., &amp; Vranes, S. (2019). An overview and comparison of supervised data mining techniques for student exam performance prediction. </span><span class="fontstyle2">Computers &amp; Education, 143</span><span class="fontstyle0">, 103676. https://doi.org/10.1016/j.compedu.2019.103676</span></p>
<p> </p>
<p><span class="fontstyle0">Waheed, H., Goyal, M., &amp; Khanum, A. (2020). Machine learning-based early prediction of students’ performance using PSO. </span><span class="fontstyle2">Education and Information Technologies, 25</span><span class="fontstyle0">(6), 4693–4711.</span></p>
<p> </p>
<p><span class="fontstyle0">Xue, B., Zhang, M., &amp; Browne, W. N. (2015). A comprehensive comparison on evolutionary feature selection approaches to classification. </span><span class="fontstyle2">International Journal of Computational Intelligence and Applications, 14</span><span class="fontstyle0">(2), 1550008.</span></p>
<p> </p>
<p><span class="fontstyle0">Xue, B., Zhang, M., &amp; Browne, W. N. (2016). A survey on evolutionary computation appr</span></p>]]></references>
    
            <keywords>Classification Algorithms, Feature Selection, Performance
Metrics, Wrapper Methods</keywords>
    
    <date></date>

    <url>https://ijtns.ibupress.com/articles/analysis-of-an-educational-dataset-using-classification-algorithm-in-conjunction-with-wrapper-feature-selection-methods</url>

</article>