Review of Feature Selection Techniques for Credit Card Fraud Detection

Review of Feature Selection Techniques for Credit Card Fraud Detection
Author	Huanjing Wang
Co-Author(s)	Taghi M. Khoshgoftaar
Abstract	Detecting credit card fraud is essential for financial institutions to prevent financial losses due to fraudulent transactions. Our research focuses on the Credit Card Fraud Detection Dataset, which is a widely used real-world transaction dataset, to identify the most crucial features that contribute to predicting fraudulent activities. Selecting the most relevant features helps reduce the amount of data that needs processing, saving time and computational resources. To overcome the common problems of high-dimensionality and class-imbalance in fraud detection datasets, we review a range of feature selection methods that are used either on their own or in combination with data sampling. Feature selection and data sampling have been successfully applied in credit card fraud detection, with tree-based feature selection methods such as Random Forest, Boruta, and Extra Tree classifier, as well as Synthetic Minority Oversampling Technique (SMOTE), being the most commonly used preprocessing techniques for this dataset.
Keywords	Feature Selection, Credit Card Fraud, Class Imbalance, Data Sampling, Machine Learning

		Article #: RQD28-396

Proceedings of 28th ISSAT International Conference on Reliability & Quality in Design
August 3-5, 2023

	International Society of Science and Applied Technologies