International Society of Science and Applied Technologies |
|
Review of Feature Selection Techniques for Credit Card Fraud Detection | ||||
Author | Huanjing Wang
|
|||
Co-Author(s) | Taghi M. Khoshgoftaar
|
|||
Abstract | Detecting credit card fraud is essential for financial institutions to prevent financial losses due to fraudulent transactions. Our research focuses on the Credit Card Fraud Detection Dataset, which is a widely used real-world transaction dataset, to identify the most crucial features that contribute to predicting fraudulent activities. Selecting the most relevant features helps reduce the amount of data that needs processing, saving time and computational resources. To overcome the common problems of high-dimensionality and class-imbalance in fraud detection datasets, we review a range of feature selection methods that are used either on their own or in combination with data sampling. Feature selection and data sampling have been successfully applied in credit card fraud detection, with tree-based feature selection methods such as Random Forest, Boruta, and Extra Tree classifier, as well as Synthetic Minority Oversampling Technique (SMOTE), being the most commonly used preprocessing techniques for this dataset.
|
|||
Keywords | Feature Selection, Credit Card Fraud, Class Imbalance, Data Sampling, Machine Learning | |||
Article #: RQD28-396 |
Proceedings of 28th ISSAT International Conference on Reliability & Quality in Design |