This repository contains a machine learning project aimed at tracking and predicting illegal funds. By analyzing past transactions data, we develop models that can track the illicit funds for IBM, helping IBM optimize and eradicate fraudulent transactions.
- Update config.yaml
- Update schema.yaml
- Update params.yaml
- Update the config_entity.py
- Update the configuration manager in src/config
- Update the components
- Update the pipeline
- Update main.py
Project Overview Dataset Installation Usage Model Building Evaluation Contributing License
The detection and prevention of money laundering activities is a critical challenge for financial institutions. This IBM anti-laundering transaction project leverages machine learning techniques to predict the likelihood of suspicious transactions based on historical data.
The dataset can be downloaded from [This Link]https://www.kaggle.com/datasets/ealtman2019/ibm-transactions-for-anti-money-laundering-aml/data.
git clone https://github.com/Somtochukwu-Achikanu/IBM-anti-laundering-transaction.git
bash init_setup.sh
python main.py
dvc.init
dvc.repro
We perform feature engineering to enhance the predictive power of our models:
- Encoding categorical variables
- Scaling Numerical features
The project uses several machine learning algorithms to build predictive models, including:
- Logistic Regression
- K Nearest neighbors
- Gradient Boosting
- Support Vector Machines (SVM)
- Decision Trees
The performance of the models is evaluated using the following metrics:
- Accuracy
- Precision
- Recall
- F1 Score
- ROC AUC
We welcome contributions to improve the project. Here are some ways you can contribute:
- Fix bugs and issues
- Add new features or models
- Improve documentation
To contribute, please fork the repository, create a new branch, and submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.
Feel free to customize this README.md file as per your specific project requirements. This template should give you a solid starting point for documenting your marketing campaign prediction use case using machine learning.