Adversarial Attacks on Deep Learning Models

Exploring the concept of "adversarial attacks" on deep learning models, specifically focusing on image classification using PyTorch. Implementing and demonstrating the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks against a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) trained on the MNIST dataset.

Overview

Deep learning models, despite their high accuracy, are known to be vulnerable to adversarial examples – subtly perturbed inputs designed to cause misclassification. This project aims to:

Train basic CNN, RNN models for MNIST digit classification.
Implement common gradient-based adversarial attacks (FGSM and PGD).
Visualize the original images, the adversarial perturbations, and the model's predictions to understand the impact of these attacks.
Develop defense mechanisms like adversarial training and gradient masking training to improve model accuracies for FGSM and PGD.

Features

CNN model definition and training on MNIST using PyTorch.
Implementation of the FGSM attack.
Implementation of the PGD attack.
Visualization script to compare clean images, adversarial images, and model predictions side-by-side.
Implementation of Adversarial Training defense.
IMplementation of Gradient Masking Training defense.

Example Visualization

Example output showing clean digits vs. FGSM/PGD perturbed digits and the model's predictions:

Adversarial Examples

CNN	RNN

Requirements

Python (3.13)
uv (for environment and package management)
Git

Installation

Clone the repository:

git clone https://github.com/Vamsi-Dath/Adversarial-Attacks-on-Deep-Learning-Models.git
cd Adversarial-Attacks-on-Deep-Learning-Models

Create a virtual environment:
```
uv venv
```
Activate the virtual environment:
- Bash/Zsh (Linux/macOS): source .venv/bin/activate
- Fish (Linux/macOS): source .venv/bin/activate.fish
- Cmd (Windows): .venv\Scripts\activate.bat
- PowerShell (Windows): .venv\Scripts\Activate.ps1
Install dependencies:
```
uv sync
```

Run: main.py

uv run main.py

for printing results as command line output

(or)

PYTHONUNBUFFERED=1 uv run main.py 2>&1 | tee output.txt

for saving results as output.txt

Sample run on mac

Output

Refer to the command line outputs (or) output.txt

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data/MNIST/raw		data/MNIST/raw
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
Mac_activity_monitor.png		Mac_activity_monitor.png
README.md		README.md
attacks.py		attacks.py
cnn_adv.png		cnn_adv.png
defense.py		defense.py
main.py		main.py
model.py		model.py
output.txt		output.txt
pyproject.toml		pyproject.toml
rnn_adv.png		rnn_adv.png
utils.py		utils.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Attacks on Deep Learning Models

Overview

Features

Example Visualization

Adversarial Examples

Requirements

Installation

Sample run on mac

Output

About

Releases

Packages

Languages

Vamsi-Dath/Adversarial-Attacks-on-Deep-Learning-Models

Folders and files

Latest commit

History

Repository files navigation

Adversarial Attacks on Deep Learning Models

Overview

Features

Example Visualization

Adversarial Examples

Requirements

Installation

Sample run on mac

Output

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages