CNN from Scratch on MNIST

This project implements a simple Convolutional Neural Network (CNN) using NumPy only, without any deep learning frameworks like TensorFlow or PyTorch.
It trains on the MNIST handwritten digit dataset (28×28 grayscale images) and includes forward and backward propagation, convolution, pooling, dense layers, softmax + cross-entropy loss, and SGD optimizer with momentum + decay.

Achieves ~98% accuracy on MNIST test set after 5 epochs.

🚀 Features

✅ Convolutional layer (Conv2D) with Xavier/He initialization
✅ Max Pooling layer (MaxPool2D)
✅ Fully-connected (dense) layers
✅ ReLU and Softmax activations
✅ Categorical cross-entropy loss
✅ SGD optimizer with momentum and learning rate decay
✅ Custom forward + backward pass
✅ Model saving and loading
✅ Support for custom image predictions

🗂 Dataset

MNIST digits (via keras.datasets.mnist)
60,000 train / 10,000 test samples
Images normalized to [0,1]

🧱 Model Architecture

Input: 28x28x1 Conv2D: 3x3 kernel, 32 filters, padding=1 → ReLU → MaxPool2D: 2x2 Flatten Dense: 128 units → ReLU Dense: 10 units → Softmax

📈 Training

Optimizer: SGD with momentum = 0.9, decay = 1e-3
Batch size: 64
Epochs: 5

Example output:

Epoch 1: Loss = 0.5501, Accuracy = 0.8502 Epoch 2: Loss = 0.4003, Accuracy = 0.8904 ...

💻 Running the code

pip install keras nnfs matplotlib opencv-python pillow

👉 Run the notebook or script:

# Example forward + training loss = forward_pass(X_batch, y_batch) backward_pass(loss_func.output, y_batch) optimizer.pre_update_params() optimizer.update_params(layer2) optimizer.update_params(layer1) optimizer.update_params(conv1) optimizer.post_update_params()

🔍 Custom image prediction

You can load your own image:

test_image = cv2.imread("pred1.png", cv2.IMREAD_GRAYSCALE) test_image = cv2.resize(test_image, (28,28)) test_image = cv2.bitwise_not(test_image) / 255.0

And predict:

conv1.forward(img) ... print("Predicted digit:", predicted_class[0])

Example output:

0: 0.000001 % 1: 0.000002 % ... 8: 99.998234 % 9: 0.000100 % Predicted digit: 8

💾 Saving & Loading model

# Save model saver = ModelSaver() saver.save_model(conv1, layer1, layer2, optimizer) # Load model saver.load_model(conv1, layer1, layer2, optimizer)

Weights, biases, optimizer state saved in .npz file.

📊 Visualizing wrong predictions

Plots incorrect guesses with predicted + true labels:

for idx in first_10_incorrect: plt.imshow(X_test[idx], cmap="gray") print(f"Predicted: {test_predictions[idx]}, True: {test_true[idx]}")

📌 File structure

├── cnn_mnist.py / notebook.ipynb ├── model.npz # Saved model ├── pred1.png # Example custom test image └── README.md

🙌 Credits

Inspired by NNFS book and built using NumPy, OpenCV, Matplotlib.

⭐️ Future work

Add more conv layers
Add dropout / batchnorm
Add support for other datasets (e.g. CIFAR-10)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
MNIST_Detection.ipynb		MNIST_Detection.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CNN from Scratch on MNIST

🚀 Features

🗂 Dataset

🧱 Model Architecture

📈 Training

💻 Running the code

🔍 Custom image prediction

💾 Saving & Loading model

📊 Visualizing wrong predictions

📌 File structure

🙌 Credits

⭐️ Future work

About

Uh oh!

Releases 1

Packages

Languages

License

Anonymous390/NNFS-MNIST-with-Convolution

Folders and files

Latest commit

History

Repository files navigation

CNN from Scratch on MNIST

🚀 Features

🗂 Dataset

🧱 Model Architecture

📈 Training

💻 Running the code

🔍 Custom image prediction

💾 Saving & Loading model

📊 Visualizing wrong predictions

📌 File structure

🙌 Credits

⭐️ Future work

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages