Implemented as a Multi-Layer Perceptron to classify handwritten Digits (0-9)

[Annotated Notebook]

Model Architecture and Results

The model comprises a flattening layer and three linear layers ((256, 64) hidden dimensions) with relus to approximate non-linearity. It achieves 95.6% accuracy after 15 training epochs and batch size = 64. Taining and Test MNIST datasets are loaded with PyTorch dataloaders.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.