Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

.DS_Store +0 -0
checkpoints/complete_diffusion_model.pth +3 -0
checkpoints/diffusion_model_final.pth +3 -0
checkpoints/inference_example.py +29 -0
checkpoints/model_info.json +36 -0
cifar10-diffusion-model.zip +3 -0
implementation.ipynb +0 -0
readme.md +92 -0

.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

checkpoints/complete_diffusion_model.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4266de3549124b61530bf87d88d108d0dca3602161f47a9a0979af9fd0d76c71
+size 67281530

checkpoints/diffusion_model_final.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e3766a051e2774e04e0d07f33b86faf4e14581077660e8882851a3016c23f2c8
+size 201861354

checkpoints/inference_example.py ADDED Viewed

	@@ -0,0 +1,29 @@

+# Inference script for the trained diffusion model
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+import matplotlib.pyplot as plt
+from tqdm import tqdm
+import math
+# [Copy all the model architecture classes here - TimeEmbedding, ResidualBlock, etc.]
+def load_model(checkpoint_path, device='cuda'):
+    """Load the trained diffusion model"""
+    checkpoint = torch.load(checkpoint_path, map_location=device)
+    # Initialize model with saved config
+    model = SimpleUNet(**checkpoint['model_config'])
+    model.load_state_dict(checkpoint['model_state_dict'])
+    model.to(device)
+    model.eval()
+    # Initialize scheduler
+    scheduler = DDPMScheduler(**checkpoint['diffusion_config'], device=device)
+    return model, scheduler, checkpoint['model_info']
+# Usage example:
+# model, scheduler, info = load_model('complete_diffusion_model.pth')
+# generated_images = generate_images(model, scheduler, num_images=4)

checkpoints/model_info.json ADDED Viewed

	@@ -0,0 +1,36 @@

+{
+  "model_name": "CIFAR-10 Diffusion Model",
+  "architecture": "SimpleUNet",
+  "dataset": "CIFAR-10",
+  "training_details": {
+    "epochs": 20,
+    "batch_size": 128,
+    "learning_rate": 0.0001,
+    "optimizer": "AdamW",
+    "scheduler": "CosineAnnealingLR",
+    "parameters": 16808835,
+    "training_time_minutes": 14.54,
+    "final_loss": 0.0363,
+    "best_loss": 0.0358
+  },
+  "model_config": {
+    "in_channels": 3,
+    "out_channels": 3,
+    "time_emb_dim": 128,
+    "image_size": 32
+  },
+  "diffusion_config": {
+    "num_timesteps": 1000,
+    "beta_start": 0.0001,
+    "beta_end": 0.02,
+    "schedule": "linear"
+  },
+  "hardware": {
+    "gpu": "NVIDIA GeForce RTX 3060",
+    "vram_used": "0.43 GB",
+    "total_vram": "11.66 GB"
+  },
+  "created_date": "2025-07-19T17:59:48.665409",
+  "framework": "PyTorch",
+  "python_version": "3.12"
+}

cifar10-diffusion-model.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:514be590d30a8d5a8207e3f18cb7fd46d4aebbc8fb2130df30645e20dff9b412
+size 246459190

implementation.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

readme.md ADDED Viewed

	@@ -0,0 +1,92 @@

+# CIFAR-10 Diffusion Model
+🎨 **A diffusion model trained from scratch on CIFAR-10 dataset**
+## Model Details
+- **Architecture**: SimpleUNet with 16.8M parameters
+- **Dataset**: CIFAR-10 (50,000 training images)
+- **Training Time**: 14.54 minutes on RTX 3060
+- **Final Loss**: 0.0363
+- **Image Size**: 32x32 RGB
+- **Framework**: PyTorch
+## Quick Start
+```python
+import torch
+from model import SimpleUNet, DDPMScheduler, generate_images
+# Load the trained model
+checkpoint = torch.load('complete_diffusion_model.pth')
+model = SimpleUNet(**checkpoint['model_config'])
+model.load_state_dict(checkpoint['model_state_dict'])
+model.eval()
+# Initialize scheduler
+scheduler = DDPMScheduler(**checkpoint['diffusion_config'])
+# Generate images
+generated_images = generate_images(model, scheduler, num_images=8)
+```
+## Installation
+```bash
+pip install torch>=2.0.0 torchvision>=0.15.0 matplotlib tqdm pillow numpy
+```
+## Files Included
+- `complete_diffusion_model.pth` - Complete model with config (64MB)
+- `model_info.json` - Training details and metadata
+- `diffusion_model_final.pth` - Final training checkpoint (64MB)
+- `inference_example.py` - Ready-to-use inference script
+## Training Details
+- **Epochs**: 20
+- **Batch Size**: 128
+- **Learning Rate**: 1e-4 (CosineAnnealingLR)
+- **Optimizer**: AdamW
+- **GPU**: NVIDIA RTX 3060 (0.43GB VRAM used)
+- **Loss Reduction**: 73% (from 0.1349 to 0.0363)
+## Hardware Requirements
+- **Minimum**: 1GB VRAM for inference
+- **Recommended**: 2GB+ VRAM for training extensions
+- **CPU**: Works but slower
+## Results
+The model generates colorful abstract patterns that capture CIFAR-10's color distributions.
+With more training epochs (50-100), it should produce more recognizable objects.
+## Improvements
+To get better results:
+1. **Train longer**: 50-100 epochs instead of 20
+2. **Larger model**: Increase channels/layers
+3. **Advanced sampling**: DDIM, DPM-Solver
+4. **Better datasets**: CelebA, ImageNet
+5. **Learning rate**: Experiment with schedules
+## Model Architecture
+- **U-Net based** with ResNet blocks
+- **Time embedding** for diffusion timesteps
+- **Attention layers** at multiple resolutions
+- **Skip connections** for better gradient flow
+## Citation
+```bibtex
+@misc{cifar10-diffusion-2025,
+  title={CIFAR-10 Diffusion Model},
+  author={Your Name},
+  year={2025},
+  url={https://github.com/your-username/cifar10-diffusion}
+}
+```
+## License
+MIT License - Feel free to use and modify!
+---
+**Created**: July 19, 2025
+**Training Time**: 14.54 minutes
+**GPU**: NVIDIA RTX 3060
+**Framework**: PyTorch