Deep Learning & Computer Vision

// Curriculum Highlights

What You'll Learn

🧠

Neural Networks from ScratchPerceptrons, activation functions, forward & backprop — built in NumPy first

🔥

PyTorch FundamentalsTensors, autograd, nn.Module, DataLoader — the framework used in industry & research

🏗️

Convolutional Neural NetworksConv layers, pooling, stride, receptive field, batch normalisation

🏆

Famous ArchitecturesLeNet → AlexNet → VGG → ResNet — why each innovation mattered

🔄

Transfer LearningFine-tune pretrained ResNet50/EfficientNet on custom datasets in hours, not weeks

🎯

Object DetectionYOLO architecture, bounding boxes, anchor-based detection on real images

🖼️

Image SegmentationU-Net, semantic vs instance segmentation, medical imaging applications

⚡

Training at ScaleGPU training on Colab, learning rate schedulers, mixed precision, early stopping

🔍

Explainability (Grad-CAM)Visualise what your CNN actually looks at — essential for medical & safety-critical use

📦

Model Export & DeploymentONNX export, TorchScript, serve predictions via FastAPI

// Architecture Deep Dives

4 Architectures You'll Build

You don't just use pretrained weights — you understand why each architecture was designed. Click each to explore.

🏗️

Simple CNN — Image Classification

Your first convolutional network built from scratch in PyTorch. Every layer placed intentionally: why 3×3 kernels, why max-pool, why ReLU over sigmoid in hidden layers.

3Conv Blocks

~95%CIFAR-10 Acc

Week 2Built In

Image Classification CIFAR-10 Custom Datasets PyTorch nn.Module

🔄

ResNet-50 with Transfer Learning

Fine-tune a pretrained ResNet-50 on a custom image dataset (plant disease, product defects, or your own choice). Understand skip connections, why they solve vanishing gradients, and how to replace the classifier head.

50Layers

~97%Fine-tuned Acc

Week 4Built In

Transfer Learning Custom Dataset Skip Connections torchvision

🎯

YOLOv5 — Real-Time Object Detection

Run and fine-tune YOLOv5 on a custom object detection dataset. Understand anchor boxes, IoU, NMS, and how to label your own data with Roboflow. Build a live webcam demo.

~45 FPSInference Speed

[email protected]Metric Used

Week 5Built In

Object Detection Custom Labels Roboflow Real-time Inference

🖼️

U-Net — Image Segmentation

Build U-Net from scratch for semantic segmentation. Applied to a medical imaging dataset (chest X-ray lung segmentation). Understand encoder-decoder architecture and skip connections for pixel-wise prediction.

Dice ~0.89Metric

Pixel-wisePrediction

Week 6Capstone

Segmentation Medical Imaging Encoder-Decoder Dice Loss

// Why PyTorch?

PyTorch-First — Here's Why

We teach PyTorch exclusively. This is a deliberate choice based on where the industry and research community has moved. Here's the full picture:

Dimension	PyTorch ✓ (taught here)	TensorFlow / Keras
Research adoption	~80% of ML papers use PyTorch Winner	Declining in research
Industry (AI startups)	Default choice at most AI startups Winner	Used in legacy systems
Debugging experience	Python-native, easy to step through Winner	Graph mode harder to debug
Dynamic computation	Dynamic by default — intuitive Winner	Static graph (Keras helps)
HuggingFace / LLMs	Near-universal PyTorch backend Winner	Limited support
Interview signal	Expected at most AI interviews Winner	Acceptable but secondary

// After This Course

Career Outcomes

👁️

Computer Vision Engineer

CV engineers are in high demand at autonomous vehicle, surveillance, medtech, and manufacturing companies.

₹12–24 LPA fresher range

🤖

Deep Learning Engineer

Roles at AI labs, product startups, and R&D divisions of large tech firms building neural-network-powered products.

₹14–28 LPA fresher range

🏥

AI in Healthcare / MedTech

Medical imaging AI is one of the fastest-growing CV application areas. U-Net and segmentation skills are directly applicable.

High growth sector

🔬

Research Engineer / Intern

Top research internships (IISc, IITB, CMU remote) require PyTorch fluency and an understanding of CNN architectures.

₹60–150k/month stipend

// This course is for

🤖 Those who completed ML Mastery or have strong Scikit-learn experience

🎓 B.Tech/M.Tech students interested in computer vision research or industry

💼 ML engineers looking to add DL & CV specialisation to their skill set

❌ Not for: beginners — this requires Python fluency and ML fundamentals

// Week by Week

Full Curriculum — 6 Weeks

Perceptron → MLP: forward pass, weights, biases
Activation functions: ReLU, sigmoid, tanh, GELU — when to use each
Backpropagation: chain rule, gradient flow, vanishing gradients
NumPy MLP from scratch on MNIST
PyTorch: tensors, autograd, nn.Module, optim
Training loop: forward → loss → backward → step
GPU setup on Google Colab — move tensors to CUDA

Convolution: kernels, stride, padding, receptive field
Pooling: max vs average, global average pooling
Batch Normalisation: why it works, where to place it
Dropout: regularisation for deep networks
Build LeNet → AlexNet → VGG in PyTorch step by step
DataLoader, transforms, augmentation with torchvision
Train on CIFAR-10: reach 93%+ accuracy

ResNet: skip connections, residual blocks, why depth works now
EfficientNet: compound scaling, MobileNet-style efficiency
Learning rate schedulers: StepLR, CosineAnnealing, OneCycleLR
Mixed precision training (torch.cuda.amp) — 2x speed on Colab
Data augmentation: RandomCrop, Cutout, MixUp
Grad-CAM: visualise what the network attends to
TensorBoard logging for training diagnostics

Transfer learning: feature extraction vs fine-tuning
Load pretrained ResNet-50 via torchvision.models
Replace classifier head for custom number of classes
Freeze/unfreeze layers — progressive unfreezing strategy
Custom ImageFolder dataset: organise, load, augment
Project: plant disease classification (38 classes, 87k images)
Confusion matrix, per-class accuracy, misclassification analysis

Object detection fundamentals: bounding boxes, IoU, NMS
Two-stage (R-CNN) vs one-stage (YOLO) detectors
Anchor boxes: intuition and design
YOLOv5 architecture walkthrough and configuration
Label custom data with Roboflow (free tier)
Fine-tune YOLOv5 on custom dataset
[email protected] evaluation, precision-recall curves
Live inference: webcam demo in OpenCV

Semantic vs instance vs panoptic segmentation
U-Net architecture: encoder-decoder, skip connections
Dice loss, IoU loss for segmentation tasks
Project: chest X-ray lung segmentation
ONNX export: framework-agnostic model format
TorchScript: serialise for production
FastAPI endpoint for image inference
Capstone: end-to-end CV system of your choice

// Hands-on Work

4 GPU-Powered Projects

PROJECT 01

CIFAR-10 CNN Classifier

Build a custom CNN from scratch in PyTorch. Beat 90% accuracy, visualise filters, and interpret predictions with Grad-CAM heatmaps.

PyTorchtorchvisionGrad-CAMTensorBoard

PROJECT 02

Plant Disease Detection

Fine-tune ResNet-50 on 87k plant disease images across 38 categories. Deploy as a FastAPI endpoint that accepts image uploads.

ResNet-50Transfer LearningFastAPIONNX

PROJECT 03

Custom Object Detector

Label your own dataset (any object: vehicles, products, defects), train YOLOv5, and run real-time inference on a video stream. Fully deployable demo.

YOLOv5RoboflowOpenCVmAP

PROJECT 04 — CAPSTONE

End-to-End CV System

Pick your domain: medical imaging, retail, agriculture, or satellite imagery. Build a full pipeline from data collection → model training → Grad-CAM interpretability → FastAPI serving.

PyTorchU-Net or YOLOFastAPIGrad-CAMONNX

💻

Pre-Configured Colab Notebooks Included

Every session comes with a ready-to-run GPU-enabled Colab notebook. No CUDA setup headaches. Open, run, and follow along from minute one. All notebooks stay yours permanently.

// Your Credential

Silver Certificate Awarded

🥈

Newton JEE Silver Badge

ML Practitioner — Deep Learning & Computer Vision

Appears on your LinkedIn profile

Silver Badge — The Deep Learning Signal

The Silver badge in Deep Learning & CV tells recruiters something specific: you can build, train, and deploy vision models end-to-end with PyTorch. It's a verifiable credential that sits alongside your ML Mastery Silver badge as a two-badge cluster that opens CV/DL interview doors.

1

Complete all 6 weeks and 4 projects

2

Present capstone CV system to mentor

3

Mentor approves and signs off on the project

4

LinkedIn credential link issued within 48hrs

5

One-click publish to LinkedIn Certifications

// Your Mentor

Meet Your Instructor

DM

Divya Menon

Computer Vision Researcher · Ex-NVIDIA India & Ola Electric

6 years in CV research and production — building lane detection systems at Ola Electric's autonomous division and optimising inference pipelines at NVIDIA India. Divya holds an M.Tech from IIT Madras in Signal Processing and has co-authored 4 papers on efficient neural architectures. She designed this course specifically for students who want to go from "I understand ML" to "I can build real vision systems" — with zero padding and maximum signal.

PyTorchObject DetectionAutonomous VehiclesMedical ImagingIIT Madras M.Tech

// Upcoming Batches

Pick Your Batch

Batch #11

Apr 5, 2025

Sat & Sun · 10:00–12:00 PM IST

5 seats left

Batch #12

Apr 26, 2025

Sat & Sun · 2:00–4:00 PM IST

13 seats open

Batch #13

May 17, 2025

Sat & Sun · 10:00–12:00 PM IST

15 seats open

// Ready to Start?

Enrol in This Course

^₹7,999 ₹14,000

Save ₹6,001 · 43% OFF

or ₹667/month · 12-month no-cost EMI

🔒 Secured by Razorpay · 100% refund after 2 sessions if unsatisfied

Everything included

✓ 18 live sessions · 36 hrs total

✓ Lifetime recording access

✓ 4 GPU-accelerated projects

✓ Pre-configured Colab notebooks

✓ LinkedIn-verified Silver badge

✓ Architecture cheat-sheets

✓ Mentor office hours (1hr/week)

✓ Resume & LinkedIn review

✓ Placement referral support

// Alumni Feedback

What Students Say

★★★★★

The YOLOv5 session was insane. We labelled our own data using Roboflow and had a working object detector running on a webcam by end of session. I showed the demo in my interview at a robotics startup and got the offer on the spot.

AP

Aryan Patel

B.Tech ECE → CV Engineer · Ati Motors

★★★★★

Divya is the best instructor I've had — in any medium. She explains skip connections in ResNet using a metaphor that made me understand in 30 seconds what three YouTube videos couldn't explain in three hours. The Grad-CAM session changed how I think about model debugging.

SR

Sneha Reddy

M.Tech CSE → Research Intern · IISc

★★★★★

I was worried about GPU access but the pre-configured Colab notebooks removed every barrier. From session one I was training on GPUs. The plant disease project alone took my portfolio from zero to something I'm actually proud to show.

KG

Karthik Gowda

B.Tech Agri Engg → CV Engineer · AgNext

★★★★★

The medical imaging capstone was genuinely hard — and I mean that as a compliment. Building a chest X-ray segmentation model and presenting it to Divya for review pushed me to a level I wouldn't have reached alone. It's now my top portfolio piece.

TN

Tanvi Nambiar

MBBS + B.Tech → Medical AI Researcher

// What's Next

Students Also Take

Deep Learning &
Computer Vision

What's Included

Simple CNN — Image Classification

ResNet-50 with Transfer Learning

YOLOv5 — Real-Time Object Detection

U-Net — Image Segmentation

Computer Vision Engineer

Deep Learning Engineer

AI in Healthcare / MedTech

Research Engineer / Intern

// This course is for

CIFAR-10 CNN Classifier

Plant Disease Detection

Custom Object Detector

End-to-End CV System

Newton JEE Silver Badge

Silver Badge — The Deep Learning Signal

What's Included

Deep Learning &Computer Vision

What's Included

Simple CNN — Image Classification

ResNet-50 with Transfer Learning

YOLOv5 — Real-Time Object Detection

U-Net — Image Segmentation

Computer Vision Engineer

Deep Learning Engineer

AI in Healthcare / MedTech

Research Engineer / Intern

// This course is for

CIFAR-10 CNN Classifier

Plant Disease Detection

Custom Object Detector

End-to-End CV System

Newton JEE Silver Badge

Silver Badge — The Deep Learning Signal

NLP & Text Mining

Generative AI & LLMs

Full Stack AI Engineer

What's Included

Deep Learning &
Computer Vision