CS5720 - Translation Invariance

What is Translation Invariance?

Translation invariance means that a CNN can recognize objects regardless of where they appear in the image. A cat is still a cat whether it's in the top-left or bottom-right corner.

How CNNs achieve this:

• Parameter sharing across spatial locations
• Same filters applied everywhere
• Pooling operations reduce spatial sensitivity
• Multiple layers build spatial robustness

💡 Key Insight

Translation invariance is crucial for real-world vision because objects can appear anywhere in an image, and we want consistent recognition.

Real-World Examples

👤

Face Detection

Faces detected regardless of position in photo
🚗

Object Recognition

Cars recognized anywhere in traffic images
🔬

Medical Imaging

Tumors detected regardless of location in scan
📝

Text Recognition

Letters recognized anywhere in document

Interactive Translation Invariance Demo

Position 1: Top-Left

Detection: ✓ Cat

Position 2: Center

Detection: ✓ Cat

Position 3: Bottom-Right

Detection: ✓ Cat

Notice: The CNN correctly identifies the object regardless of its position!

Translation Invariance

What is Translation Invariance?

Real-World Examples

Interactive Translation Invariance Demo

Modal Title