Tiled Convolution with Halo
→
→
Convolved (pre-crop)
Only green region kept
→
Core Tile (kept)
Halo (discarded)
Parallel Tiled Convolution
Core Tile (kept)
Halo (discarded)
Parallel Processing
Multi-Channel Convolution (Conv4D)
⊛
3×3×3 Kernel
R channel
G channel
B channel
→
Red Channel
Green Channel
Blue Channel
Channel-Parallel + Tiled Conv4D
Input Chunks (1×1×H×W tiles)
⊛
Per-Chunk Kernels
Channel slice applied
to each spatial tile
→
Partial Sums → Reduce
Reduction
Σ channel partials
→
Parallel Chunks
Halo Region
Core Output