Gulumseme 2 «Tested »»
Input: 32-frame grayscale sequence (112×112) → 3D-CNN (3 layers, 64–128–256 filters, kernel 3×3×3) → Temporal Transformer Encoder (4 heads, 2 layers) → Two heads: - Intensity: MSE loss (regression) - Authenticity: BCE loss (binary)