Tensor Shapes

Follow a tensor through one transformer layer. See how dimensions change at each stage — this is why FFN dominates FLOPs.

Model

Tensor Flow Through One Layer

FLOPs Summary

Click a preset to load an interesting configuration.