Application MFU

Cross-source consensus on Application MFU from 1 sources and 4 claims.

1 sources · 4 claims

Risks & contraindications

Framework-level MFU estimation is fragmented and brittle because FLOPs formulas must be updated for new training modalities. — Instant GPU Efficiency Visibility at Fleet Scale
A DeepSeek-style MoE job reported application MFU far above OFU because the framework FLOPs counter assumed experts operated at the full hidden dimension. — Instant GPU Efficiency Visibility at Fleet Scale
A hybrid Mamba-Transformer MoE job inflated reported FLOPs because its Megatron-LM branch lacked per-layer-type FLOPs accounting. — Instant GPU Efficiency Visibility at Fleet Scale
In production validation, OFU and application MFU had moderate correlation across all 608 jobs and stronger correlation after excluding jobs affected by identified FLOPs miscalculations. — Instant GPU Efficiency Visibility at Fleet Scale