Perplexity
Cross-source consensus on Perplexity from 1 sources and 4 claims.
1 sources · 4 claims
Benefits
Comparisons
Evidence quality
Highlighted claims
- The reported mixture perplexity was much lower than any individual exit perplexity in a 24-layer Quadrivium analysis. — N-vium: Mixture-of-Exits Transformer for Accelerated Exact Generation
- Depth-scaling experiments showed N-vium perplexity equal to or better than matched dense baselines. — N-vium: Mixture-of-Exits Transformer for Accelerated Exact Generation
- After supervised fine-tuning, N-vium improved perplexity over dense baselines across tested depths. — N-vium: Mixture-of-Exits Transformer for Accelerated Exact Generation
- IsoFLOP and isoSpeed comparisons favored Quadrivium over dense baselines trained for more steps. — N-vium: Mixture-of-Exits Transformer for Accelerated Exact Generation