Switch Transformer: Scaling Neural Networks with Sparsity

research
advanced
Author

Krishnatheja Vanka

Published

July 15, 2025