Machine Learning (Stats) 13 JAN, 2026 Rethinking Attention: Polynomial Alternatives to Softmax in Transformers By Hemanth Saratch