Geometric Interpretation of Transformers for NLP - arxiv.org

Clear