A Formal Framework for Understanding Length Generalization in Transformers (bibtex)
by Xinting Huang, Andy Yang, Satwik Bhattamishra, Yash Sarrof, Andreas Krebs, Hattie Zhou, Preetum Nakkiran, Michael Hahn
Reference:
A Formal Framework for Understanding Length Generalization in TransformersXinting Huang, Andy Yang, Satwik Bhattamishra, Yash Sarrof, Andreas Krebs, Hattie Zhou, Preetum Nakkiran, Michael HahnICLR, 2025.
Bibtex Entry:
@inproceedings{huang2024formalframeworkunderstandinglength,
      title={A Formal Framework for Understanding Length Generalization in Transformers}, 
      author={Xinting Huang and Andy Yang and Satwik Bhattamishra and Yash Sarrof and Andreas Krebs and Hattie Zhou and Preetum Nakkiran and Michael Hahn},
	month={June},
	year={2025},
      booktitle={ICLR},
      eprint={2410.02140},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2410.02140}, 
}
Powered by bibtexbrowser