hzy00
/

ALiBi-151M

Text Generation

text-generation-inference

Model card Files Files and versions

References

For more information, please refer to our paper and GitHub repository.

Paper: Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

GitHub: BiPE

Authors: Zhenyu He*, Guhao Feng*, Shengjie Luo*, Kai Yang, Di He, Jingjing Xu, Zhi Zhang, Hongxia Yang, Liwei Wang

Downloads last month: 7

Paper for hzy00/ALiBi-151M

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Paper • 2401.16421 • Published Jan 29, 2024