Liger-GLA-8B

We introduce Liger-GSA-8B, a gated linear recurrent model linearized from Transformer-based LLM.

Our Liger framework is compatible with various linear recurrent models with gating structures:

Model Name	Base Model	Linear Structure	HF Link
Liger-GLA-8B	Llama-3-8B	GLA	🤗 link
Liger-GSA-8B	Llama-3-8B	GSA	🤗 link

Citation

If you find this repo useful, please cite and star our work:

@article{lan2025liger,
  title={Liger: Linearizing Large Language Models to Gated Recurrent Structures},
  author={Lan, Disen and Sun, Weigao and Hu, Jiaxi and Du, Jusen and Cheng, Yu},
  journal={arXiv preprint arXiv:2503.01496},
  year={2025}
}

Downloads last month: 3

Safetensors

Model size

8B params

Tensor type

BF16

Collection including linear-moe-hub/Liger-GSA-8B

Liger

Collection

6 items • Updated Mar 20, 2025 • 3

Papers for linear-moe-hub/Liger-GSA-8B