LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Paper • 2410.03355 • Published
How to use jadohu/llamagen_drafter with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("jadohu/llamagen_drafter")
model = AutoModelForCausalLM.from_pretrained("jadohu/llamagen_drafter")This repository contains the model described in LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding.