performance gain

#2
by almanah - opened

Hi mudler, can you show some comparision of performance you gained due to speculative decoding on this model?

Sign up or log in to comment