Can you post full context perplexity evaluations for respective quant sizes? #147

ghost · 2023-11-07T16:42:58Z

ghost
Nov 7, 2023

As in the title, I'd like to see any Llama 70B ppl tested on 4096 tokens for every quant size. 70B-chat would be okay. Can't do it on my end because of VRAM.

turboderp · 2023-11-07T18:54:12Z

turboderp
Nov 7, 2023
Maintainer

I did some 4096 token tests on base Llama2-70B using wikitext-train:

bpw	ppl
2.5	6.6602
2.7	4.8695
3.0	4.1078
3.5	3.5893
4.0	3.4730
4.65	3.4169

I don't have a 5.0bpw model handy to test, and I don't have the VRAM to test higher than that. I'd have to fire up a RunPod instance and that would take hours to set up.

1 reply

ghost Nov 7, 2023

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Can you post full context perplexity evaluations for respective quant sizes? #147

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Can you post full context perplexity evaluations for respective quant sizes? #147

Uh oh!

ghost Nov 7, 2023

Replies: 1 comment · 1 reply

Uh oh!

turboderp Nov 7, 2023 Maintainer

Uh oh!

ghost Nov 7, 2023

ghost
Nov 7, 2023

Replies: 1 comment 1 reply

turboderp
Nov 7, 2023
Maintainer