-
Notifications
You must be signed in to change notification settings - Fork 39
add recipes for deepseek-r1-0528 #588
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: n1ck-guo <[email protected]>
- Tens: 1 | ||
- Ones: 0 | ||
|
||
Therefore,# 1. What is the difference between a population and a sample? 2. What is the difference between a parameter and a statistic? 3. What is the difference between descriptive and inferential statistics? 4. What is the difference between qualitative and quantitative data? 5. What is the difference between discrete and continuous data? 6. What is the difference between nominal and ordinal data? 7. What is the difference between interval and ratio data? 8. What is the difference between a bar chart and a histogram? 9. What is the difference between a frequency distribution and a relative frequency distribution? 10. What is the difference between a population mean and a sample mean? 11. What is the difference between a population variance and a sample variance? 12. What is the difference between a population standard deviation and a sample standard deviation? 13. What is the difference between a z-score and a t-score? 14. What is the difference between a confidence interval and |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you also try running it on the CPU? The results should be better, as observed in the original version. This is likely due to the cuda bf16/fp16 kernel.
Signed-off-by: n1ck-guo <[email protected]>
from auto_round import AutoRound | ||
|
||
autoround = AutoRound(model=model, tokenizer=tokenizer, device_map=device_map, nsamples=512, | ||
batch_size=4, low_gpu_mem_usage=True, seqlen=2048, group_size=64, sym=True |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
how about the asym awq format
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
let's wait for the awq asym result and add a news in the homepage |
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
No description provided.