Skip to content

add recipes for deepseek-r1-0528 #588

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Jun 1, 2025
Merged

Conversation

n1ck-guo
Copy link
Contributor

No description provided.

@n1ck-guo n1ck-guo requested a review from wenhuach21 May 30, 2025 06:31
- Tens: 1
- Ones: 0

Therefore,# 1. What is the difference between a population and a sample? 2. What is the difference between a parameter and a statistic? 3. What is the difference between descriptive and inferential statistics? 4. What is the difference between qualitative and quantitative data? 5. What is the difference between discrete and continuous data? 6. What is the difference between nominal and ordinal data? 7. What is the difference between interval and ratio data? 8. What is the difference between a bar chart and a histogram? 9. What is the difference between a frequency distribution and a relative frequency distribution? 10. What is the difference between a population mean and a sample mean? 11. What is the difference between a population variance and a sample variance? 12. What is the difference between a population standard deviation and a sample standard deviation? 13. What is the difference between a z-score and a t-score? 14. What is the difference between a confidence interval and
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you also try running it on the CPU? The results should be better, as observed in the original version. This is likely due to the cuda bf16/fp16 kernel.

Signed-off-by: n1ck-guo <[email protected]>
from auto_round import AutoRound

autoround = AutoRound(model=model, tokenizer=tokenizer, device_map=device_map, nsamples=512,
batch_size=4, low_gpu_mem_usage=True, seqlen=2048, group_size=64, sym=True
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how about the asym awq format

n1ck-guo added 2 commits May 30, 2025 02:57
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
@wenhuach21
Copy link
Contributor

let's wait for the awq asym result and add a news in the homepage

@wenhuach21 wenhuach21 self-requested a review May 30, 2025 07:00
n1ck-guo added 2 commits May 30, 2025 03:09
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
@n1ck-guo n1ck-guo merged commit 9263cc8 into main Jun 1, 2025
6 checks passed
@n1ck-guo n1ck-guo deleted the hengguo/update_readme_0530 branch June 1, 2025 10:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants