Skip to content

【PaddleNLP No.1】 add pretrain.md #10506

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
May 21, 2025
Merged

【PaddleNLP No.1】 add pretrain.md #10506

merged 10 commits into from
May 21, 2025

Conversation

Echo-Nie
Copy link
Contributor

@Echo-Nie Echo-Nie commented Apr 26, 2025

PR types

Others

PR changes

Docs

Description

1. 针对小白

大体框架与 .rst 一致,从新手小白用户的角度对文档进行润色。润色了一下文档的内容,适合小白入手大模型的预训练。

2. 克隆问题

由于少部分用户可能会碰到网络问题,加入了 gitee克隆和国内镜像 (个人测试过使用百度镜像和gitee均能克隆成功) 帮助小白能更快配好环境。 gitee具有滞后性

3. 模型支持列表

模型权重支持列表按照根目录的 README.md 进行更新。

4. 其他

添加 预训练成功后的打印信息 (以Qwen为例)。

修复部分失效链接:

5. docs/zh/llm/docs/pretrain.md

添加上面文档的相对路径

#9763

Copy link

paddle-bot bot commented Apr 26, 2025

Thanks for your contribution!

Copy link

codecov bot commented Apr 26, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 48.66%. Comparing base (759ae99) to head (892abbb).
Report is 20 commits behind head on develop.

❌ Your project status has failed because the head coverage (48.66%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files
@@             Coverage Diff             @@
##           develop   #10506      +/-   ##
===========================================
- Coverage    48.66%   48.66%   -0.01%     
===========================================
  Files          768      768              
  Lines       127103   127103              
===========================================
- Hits         61860    61859       -1     
- Misses       65243    65244       +1     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@Echo-Nie
Copy link
Contributor Author

@DrownFish19 pls review

@Echo-Nie
Copy link
Contributor Author

Echo-Nie commented Apr 27, 2025

image

@DrownFish19 该目录下只有pretrain是rst格式,是否需要删除?(同步docs/zh/llm/docs/pretrain.rst 也删除)

@luotao1 luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Apr 27, 2025
@luotao1 luotao1 assigned luotao1 and DrownFish19 and unassigned lugimzzz Apr 27, 2025
@Echo-Nie Echo-Nie requested a review from DrownFish19 May 7, 2025 14:52
@Echo-Nie
Copy link
Contributor Author

@DrownFish19 pls review

@ZHUI ZHUI merged commit 7ea6253 into PaddlePaddle:develop May 21, 2025
9 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contributor HappyOpenSource 快乐开源活动issue与PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants