-
Notifications
You must be signed in to change notification settings - Fork 689
[Model Runner] Prepare token count and move FA3 initialization into the graph #6170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for your contribution! |
| output_cum_offsets, | ||
| output_padding_offset, | ||
| ) = pre_process( | ||
| token_num_cpu, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
其他文件处的 pre_process( 都需要改一下的
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #6170 +/- ##
==========================================
Coverage ? 66.35%
==========================================
Files ? 383
Lines ? 50519
Branches ? 7894
==========================================
Hits ? 33523
Misses ? 14543
Partials ? 2453
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
| {token_num_data}, paddle::DataType::INT64, input_ids.place()); | ||
| auto batch_id_per_token = paddle::empty( | ||
| {token_num_data}, paddle::DataType::INT32, input_ids.place()); | ||
| auto x_remove_padding = paddle::full( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里改为2是什么原因呢~
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里改为2是什么原因呢~
只是先给一个有效值~
EmmonsCurse
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for skip-coverage~
Motivation
Modifications
Usage or Command
Accuracy Tests
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.