-
Notifications
You must be signed in to change notification settings - Fork 1.9k
[TRTLLM-6780][fix] Add multimodal data to dummy requests during memory profiling #7539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
8ad3a36
add multimodal dummy request to profiling - drafting stage
johncalesp 7f8e169
guard against multimodal_input being None
johncalesp d59fb38
rebase and change logic
johncalesp 22b8d37
check attribute original_arch is present
johncalesp 1101862
rebase from main and add fix for only-text request for qwen2.5 and ad…
johncalesp 5449a63
fix max_seq_len is less than max_num_tokens during profiling
johncalesp f20c8fb
address yechang comments - p1
johncalesp 0a1fd4a
fix rebase to 80dd8fe1973323eb8f01060788c0d5485a0ce0f8
johncalesp 39e78ff
check for new function name and fix TextPrompt attribute for qwen
johncalesp 38bb1d2
address comments - change design to use default_multimodal_input_loader
johncalesp 62679a3
add additional arguments for mm data
johncalesp 8b80b32
address comments: change unit test, and add more asserts
johncalesp 1512044
fix rebase to commit b1738c3f189560a857ea1adcfdfb8e68c571c81d
johncalesp 3658a44
address code rabbit comments && remove mrope_config.mrope_position_id…
johncalesp b7f0f7e
integrate latest feedback
johncalesp 52afab2
add unit test to test-db
johncalesp 187b80f
fix unit test by adding chunked prefill parameter
johncalesp 0d470ec
fix test L40S-PyTorch-2.test_e2e.test_ptp_quickstart_multimodal[NVILA…
johncalesp bcc9004
fix tests A10-PyTorch-1.test_e2e.test_openai_chat_multimodal_example …
johncalesp 345954c
include flag to check chunked prefill flag during profiling
johncalesp a546ae9
change logic to get initial input_seq_len
johncalesp f011861
fix latest rebase from 7291cdc42287297bf72015e7201fede7985edeae
johncalesp 5410711
Fix rebase to 1cdb0b6
johncalesp 9919b4c
fix format file _util.py
johncalesp File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.