-
Notifications
You must be signed in to change notification settings - Fork 703
test: add large size image transfer over tcp #4483
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -177,6 +177,36 @@ class VLLMConfig(EngineConfig): | |
| ) | ||
| ], | ||
| ), | ||
| "multimodal_agg_qwen_epd_30mb_img": VLLMConfig( | ||
| name="multimodal_agg_qwen_epd_30mb_img", | ||
| directory=vllm_dir, | ||
| script_name="agg_multimodal_epd.sh", | ||
| marks=[pytest.mark.gpu_2], | ||
| model="Qwen/Qwen2.5-VL-7B-Instruct", | ||
| delayed_start=0, | ||
| script_args=[ | ||
| "--model", | ||
| "Qwen/Qwen2.5-VL-7B-Instruct", | ||
| "--request-plane", | ||
| "tcp", | ||
| ], | ||
| timeout=360, | ||
| request_payloads=[ | ||
| chat_payload( | ||
| [ | ||
| {"type": "text", "text": "What is in this image?"}, | ||
| { | ||
| "type": "image_url", | ||
| "image_url": { | ||
| "url": "https://floatingworld.com/wp-content/uploads/2023/02/Sample-jpg-image-30mb-16.jpg" | ||
| }, | ||
| }, | ||
| ], | ||
| repeat_count=1, | ||
| expected_response=["bus"], | ||
| ) | ||
| ], | ||
| ), | ||
|
Comment on lines
+180
to
+209
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🧩 Analysis chainVerify the expected response matches the actual image content. The test expects a response containing "bus", but the 30MB image URL ( Please verify the image content and update the expected response accordingly. You can check the image content with: Additionally, please search the web to confirm whether the floatingworld.com sample image contains a bus: 🏁 Script executed: #!/bin/bash
# Description: Verify the image URL is accessible and examine its metadata
# Expected: HTTP 200 response and image metadata
echo "Checking image URL accessibility..."
curl -I "https://floatingworld.com/wp-content/uploads/2023/02/Sample-jpg-image-30mb-16.jpg" 2>&1 | head -n 10
echo -e "\n\nChecking reference COCO image URL..."
curl -I "http://images.cocodataset.org/test2017/000000155781.jpg" 2>&1 | head -n 10Length of output: 1218 🌐 Web query: 💡 Result: The image is a colorful nebula/space-style test picture (a bright, cloud‑like “nebula” texture used as a large sample JPG). [1][2] Sources: Update the expected response to match the actual nebula image content or replace the image with one containing a bus. The verification confirms the major issue: the image at the provided URL is a colorful nebula/space-style test texture, not a bus image. The test expects 🤖 Prompt for AI Agents |
||
| "multimodal_agg_qwen": VLLMConfig( | ||
| name="multimodal_agg_qwen", | ||
| directory=vllm_dir, | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@krishung5 - want to double check - will this be decoded in the frontend and sent over nixl or over the request plane?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In EPD, the image will be loaded in the encode worker: https://github.com/ai-dynamo/dynamo/blob/main/components/src/dynamo/vllm/multimodal_handlers/encode_worker_handler.py#L50
it's not decoded in the frontend. I think this PR is going to add that: #3988