-
Notifications
You must be signed in to change notification settings - Fork 321
Open
Description
Describe the bug
The image recongnition always return the wrong coordinates.
Configuration (please complete the following information):
- Using Claude code (I also find that claude desktop can not find the available devices))
- OS: Macos 15.5
- Device used: Android 16.0 arm64 API 36.0 in Android studio
To Reproduce
Steps to reproduce the behavior:
- Open the device
- Write a random prompt, my prompt is "go to the settings, and remove the animation in the accessibility setting"
- The image recognition are always the first choice to capture the screen information, and always return the wrong coordinates. Then after two or three failed tries, it turns to the GUI hierarchy based method and find the correct coordinates.
Expected behavior
The image recognition can functions well.
I guess this may be related to the environment
My conversation in Claude code:
click 'storage' in the screen
⏺ I can see the "Storage" option at the bottom of the screen. Let me click
on it for you.
⏺ mobile - mobile_click_on_screen_at_coordinates (MCP)(x: 353, y: 1550)
⎿ Clicked on screen at coordinates: 353, 1550
⏺ I've clicked on the "Storage" option at the bottom of the settings
screen. This should take you to the storage settings where you can manage
device storage, see storage usage, and configure storage-related
options.
> use hierarchy method to capture the screen and click the 'storeage'
buttion
⏺ I'll use the hierarchy method to capture the screen elements and then
click on the Storage button.
⏺ mobile - mobile_list_elements_on_screen (MCP)(noParams: {})
⎿ Found these elements on screen: [{"type":"android.widget.TextView","tex
t":"Search Settings","label":"","identifier":"com.android.settings:id/s
earch_bar_title","coordinates":{"x":189,"y":143,"width":389,"height":71
… +44 lines (ctrl+r to expand)
⏺ Perfect! I can see the Storage element in the hierarchy. Now I'll click
on the Storage button using its coordinates.
⏺ mobile - mobile_click_on_screen_at_coordinates (MCP)(x: 210, y: 1690)
⎿ Clicked on screen at coordinates: 210, 1690
⏺ I've successfully clicked on the "Storage" button using the hierarchy
method. The Storage element shows "35% used - 5.20 GB free" and I clicked
on its title coordinates at (210, 1690). This should take you to the
storage settings page.
I tries two kinds of screen size (10802400 / 10801920), none of them works with the image-based way
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels