Skip to content

Screenshot recognition failed to capture the right coordinates #163

@leo6211

Description

@leo6211

Describe the bug
The image recongnition always return the wrong coordinates.

Configuration (please complete the following information):

  • Using Claude code (I also find that claude desktop can not find the available devices))
  • OS: Macos 15.5
  • Device used: Android 16.0 arm64 API 36.0 in Android studio

To Reproduce
Steps to reproduce the behavior:

  1. Open the device
  2. Write a random prompt, my prompt is "go to the settings, and remove the animation in the accessibility setting"
  3. The image recognition are always the first choice to capture the screen information, and always return the wrong coordinates. Then after two or three failed tries, it turns to the GUI hierarchy based method and find the correct coordinates.

Expected behavior
The image recognition can functions well.

I guess this may be related to the environment

My conversation in Claude code:

click 'storage' in the screen

⏺ I can see the "Storage" option at the bottom of the screen. Let me click
  on it for you.

⏺ mobile - mobile_click_on_screen_at_coordinates (MCP)(x: 353, y: 1550)
  ⎿  Clicked on screen at coordinates: 353, 1550

⏺ I've clicked on the "Storage" option at the bottom of the settings
  screen. This should take you to the storage settings where you can manage
   device storage, see storage usage, and configure storage-related
  options.

> use hierarchy method to capture the screen and click the 'storeage' 
  buttion

⏺ I'll use the hierarchy method to capture the screen elements and then
  click on the Storage button.

⏺ mobile - mobile_list_elements_on_screen (MCP)(noParams: {})
  ⎿ Found these elements on screen: [{"type":"android.widget.TextView","tex
    t":"Search Settings","label":"","identifier":"com.android.settings:id/s
    earch_bar_title","coordinates":{"x":189,"y":143,"width":389,"height":71
    … +44 lines (ctrl+r to expand)




⏺ Perfect! I can see the Storage element in the hierarchy. Now I'll click
  on the Storage button using its coordinates.

⏺ mobile - mobile_click_on_screen_at_coordinates (MCP)(x: 210, y: 1690)
  ⎿  Clicked on screen at coordinates: 210, 1690

⏺ I've successfully clicked on the "Storage" button using the hierarchy
  method. The Storage element shows "35% used - 5.20 GB free" and I clicked
   on its title coordinates at (210, 1690). This should take you to the
  storage settings page.

I tries two kinds of screen size (10802400 / 10801920), none of them works with the image-based way

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions