3spky5u-oss/HandsOn

30 stars · Last commit 2026-03-07

Give Claude eyes and hands — screen capture and desktop automation plugin for Claude Code

README preview

# HandsOn

[![GitHub release](https://img.shields.io/github/v/release/3spky5u-oss/HandsOn?style=flat-square)](https://github.com/3spky5u-oss/HandsOn/releases)
[![License: MIT](https://img.shields.io/badge/license-MIT-blue?style=flat-square)](LICENSE)
[![Platform](https://img.shields.io/badge/platform-Windows%20%7C%20macOS-lightgrey?style=flat-square)]()
[![Claude Code Plugin](https://img.shields.io/badge/Claude%20Code-plugin-blueviolet?style=flat-square)](https://docs.anthropic.com/en/docs/claude-code)
[![MCP](https://img.shields.io/badge/MCP-server-orange?style=flat-square)](https://modelcontextprotocol.io)
[![Tools](https://img.shields.io/badge/tools-34-green?style=flat-square)]()

Give Claude eyes and hands. A Claude Code plugin that lets Claude see your screen and interact with any application on your desktop.

<!-- TODO: ![Demo GIF](docs/demo.gif) -->

> **Alpha software.** HandsOn works and is genuinely useful, but Claude will sometimes stumble through tasks — misclicking sidebar links, scrolling the wrong container, needing multiple attempts to find the right element. Complex multi-step workflows (like filling out a Reddit post) may take several retries to get right. It's getting better with each release, but set your expectations accordingly.

## What's New in v0.3.0

- **Window-cropped visual detection** — When a target window is set, visual analysis is cropped to that window's bounds, eliminating noise from the terminal and other background apps
- **30-50% token reduction** — Compact output across all tools: shorter class names, removed redundant formatting, tighter coordinate notation
- **Unnamed element filtering** — `list_elements` now skips unnamed Pane/Group/Custom elements that add noise without value (common in Steam, Electron apps, etc.)

View full repository on GitHub →