tauri-agent-tools
Agent-driven inspection toolkit for Tauri desktop apps.
11 read-only commands to screenshot, inspect, and monitor Tauri apps from the CLI.
The Problem
Debugging frontend issues in Tauri desktop apps requires manually screenshotting, cropping, and describing what you see. Existing tools either hijack your cursor (xcap-based), render DOM to canvas (html2canvas — can't capture WebGL/video/canvas elements), or have no authentication.
The Solution
Combine a bridge's knowledge of element positions (getBoundingClientRect) with real pixel screenshots (import -window + ImageMagick crop). No other tool does this.
Features
- DOM-targeted capture — screenshot any CSS-selectable element with real screen pixels
- 11 read-only commands — screenshot, dom, eval, wait, info, list-windows, ipc-monitor, console-monitor, storage, page-state
- Cross-platform — Linux X11, Linux Wayland/Sway, macOS
- Token authenticated — random 32-char token, localhost-only bridge
- Agent Skills — ships agentskills.io skills for AI coding agents
- JSON output — all commands support
--jsonfor structured output
Quick Install
See Installation for platform-specific requirements.
Quick Example
# Screenshot a specific DOM element
tauri-agent-tools screenshot --selector ".toolbar" -o /tmp/toolbar.png
# Explore the DOM tree
tauri-agent-tools dom --depth 3
# Monitor IPC calls in real-time
tauri-agent-tools ipc-monitor --duration 10000
Next Steps
- Installation — set up system requirements
- Quick Start — get running in 5 minutes
- Bridge Setup — integrate the Rust bridge
- Command Reference — all 11 commands with examples
- Architecture Overview — how it works under the hood