Skip to content

tauri-agent-tools

Agent-driven inspection toolkit for Tauri desktop apps.

11 read-only commands to screenshot, inspect, and monitor Tauri apps from the CLI.

License: MIT Node >= 20

The Problem

Debugging frontend issues in Tauri desktop apps requires manually screenshotting, cropping, and describing what you see. Existing tools either hijack your cursor (xcap-based), render DOM to canvas (html2canvas — can't capture WebGL/video/canvas elements), or have no authentication.

The Solution

Combine a bridge's knowledge of element positions (getBoundingClientRect) with real pixel screenshots (import -window + ImageMagick crop). No other tool does this.

Features

  • DOM-targeted capture — screenshot any CSS-selectable element with real screen pixels
  • 11 read-only commands — screenshot, dom, eval, wait, info, list-windows, ipc-monitor, console-monitor, storage, page-state
  • Cross-platform — Linux X11, Linux Wayland/Sway, macOS
  • Token authenticated — random 32-char token, localhost-only bridge
  • Agent Skills — ships agentskills.io skills for AI coding agents
  • JSON output — all commands support --json for structured output

Quick Install

npm install -g tauri-agent-tools

See Installation for platform-specific requirements.

Quick Example

# Screenshot a specific DOM element
tauri-agent-tools screenshot --selector ".toolbar" -o /tmp/toolbar.png

# Explore the DOM tree
tauri-agent-tools dom --depth 3

# Monitor IPC calls in real-time
tauri-agent-tools ipc-monitor --duration 10000

Next Steps