Built a dataset of GUI-based human-computer interaction tasks for pretraining and aligning a general-purpose computer-use agent. The dataset spans applications, operating systems, and task intents and combines timestamped screen recordings, step-by-step action logs, and action metadata.

Training a general-purpose GUI agent requires grounded, executable demonstrations of how people use real applications, including subtle decisions, inter-application handoffs, and UI interaction reasoning. The client needed:
The client required a dataset that could support both pretraining and fine-tuning of agents able to complete open-ended, real-world tasks across a wide GUI surface.
Turing deployed a global network of expert trainers across Asia and Latin America, all vetted for analytical ability and tool fluency. Tasks were designed, executed, and reviewed using Turing’s internal labeling platform with custom QA automation and structured logging.
Dataset components
Each task included:
Tasks were distributed across:
QA process
Turing implemented a multi-layered QA and compliance framework:
The resulting dataset powers agent training workflows and instruction-following evaluations. It is used to:
Request a sample with a realistic user prompt, action log, metadata, timestamps, screenshots, and full task metadata for operating system, application, complexity, and trajectory.
Request SampleOffice tools, browsers, system settings, design tools, spreadsheets, and more across Windows, macOS, and Linux.
Each task includes a prompt, an action log, screenshot timeline, and coordinates.
The dataset supports GUI agent pretraining, simulation, reward modeling, and general task grounding.
A standard mutual NDA. Turing provides the countersigned agreement within one business day.
Within three business days after NDA execution.
Request a sample with step-level data including screenshots and timestamps.