Careerflow.ai Logo

Careerflow.ai

Data Annotation Specialist - Computer Use Agents (CUA) Trajectory Evaluator

Posted 5 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in IN
Mid level
Remote
Hiring Remotely in IN
Mid level
Create, validate, and document step-by-step Computer-Use Agent (CUA) trajectories for technical developer workflows. Break down natural language instructions into reproducible actions, execute and test workflows in Linux using Python/Bash, interact with APIs and browser automation, and collaborate to improve annotation quality and guidelines.
The summary above was generated by AI

Role Overview:

We are looking for skilled professionals to contribute as S2 Annotators, responsible for producing and validating high-quality Computer-Use Agent (CUA) trajectories for developer-adjacent workflows. This includes tasks such as file operations, light scripting, API interactions, and browser automation. This role requires a strong understanding of technical workflows, attention to detail, and the ability to translate natural language instructions into precise, step-by-step executable actions that can be used to train advanced AI systems.

What does day-to-day look like

  • Create detailed, step-by-step positive CUA trajectories for technical tasks (e.g., file manipulation, scripting, API calls, browser-based workflows)

  • Break down natural language instructions into clear, verifiable actions

  • Validate and review trajectories for correctness, completeness, and reproducibility

  • Work within Linux desktop environments to execute and document workflows

  • Use scripting (Python/Bash) to simulate or validate task execution where required

  • Interact with tools and environments involving APIs, terminals, and browser automation

  • Collaborate with internal teams to refine task quality and annotation guidelines

  • Ensure consistency, accuracy, and high-quality standards across all annotations

Requirements

  • 2–5 years of experience in software development, technical support, or similar technical roles

  • Strong familiarity with Linux environments and command-line operations

  • Proficiency in at least one scripting language: Python or Bash

  • Ability to decompose complex instructions into structured, step-by-step workflows

  • Strong attention to detail in documenting technical processes

  • Exposure to LLM-based tools, AI systems, or agentic workflows

  • Basic understanding of APIs, file systems, and developer tooling

  • Familiarity with OpenClaw or similar environments/tools

Nice to have

  • Prior experience in data annotation, RLHF, or SFT labeling workflows

  • Exposure to CI/CD pipelines, REST APIs, or terminal-based automation

  • Experience working with browser automation tools or developer productivity tools

  • Background in evaluating or improving AI-generated outputs

Offer Details:

  • Engagement type: Contractor assignment/freelancer (no medical/paid leave)

  • Duration: 5 weeks

Evaluation Process:

  • Resume screening

  • Take home assessment (60 mins)

Similar Jobs

3 Days Ago
Remote or Hybrid
Senior level
Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Lead deployment of Industry 4.0 and automation across manufacturing sites: develop digital roadmaps, implement automation/vision/process control solutions, ensure cybersecurity/compliance, drive capability building, and support cross-regional digital transformation and value realization.
Top Skills: Artificial IntelligenceBeckhoffCloud ComputingEdge ComputingIdcIndustrial NetworkingIot PlatformsMachine LearningMachine VisionOpc UaPlcPower AppsPower AutomatePower BIPythonRoboticsRockwellSiemens
18 Days Ago
Easy Apply
Remote
Easy Apply
Mid level
Mid level
Big Data • Fintech • Mobile • Payments • Financial Services
As the CRA Compliance Lead, you will manage compliance strategies, enhance community engagement, analyze consumer complaints, and ensure alignment with regulatory expectations for Affirm Bank.
50 Minutes Ago
Remote
Mid level
Mid level
HR Tech • Professional Services • Consulting
The Marketing Manager develops marketing strategies to boost brand visibility and customer acquisition while overseeing digital and offline campaigns, ensuring consistent branding, and collaborating with teams for growth.
Top Skills: Digital MarketingEmail MarketingPaid AdvertisingSemSeoSocial Media

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account