AI Agent Evaluation Analyst

2 weeks ago


Philippines Mindrift Full time $24,000 - $47,000 per year

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. 

What we do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for:

We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate. 

Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for:

  • Analysts, researchers, or consultants with strong critical thinking skills.
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig.
  • People open to a part-time and non-permanent opportunity.


About the project:

We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.

What you'll be doing:

  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
  • Identifying inconsistencies, missing assumptions, or unclear decision points.
  • Helping define clear expected behaviors (gold standards) for AI agents.
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly.
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage.

How to get started:

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements
  • Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.
  • Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.
  • Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.
  • Ability to assess scenarios holistically: What's missing, what's unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings.

We also value applicants who have:

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
  • Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.
  • Exposure to LLMs, prompt engineering, or AI-generated content.
  • Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong").
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
Benefits
  • Get paid for your expertise, with rates that can go up to $47/hour depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.

  • AI Agent Developer

    1 week ago


    Philippines Appfront Inc Full time ₱24,000 - ₱60,000 per year

    About the RoleWe are seeking a skilled AI Agent / Copilot Developer experienced in building and extending Microsoft Copilots, integrating LLMs (e.g., ChatGPT, LLaMA), and automating workflows through the Power Platform.The ideal candidate has a strong background in .NET development, with practical experience in Power Automate, Dataverse, and SharePoint, and...

  • AI Engineer

    3 weeks ago


    , Metro Manila, Philippines Ibex Staffing Solutions Full time

    We are seeking a highly skilled AI Engineer to design and develop intelligent, adaptive systems that learn from real-world customer interactions. This role focuses on building AI‑driven agents, reinforcement learning pipelines, and continuous feedback loops that enhance user experience and expand product intelligence over time. Key Responsibilities Design,...


  • Philippines Mindrift Full time $47,000 per year

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What...

  • AI Developer

    4 weeks ago


    , Metro Manila, Philippines JPMorganChase Full time

    Join our innovative team to shape the future of AI applications. Leverage your expertise in Amazon Bedrock and AWS to build intelligent systems that solve complex business problems. Be at the forefront of AI‑driven innovation. Job Summary As an Applied AI Developer within our innovative team, you will design, develop, and deploy generative and agentic AI...

  • AI Developer

    4 weeks ago


    , Metro Manila, Philippines JPMorgan Chase Full time

    Join our innovative team to shape the future of AI applications. Leverage your expertise in Amazon Bedrock and AWS to build intelligent systems that solve complex business problems. Be at the forefront of AI-driven innovation. Job Summary As an Applied AI Developer within our innovative team, you will design, develop, and deploy generative and agentic AI...

  • AI Workflow Engineer

    2 weeks ago


    , , Philippines XO Cyber Full time

    Executive Operations is seeking an AI Automation Specialist to help build and scale our next‑generation AI‑enabled digital workforce. In this role, you will design, integrate, and optimize automation solutions across business units, enabling seamless collaboration between human teams and AI agents. This is a key position for driving operational...


  • , , Philippines Solugenix Full time

    Quality Assurance Analyst Clark, Pampanga (Onsite) Full-Time Job ID The Quality Assurance Analyst is responsible for monitoring and evaluating customer interactions, ensuring compliance with the company standards, policies, and procedures, identifying areas for improvement, and providing actionable feedback to improve overall performance and customer...

  • Quality Analyst

    4 weeks ago


    , , Philippines RippedBoxStation Full time

    Quality Analyst (QA) Position: Quality Analyst (QA)Number of hours: 40 Hrs/weekSchedule: TBD Location: Manila, National Capital Region, Philippines Responsibilities Monitor and evaluate customer interactions (calls, emails, chats) to ensure compliance with quality standards and company policies. Conduct regular quality audits and provide detailed reports on...


  • , , Philippines Luxoft Full time

    Project description Our client, one of the leading Agriculture Companies, is modernising their landscape and adopting AI and innovations in their process.We are seeking a highly skilled and innovative Python/AI Engineer to join our team. The ideal candidate will be responsible for developing cutting‑edge AI solutions, creating Proof of Concepts (PoCs), and...

  • D2C - Product Analyst

    2 weeks ago


    , , Philippines HomeLight Full time

    Join to apply for the BI-Product Analyst role at HomeLight Join to apply for the BI-Product Analyst role at HomeLight This role is based in the Philippines. WFH setup, US Pacific hours ***** This role is based in the Philippines. WFH setup, US Pacific hours ***** Who We AreWe’re building the future of real estate — today.HomeLight is the essential...