AI Agent Evaluation Analyst

4 weeks ago


Capital District, Philippines Mindrift Full time

AI Agent Evaluation Analyst - AI Trainer

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for: We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate.

Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for:

  • Analysts, researchers, or consultants with strong critical thinking skills
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig
  • People open to a part-time and non-permanent opportunity

About the project: We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks.

Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups.

What you'll be doing:

  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism
  • Identifying inconsistencies, missing assumptions, or unclear decision points
  • Helping define clear expected behaviors (gold standards) for AI agents
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage

Requirements:

  • Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications
  • Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements
  • Familiarity with structured data formats: Can read, not necessarily write JSON/YAML
  • Can assess scenarios holistically: What's missing, what's unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings.

We also value applicants who have:

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
  • Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
  • Exposure to LLMs, prompt engineering, or AI-generated content
  • Familiarity with QA or test-case thinking (edge cases, failure modes, 'what could go wrong')
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)

Benefits:

  • Get paid for your expertise, with rates that can go up to $38/hour depending on your skills, experience, and project needs
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio
  • Influence how future AI models understand and communicate in your field of expertise
#J-18808-Ljbffr

  • Capital District, Philippines Creatify AI Full time

    Location Philippines Employment Type Full time Location Type Remote Department Creative About Creatify Creatify is building the world’s first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube, and more. In...

  • QA Engineer

    7 days ago


    Capital District, Philippines Bamboo Works Full time

    Get AI-powered advice on this job and more exclusive features. We are looking for a motivated Quality Assurance (QA) Engineer with a passion for delivering high-quality Voice AI Agent Software. In this role, you will support testing efforts across Voice AI components such as Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), and...

  • AI Content Writer

    4 weeks ago


    Capital District, Philippines DataAnnotation Full time

    We are looking for a Bilingual AI Content Writer (Tagalog/English) to join our team to train AI models. You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of each model. In this role you will need to hold an expert level of linguistics. You will have conversations in both Tagalog and English...


  • Southern Manila District, Philippines Avaloq AG Full time

    We are seeking a highly skilled, fast learner and well rounded, AI Software Engineer to join our AI innovation lab team. The ideal candidate will lead AI experiments and bring cutting‑edge AI Agentic solutions from ideation to production. The mission of Avaloq’s AI Lab is to improve operational efficiency and client advisory capabilities through the...

  • Data Analyst

    7 days ago


    Capital District, Philippines Peroptyx Full time

    Overview Join to apply for the Data Analyst (0 Experience Required) role at Peroptyx . As a Maps Evaluator, you will provide ground truth for your town, city or country. Peroptyx seeks Data Analysts who will review mapping data for digital mapping applications. Your research capabilities will validate and ensure that navigation of certain routes is accurate...


  • Capital District, Philippines Innodata Inc. Full time

    Manager, Human Resource @ Innodata Inc. | Talent Acquisition, Business Partnering, Candidate Experience Advocate We are currently hiring full‑time Subject Matter Experts. This role is for those interested in agentic complex engineering and evaluation. The selected candidate will be responsible for the creation of complex prompts and responses, implementing...


  • Capital District, Philippines Mindrift Full time

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. What...


  • Eastern Manila District, Philippines Mindrift Full time

    Be among the first 25 applicants. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to...

  • Senior Data Analyst

    2 weeks ago


    Capital District, Philippines Sleek Full time

    Company Overview Sleek makes the back-office easy for micro SMEs. With a global surge of entrepreneurs, we innovate in a lucrative space. We operate three business segments: Corporate Secretary: Automating company incorporation, secretarial, filing, nominee director, mailroom and immigration via custom online robots and SleekSign. Market leader in Singapore...

  • SEO Analyst

    2 weeks ago


    Capital District, Philippines Magic Full time

    Magic Taguig, National Capital Region, Philippines 2 days ago Be among the first 25 applicants About the Client Our client is an established, multi-office residential real estate brokerage in the Upper Midwest, serving home buyers and sellers across multiple local markets. They combine deep local expertise with the marketing reach of a nationally recognized...