AI Agent Evaluation Analyst

9 hours ago


Biñan, Philippines Mindrift Full time

Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For We’re looking for curious and intellectually proactive contributors—individuals who double‑check assumptions and play devil’s advocate. Comfortable with ambiguity and complexity? Prefer an async, remote, flexible opportunity? Interested in learning how modern AI systems are tested and evaluated? This role is ideal. Flexible Project‑Based Opportunity Analysts, researchers, or consultants with strong critical thinking skills Students (senior undergrads / grad students) looking for an intellectually interesting gig People open to a part‑time and non‑permanent opportunity About the Project We are hiring QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you will balance quality assurance, research, and logical problem‑solving. This opportunity is ideal for those who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases. What You'll Be Doing Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumptions, or unclear decision points Help define clear expected behaviors (gold standards) for AI agents Annotate cause‑effect relationships, reasoning paths, and plausible alternatives Think through complex systems and policies as a human would to ensure agents are tested properly Work closely with QA, writers, or developers to suggest refinements or edge‑case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: reason about complex systems, scenarios, and logical implications Strong attention to detail: spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: read JSON/YAML Ability to assess scenarios holistically: identify missing or unrealistic elements that might break Good communication and clear writing in English to document findings We also value applicants who have: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads (logic/math/informatics), or research Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity with QA or test‑case thinking (edge cases, failure modes) Understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates up to $47/hour depending on skills, experience, and project needs Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments Participate in an advanced AI project and gain valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise #J-18808-Ljbffr



  • Biñan, Philippines Sigma AI Full time

    Ilocano/Iloko Linguistic Projects (Remote) Join the Ilocano/Iloko Linguistic Projects (Remote) role at Sigma AI. Sigma is a leading global technology company specializing in data collection and annotation for Artificial Intelligence. With over 30 years of experience, offices in Spain, the US, and the UK, and operations in more than 200 languages, we support...


  • Biñan, Philippines Moxie Full time

    This range is provided by Moxie. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $11.00/hr - $15.00/hr At Moxie, we empower ambitious aesthetic entrepreneurs to build profitable, independent practices—without burnout, overwhelm, or guesswork. In just a few years, we’ve grown from an...

  • AI Data Quality

    4 weeks ago


    Biñan, Philippines TaskUs Full time

    AI Data Quality & Engineering Lead at TaskUs About TaskUs: TaskUs is a provider of outsourced digital services and next-generation customer experience to fast-growing technology companies, helping its clients represent, protect and grow their brands. Leveraging a cloud-based infrastructure, TaskUs serves clients in fast-growing sectors, including social...


  • Biñan, Philippines Remotasks Full time

    Video Content Description Specialist Expertise Sought for AI Training Join to apply for the Video Content Description Specialist Expertise Sought for AI Training role at Remotasks Video Content Description Specialist Expertise Sought for AI Training Join to apply for the Video Content Description Specialist Expertise Sought for AI Training role at Remotasks...


  • Biñan, Philippines Mindrift Full time

    Freelance Civil Engineering Expert - AI Trainer 4 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English....


  • Biñan, Philippines TaskUs Full time

    Business Development Representative, AI Safety Join or sign in to find your next job Join to apply for the Business Development Representative, AI Safety role at TaskUs Business Development Representative, AI Safety 1 day ago Be among the first 25 applicants Join to apply for the Business Development Representative, AI Safety role at TaskUs Get...

  • Financial Analyst

    1 week ago


    Biñan, Philippines Crown Worldwide Group Full time

    Overview Join to apply for the Financial Analyst role at Crown Worldwide Group . Crown Worldwide Group is a privately owned, global logistics company founded in 1965 and headquartered in Hong Kong. We are an extraordinary and purposeful business committed to making it simpler to live, work and do business anywhere in the world, delivered through our broad...


  • Biñan, Calabarzon, Philippines Western Digital Full time ₱360,000 - ₱720,000 per year

    Company Description At Western Digital, our vision is to power global innovation and push the boundaries of technology to make what you thought was once impossible, possible. At our core, Western Digital is a company of problem solvers. People achieve extraordinary things given the right technology. For decades, we've been doing just that—our technology...


  • Biñan, Calabarzon, Philippines Western Digital Full time ₱900,000 - ₱1,200,000 per year

    Company Description At Western Digital, our vision is to power global innovation and push the boundaries of technology to make what you thought was once impossible, possible.At our core, Western Digital is a company of problem solvers. People achieve extraordinary things given the right technology. For decades, we've been doing just that—our technology...


  • Biñan, Philippines Emperador Distillers, Inc. Full time

    Quality Assurance Laboratory Analyst Job Description a. Directly reports to QA Lab/Line Supervisor b. Performs routine laboratory sampling, analysis and evaluation of raw materials, in-process and finished goods based on the current and standard laboratory procedure. c. Analyzes and interprets all laboratory tests results. d. Supports by ensuring the...