
AI Agent Evaluation Analyst
1 week ago
AI Agent Evaluation Analyst - AI Trainer
At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we're looking for: We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate.
Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills
- Students (senior undergrads / grad students) looking for an intellectually interesting gig
- People open to a part-time and non-permanent opportunity
About the project: We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks.
Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups.
What you'll be doing:
- Reviewing evaluation tasks and scenarios for logic, completeness, and realism
- Identifying inconsistencies, missing assumptions, or unclear decision points
- Helping define clear expected behaviors (gold standards) for AI agents
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives
- Thinking through complex systems and policies as a human would to ensure agents are tested properly
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage
Requirements:
- Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications
- Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements
- Familiarity with structured data formats: Can read, not necessarily write JSON/YAML
- Can assess scenarios holistically: What's missing, what's unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings.
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
- Exposure to LLMs, prompt engineering, or AI-generated content
- Familiarity with QA or test-case thinking (edge cases, failure modes, 'what could go wrong')
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
Benefits:
- Get paid for your expertise, with rates that can go up to $38/hour depending on your skills, experience, and project needs
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio
- Influence how future AI models understand and communicate in your field of expertise
-
Prompt Engineer – Conversational AI
3 weeks ago
Capital District, Philippines Outsourced Staff Full timeAbout Us At Voice AI Solutions, we specialise in building advanced conversational AI voice agents for businesses. Our products include AI Receptionists, AI Customer Service Agents, and AI Sales Agents, with custom solutions available across industries such as IT, finance, real estate, and insurance. Designed with data sovereignty, security, and seamless...
-
QA Engineer
3 weeks ago
Capital District, Philippines Bamboo Works Full timeGet AI-powered advice on this job and more exclusive features. We are looking for a motivated Quality Assurance (QA) Engineer with a passion for delivering high-quality Voice AI Agent Software. In this role, you will support testing efforts across Voice AI components such as Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), and...
-
Account and Project Manager
3 weeks ago
Capital District, Philippines Creatify AI Full timeLocation Philippines Employment Type Full time Location Type Remote Department Creative About Creatify Creatify is building the world’s first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube, and...
-
Capital District, Philippines e2f Inc. Full timeAre you ready for new challenges and new opportunities? Join our team! Current job opportunities are posted here as they become available. Subscribe to our RSS feeds to receive instant updates as new positions become available. Job Description We are currently building our team of freelance annotators for a Response Correctness Annotation (IDEAS) Project –...
-
AI Content Writer
1 week ago
Capital District, Philippines DataAnnotation Full timeWe are looking for a Bilingual AI Content Writer (Tagalog/English) to join our team to train AI models. You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of each model. In this role you will need to hold an expert level of linguistics. You will have conversations in both Tagalog and English...
-
Freelance AI Trainer
3 weeks ago
Capital District, Philippines Mindrift Full timeFreelance AI Trainer - Mathematics & Python 6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English. At...
-
AI Data Quality Analyst
3 weeks ago
Eastern Manila District, Philippines TaskUs Full timeAbout TaskUs TaskUs is a provider of outsourced digital services and next-generation customer experience to fast-growing technology companies, helping its clients represent, protect and grow their brands. Leveraging a cloud-based infrastructure, TaskUs serves clients in sectors including social media, e-commerce, gaming, streaming media, food delivery,...
-
Data Operations Analyst
3 weeks ago
Capital District, Philippines Carbon6 Full timeJoin to apply for the Data Operations Analyst role at Carbon6 Join to apply for the Data Operations Analyst role at Carbon6 Get AI-powered advice on this job and more exclusive features. Who We AreCarbon6 , now proudly part of SPS Commerce , is transforming the future of ecommerce. Our mission is to simplify success for online sellers by removing...
-
Mortgage Credit Analyst
2 weeks ago
Capital District, Philippines Wingman Group PTY LTD Full timeJob Summary We are seeking a professional and reliable Mortgage Credit Analyst to join our team! You will be responsible for assessing the creditworthiness of loan applicants, ensuring compliance with lending regulations, and working with real estate agents to ensure a successful and timely close. Responsibilities Assess each borrower’s financial...
-
Sr. Website Data Analyst
1 week ago
Capital District, Philippines Cloudflare Full timeAbout Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without...