Professional Evaluator - Fully Remote
About the role
About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Model Evaluator
Type: Contract Compensation: $60–$80/hour Commitment: 20 hours/week Role Responsibilities
- Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
- Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
- Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
- Score and rank multiple model responses using structured rubrics across dimensions.
- Provide written justifications with specific evidence for each evaluation.
Qualifications Must-Have
- Master’s degree or higher in a relevant professional field (e.g., Finance, Accounting, Law, Medicine, Healthcare, Engineering).
- Professional experience applying domain expertise in a practitioner or advisory capacity.
- Familiarity with industry-specific standards, regulations, or clinical guidelines.
- Strong written communication and critical reasoning skills.
Application Process (Takes 20–30 mins to complete)
- Submit your resume to begin.
- Complete a Training Assessment.
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. ,
Similar jobs you might like
Professional Evaluator - Fully Remote
About the role
About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Model Evaluator
Type: Contract Compensation: $60–$80/hour Commitment: 20 hours/week Role Responsibilities
- Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
- Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
- Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
- Score and rank multiple model responses using structured rubrics across dimensions.
- Provide written justifications with specific evidence for each evaluation.
Qualifications Must-Have
- Master’s degree or higher in a relevant professional field (e.g., Finance, Accounting, Law, Medicine, Healthcare, Engineering).
- Professional experience applying domain expertise in a practitioner or advisory capacity.
- Familiarity with industry-specific standards, regulations, or clinical guidelines.
- Strong written communication and critical reasoning skills.
Application Process (Takes 20–30 mins to complete)
- Submit your resume to begin.
- Complete a Training Assessment.
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. ,