About the Role
Mercor is seeking a highly skilled Research and STEM Expert to join our AI evaluation and technical quality assurance team. In this role, you will analyze, evaluate, and fact-check AI-generated outputs across scientific, mathematical, and technical domains — ensuring the highest standards of factual accuracy, logical reasoning, and clarity.
You will help improve the reasoning and reliability of cutting-edge Large Language Models (LLMs) by providing structured feedback and expert judgment across diverse STEM fields. This position is ideal for individuals with strong academic training, analytical precision, and a passion for advancing AI alignment in research and science.
Key Responsibilities
-
Evaluate and critique AI-generated responses in STEM-related subjects (e.g., computer science, mathematics, physics, biology, and engineering).
-
Conduct fact-checking and research validation using reputable public and academic sources.
-
Assess scientific explanations, calculations, and reasoning for correctness and clarity.
-
Provide structured written feedback to improve the model’s understanding and communication of technical topics.
-
Collaborate with the AI quality team to improve annotation guidelines and maintain consistency across evaluations.
Minimum Requirements
-
BS, MS, or PhD in a STEM domain (e.g., Computer Science, Mathematics, Biology, Physics, Engineering, etc.)
-
English expert with excellent comprehension and communication skills
-
Excellent at high school–level math
-
Experts at fact-checking information across multiple domains (medical, legal, financial, technical, etc.) using trusted public sources
-
Excellent writing skills and attention to detail
-
Significant experience using Large Language Models
Please mention the word COMMENDABLY and tag RMTM4LjE5Ny4yMy4yNTU= when applying to show you read the job post completely (#RMTM4LjE5Ny4yMy4yNTU=). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.