About Turing:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L
Role Overview
Turing is seeking highly skilled and motivated Applied AI Research Scientists in Computer Science and Computer Engineering with an MS or Ph.D. in a relevant technical field to join our team at Turing. In this role, you will contribute to the design, validation, and execution of expert-level evaluation tasks that probe the limits of state-of-the-art AI systems. Your work will focus on creating headroom-level, rigorously verifiable questions across hardware, systems, and computing domains to assess and stress-test advanced multimodal and reasoning-capable AI models.
This position requires deep domain expertise, strong analytical rigor, and the ability to translate complex technical concepts into precise, evaluable challenges that expose model limitations beyond surface-level reasoning. You will work closely with a collaborative, cross-functional team and are expected to be a reliable team player who is highly detail-oriented and committed to accuracy and quality.
Roles & Responsibilities
- Design headroom-level evaluation questions requiring advanced reasoning and graduate-level domain expertise in CS/CE.
- Ensure all tasks are objectively verifiable with clear, definitive ground-truth answers.
Develop high-quality multimodal prompts, including accurate technical diagrams or visuals when appropriate. - Identify and document model headroom, focusing on SOTA models like Gemini and ChatGPT, and conduct structured side-by-side evaluations.
- Document model failures and reasoning gaps, provide correct solutions, and maintain accurate records of prompts, answers, and evaluation results in shared tracking systems.
- Overlap of 6 hours with PST time zone (12pm PST to 6pm PST) is mandatory.
Required Skills & Qualifications
- MS or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, Information Technology, Data Science, or a closely related field
- Strong expertise in two or more Computer Engineering or Computer Science domains, such as hardware, computer architecture, systems, VLSI design, embedded systems and IoT, operating systems, compilers, systems security, or AI/ML
- Experience with applied AI research, technical evaluation, or research-driven problem formulation in real-world or production-oriented settings is a plus
- Strong programming proficiency, with experience in Python for analysis, verification, and evaluation workflows
- Strong written communication skills and the ability to collaborate effectively as a detail-oriented team player
Perks of Freelancing With Turing:
- Work in a fully remote environment
- Opportunity to work on cutting-edge AI projects with leading LLM companies
Offer Details:
- Commitment Required: 8 hours per day, with 4 hours of mandatory overlap with PST
- Employment Type: Contractor assignment (no medical/paid leave).
- Contract Duration: 3 months (expected start date: next week).
- Eligible Locations: US, Canada, LATAM, Europe, Africa.
Evaluation Process:
- Round 1: Take home assessment
Offline assessment to completed and submitted for reveiw. - Round 2: Delivery Interview (60 minutes)
A combined technical and cultural discussion with the Delivery Team.