Turing.com review by former tech entrepreneur

"I can work in the heart of Silicon Valley from the comfort of my home"

- Dhyey, Software Engineer from India

Dhyey__.webp

Dhyey, an AI/ML & Data Science enthusiast, shared his Turing.com review and said that he loves the company's close-knit culture and the opportunity to work with talented people from diverse backgrounds from across the world. He also added that now he can spend his free time on the things he enjoys such as binge-watching movies/TV shows and playing chess.

Life before Turing jobs

A software engineer based out of Rajkot, India, Dhyey specializes in various areas of Artificial Intelligence such as ML, NLP, and CV. Being an AI, data science, cloud computing, and chess enthusiast, Dhyey spends his time building cross-platform apps in React Native.

After working at Amazon Canada for a couple of years on warehouse control and demand forecasting, Dhyey decided to return home. "I wanted to work on my own startup, so I moved back to India. Unfortunately, after making some progress, the pandemic hit, and the project got derailed," he recalls.

But hardship was never going to get the better of him. "I decided to utilize my time productively and work with a firm that is making an impact in the world," he exclaims.

How did he learn about Turing US software jobs?

Dhyey came across Turing's ad during one of his determined job hunts. "The idea of working for top US companies from anywhere in the world without needing a VISA felt like a dream! I just had to sign up and check it out!" he says.

"I took their tests which were quite challenging. But that's how I knew that the work would be just as fulfilling! After 4-6 hours of tests and interviews, I got hired. Today I am working with top tech leaders from all over the world," he remarks.

How has his journey with Turing.com been so far?

"Life is slow and peaceful back home, Rajkot being a small city. I don't have to travel to big cities for decent tech opportunities. I'm not too fond of the commute and pollution, which drain my energy. With Turing, I can spend my free time on the things I enjoy, like playing chess and binge-watching movies and TV shows," he says.

What's his take on Turing developers?

The international exposure that comes from working at Turing is naturally appealing to talented professionals like Dhyey. "I love the company's close-knit culture and the opportunity to work with talented people from diverse backgrounds. Every day, I work and interact with the best engineering minds not just in my city or country, but from around the world," he shares.

What's the final verdict?

Enjoying his new life as a remote developer, Dhyey concludes: "I can work in the heart of Silicon Valley from the comfort of my home. With great pay and solid growth opportunities, I feel like I have the best of both worlds."

If you are a talented software developer looking to take your career to the next level, join Turing's boundaryless team today!

Interested in U.S. software jobs?

Apply to Turing today.

Apply now

Explore remote developer jobs

briefcase
AI Quality Analyst - Portuguese (Portugal)

About Turing:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

Role Overview:

As an AI Quality Analyst, you will evaluate a new personalization feature for Gemini. You will assess how well the model uses information from your past Gemini conversations, Gmail, Google Search, and YouTube activity to make responses more relevant and helpful. This role requires a unique blend of creativity and analytical rigor. You will actively design prompts from the perspective of your own personal experiences. You will then use your analytical skills to assess the quality of the model's personalized responses, evaluating dimensions like Grounding, Integration, and Helpfulness.


Key Qualifications

  • Portugueese Proficiency: Ability to read and write in Portuguese with a high degree of comp, as Portuguese is the focus language for this project.
  • Personal Account Usage: Willingness to use your primary personal Google account (not a testing account) and enable personal data sources for a genuine assessment.
  • Schedule Flexibility: Full-time availability in your local time zone is required.  We are staffing a global, 24-hour operations team.
  • Exceptional Analytical Thinking: Demonstrate ability to evaluate nuanced and ambiguous AI responses, specifically assessing personalization quality.
  • Creative Prompt Engineering: Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test the model's capabilities.
  • Strong Evaluation Acumen: Understanding of personalization concepts, including the ability to identify incorrect personalization, poor inferences, and forced connections.
  • Meticulous Attention to Detail: The ability to review Side-by-Side (SxS) model responses and spot subtle differences in naturalness and overnarrating.
  • Excellent Written Communication: Superior ability to write clear, concise, and structured rationales for model rankings, explicitly referencing specific turn numbers.
  • Feedback: Ability to provide constructive feedback and detailed annotations.
  • Communication: Excellent communication and collaboration skills.
  • Independence: Self-motivated and able to work independently in a remote setting.
  • Technical Setup: Desktop/Laptop set up with a good internet connection.


Description:

  • In this role, you will be part of a dynamic team focused on evaluating the quality of personalized AI interactions. Your day-to-day work will involve:
  • Designing and executing multi-turn conversational prompts (typically 1-5 turns) that require the AI to utilize your personal information and experiences.
  • Evaluating model responses based on your intent from the starting prompt, checking if the personalization was appropriately applied.
  • Analyzing responses for Grounding issues, ensuring claims about you are supported by evidence and not flawed inferences or hallucinations.
  • Assessing Integration quality to ensure personal data is woven naturally into the response without robotic "overnarrating".
  • Rigorously evaluating and stack-ranking two model responses side-by-side (SxS) to determine which is overall more helpful, easy to use, and enjoyable.
  • Writing clear, defensible rationales for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
  • Extracting and verifying "Debug Info" from the model to confirm that chat summaries and data sources were properly utilized.
  • Maintaining strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history.


Education & Experience

  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field).
  • Experience in data annotation, AI quality evaluation, content moderation, or a related role is strongly preferred.

Offer Details:

  • Commitments Required: at least 4 hours per day and upto 40 hours per week with 4 hours of overlap with PST.
  • Engagement type: Contractor
  • Engagement Length: 3 months
  • Our offered rate for this project is $15 per hour.

Evaluation Process -

  • Shortlisted candidates will be sent a Job Interest Form.
  • After the profile review, an assessment will be shared, which must be completed within 24 hours.
  • Based on the assessment outcomes, shortlisted candidates will be contacted to discuss the pre‑onboarding requirements.
Software
10K+ employees
Domain-Specific Languages
briefcase
AI Engineer

About Turing


Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L


Role Overview


We are looking for an AI/ML Engineer specializing in LLM post-training and reinforcement learning workflows. The role focuses on fine-tuning open-weight models, building reward systems, and improving model performance through scalable training, evaluation, and data curation


What does day-to-day life look like?

  • Design and execute fine-tuning pipelines for open-weight models (Qwen, Llama, Mistral families) using SFT → DPO → GRPO progressions on tool-use and agentic data.
  • Implement and tune LoRA / QLoRA adapters for parameter-efficient fine-tuning; understand when full fine-tuning vs PEFT is the right call.
  • Build reward functions and verifiers for RL training  including programmatic verifiers, LLM-as-judge rubrics, and state-transition checks against gym environments.
  • Generate, curate, and filter RL tool-use training data: golden trajectories, preference pairs, on-policy rollouts, and rejection-sampled completions.
  • Run distributed training on multi-GPU setups; manage inference at scale with vLLM (including extended-context configurations via YaRN / RoPE scaling).
  • Diagnose failure modes: reward hacking, distribution collapse, KL blow-up, tool-selection errors vs state-transition errors, format drift.
  • Define and track evaluation metrics  pass@k, pass^k, trajectory-level scoring, rubric-based vs binary scoring  and own model-quality reporting against benchmarks.
  • Partner with annotation, eval, and client teams to translate data-quality signals into training improvements.

Requirements

  • 3+ years of hands-on ML engineering experience, with at least 1+ year specifically on LLM post-training.
  • Demonstrated production or research experience with at least three of: SFT, LoRA/QLoRA, DPO, PPO, GRPO, RLHF.
  • Strong PyTorch fundamentals; working familiarity with Hugging Face TRL, Accelerate, DeepSpeed or FSDP, and vLLM.
  • Experience designing reward signals or verifiers for RL training  not just running training scripts.
  • Solid grasp of tokenization, attention, chat templates, tool-calling formats (OpenAI/Anthropic-style), and common failure modes in agent training.
  • Comfort with Python, distributed training, GPU profiling, and reading research papers and turning them into working code.

Strongly Preferred:


  • Experience training tool-use or agentic models (function calling, multi-step tool selection, planner-executor patterns).
  • Experience with synthetic data generation pipelines and rejection sampling.
  • Familiarity with MCP, LangChain/LangGraph, or similar agent frameworks.
  • Exposure to evals at scale: building harnesses, designing rubrics, dealing with judge variance and reward hacking.
  • Cloud/infra: RunPod, AWS, GCP; container workflows; long-context inference tuning.


Perks of Freelancing With Turing

  • Work in a fully remote environment.
  • Opportunity to work on cutting-edge AI projects with leading LLM companies.

Offer Details

  • Commitments Required: 40 hours per week with overlap of 4 hours with PST. 
  • Engagement Type: Contractor assignment (no medical/paid leave)
  • Duration of contract : 2 months; [expected start date is next week]
  • Location: India, Pakistan, Bangladesh, Brazil

Evaluation Process

  • 2 rounds of Technical Interview (90 mins)
-
1-10 employees
PythonMachine Learning
sample card

Apply for the best jobs

View more openings

Work full-time at top U.S. companies

Create your profile, pass Turing Tests and get job offers as early as 2 weeks.