Turing.com review by full-stack developer from Colombia

"I feel like I’m Cristiano Ronaldo and Turing is my manager."

- Hernan, Fullstack Developer from Colombia

Hernan_2.webp

In his Turing.com review, Hernan, a highly-skilled full-stack developer from Bogota, Colombia, said he learnt the importance of spending quality time with friends and family. He also mentioned his love for football, electric guitars and spending time with his son.

Life before Turing jobs

With over 11 years of experience, Hernan has worked with several developer teams and product owners as a full-stack developer as well as a database administrator.

How did he learn about Turing US software jobs?

A loving father, Hernan was set on finding a job where he could spend more time with his son without any economic or professional compromise. Scanning the internet, he came across Turing. “I found [Turing] online. I gave the test, went through their vetting process, and was hired for one of their customers,” he says.

“The process was very short and convenient. The tests are designed to let only experienced and highly skilled developers in. I chose seven other tech stacks apart from my expertise. I would suggest everyone applying to Turing to try that too. It’s a lot of fun. Haha,” he exclaims.

How has his journey with Turing.com been so far?

Remote work offers an unmatched opportunity to optimize and manage time the way you see fit. For exceptional developers like Hernan, these benefits were holistic.

“I noticed that I spend more of my free time in ways I’d consider quality time: time spent with my family, with friends, working on things that are important to me, hobbies like playing electric guitar and football,” he says.

“Once I became a Turing developer, I saw a radical change in terms of growth, financial health, and personal life too. The payments are on time, and I always felt very secure and happy with my salary,” the Colombian developer remarks.

What's his take on Turing developers?

Turing is not just a work-space, it's an international family, always looking out for one another. Talking more about his experience, Hernan recalls, ”My Turing manager is very sweet. He always makes sure that I am happy with the work and lifestyle. I have the luxury of planning my own routine. I can take care of my baby son who is always just one door away. I get to be a part of his every moment.”

“I love what Turing is doing: hiring world-class developers and giving them the opportunity to work with amazing tech leaders on huge projects from their homes. I’ve got many meaningful interactions in my daily life, within and outside of work. That too while working on interesting projects from home, collaborating with people from all over the globe,” he mentions.

What's the final verdict?

Summing up his experience at Turing, Hernan observes: “There is complete transparency in their whole process. Without Turing, finding an opportunity to work for such US companies would have been really tough.”

“I feel like I am Cristiano Ronaldo and Turing is my manager. I don’t have to send CVs to companies, Turing finds the best opportunities for me. I have someone whom I can trust, who will never let me down, and will always try to give me the best career options,” he concludes.

If you’re a highly skilled developer, join Hernan today and enjoy a better quality of life while working with a world-class company. Go Boundaryless with Turing.

Interested in U.S. software jobs?

Apply to Turing today.

Apply now

Explore remote developer jobs

briefcase
AI Quality Analyst - Portuguese (Portugal)

About Turing:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

Role Overview:

As an AI Quality Analyst, you will evaluate a new personalization feature for Gemini. You will assess how well the model uses information from your past Gemini conversations, Gmail, Google Search, and YouTube activity to make responses more relevant and helpful. This role requires a unique blend of creativity and analytical rigor. You will actively design prompts from the perspective of your own personal experiences. You will then use your analytical skills to assess the quality of the model's personalized responses, evaluating dimensions like Grounding, Integration, and Helpfulness.


Key Qualifications

  • Portugueese Proficiency: Ability to read and write in Portuguese with a high degree of comp, as Portuguese is the focus language for this project.
  • Personal Account Usage: Willingness to use your primary personal Google account (not a testing account) and enable personal data sources for a genuine assessment.
  • Schedule Flexibility: Full-time availability in your local time zone is required.  We are staffing a global, 24-hour operations team.
  • Exceptional Analytical Thinking: Demonstrate ability to evaluate nuanced and ambiguous AI responses, specifically assessing personalization quality.
  • Creative Prompt Engineering: Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test the model's capabilities.
  • Strong Evaluation Acumen: Understanding of personalization concepts, including the ability to identify incorrect personalization, poor inferences, and forced connections.
  • Meticulous Attention to Detail: The ability to review Side-by-Side (SxS) model responses and spot subtle differences in naturalness and overnarrating.
  • Excellent Written Communication: Superior ability to write clear, concise, and structured rationales for model rankings, explicitly referencing specific turn numbers.
  • Feedback: Ability to provide constructive feedback and detailed annotations.
  • Communication: Excellent communication and collaboration skills.
  • Independence: Self-motivated and able to work independently in a remote setting.
  • Technical Setup: Desktop/Laptop set up with a good internet connection.


Description:

  • In this role, you will be part of a dynamic team focused on evaluating the quality of personalized AI interactions. Your day-to-day work will involve:
  • Designing and executing multi-turn conversational prompts (typically 1-5 turns) that require the AI to utilize your personal information and experiences.
  • Evaluating model responses based on your intent from the starting prompt, checking if the personalization was appropriately applied.
  • Analyzing responses for Grounding issues, ensuring claims about you are supported by evidence and not flawed inferences or hallucinations.
  • Assessing Integration quality to ensure personal data is woven naturally into the response without robotic "overnarrating".
  • Rigorously evaluating and stack-ranking two model responses side-by-side (SxS) to determine which is overall more helpful, easy to use, and enjoyable.
  • Writing clear, defensible rationales for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation.
  • Extracting and verifying "Debug Info" from the model to confirm that chat summaries and data sources were properly utilized.
  • Maintaining strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history.


Education & Experience

  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field).
  • Experience in data annotation, AI quality evaluation, content moderation, or a related role is strongly preferred.

Offer Details:

  • Commitments Required: at least 4 hours per day and upto 40 hours per week with 4 hours of overlap with PST.
  • Engagement type: Contractor
  • Engagement Length: 3 months
  • Our offered rate for this project is $15 per hour.

Evaluation Process -

  • Shortlisted candidates will be sent a Job Interest Form.
  • After the profile review, an assessment will be shared, which must be completed within 24 hours.
  • Based on the assessment outcomes, shortlisted candidates will be contacted to discuss the pre‑onboarding requirements.
Software
10K+ employees
Domain-Specific Languages
briefcase
AI Engineer

About Turing


Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L


Role Overview


We are looking for an AI/ML Engineer specializing in LLM post-training and reinforcement learning workflows. The role focuses on fine-tuning open-weight models, building reward systems, and improving model performance through scalable training, evaluation, and data curation


What does day-to-day life look like?

  • Design and execute fine-tuning pipelines for open-weight models (Qwen, Llama, Mistral families) using SFT → DPO → GRPO progressions on tool-use and agentic data.
  • Implement and tune LoRA / QLoRA adapters for parameter-efficient fine-tuning; understand when full fine-tuning vs PEFT is the right call.
  • Build reward functions and verifiers for RL training  including programmatic verifiers, LLM-as-judge rubrics, and state-transition checks against gym environments.
  • Generate, curate, and filter RL tool-use training data: golden trajectories, preference pairs, on-policy rollouts, and rejection-sampled completions.
  • Run distributed training on multi-GPU setups; manage inference at scale with vLLM (including extended-context configurations via YaRN / RoPE scaling).
  • Diagnose failure modes: reward hacking, distribution collapse, KL blow-up, tool-selection errors vs state-transition errors, format drift.
  • Define and track evaluation metrics  pass@k, pass^k, trajectory-level scoring, rubric-based vs binary scoring  and own model-quality reporting against benchmarks.
  • Partner with annotation, eval, and client teams to translate data-quality signals into training improvements.

Requirements

  • 3+ years of hands-on ML engineering experience, with at least 1+ year specifically on LLM post-training.
  • Demonstrated production or research experience with at least three of: SFT, LoRA/QLoRA, DPO, PPO, GRPO, RLHF.
  • Strong PyTorch fundamentals; working familiarity with Hugging Face TRL, Accelerate, DeepSpeed or FSDP, and vLLM.
  • Experience designing reward signals or verifiers for RL training  not just running training scripts.
  • Solid grasp of tokenization, attention, chat templates, tool-calling formats (OpenAI/Anthropic-style), and common failure modes in agent training.
  • Comfort with Python, distributed training, GPU profiling, and reading research papers and turning them into working code.

Strongly Preferred:


  • Experience training tool-use or agentic models (function calling, multi-step tool selection, planner-executor patterns).
  • Experience with synthetic data generation pipelines and rejection sampling.
  • Familiarity with MCP, LangChain/LangGraph, or similar agent frameworks.
  • Exposure to evals at scale: building harnesses, designing rubrics, dealing with judge variance and reward hacking.
  • Cloud/infra: RunPod, AWS, GCP; container workflows; long-context inference tuning.


Perks of Freelancing With Turing

  • Work in a fully remote environment.
  • Opportunity to work on cutting-edge AI projects with leading LLM companies.

Offer Details

  • Commitments Required: 40 hours per week with overlap of 4 hours with PST. 
  • Engagement Type: Contractor assignment (no medical/paid leave)
  • Duration of contract : 2 months; [expected start date is next week]
  • Location: India, Pakistan, Bangladesh, Brazil

Evaluation Process

  • 2 rounds of Technical Interview (90 mins)
-
1-10 employees
PythonMachine Learning
sample card

Apply for the best jobs

View more openings

Work full-time at top U.S. companies

Create your profile, pass Turing Tests and get job offers as early as 2 weeks.