Remote site reliability engineer jobs

We, at Turing, are looking for site reliability engineers who will be responsible for automating solutions including capacity and performance planning, managing risks, disaster response, and on-call monitoring. Here’s your chance to work with elite U.S. companies and collaborate with top professionals across the globe.

Find remote software jobs with hundreds of Turing clients

Job description

Job responsibilities

  • Build software applications to help operations and support teams
  • Gather and analyze metrics to help in performance tuning and troubleshooting errors
  • Contribute to system design consulting, platform management, and capacity planning
  • Develop sustainable systems and services with automation and uplifts
  • Improve feature development speed and system reliability through optimization of on-call processes
  • Prepare documentation of historical knowledge concerning software development, support, IT operations, and on-call duties
  • Monitor application performance and keep the sites up and running

Minimum requirements

  • Bachelor’s/Master’s degree in Engineering, Computer Science, or IT (or equivalent experience)
  • At least 3+ years of experience as a site reliability engineer (rare exceptions for highly skilled engineers)
  • Proficient understanding of operating systems (Linux/Windows)
  • Expert knowledge of DevOps concepts and best practices
  • Expertise in CI/CD implementation
  • Hands-on experience in troubleshooting issues
  • Knowledge of one or more high-level programming languages like Python, Java, JavaScript, C/C++, Ruby, etc.
  • Experience with distributed storage technologies and dynamic resource management frameworks
  • Fluent in English to communicate effectively
  • Ability to work full-time (40 hours/week) with a 4 hour overlap with US time zones

Preferred skills

  • Working knowledge of code versioning tools such as Git
  • Proactivity in finding issues, performance bottlenecks, and areas for improvement
  • Passion for automation, coding skills, and software-centric mindset
  • Understanding of distributed computing, cloud-native applications, application monitoring, and database management
  • Excellent organizational and interpersonal skills

Interested in this job?

Apply to Turing today.

Apply now

Why join Turing?

Elite US Jobs

1Elite US Jobs

Turing’s developers earn better than market pay in most countries, working with top US companies.
Career Growth

2Career Growth

Grow rapidly by working on challenging technical and business problems on the latest technologies.
Developer success support

3Developer success support

While matched, enjoy 24/7 developer success support.

Developers Turing

Read Turing.com reviews from developers across the world and learn what it’s like working with top U.S. companies.
4.65OUT OF 5
based on developer reviews as of June 2024
View all reviews

How to become a Turing developer?

Work with the best software companies in just 4 easy steps
  1. Create your profile

    Fill in your basic details - Name, location, skills, salary, & experience.

  2. Take our tests and interviews

    Solve questions and appear for technical interview.

  3. Receive job offers

    Get matched with the best US and Silicon Valley companies.

  4. Start working on your dream job

    Once you join Turing, you’ll never have to apply for another job.

cover

How to become a Site Reliability engineer ?

As software development became faster and more complex, traditional software teams had trouble keeping up. To help with the transition of workflows from development to production applications, they introduced DevOps.

However, it became increasingly apparent that this system needed greater reliability and performance in order to stay competitive. This is where the field of site reliability engineering comes into play.

Site reliability engineering blends software engineering practices with information technology (IT) engineering practices to create highly reliable systems. Site reliability engineers are responsible for ensuring the reliability of all aspects of the full stack, from the front-end, customer-facing applications all the way through to the database and hardware infrastructure.

What is the scope in Site Reliability engineering?

The role of SRE (Systems and Release Engineer) is ideal for assessing the newest development in the DevOps world, expanding your knowledge and skills in high-demand areas such as infrastructure automation, release engineering, and continuous delivery. As an SRE, you’ll be highly creative, stimulated, and technically challenged every day.

Site reliability engineers are crucial to most organizations. These professionals are in high demand at successful tech companies that have large data centers and complex technical challenges. They can also be inspirational from both a financial and workplace culture perspective. Google considers them scarce resources.

What are the roles and responsibilities of a Site Reliability engineer?

Site reliability engineering (SRE) refers to software engineering approaches used by organizations to manage their IT operations. SRE teams use software tools as a way to automate operations and solve problems in a timely manner.
Software reliability engineers (SREs) are software engineers who have Unix systems administration, networking, and software engineering experience. SREs also have polished programming skills because they regularly use automation to reduce human labor and increase reliability.
Software Release Engineering (SRE) transfers the tedious work traditionally done by DevOps and operations teams to software engineers who can use automation and software to optimize processes.
Site reliability engineers spend half their time doing development work, and the other half doing operations duties, such as responding to outages and incidents and being on call.

The roles and responsibilities of a site reliability engineer include

  • Building software to help Operations and Support Teams
  • Conducting Post-Incident Reviews
  • Documenting the knowledge to ensure a seamless flow of information between teams
  • Implementing strategies to increase system reliability and performance through on-call rotation
  • Fix cases related to support escalation
  • Incorporate various software engineering aspects to develop and implement services that improve IT and support teams
  • Optimize the Software Development Life Cycle (SDLC) to boost service reliability

How to become a Site Reliability engineer?

You can become a site reliability engineer in the following ways:

  1. Bachelor's degree: It is mandatory for the developer to have a Bachelor’s degree or Master’s degree. This helps with growth in the software field and also aids in easy understanding of technical aspects of the job.
  2. 2+ years experience in operations or software engineering role: It helps if you have some previous experience working as a software engineer. This will give you an advantage over other candidates while trying for SRE positions.
  3. Required skills: You must have the following technical skills.
  • Experience with cloud-continuous deployment based software development lifecycles
  • Expertise in infrastructure automation technologies

Along with technical skills you must have a strong foundation of non-technical skills as well. What you need:

  • Excellent verbal and written communication skills
  • Strong problem-solving skills
  • Passion and curiosity for technology
  • Keenness to provide support for teams or customers.

Now let us discuss the skills and methods you will need to learn to become a successful site reliability engineer:

Interested in remote Site Reliability jobs?

Become a Turing developer!

Apply now

Skills required to become a Site Reliability engineer

Fundamental skills are important in helping you land high-paying site reliability engineer jobs. Here is what you need to know!

1. DevOps

DevOps refers to a set of practices that promote better collaboration and widespread automation of the processes happening between operational and development teams. It can be extended to other business units as well.

DevOps is a new cultural movement combining software development, operations, and engineering. It stimulates the adoption of agile practices that are continuous in nature and enable continuous delivery of small batches to customers.

2. Python

Python is easy to learn. It is a high-level, dynamic language with an interpreted structure to make debugging errors relatively painless. Which helps programmers rapidly develop working application prototypes. This feature has earned Python a reputation as a language well-suited for coding. Because Python supports cross-platform operating systems, it is a good choice for programmers. Especially those who do not want to spend time writing separate programs for different operating systems.

3. Go

Go was created for applications relating to network infrastructure and was intended to replace Java and C++. It is used in cloud-based or server-side (web) applications. With DevOps, site reliability automation, micro-controller programming, robotics, and games also common users of Go. Go is also used in the world of artificial intelligence and data science.

4. CI/CD

Continuous integration/continuous delivery (CI/CD) is a software development process in which code is automatically built and tested as new code is added. CI/CD can improve the effectiveness of a software team by reducing the risk of errors or defects and enabling automated deployments, freeing up time spent manually building, testing, or releasing software.

CI/CD introduces automated processes to integrate code and test in a continuous manner with delivery and deployment, replacing error-prone manual processes. CI/CD is supported by teams working together in an agile way, either with DevOps or SRE practices.

5. Version control

Version control or revision control systems help software developers keep track of changes to application code and manage the development of a single program by more than one person. Version control systems such as Git have the ability to create branches, where a developer can make a copy of an existing project and modify one or more files.

6. NoSQL databases

NoSQL databases are a class of database management systems (DBMSs) that do not rely on the traditional relational database management system (RDBMS) structure. NoSQL databases are purpose-built for specific data models, have flexible schemas for building modern applications, and are widely recognized for their ease of development and performance at scale. These databases use various data models for accessing and managing data, which makes them optimized specifically for applications that require large data volume, low latency, and flexible data models.

Interested in remote Site Reliability jobs?

Become a Turing developer!

Apply now

How to get remote Site Reliability engineer jobs?

Developers are a lot like athletes. In order to excel at their craft, they have to practice effectively and consistently. They also need to work hard enough so that their skills grow gradually over time. In that regard, there are two major factors that developers must focus on in order for that progress to happen: the support of someone who is more experienced and effective in practice techniques while you're practicing. As a developer, it's vital for you to know how much to practice - so make sure there is someone on hand who will help you out and keep an eye out for any signs of burnout!

Turing offers the best remote site reliability engineer jobs that suit your career trajectories as a site reliability engineer. Grow rapidly by working on challenging technical and business problems on the latest technologies. Join a network of the world's best developers & get full-time, long-term remote site reliability engineer jobs with better compensation and career growth.

Why become a Site Reliability engineer at Turing?

Elite US jobs
Career growth
Exclusive developer community
Once you join Turing, you’ll never have to apply for another job.
Work from the comfort of your home
Great compensation

How much does Turing pay their Site Reliability engineers?

At Turing, every site reliability engineer is allowed to set their rate. However, Turing will recommend a salary at which we know we can find a fruitful and long-term opportunity for you. Our recommendations are based on our assessment of market conditions and the demand that we see from our customers.

Frequently Asked Questions

Turing is an AGI infrastructure company specializing in post-training large language models (LLMs) to enhance advanced reasoning, problem-solving, and cognitive tasks. Founded in 2018, Turing leverages the expertise of its globally distributed technical, business, and research experts to help Fortune 500 companies deploy customized AI solutions that transform operations and accelerate growth. As a leader in the AGI ecosystem, Turing partners with top AI labs and enterprises to deliver cutting-edge innovations in generative AI, making it a critical player in shaping the future of artificial intelligence.

After uploading your resume, you will have to go through the three tests -- seniority assessment, tech stack test, and live coding challenge. Once you clear these tests, you are eligible to apply to a wide range of jobs available based on your skills.

No, you don't need to pay any taxes in the U.S. However, you might need to pay taxes according to your country’s tax laws. Also, your bank might charge you a small amount as a transaction fee.

We, at Turing, hire remote developers for over 100 skills like React/Node, Python, Angular, Swift, React Native, Android, Java, Rails, Golang, PHP, Vue, among several others. We also hire engineers based on tech roles and seniority.

Communication is crucial for success while working with American clients. We prefer candidates with a B1 level of English i.e. those who have the necessary fluency to communicate without effort with our clients and native speakers.

Currently, we have openings only for the developers because of the volume of job demands from our clients. But in the future, we might expand to other roles too. Do check out our careers page periodically to see if we could offer a position that suits your skills and experience.

Our unique differentiation lies in the combination of our core business model and values. To advance AGI, Turing offers temporary contract opportunities. Most AI Consultant contracts last up to 3 months, with the possibility of monthly extensions—subject to your interest, availability, and client demand—up to a maximum of 10 continuous months. For our Turing Intelligence business, we provide full-time, long-term project engagements.

No, the service is absolutely free for software developers who sign up.

Ideally, a remote developer needs to have at least 3 years of relevant experience to get hired by Turing, but at the same time, we don't say no to exceptional developers. Take our test to find out if we could offer something exciting for you.

View more FAQs

Latest posts from Turing

Software-developer-jobs-in-Silicon-Valley-tech-companies

Looking for Software Developer Jobs? Learn How to Write a Clean Code First

Are you a software developer looking for remote jobs in Silicon Valley tech companies? If yes, these clean code t...

Read more

Turing.com Review by Nigeria’s Joy: Flexibility in Work Allows Me to Enjoy Life More

Flexibility in work at Turing allows me to enjoy life more, says Nigeria’s Joy in her Turing.com review...

Read more
Software-Development-Life-Cycle-scaled

The Nine Steps of Software Product Development Life Cycle

A product development process depends on the nature of the business. But these steps can turn your ordinary softw...

Read more

Ten Tips to Crack a Software Developer Job Interview

Cracking a software developer job interview is no cakewalk. Here are a few tips to help level up your...

Read more

Turing Blog: Articles, Insights, Company News and Updates

Explore insights on AI and AGI at Turing's blog. Get expert insights on leveraging AI-powered solutions to drive ...

Read more

Leadership

In a nutshell, Turing aims to make the world flat for opportunity. Turing is the brainchild of serial A.I. entrepreneurs Jonathan and Vijay, whose previous successfully-acquired AI firm was powered by exceptional remote talent. Also part of Turing’s band of innovators are high-profile investors, such as Facebook's first CTO (Adam D'Angelo), executives from Google, Amazon, Twitter, and Foundation Capital.

Equal Opportunity Policy

Turing is an equal opportunity employer. Turing prohibits discrimination and harassment of any type and affords equal employment opportunities to employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, age, disability status, protected veteran status, or any other characteristic protected by law.

Explore remote developer jobs

briefcase
Senior Full-Stack LLM Engineer (Onshore)

Senior Full-Stack LLM Engineer (Onshore)
Dallas Texas - Work from Office

We are seeking an experienced Full Stack Engineer who possesses extensive hands-on expertise across the contemporary technology stack and a collaborative mindset. The ideal candidate will effectively collaborate with cross-functional teams, introduce and advocate for the most recent tools and frameworks, and drive engineering excellence throughout the organization.

As part of this role, the candidate will be responsible for designing and developing high-quality, production-grade code while adhering to industry best practices, coding standards, and architectural principles. Beyond individual contributions, the candidate will play a pivotal role in mentoring and training Jr developers—guiding them through code reviews, knowledge-sharing workshops to accelerate their growth, strengthen their technical foundation, and foster a culture of continuous learning within the team.

Must-Have Technologies: ReactJS, Java 17+, Microservices, AWS/Azure, RDBMS, NoSQL

Core Technical Skills

· 10+ years of hands-on software development experience, with deep expertise in modern Java Full Stack development.

· Strong command of Java frameworks and libraries, including Spring, Spring Boot, and Hibernate.

· Proven experience building single-page web applications using ReactJS, Bootstrap, and Nodejs.

· Hands-on experience designing, developing, and deploying cloud-based architectures on AWS, Azure, or equivalent platforms.

· Expertise developing Java and Spring Boot applications backed by both relational (SQL Server / PostgreSQL) and NoSQL (MongoDB / Elasticsearch) databases.

· Practical experience integrating Large Language Models (LLMs) into applications via APIs (e.g., OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock).

· Familiarity with prompt engineering, Retrieval-Augmented Generation (RAG), embeddings, and vector databases.

· Understanding of responsible AI practices, including data privacy, hallucination mitigation, evaluation, and guardrails.

· Solid hands-on experience developing production applications, data processing scripts, automation tooling, or backend services in Python.

· Working knowledge of modern security frameworks and standards such as OAuth 2.0, OpenID Connect, and JWT.

· Proficient in designing and building RESTful APIs and web services following industry best practices.

· Strong SQL development skills, including query optimization and performance tuning.

· Experience with modern testing frameworks (Jest, Mocha, Chai) and a commitment to test-driven development.

· Solid understanding of front-end architecture principles and data-driven UI development.

· Demonstrated ability to write clean, efficient, and maintainable code that adheres to coding standards and best practices.

· Excellent problem-solving and troubleshooting skills for diagnosing complex technical issues.

Leadership & Soft Skills

· Proven ability to lead, mentor, and grow engineering teams, fostering a culture of technical excellence and continuous learning.

· Experience driving architectural decisions and setting technical direction across cross-functional teams.

· Strong stakeholder management skills, with the ability to translate business requirements into scalable technical solutions.

· Track record of leading end-to-end delivery of complex projects, from design through deployment.

· Effective communicator who can articulate technical concepts clearly to both technical and non-technical audiences.

· Skilled at conducting code reviews, establishing engineering best practices, and raising the bar for code quality across the team.

· Ability to collaborate with product managers, designers, and business stakeholders to align engineering efforts with organizational goals.

· Experience coaching junior engineers, conducting performance reviews, and contributing to hiring and talent development.

· Strategic thinker who can balance short-term delivery pressures with long-term technical investments and technical debt management.

· Ownership mindset — takes accountability for outcomes and proactively drives initiatives forward.

Good to Have (Palantir)

· Hands-on experience with the Palantir foundry, including building pipelines, ontologies, and data-driven applications.

· Familiarity with Palantir's development tooling such as PySpark transforms, TypeScript-based Functions, Workshop.

· Exposure to enterprise-scale data integration and operational analytics workflows built on Palantir.

· Exposure to AI/ML frameworks and tooling such as LangChain, LlamaIndex, Hugging Face, or equivalent.

· Ability to identify and apply AI-driven solutions to real-world business problems and integrate them into enterprise applications.

Real Estate
10K+ employees
ReactJavaMicroservices+ 4
briefcase
Engineering Manager (Java, React – Playwright)

Job Title: Engineering Manager (Java, React, AI Tools)

Location: Dallas, Texas
Experience: 10+ years (with 3+ years in leadership roles)

About the Role

We are looking for an experienced Engineering Manager to lead a high-performing, full-stack engineering team working on scalable, enterprise-grade applications. This role requires a strong technical background across backend, frontend, data, and QA automation, along with proven leadership capabilities.

You will be responsible for driving technical excellence, mentoring engineers, and ensuring high-quality product delivery in a fast-paced environment.

Key Responsibilities

  • Lead and manage a cross-functional engineering team (backend, frontend, QA)
  • Drive design, development, and delivery of scalable applications using Java and React
  • Oversee end-to-end software development lifecycle, including architecture, coding, testing, and deployment
  • Champion automation-first QA practices using tools like Playwright (TypeScript)
  • Collaborate with product managers, designers, and stakeholders to define technical roadmaps
  • Ensure best practices in code quality, performance, security, and scalability
  • Mentor engineers, conduct code reviews, and support career growth
  • Improve team productivity through agile processes and engineering excellence
  • Work with data using SQL and Python for analytics, integrations, or backend processing
  • Leverage modern AI-assisted development tools (e.g., Cursor, Windsurf, Claude Code, OpenAI Codex, or similar) to enhance engineering productivity and efficiency

Required Skills & Qualifications

  • 10+ years of software engineering experience
  • 3+ years of experience in an Engineering Manager / Tech Lead role
  • Mandatory experience in the Retail domain
  • Demonstrated ability to operate at a strategic level, including defining team structures, ownership models, and long-term technical direction
  • Proven experience driving engineering outcomes beyond execution, including stakeholder alignment and measurable business impact
  • Strong hands-on experience with:Java (Spring Boot or similar frameworks)
    React / JavaScript (modern frontend frameworks)
    SQL and/or Python
  • Experience leading full-stack development teams
  • Solid understanding of system design, microservices, and APIs
  • Strong ability to think at system, product, and organizational level, including making architecture and business trade-offs
  • Experience with QA automation frameworks, especially:Playwright with TypeScript
  • Familiarity with CI/CD pipelines and modern DevOps practices
  • Hands-on experience leveraging AI tools (e.g., Cursor, Windsurf, Claude Code, OpenAI Codex, or similar) to improve developer productivity, code quality, or delivery workflows
  • Strong people management skills, including experience with performance management, team scaling, and handling team challenges
  • Ability to operate effectively in ambiguous environments and manage delivery risks
  • Excellent communication and stakeholder management abilities
Manufacturing
10K+ employees
Core JavaSpring BootReact+ 5
sample card

Apply for the best jobs

View more openings
Turing books $87M at a $1.1B valuation to help source, hire and manage engineers remotely
Turing named one of America's Best Startup Employers for 2022 by Forbes
Ranked no. 1 in The Information’s "50 Most Promising Startups of 2021" in the B2B category
Turing named to Fast Company's World's Most Innovative Companies 2021 for placing remote devs at top firms via AI-powered vetting
Turing helps entrepreneurs tap into the global talent pool to hire elite, pre-vetted remote engineers at the push of a button

Work with the world's top companies

Create your profile, pass Turing Tests and get job offers as early as 2 weeks.