Site Reliability Engineer

Industry: Technology
Remote
Company size: 51-250
Full-time

Apply as Site Reliability Engineer

Check out the best jobs for November 2023here

Find remote software jobs with hundreds of Turing clients

Job description

A US-based company providing businesses with in-depth analysis and data-backed insights on their manual assembly lines, is looking for a Site Reliability Engineer. The selected candidate will be making critical decisions about the technology built and help define the engineering values and culture of the company. The company's patented recognition technology is leveraging the power of AI technology to evaluate videos in order to gain visibility and ideas on how to optimize the production lines. The company was able to secure $35mn+ during their Series B round of funding. This will be a full-time long-term position requiring some overlap with the IST/EST time zones. 

  

Job Responsibilities:

  • Contribute to the entire development cycle of services - inception, design, deployment, operation, and refinement 
  • Undertake tasks like maintaining services once they live by monitoring availability, latency, and overall system health 
  • Implement practices like sustainable incident response and blameless post mortems
  • Build sustainable systems and services utilizing automation and uplifts
  • Scale feature development speed and system reliability by optimizing on-call processes 
  • Develop documentation of historical knowledge concerning software development, support, IT operations, and on-call duties
  • Help to monitor app performance and keep websites up and running

Job Requirements:

  • Must possess a Bachelor’s/Master’s degree in engineering, computer science, or related fields
  • Minimum of 5 years experience in working as a Site Reliability Engineer
  • Deep understanding of distributed systems, web-based services, databases, or related monitoring systems
  • Must have worked as an SRE in a rapidly growing production system taking charge of system uptime, being on-call, and incident handling
  • The ability to work with monitoring tools like Grafana, Prometheus, Graphite, and standard cloud monitoring tools is essential
  • Familiarity with on-call tools like pagerDuty or OpsGenie is required
  • Expertise in developing, designing, and scaling monitoring toolsets and software is necessary
  • Ability to contribute to early-stage systems, defining metrics to measure, and alerts to configure will be helpful
  • Experience in configuring monitoring systems for Kubernetes databases, VMs, and cloud platforms in a common place is preferred
  • Ability to manage or work with the first-level Monitoring/Networking SRE operations team will be beneficial
  • Well-versed in systems software and networking principles with knowledge of security software development
  • Familiarity with Linux-based development environment and cloud stacks like GCP, Azure or AWS
  • Experience in automating recurring processes and building tight full-stack integrations
  • Well-versed in programming like C++, Python, or GO (Python is preferred)
  • Strong grasp and proven experience in operational automation
  • Deep understanding of database systems like both SQL and NoSQL
  • Should possess expertise in DevOps, monitoring, and automation tools
  • Must be well-versed with change control processes and change controlled environments

Interested in this job?

Apply to Turing today.

Apply now

How to become a Turing developer?

Work with the best software companies in just 4 easy steps
  1. Create your profile

    Fill in your basic details - Name, location, skills, salary, & experience.

  2. Take our tests and interviews

    Solve questions and appear for technical interview.

  3. Receive job offers

    Get matched with the best US and Silicon Valley companies.

  4. Start working on your dream job

    Once you join Turing, you’ll never have to apply for another job.

Leadership

In a nutshell, Turing aims to make the world flat for opportunity. Turing is the brainchild of serial A.I. entrepreneurs Jonathan and Vijay, whose previous successfully-acquired AI firm was powered by exceptional remote talent. Also part of Turing’s band of innovators are high-profile investors, such as Facebook's first CTO (Adam D'Angelo), executives from Google, Amazon, Twitter, and Foundation Capital.

Equal Opportunity Policy

Turing is an equal opportunity employer. Turing prohibits discrimination and harassment of any type and affords equal employment opportunities to employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, age, disability status, protected veteran status, or any other characteristic protected by law.

Work full-time at top U.S. companies

Create your profile, pass Turing Tests and get job offers as early as 2 weeks.

Apply now

Apply now