Site Reliability Engineer / DevOp's Engineer in Atlanta, GA at RaceTrac

Date Posted: 3/25/2021

Job Snapshot

  • Employee Type:
    Full-Time
  • Location:
    200 Galleria Parkway Southeast
    Atlanta, GA
  • Job Type:
  • Experience:
    Not Specified
  • Date Posted:
    3/25/2021

Job Description

The Site Reliability Engineer (SRE) is a results-driven engineer with a DevOp's Mindset. He/She will have asolid technology acumen supporting the RaceTrac teams cloud compute activities.  The SRE combines software, systems and infrastructure engineering to build and run large-scale, massively distributed, fault-tolerant systems. The SRE ensures that RT services, both our internally critical and our externally visible systems have reliability, uptime and performance commensurate with application requirements. Additionally, SRE’s will keep a watchful eye on our systems capacity and performance.  At RaceTrac we are striving to build resilient and highly scalable infrastructure and eliminate manual work through automation.

Job Requirements

Responsibilities
  • Engages in and improve the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
  • Develops software and provide hands-on technical knowledge to design, deploy, and optimize large-scale, massively distributed, fault-tolerant systems. 
  • Supports services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, automation, pipelining and launch reviews.
  • Maintains services once they are live by measuring/monitoring availability, latency, and overall system health.
  • Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity. Reduces manual intervention and turn-around time to solve for repetitive problems while automating and monitoring the health of our sites and services.
  • Practices sustainable incident response and blameless postmortems.
  • Improves, tunes and performs operational efficiency within the Windows based infrastructure and production environment. 
  • Actively participates in deploying and supporting applications on our private and public cloud environment.
  • Collaborates with development teams to support the current environment as we transform into a cloud architecture and provides resources “as a service” to developers.


Qualifications
  • Bachelor’s degree from an accredited college or university in Computer Science or related field preferred. Equivalent practical experience will be considered.
  • Experience programming in at least one of the following languages: C, C++, Java, Python, or Go.
  • Minimum 4 years of working experience in Azure. Experience with Jenkins or similar application.
  • General knowledge of Infrastructure as Code tools and Config management tools such as (Terraform/Ansible/Chef/Puppet/SCCM). 
  • Comfort with large-scale production systems and technologies (load balancing, monitoring, distributed system and configuration management. Expertise in designing, analyzing, and troubleshooting.
  • Ability to debug, optimize code, and automate routine tasks.
  • Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviews. 
  • Demonstrated history of living the values that are important to RaceTrac:  Honesty, Efficiency, Attitude, Respect, Teamwork.

Not ready to apply?

Joining our Talent Network will allow us to contact you when open jobs in your area are available and to keep you updated on all things happening at RaceTrac. Whether you choose to apply or just leave your information, we look forward to staying connected with you.