Senior Site Reliability Engineer
Overview
Placement Type:
Temporary
Salary:
$54 to $60 an hour.
Start Date:
04.07.2025
As a Senior Site Reliability Engineer, you’ll be an integral part of a mission-critical team, driving automation, enhancing resilience, and ensuring seamless operation of large-scale systems. Your expertise will directly impact the stability and performance of critical applications, contributing to the company’s continued success and pushing the boundaries of what’s possible. If you’re a passionate problem-solver with a drive for automation and a knack for optimizing complex environments, this is your opportunity to make a real difference.
Responsibilities:
- Developing and implementing automation solutions, integrating diverse systems, and ensuring the reliability and scalability of critical infrastructure.
- You will play a key role in enhancing operational excellence, optimizing performance, and driving innovation within a dynamic and fast-paced environment.
- Develop Python-based automation solutions for on-premises and cloud infrastructure.
- Continuously identify and implement opportunities to enhance operational excellence.
- Build proactive and scalable solutions.
- Integrate tools and services via APIs and client libraries.
- Enhance deployment reliability through automated chaos strategies and failover mechanisms.
- Develop proactive monitoring and alerting solutions.
- Perform root cause analysis and incident management for complex system failures.
- Work on system resilience and performance tuning.
- Apply AI/ML techniques to automation workflows.
- Identify and develop AIOps opportunities.
- Experiment with machine learning models for optimized log analysis and failure predictions.
Qualifications:
- Strong background in Systems Engineering with a focus on automation and reliability.
- Proficiency in Python.
- Hands-on expertise with Kubernetes and cloud platforms (GCP or any major cloud).
- Experience integrating tools and platforms via APIs and client libraries.
- Deep understanding of monitoring and alerting using industry-standard tools.
- Ability to work in aggressive, high-stakes environments.
- Strong problem-solving skills.
Nice-to-Have Qualifications:
- Experience with Ansible for infrastructure automation.
- Prior experience working in mission-critical teams handling large-scale, high-availability systems.
- Enthusiasm for AI/ML and AIOps.
The target hiring compensation range for this role is $54 to $60 an hour. Compensation is based on several factors including, but not limited to education, relevant work experience, relevant certifications, and location.
About Aquent Talent:
Aquent Talent connects the best talent in marketing, creative, and design with the world’s biggest brands.
Our eligible talent get access to amazing benefits like subsidized health, vision, and dental plans, paid sick leave, and retirement plans with a match. We also offer free online training through Aquent Gymnasium. More information on our awesome benefits!
Aquent is an equal-opportunity employer. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, and other legally protected characteristics. We’re about creating an inclusive environment — one where different backgrounds, experiences, and perspectives are valued, and everyone can contribute, grow their careers, and thrive.