Sr. Site Reliability Engineer
5 days ago
We Are:
At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological innovation.
You Are:
You are a forward-thinking technologist with a passion for reliability, automation, and scalable infrastructure. With a deep foundation in cloud services and modern DevOps practices, you thrive in environments where collaboration and continuous improvement are valued. You understand that true reliability comes from proactive monitoring, robust automation, and a data-driven approach to problem-solving. Your experience with AWS, Terraform, and container orchestration enables you to architect and maintain resilient systems. You naturally seek out opportunities to streamline processes, reduce manual interventions, and elevate the standards of operational excellence.
You bring a consultant's mindset, listening deeply to customers and internal teams, and leading by example in implementing Site Reliability Engineering (SRE) best practices. You are skilled at incident response, able to rapidly diagnose and resolve issues under pressure, and always eager to refine your processes based on lessons learned. Your programming and scripting abilities allow you to automate workflows and build reusable solutions that benefit the entire organization. You are a strong communicator, able to translate complex technical concepts into actionable insights, and you foster a culture of transparency, accountability, and continuous learning.
Whether working with development teams to create automated pipelines or developing solutions for cloud and on-premise deployments, you are motivated by the impact your work has on customer experience and business outcomes. You are excited to join a high-performing team at Synopsys, where your expertise will help shape the future of cloud-native solutions deployed across Ansys Cloud, Customer Cloud, and on-prem environments.
What You'll Be Doing:
- Collaborating with engineers and stakeholders to resolve complex infrastructure, build, and packaging challenges across cloud and on-prem environments.
- Partnering with development teams to design and implement automated pipelines for continuous delivery and deployment.
- Leading and advocating for SRE best practices, fostering a culture of reliability and operational excellence.
- Improving the predictability and reliability of software releases, workflows, and operational systems.
- Reducing complexity and streamlining delivery by developing and promoting reusable code, tooling, and solution patterns.
- Serving as an expert in incident response, quickly identifying and resolving system incidents to maintain high availability.
- Measuring, monitoring, and analyzing system metrics and alarms to ensure performance and reliability, using a data-driven approach for decision-making.
The Impact You Will Have:
- Enhance system reliability and uptime across multiple software products and supporting infrastructure.
- Accelerate software delivery cycles through advanced automation and streamlined CI/CD pipelines.
- Elevate the organization's SRE maturity, influencing both technical strategy and team culture.
- Reduce operational complexity, enabling scalable deployments to Ansys Cloud, Customer Cloud, and on-premises.
- Ensure rapid and effective incident resolution, minimizing downtime and customer impact.
- Empower teams with reusable tools and patterns, driving efficiency and consistency in operations.
- Champion a data-driven approach to reliability, informing decisions with robust metrics and monitoring.
What You'll Need:
- Bachelor's in Engineering, Computer Science, or a related field with 5+ years' experience, or Master's with 3+ years, or PhD with 1+ year.
- Minimum 3 years of hands-on experience with git and AWS cloud platforms.
- Proficiency in Terraform, including module creation and best practices for infrastructure as code.
- Expertise in at least one scripting language (e.g., Bash, Python, Perl, Ruby).
- Strong experience with EKS, including cluster configuration and troubleshooting.
- Thorough knowledge of Linux operating systems and related tooling.
- Deep understanding of software development tools, compilers, and packaging systems.
Who You Are:
- Collaborative and communicative, able to interface effectively with technical and non-technical stakeholders.
- Analytical and data-driven, using metrics to guide reliability and performance improvements.
- Proactive and solution-oriented, anticipating challenges and driving continuous improvement.
- Adaptable, thriving in fast-paced, dynamic environments with evolving technologies.
- Passionate about automation, efficiency, and robust system design.
- Consultative, with the ability to listen, advise, and lead teams towards best practices.
The Team You'll Be A Part Of:
You'll join a dedicated SRE and DevOps engineering team focused on delivering reliable, scalable, and secure infrastructure solutions for Synopsys deployments on Ansys Cloud, Customer Cloud, and on-prem environments. The team is committed to innovation, operational excellence, and cross-functional collaboration, working closely with development, product, and customer success teams to deliver impactful solutions that drive customer satisfaction and business growth.
Rewards and Benefits:
We offer a comprehensive range of health, wellness, and financial benefits to cater to your needs. Our total rewards include both monetary and non-monetary offerings. Your recruiter will provide more details about the salary range and benefits during the hiring process.
-
Site Reliability Engineer
2 weeks ago
Ottawa, Ontario, Canada TECSYS Inc. Full time $120,000 - $140,000 per yearHaving recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...
-
Sr Advanced Hardware Engineer
3 days ago
Ottawa, Ontario, Canada AI Jobs Full time $120,000 - $180,000 per yearRole: Sr Advanced Hardware EngineerLocation: Kanata, Ontario, CanadaAbout the RoleThis position supports the development of advanced space systems by guiding the selection, evaluation, and integration of high-reliability electronic, electrical, and electromechanical components. The role contributes directly to mission-critical space instruments used for...
-
Sr. Systems Engineer
2 weeks ago
Ottawa, Ontario, Canada 293fea1f-cc61-42fd-a359-00b7443872e0 Full time $110,000 - $140,000 per yearMarshall Canada is seeking a Sr. Systems Engineer to join our team in Ottawa, ON. Reporting to the Sr. Manager, Systems Engineering, this role is responsible to support and lead systems engineering activities critical to the performance, reliability, and scalability of Marshall products and solutions. This role will also act as a technical leader and mentor,...
-
Sr. Structural Engineer
2 weeks ago
Ottawa, Ontario, Canada Insight Global Full time $90,000 - $120,000 per yearRequired Skills & ExperienceBachelor's or Master's degree in Structural/Civil EngineeringMore than 5 years' experience in structural design, preferably in renewable energy projectsProfessional Engineer (P.Eng.)Proficiency in structural analysis software (STAAD Pro, Risa3D, LPile).Familiarity with AutoCAD/Revit for design documentationJob SummaryInsight...
-
Sr. Systems Engineer
2 weeks ago
Ottawa, Ontario, Canada Marshall Full time $1,100,000 - $1,450,000 per yearMarshall Canada is seeking a Sr. Systems Engineer to join our team in Ottawa, ON. Reporting to the Sr. Manager, Systems Engineering, this role is responsible to support and lead systems engineering activities critical to the performance, reliability, and scalability of Marshall products and solutions. This role will also act as a technical leader and...
-
Senior Cloud Site Reliability Engineer
5 days ago
Ottawa, Ontario, Canada Barracuda Networks Inc. Full timeReq ID: 26-321 Come join our passionate team Barracuda is a leading cybersecurity company providing complete protection against complex threats. Our platform protects email, data, applications, and networks with innovative solutions, and a managed XDR service, to strengthen cyber resilience. Hundreds of thousands of IT professionals and managed service...
-
Sr Mechanical Engineer
2 weeks ago
Ottawa, Ontario, Canada Sotera Health Full time $120,000 - $180,000 per yearDescriptionIn this role as an Engineer Specialist, you will be key in design and analysis of mechanical systems and structures, ensuring they meet high standards for performance, safety, and reliability. Your expertise will help us come up with practical solutions, tackle complex engineering challenges, and contribute to the success of projects of all sizes,...
-
Reliability Test Automation Engineer
5 days ago
Ottawa, Ontario, Canada Lumentum Full timeIt's fun to work in a company where people truly BELIEVE in what they're doingWe're committed to bringing passion and customer focus to the business.If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with usLumentum Canada was awarded the 2022 National Capital Region's Top Employers for the 6th consecutive...
-
Reliability Test Automation Engineer
5 days ago
Ottawa, Ontario, Canada Lumentum Full timeIt's fun to work in a company where people truly BELIEVE in what they're doingWe're committed to bringing passion and customer focus to the business.If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with usLumentum Canada was awarded the 2022 National Capital Region's Top Employersfor the 6th consecutive...
-
Reliability Test Automation Engineer
5 days ago
Ottawa, Ontario, Canada Lumentum Operations Full timelocationsCanada - Ottawa (Bill Leathem)time typeFull timeposted onPosted Todayjob requisition id It's fun to work in a company where people truly BELIEVE in what they're doingWe're committed to bringing passion and customer focus to the business.If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with...