Senior Site Reliability Engineer

4 weeks ago


Toronto, Canada Autodesk Full time

Join to apply for the Senior Site Reliability Engineer role at Autodesk Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners. Job Requisition ID 25WD92369 Responsibilities Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture Independently manage requirement analysis, solution design, implementation, and release planning Ensure high adherence to trust and security compliance, guidelines and standards Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices Implement and maintain configuration management and infrastructure as code (IaC) using Terraform Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities Contribute to critical vulnerability (CVEs) remediation efforts Promote and document security and best practices across all pillars of DevOps/SRE throughout system design Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues Participate in on-call rotations, providing critical 24x7 support for production systems Minimum Qualifications Bachelor’s degree or higher in Computer Science, Engineering, or a related field 5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field Proficiency with managing AWS resources and understanding of networking and security protocols Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory Experience with container-based technologies like Docker and AWS ECS Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk. Preferred Qualifications Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation Knowledge of standardized observability frameworks such as OpenTelemetry Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer) Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka Knowledge on core Java and SpringBoot concepts in JVM optimization Knowledge on build tools, e.g. Gradle Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership Learn More About Autodesk: Welcome to Autodesk Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. Salary Transparency Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package. Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging #J-18808-Ljbffr



  • Toronto, Canada Tubi Full time

    Join to apply for the Senior Site Reliability Engineer role at Tubi . About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...


  • Toronto, Canada Tubi Full time

    Join to apply for the Senior Site Reliability Engineer role at Tubi. About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, Canada Autodesk Full time

    Join to apply for the Senior Site Reliability Engineer role at Autodesk Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability,...


  • Toronto, Canada Kyndryl Full time

    Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...


  • Toronto, Canada Tubi Full time

    OverviewSenior Manager, Site Reliability Engineering at Tubi. Join to apply for the Senior Manager, Site Reliability Engineering role at Tubi. About Tubi: Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands...


  • Toronto, Ontario, Canada Fivetran Full time

    About the RoleFivetran is looking for a high-performance engineer to be a part of a team of Site Reliability Engineers. You will be working closely with engineering teams, product managers, as well as support and sales engineers to build the future of the Fivetran Data Platform Reliability. As a member of the Site  Reliability Engineering team, you will...


  • Toronto, Canada Royal Bank of Canada Full time

    Job Description What is the opportunity?Join our Commercial, Core Banking and Payments Technology (CCBPT) team as a Senior Site Reliability Engineer, where you'll play a key role in supporting our cloud and distributed environments for the Personal Commercial Credit SRE & Ops team. This exciting opportunity will challenge you to work with cutting-edge...


  • Toronto, Canada Autodesk, Inc. Full time

    **Job Requisition ID #**25WD92369**Position Overview**We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloudinfrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuringthe highest reliability, availability, and performance of our...