Senior Site Reliability Engineer

3 weeks ago


Halifax, Canada ISTITUTO MARANGONI Full time

ResMed is seeking a Sr. Site Reliability Engineer – SRE to help define and execute against a Site Reliability
Engineering strategy for its rapidly expanding Digital Health Technology group. You will use your software engineering
expertise to constantly automate processes and innovate in a push to improve the reliability of the system. You will plan,
design, build and maintain large scale engineering solutions. Whether a bug fix or an awesome feature, you will own
your work and deliver the most elegant and scalable solutions.

Let’s talk about Responsibilities.

• Monitoring and metrics — establishing desired service behavior, measuring how the service is actually behaving availability, latency, and overall system health), and correcting discrepancies
• Emergency response — noticing and responding effectively to service failures in order to preserve the service's conformance to its SLA (service-level agreement)
• Work to simplify and automate deployment processes, run-time operations, and provide non-disruptive releases
• Provide technical advisory for other engineers to help them grow and deliver high quality work faster.
• Capacity planning — projecting future demand and ensuring that a service has enough computing resources in appropriate locations to satisfy that demand
• Service turn-up and turn-down — deploying and removing computing resources for a service in a data center in a predictable fashion, often as a consequence of capacity planning
• Scaling systems sustainably through mechanisms such as automation
• Participate in planning discussions with Product Development and other IT teams
• Maintain expertise in the area of architecture, including industry trends, strategies, and products to ensure that our assets are effectively and efficiently utilized
• Evolving systems by pushing for changes that improve reliability and velocity
• Conducting incident responses and blameless postmortems


Let’s talk about Qualifications and Experience

Required:
• Bachelor's degree in Computer Science or Information Systems or equivalent technical discipline, minimum 8 years working experience in an enterprise 24/7 production environment supporting
critical, real-time applications.
• Minimum 4 years of experience focused on site reliability for high-traffic applications
• Systematic problem-solving approach, combined with strong communication skills and a sense of ownership
• Cloud programming experience and comfort with working in multiple languages as required (please note we mainly use Python and Java)
• Expert full-stack debugging and performance optimization ability, including hands-on knowledge of AWS
• Extensive experience with monitoring tools such as DataDog and AWS native monitoring
• Track record monitoring and analyzing system performance, isolating issues or bottlenecks that could impact
reliability, performance and scalability ( We are using mainly DataDog and cloudwatch).

• Performance engineering mindset — design, development, and engineering related to scalability, isolation, latency, throughput, and efficiency
• Good verbal and written communication skills, and be able to work effectively with geographically remote teams


Good to have:

Able to write/maintain terraform and lambdacode in the AWS environment

Supporting CI/CD pipeline with GitHub

Strong exposure and use of AWS EKS

• Experience using Atlassian tools as Confluence and JIRA

• Understanding of Product Development Life Cycle, including Agile SCRUM, TDD, BDD
• Experience with Machine Learning

Joining us is more than saying “yes” to making the world a healthier place. It’s discovering a career that’s challenging, supportive and inspiring. Where a culture driven by excellence helps you not only meet your goals, but also create new ones. We focus on creating a diverse and inclusive culture, encouraging individual expression in the workplace and thrive on the innovative ideas this generates. If this sounds like the workplace for you, apply now We commit to respond to every applicant.

#J-18808-Ljbffr

  • Halifax, Canada ResMed Inc Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Halifax, Canada San Diego, CA, United States time type Full time posted on Posted 4 Days Ago job requisition id JR_033768 ResMed is seeking a Sr. Site Reliability Engineer – SRE to help define and execute against a Site...


  • Halifax, Canada ResMed Inc Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Halifax, Canada San Diego, CA, United States time type Full time posted on Posted 4 Days Ago job requisition id JR_033768 ResMed is seeking a Sr. Site Reliability Engineer – SRE to help define and execute against a Site...


  • Halifax, Nova Scotia, Canada Lightci Full time

    As a Site Reliability Engineer (SRE) serving clients across multiple industries (including edtech, telecommunications, and more), you will work with cutting-edge technologies like AWS, ECS/EKS, and event-based systems, ensuring the reliability, scalability, and performance of our services. Support various databases (RDBMS, NoSQL) ensuring optimal performance...


  • Halifax, Nova Scotia, Canada Lightci Full time

    Role missionAs a Site Reliability Engineer (SRE) serving clients across multiple industries (including edtech, telecommunications, and more), you will work with cutting-edge technologies like AWS, ECS/EKS, and event-based systems, ensuring the reliability, scalability, and performance of our services. If you are passionate about solving complex challenges...


  • Halifax, Nova Scotia, Canada CGI Full time

    Position Description: As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, performance, and availability of our systems. Your expertise in managing infrastructure, automating processes, and implementing best practices will contribute to seamless operations. You'll collaborate with cross-functional teams,...


  • Halifax Regional Municipality, Canada CGI Full time

    Position Description: As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, performance, and availability of our systems. Your expertise in managing infrastructure, automating processes, and implementing best practices will contribute to seamless operations. You’ll collaborate with cross-functional teams,...


  • Halifax, Canada Compest Solutions Inc Full time

    Implement SRE practices Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate Work with Application teams to set up Observability, Telemetry Define what it means for a service to be available and develop, monitor, and alert on SLIs/SLOs Define, track, and...

  • Reliability Engineer

    2 weeks ago


    Halifax, Nova Scotia, Canada IMP Group Full time

    Reliability Engineer Requisition ID: 587 Why Choose IMP Aerospace & Defence:IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our...

  • Reliability Engineer

    2 months ago


    Halifax, Canada IMP Group International Inc. Full time

    Requisition ID: 587 Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our ever-growing team. IMP is able...

  • Reliability Engineer

    2 weeks ago


    Halifax, Nova Scotia, Canada IMP Group International Inc. Full time

    Requisition ID: 587 Why Choose IMP Aerospace & Defence:IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our ever-growing team. IMP is able to...

  • Reliability Engineer

    4 weeks ago


    Halifax, Canada IMP Group International Inc. Full time

    Requisition ID: 587 Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our ever-growing team. IMP is able...

  • Reliability Engineer

    1 month ago


    Halifax, Nova Scotia, Canada Irving Shipbuilding Full time

    Reliability Engineer Located at 3099 Barrington Street in Halifax, Nova Scotia, Canada, B3K 5M7, and 35 Micmac Boulevard, Dartmouth, Nova Scotia, Canada, B3A 4Y8, Irving Shipbuilding is proud to be Canada's National Shipbuilder. Over the next 30 years, our shipbuilders will construct 20+ modern patrol ships and surface combatants for the Royal Canadian Navy...

  • Reliability Engineer

    4 weeks ago


    Halifax, Nova Scotia, Canada Irving Shipbuilding Full time

    Reliability Engineer Located at 3099 Barrington Street in Halifax, Nova Scotia, Canada, B3K 5M7, and 35 Micmac Boulevard, Dartmouth, Nova Scotia, Canada, B3A 4Y8, Irving Shipbuilding is proud to be Canada's National Shipbuilder. Over the next 30 years, our shipbuilders will construct 20+ modern patrol ships and surface combatants for the Royal Canadian Navy...


  • Halifax, Canada IMP Group International Inc. Full time

    Requisition ID: 802 Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field. But we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are apart of our ever-growing team. IMP is able...


  • Halifax, Canada IMP Group International Inc. Full time

    Requisition ID: 802 Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field. But we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are apart of our ever-growing team. IMP is able...

  • Reliability Engineer

    4 weeks ago


    Halifax Regional Municipality, Canada IMP Group Full time

    Reliability Engineer Requisition ID: 587  Why Choose IMP Aerospace & Defence:IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our...

  • Site Engineer

    3 days ago


    Halifax, Canada Eco Careers LTD. Full time

    Site Agent – Halifax My client is a premier construction and engineering company specialising in infrastructure projects, including highways and civil works. We are committed to delivering innovative and sustainable solutions that improve communities and drive economic growth. Now looking for a talented and driven Site Engineer to join the team, working on...

  • Site Engineer

    3 days ago


    Halifax, Canada Eco Careers LTD. Full time

    Site Agent – Halifax My client is a premier construction and engineering company specialising in infrastructure projects, including highways and civil works. We are committed to delivering innovative and sustainable solutions that improve communities and drive economic growth. Now looking for a talented and driven Site Engineer to join the team, working on...

  • Reliability Engineer

    2 months ago


    Halifax Regional Municipality, Canada IMP Group Full time

    Requisition ID: 587  Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our ever-growing team. IMP...

  • Reliability Engineer

    4 weeks ago


    Halifax Regional Municipality, Canada IMP Group Full time

    Requisition ID: 587  Why Choose IMP Aerospace & Defence: IMP Aerospace & Defence is not only one of the leading companies in the Aerospace field, but we are also one of the most engaging and equality advocating companies to work for. We believe that regardless of where you work at IMP Aerospace & Defence, you are a part of our ever-growing team. IMP...