Site Reliability Engineer

1 week ago


Toronto, Ontario, Canada Themesoft Inc. Full time
Role: SRE - Monitoring Specialist

Location: Toronto(Onsite)

Job Summary:

We are looking for a Dynatrace Deployment Specialist to oversee the end-to-end implementation of Dynatrace on our server stack. This role requires a mix of technical expertise, financial domain knowledge and strategic thinking to ensure successful deployment, adoption, and optimization of the Dynatrace platform. The ideal candidate will collaborate with cross-functional teams to deliver a seamless rollout, enabling advanced monitoring, analytics, and alerting capabilities for our Application infrastructure.

Key Responsibilities:

1. Planning and Scope Assessment

- Evaluate the current IT environment and application stack to identify monitoring and observability needs.
- Define the scope of the deployment in alignment with organizational goals and technical requirements.
- Engage stakeholders to prioritize applications, services, and systems to be included in the deployment.

2. Deployment Strategy

- Develop a detailed deployment roadmap, considering technical dependencies, timelines, and resource availability.
- Ensure compatibility with the existing infrastructure and identify any gaps requiring resolution before deployment.

3. Phased Rollout

- Implement Dynatrace in a phased manner to minimize disruption, focusing on high-priority areas first.
- Test and validate functionality at each stage of the rollout to ensure alignment with performance objectives.
- Address issues promptly during the rollout to maintain project momentum.

4. Enabling Full Stack Monitoring

- Deploy and configure full-stack monitoring features, including infrastructure, applications, and user experience monitoring where needed.
- Ensure that all key performance indicators (KPIs) are tracked and that integrations with relevant tools and systems are in place.

5. Tool Adoption & Upskilling

- Provide training sessions and resources to IT teams to enhance understanding and usage of Dynatrace.
- Act as a subject matter expert, offering ongoing support and guidance for tool adoption.
- Develop documentation and best practices for future use and troubleshooting.

6. Analytics and Alerting

- Configure Dynatrace analytics to deliver actionable insights into system performance and health.
- Establish custom alerts to ensure timely responses to incidents and anomalies.
- Monitor and refine alerting thresholds to reduce noise and improve system reliability.

Qualifications:

- Proven experience in deploying Dynatrace or similar observability tools across complex IT environments.
- Strong understanding of server stacks, networking, databases, cloud services, and application architectures.
- Proficient in scripting and automation to streamline deployment and configuration processes.
- Strong problem-solving skills and the ability to address technical issues effectively.
- Excellent communication skills to collaborate with stakeholders and train end-users.

Preferred Skills:

- Certification in Dynatrace is preferred.
- Familiarity with DevOps tools and practices.
- Experience with ITIL processes for incident and problem management.

Regards

Praveen Kumar

Talent Acquisition Group – Strategic Recruitment Manager

praveen.r@themesoft.com

  • Toronto, Ontario, Canada Moneris Solutions Corp. Full time

    Site Reliability Engineer page is loadedSite Reliability EngineerApply locations Toronto time type Full time posted on Posted 2 Days Ago job requisition id JR104859Your Moneris Career - The OpportunityWe are looking for a Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will help ensure the reliability, performance, and scalability of...


  • Toronto, Ontario, Canada Moneris Solutions Corp. Full time

    Site Reliability Engineer page is loadedSite Reliability EngineerApply locations Toronto time type Full time posted on Posted 2 Days Ago job requisition id JR104859Your Moneris Career - The OpportunityWe are looking for a Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will help ensure the reliability, performance, and scalability of...


  • Toronto, Ontario, Canada Randstad Canada Full time

    Are you a Site Reliability Engineer looking for a new opportunity?Are you looking for a new contract opportunity?We are pleased to offer you a new contract opportunity for you to consider: Site Reliability Engineer- Start: ASAP- Estimated length: 6 months- Location: Toronto- Can be remote but will be required to be in Toronto office roughly 2 days per...


  • Toronto, Ontario, Canada Randstad Canada Full time

    Are you a Site Reliability Engineer looking for a new opportunity?Are you looking for a new contract opportunity?We are pleased to offer you a new contract opportunity for you to consider: Site Reliability Engineer- Start: ASAP- Estimated length: 6 months- Location: Toronto- Can be remote but will be required to be in Toronto office roughly 2 days per...


  • Toronto, Ontario, Canada Autodesk, Inc. Full time

    About the JobWe are seeking a highly motivated and experienced Site Reliability Engineer Lead to manage critical cloud infrastructure and site reliability operations at Autodesk.ResponsibilitiesLead disaster recovery strategies, failover exercises, gamedays, and period maintenance activities.Contribute to critical vulnerability remediation efforts.Promote...


  • Toronto, Ontario, Canada OMERS Oxford Properties Group Full time

    The Site Reliability Engineering Lead will be responsible for ensuring the reliability, scalability, and performance of our investment systems. This role requires a deep understanding of distributed systems, networking, and cloud computing.About the Job:Design and implement reliable and scalable systems to handle high traffic and large datasets.Collaborate...


  • Toronto, Ontario, Canada Interac Corp. Full time

    Job DescriptionAt Interac Corp., we are a leading provider of payment and value exchange services in Canada. Our team is passionate about delivering innovative solutions that empower Canadians to transact digitally with confidence.This role is part of our Site Reliability Engineering team, responsible for designing and delivering high-performance payment...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title : Site Reliability Engineer (SRE) Location :  Toronto, CA Duration :  Long term A Bachelor's degree in Computer Science or related technical field (Example: Mathematics/Engineering/Physics), or equivalent practical experience. Advanced knowledge of the following SRE practices and technologies In-depth hands-on experience in a...


  • Toronto, Ontario, Canada Autodesk, Inc. Full time

    The ideal candidate for this role will have extensive experience in Site Reliability Engineering, DevOps, and cloud infrastructure architecture. As a DevOps Site Reliability Engineer at Autodesk, you will be responsible for ensuring the reliability, availability, and performance of our cloud infrastructure.You will lead the design and development of cloud...


  • Toronto, Ontario, Canada Wisedocs Full time

    Wisedocs is on a mission to make it easy and accessible for any company in the insurance, legal and medical space to understand medical documents quickly using AI (Artificial Intelligence). Every week, we process hundreds of thousands of pages of documents, saving our customers hours and hours of manual processing time, and helping them process medical...


  • Toronto, Ontario, Canada Northbridge Financial Full time

    What is it like to be a senior Site Reliability Engineer at Northbridge FinancialThe Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and is responsible for mentoring and leading less experienced SREs.We...


  • Toronto, Ontario, Canada Randstad Digital Full time

    Senior Site Reliability Engineer - Establish and SRE Practice (Contract Position)Number of Positions: 1 Filled: 0 Duration: 6 monthsLocation: Toronto, ON, CAMust be eligible to work in CanadaHybrid position, 2-3d/month onsite in Toronto mandatoryRoles and responsibilities:The consultant will be building and SRE practice from the ground up. He/she would have...


  • Toronto, Ontario, Canada The Home Depot Full time

    Manager, Site Reliability Engineering (SRE) - eCommerce Job At The Home Depot in Toronto, ONM3C 4H9Req125256 Full Time Corporate RemoteWith a career at The Home Depot, you can be yourself and also be part of something bigger.Position Overview:The Manager, SRE will lead a team of Site Reliability Engineers to ensure the reliability, performance, and...


  • Toronto, Ontario, Canada Gotvantage Full time

    Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat with youAs a Senior Site Reliability Engineer (SRE) / DevOps Engineer at...


  • Toronto, Ontario, Canada Randstad Digital Full time

    Senior Site Reliability Engineer - Establish and SRE Practice (Contract Position) Number of Positions: 1 Filled: 0 Duration: 6 months Location: Toronto, ON, CA Must be eligible to work in Canada Hybrid position, 2-3d/month onsite in Toronto mandatory Roles and responsibilities: The consultant will be building and SRE practice from the ground up....


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and are responsible for mentoring and leading less experienced...


  • Toronto, Ontario, Canada Scotiabank Full time

    About the RoleThe Director of Site Reliability Engineering will be responsible for ensuring the reliability, scalability, and performance of critical applications and infrastructure by driving automation, monitoring, incident response, and continuous improvement initiatives.This role requires a high-degree of cross-team collaboration and ability to influence...


  • Toronto, Ontario, Canada Scotiabank Full time

    Job DescriptionWe're seeking a highly skilled leader to join our Technology organization as Director of Site Reliability Engineering. This role requires strong technical expertise, excellent communication skills, and the ability to collaborate effectively with cross-functional teams.The successful candidate will lead a team responsible for ensuring the...


  • Toronto, Ontario, Canada Vantage Full time

    Senior Site Reliability Engineer / DevOps Engineer Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat to you As a...


  • Toronto, Ontario, Canada Myticas Consulting Full time

    Cloud Site Reliability Engineer (Azure)Lead strategic initiatives in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This advanced role requires mastery in cloud technologies, strategic planning, and incident management to drive innovative solutions and operational excellence.As a Cloud Site Reliability...