Site Reliability Engineer Toronto

1 month ago


Old Toronto Ontario, CA Ascend Fundraising Solutions Full time

Founded in 2010, Ascend Fundraising Solutions provides online and in-venue fundraising platforms and solutions. Our innovative approach has been embraced by renowned non-profit organizations worldwide, including United Way, Vancouver Canucks Foundation, Canadian Olympic Foundation, Canadian Institute for the Blind, Kansas City Chiefs Foundation, Boston Red Sox Foundation, Big Brothers Big Sisters, Thunder Bay Regional Health Science Foundation, Arizona Humane Society, and many others. We are transforming the fundraising landscape by assisting charitable organizations in raising funds through electronic raffle solutions, recurring donations, donor dataset enhancement, deeper donor engagement, and achieving unprecedented donor revenues.

As a leader in strategy and technology for 50/50 raffles, sweepstakes, and Catch the Ace raffles, we've empowered over 500 charitable organizations to raise over $1 billion on our platform to date, and we're just getting started.

We are currently seeking a full-time Site Reliability Engineer to join our IT team. In this role, you will collaborate closely with the client services team to diagnose, troubleshoot, and resolve issues related to system reliability.

RESPONSIBILITIES:

  • Take ownership of customer-reported issues and see problems through to resolution.
  • Develop preventive measures to avoid recurring issues.
  • Follow standard procedures for escalating unresolved issues to the appropriate internal teams.

Infrastructure Management:

  • Design, configure, deploy, and maintain AWS infrastructure using best practices.
  • Implement Infrastructure as Code (IaC) using Terraform for scalability, repeatability, and maintainability.
  • Collaborate with the development team to optimize .NET applications for peak performance in a cloud environment.

Monitoring and Alerting:

  • Design and implement advanced system monitoring solutions for high performance, availability, and security.
  • Use monitoring tools proactively to identify and diagnose infrastructure and application-level issues.
  • Collaborate on defining Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets.

Reliability and Availability:

  • Optimize cloud resource availability, performance, and cost using best practices.
  • Plan and execute disaster recovery drills and ensure high availability of critical systems.
  • Respond promptly to system alerts, lead incident resolution, and contribute to post-mortem analyses.

Automation and Optimization:

  • Automate repetitive tasks related to infrastructure provisioning, configuration, and deployment.
  • Ensure continuous deployment and continuous integration best practices are implemented and maintained.

Collaboration and Knowledge Sharing:

  • Collaborate with developers, product managers, and other teams to ensure seamless and stable application deployment.
  • Document processes, architectures, and best practices to facilitate knowledge sharing.

WHAT WE SEEK IN OUR IDEAL CANDIDATE:

  • AWS certifications such as AWS Certified Solutions Architect or AWS Certified DevOps Engineer.
  • Experience with monitoring and alerting tools in the AWS ecosystem.
  • Familiarity with Site Reliability Engineering (SRE) philosophy, SLOs, SLIs, and Error Budgets.
  • Strong analytical and troubleshooting skills.
  • Excellent communication and collaboration skills.

YOUR EXPERIENCE & SKILLS:

  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
  • 5+ years of experience managing and operating AWS environments.
  • Familiarity with best practices in monitoring, logging, and alerting.

WHY WORK AT ASCEND?

  • Market leader in the fundraising solutions sector with a highly regarded senior management team.
  • Intellectual curiosity, dedication, and a team willing to get the job done.
  • Opportunity to make a significant impact on the business in the short and long term.
  • Contribute to a company that supports charities and NPOs in funding their causes.
  • Beautiful downtown Toronto office with lake views and proximity to transit.
  • Competitive compensation package, including preferred hardware.
  • Hybrid work environment.

TO APPLY:

AscendFS is committed to building and preserving an open, inclusive, and healthy work environment. We welcome all applicants and accommodate people with disabilities throughout the recruitment and selection process. Applicants are encouraged to advise Human Resources in advance if an accommodation is required. We appreciate your interest in working at AscendFS, and we will contact you for further steps.

Ascend Fundraising Solutions works with foundations and non-profit organizations by offering best in class fundraising solutions. Our raffle, private lottery & sweepstakes programs grow donor prospects and engage existing donors

#J-18808-Ljbffr

  • Old Toronto, Ontario, CA United Software Group Inc. - Canada Full time

    Position: Site Reliability Engineer Location: Toronto, Canada Duration: Contract Job Description: 3+ years of experience Advanced knowledge of the following SRE practices and technologies Python, YAML, Shell scripting Azure, Linux Dynatrace, Prometheus, PagerDuty, Moog, Splunk, Elastic, Azure monitor Chaos Engineering MQ, Kafka Perform production support...


  • Old Toronto, Ontario, CA CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and Confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Ontario, CA Rogers Part time

    Site Reliability Engineer Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of...


  • Old Toronto, Ontario, CA Rogers Communications, Inc. Part time

    Site Reliability EngineerAre you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of sports,...


  • Old Toronto, Ontario, CA Reperio Human Capital Full time

    ```html Site Reliability Engineer 100421 Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools. Responsibilities: Ensure the reliability,...


  • Old Toronto, Ontario, CA Rogers Communications Full time

    Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of sports, news, e-commerce, and...


  • Old Toronto, Ontario, CA Rogers Communications Full time

    ```html Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of sports, news,...


  • Old Toronto, Ontario, CA Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze...


  • Old Toronto, Ontario, CA TD Bank Full time

    Site Reliability EngineerSite Reliability EngineerWork Location: CanadaHours: 37.5Line of Business: Technology SolutionsPay Details: We’re committed to providing fair and equitable compensation to all our colleagues. As a candidate, we encourage you to have an open dialogue with a member of our HR Team and ask compensation related questions, including pay...


  • Old Toronto, Ontario, CA eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Old Toronto, Ontario, CA Lightspeed Full time

    ```html Job Opportunity: Principal Site Reliability Engineer Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuOrder by Lightspeed...


  • Old Toronto, Ontario, CA Vaco Full time

    About the CompanyOur client operates global markets and builds digital communities and analytic solutions and is looking to hire a Site Reliability EngineerAbout the OpportunityStephen manages the infra group team, Windows, virtualization, IT infrastructure, etc. Works closely with Jeremy who is the hiring manager away for Pat leave. They are currently...


  • Old Toronto, Ontario, CA The Voleon Group Full time

    Voleon is a technology company that applies state-of-the-art machine learning techniques to real-world problems in finance. For more than 15 years, we have led our industry and worked at the frontier of applying machine learning to investment management. We have become a multi-billion-dollar asset manager, and we have ambitious goals for the future.Your...


  • Old Toronto, Ontario, CA Tecsys Inc. Full time

    Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...


  • Old Toronto, Ontario, CA Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 197089Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Team We are looking for a developer to join our Digital Engineering Operations. The ideal candidate is passionate about designing and...


  • Old Toronto, Ontario, CA Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Old Toronto, Ontario, CA Nityo Infotech Full time

    ```html Job Responsibilities: Objectives of this Role: Run the IKP clusters by monitoring availability and taking a holistic view of system health. Build tools and automation to manage platform infrastructure and services. Improve reliability, quality, and time to upgrade cluster and service versions. Measure and optimize system performance and resource...


  • Old Toronto, Ontario, CA PharmaLex Full time

    Your Job SRE at Pharmalex is the software engineering approach to production operations. 50% of your time will be building software to automate the manual work you do during the other 50% of your time will be providing operational support to the products you cover. SRE operates critical products 24/7/365 operating within agreed SLOs. Out-of-hours support via...


  • Old Toronto, Ontario, CA Guidewire Full time

    ```html ESSENTIAL DUTIES AND RESPONSIBILITIES Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments. Oversee and automate the team’s growing presence in AWS. Contribute to core infrastructure systems development with...

  • Manager, Engineering

    4 weeks ago


    Old Toronto, Ontario, CA Toronto Hydro Full time

    WORK ILLUSTRATION:Reporting to the Director, Capacity Planning and Grid Innovation, the Manager, Engineering (Capacity Planning) is responsible for the effective management of processes and resources for the purpose of providing direction to a cross-functional team executing distribution substation capital project planning, and developing and leveraging...