Platform Reliability Engineer
7 days ago
Join to apply for the Platform Reliability Engineer role at J&M GroupContinue with Google Continue with GoogleJoin to apply for the Platform Reliability Engineer role at J&M GroupInfrastructure as Code (IaC): Terraform, ARM templates, CloudFormationScripting Languages: Python, PowerShell, BashSecurity & Compliance: Access control models, cloud security practicesPlatform Governance: Unity Catalog (nice to have)Operational Excellence: SRE principles, SLOs, SLIsJob DescriptionTechnical SkillsCloud Platforms: Azure, AWSInfrastructure as Code (IaC): Terraform, ARM templates, CloudFormationScripting Languages: Python, PowerShell, BashMonitoring & Observability: Azure Monitor, Log Analytics, PrometheusCI/CD Tools: Azure DevOps, GitHub ActionsPlatform Services: Compute, Storage, Networking, Data Plane InfrastructureSecurity & Compliance: Access control models, cloud security practicesPlatform Governance: Unity Catalog (nice to have)Operational Excellence: SRE principles, SLOs, SLIsAutomation & Cost Optimization: Platform automation, cost reduction strategiesSoft SkillsEffective communication and cross-team collaborationStrong problem-solving and analytical mindsetProactive, independent, and team-oriented work styleAttention to detailExperience & Qualifications3-6 years in platform engineering, SRE, or infrastructure rolesBachelor's degree in Computer Science, IT, or related fieldExperience in agile or iterative development environmentsCertifications (nice to have): Azure Administrator, Azure DevOps Engineer, AWS Solutions ArchitectJob SummaryWe are looking for a skilled and motivated Platform Reliability Engineer to support and optimize our platform services. This role bridges the gap between infrastructure services and the platform capabilities required by development and operations teams. The engineer will contribute to automation, reliability, cost optimization, and service excellence of core platform components hosted in the cloud (Azure/AWS). This is a hands-on technical role with a focus on enabling reliable, secure, and scalable platform foundations for enterprise-scale workloads.Key ResponsibilitiesSupport the design and implementation of core platform services that enable development teams to build, deploy, and operate applications reliably.Develop Infrastructure as Code (IaC) templates and scripts using tools like Terraform or ARM to automate provisioning and configuration.Monitor and maintain platform services including compute, storage, networking, and data plane infrastructure for scalability and performance.Collaborate with development, cloud engineering, and security teams to ensure platform alignment with architectural standards and security requirements.Implement observability practices using tools for monitoring, logging, and alerting to support performance tuning and incident detection.Troubleshoot platform-related incidents, perform root cause analysis, and document findings for continuous improvement.Participate in deployment activities, ensuring proper controls and validations are in place when promoting workloads to production.Support optimization initiatives to reduce costs across services such as compute, storage, Synapse, and platform integration tools.Contribute to ongoing platform modernization efforts, including migration from legacy configurations to unified governance models such as Unity Catalog.QualificationsBachelor's degree in Computer Science, Information Technology, or a related field.3-6 years of experience in platform engineering, SRE, or related infrastructure roles.Practical experience with Azure or AWS cloud services, particularly related to infrastructure and platform-level resource management.Proficiency in Infrastructure as Code (IaC) tools such as Terraform, ARM templates, or CloudFormation.Hands-on experience with monitoring and observability solutions (e.g., Azure Monitor, Log Analytics, Prometheus).Familiarity with CI/CD pipelines and release processes (e.g., Azure DevOps, GitHub Actions).Strong scripting skills (Python, PowerShell, or Bash) to automate tasks and workflows.Understanding of access control models, security practices, and compliance in cloud platforms.Familiarity with SRE principles and operational excellence metrics (SLOs, SLIs).Experience working in agile or iterative environmentsSoft SkillsEffective communicator with the ability to coordinate across platform, security, cloud, and development teams.Strong problem-solving mindset with attention to detail.Proactive and collaborative team player, able to work independently and drive issues to resolution.Nice to HaveExposure to Unity Catalog or similar data governance tooling in the context of platform services.Experience supporting platform migrations or re-architecture projects.Certification in Azure Administrator, Azure DevOps Engineer, or AWS Solutions Architect.This role is ideal for someone with a strong technical foundation who is ready to take on ownership of platform-level responsibilities, contribute to modernization efforts, and apply SRE practices to maintain high availability and performance of services.Seniority levelSeniority levelEntry levelEmployment typeEmployment typeContractJob functionIndustriesIT Services and IT ConsultingReferrals increase your chances of interviewing at J&M Group by 2xGet notified about new Reliability Engineer jobs in Toronto, Ontario, Canada.Applications Consultant 2 - Platform Reliability EngineerToronto, Ontario, Canada CA$90,000 - CA$130,000 3 weeks agoField Engineer - SPT Canada (Ontario/North Bay/Edmonton)Mississauga, Ontario, CanadaCA$109,000.00-CA$118,000.002 weeks agoSoftware Quality Assurance and Automation Test Engineer -Automotive InfotainmentMechanical Engineer - Thermal ManagementPerformance Engineer / Analyst (H/F) - SAFRAN LANDING SYSTEMSEngineer- Autonomy Test and Validation (Contract)Integration Reliability Engineer, Technical OperationsGreater Toronto Area, Canada 14 hours agoAssistant Engineer/Scientist/Technical OfficerToronto, Ontario, Canada CA$150,000 - CA$170,000 2 weeks agoWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
-
Platform Reliability Engineer
3 days ago
Toronto, Canada J&M Group Full timeJoin to apply for the Platform Reliability Engineer role at J&M Group Continue with Google Continue with Google Join to apply for the Platform Reliability Engineer role at J&M Group Infrastructure as Code (IaC): Terraform, ARM templates, CloudFormation Scripting Languages: Python, PowerShell, Bash Security & Compliance: Access control models, cloud security...
-
Platform Reliability Engineer
5 days ago
Toronto, Canada J&M Group Full timeJoin to apply for the Platform Reliability Engineer role at J&M GroupContinue with Google Continue with GoogleJoin to apply for the Platform Reliability Engineer role at J&M GroupInfrastructure as Code (IaC): Terraform, ARM templates, CloudFormationScripting Languages: Python, PowerShell, BashSecurity & Compliance: Access control models, cloud security...
-
Platform Reliability Engineer
5 days ago
Toronto, Canada Capgemini Full timePlatform Reliability Engineer (contract)Join to apply for the Platform Reliability Engineer (contract) role at CapgeminiPlatform Reliability Engineer (contract)Join to apply for the Platform Reliability Engineer (contract) role at CapgeminiGet AI-powered advice on this job and more exclusive features.We are seeking a Platform Reliability Engineer to support...
-
Platform Reliability Engineer
3 days ago
Toronto, Canada Capgemini Full timePlatform Reliability Engineer (contract) Join to apply for the Platform Reliability Engineer (contract) role at Capgemini Platform Reliability Engineer (contract) Join to apply for the Platform Reliability Engineer (contract) role at Capgemini Get AI-powered advice on this job and more exclusive features. We are seeking a Platform Reliability Engineer to...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Financial Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility of...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Financial Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility of...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility of...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility of...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Financial Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility...
-
Platform Reliability Engineer
2 weeks ago
Toronto, Canada Manulife Financial Full timeDo you enjoy working in a fast-paced environment with highly skilled individuals and solving problems? Would you enjoy being a part of a cohesive, impactful team that makes work fun? Is continuous learning part of your DNA? We’re looking for a Platform Reliability Engineer who is passionate about working on a team with the purpose of enabling visibility of...