See more Collapse

Site Reliability and Log Management Specialist

2 months ago


Ottawa, Canada Maplesoft Group Full time

Maplesoft Group is currently seeking a Site Reliability and Log Management Specialistfor our Federal Government client.

The following responsibilities are associated with the “Statement of Work” but are not limited to:
Primary Responsibilities

The Consultant will be responsible for providing the following Services to the Client:
Under the direction of the Assistant Director, Architecture & Site Reliability Services:

- Provide engineering, and administrative support in a Splunk environment (search heads, indexers,

deployers, deployment servers, heavy/universal forwarders, etc.)
- Recognize and onboard new data sources into Splunk
- Analyze data for anomalies and trends and building dashboards highlighting the key trends of the data.
- Perform the administration of monitoring platforms and related infrastructure working with IT specialists

in cyber, network, storage, security, virtual infrastructure, platform, and database to deliver their logging

requirements
- Assess and refine current logs and whenever possible consider fine-tuning or explore opportunities for

automation
- Support migration of data sources & log traffic
- Other related activities and deliverables as required

Required Qualifications & Skills

The Consultant should have the following qualifications and skills:

- University degree or college diploma in computer science, networking, engineering, or a related field
- A minimum of five (5) years of relevant work experience with Log Management / Monitoring
- Demonstrated experience with Splunk & Syslog-NG
- Demonstrated experience with Linux environment, editing and maintaining Splunk configuration files and

apps.
- Demonstrated knowledge of SCOM (2012 R2, 2016, 1807) and Vendor Management Packs
- Demonstrated experience with SCOM Report Customization and SQL Report Server administration
- Demonstrated experience with SolarWinds
- Demonstrated experience with (basic) admin level Windows / Linux Basic Operations around file system
- Demonstrated experience with Powershell scripting
- Demonstrated experience in using scripting languages such as Bash or Python, specifically for systems

automation

Additional Qualifications

The following will also be considered:

- Demonstrated knowledge and understanding of load testing, monitoring, and performance

management tools for every layer of the environment
- Demonstrated experience with testing high availability environments with performance of disaster

recovery tests
- Demonstrated knowledge and expertise in Site Reliability Engineering, creating SLOs, SLAs, enhancing

observability, incident management work.
- Demonstrated experience with infrastructure scripting and automation or familiarity with Infrastructure as

Code

Maplesoft Group prides itself on its distinct corporate culture and recognizes that success is a direct reflection of our most valuable asset - our people. Therefore, attitude and ambition are key personality traits we seek out, along with skill and aptitude, in potential employees.

All employment decisions are made based on business needs, job requirements, and individual qualifications.


We have other current jobs related to this field that you can find below


  • Ottawa, Canada Innovapost Full time

    **Requisition Number**:2196 **Location**:March Road **Province**: Ontario (CA-ON) **Country**:Canada (CA) **Employment Type**: Regular **Who is Innovapost?** Great question! We are the technology arm of the Canada Post Group of Companies. This includes Canada Post, Purolator, and SCI. By joining us you will be able to make a positive impact on how...


  • Ottawa, ON, Canada Lightspeed Commerce Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Ottawa, Canada Lightspeed Restaurant Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff, Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Ottawa, Canada Lightspeed Restaurant Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff, Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Ottawa, Canada Reliability Screening Solutions Inc. Full time

    **Job Summary**: **Key Responsibilities**: 1. International and Retail Client Services Coordination: - Serve as a main point of contact for both local and international clients, addressing inquiries and providing support regarding background screening services. - Guide clients in selecting appropriate screening services based on their specific needs. 2....


  • Ottawa, Canada Themesoft Inc. Full time

    Position: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...


  • Ottawa, Canada Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Ottawa, ON(Onsite)Responsibilities:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid understanding of...


  • Ottawa, Canada Lightspeed Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size...


  • Ottawa, Canada Themesoft Inc. Full time

    Job Title: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Duration: 6 Months ContractJob Description:Should have 7+ years of experienceStrong experience in Ansible and TerraformStrong experience of CI/CD Pipelines with YAML/ARM Template Knowledge & experience on Live site maintenanceDemonstrated ability to debug, fix, and optimize...


  • Ottawa, Canada Themesoft Inc. Full time

    Job Title: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Duration: 6 Months ContractJob Description:Should have 7+ years of experienceStrong experience in Ansible and TerraformStrong experience of CI/CD Pipelines with YAML/ARM Template Knowledge & experience on Live site maintenanceDemonstrated ability to debug, fix, and optimize...


  • Ottawa, Canada Themesoft Inc. Full time

    Position: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...


  • Ottawa, Canada Themesoft Inc. Full time

    Position : SRE (Site Reliability Engineer) Location : Ottawa, ON (Hybrid Onsite) Job Description: Proven experience as a Site Reliability Engineer or similar role. Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC). Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g.,...


  • Ottawa, Canada Themesoft Inc. Full time

    Position: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...


  • Ottawa, Canada Reliability Screening Solutions Inc. Full time

    **Job Summary**: **Key Responsibilities**: 1. International and Retail Client Services Coordination: - Serve as a main point of contact for both local and international clients, addressing inquiries and providing support regarding background screening services. - Guide clients in selecting appropriate screening services based on their specific needs. 2....


  • Ottawa, Canada Lightspeed Restaurant Full time

    Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place!We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....


  • Ottawa, Ontario, Canada Lightspeed Restaurant Full time

    Are you actively looking for a new opportunity? Or just checking the market? Well... you might just be in the right place We're looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....


  • Ottawa, Canada Lightspeed Restaurant Full time

    Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place!We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....


  • Ottawa, Canada Lightspeed Restaurant Full time

    Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....


  • Ottawa, Canada Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Ottawa, ON(Onsite)Responsibilities:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid understanding of...


  • Ottawa, Canada Themesoft Inc. Full time

    Role: Site Reliability Engineer Location: Ottawa, ON(Onsite) Responsibilities: Proven experience as a Site Reliability Engineer or similar role. Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC). Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes). Solid...