Site Reliability and Log Management Specialist
2 months ago
Maplesoft Group is currently seeking a Site Reliability and Log Management Specialistfor our Federal Government client.
The following responsibilities are associated with the “Statement of Work” but are not limited to:
Primary Responsibilities
The Consultant will be responsible for providing the following Services to the Client:
Under the direction of the Assistant Director, Architecture & Site Reliability Services:
- Provide engineering, and administrative support in a Splunk environment (search heads, indexers,
deployers, deployment servers, heavy/universal forwarders, etc.)
- Recognize and onboard new data sources into Splunk
- Analyze data for anomalies and trends and building dashboards highlighting the key trends of the data.
- Perform the administration of monitoring platforms and related infrastructure working with IT specialists
in cyber, network, storage, security, virtual infrastructure, platform, and database to deliver their logging
requirements
- Assess and refine current logs and whenever possible consider fine-tuning or explore opportunities for
automation
- Support migration of data sources & log traffic
- Other related activities and deliverables as required
Required Qualifications & Skills
The Consultant should have the following qualifications and skills:
- University degree or college diploma in computer science, networking, engineering, or a related field
- A minimum of five (5) years of relevant work experience with Log Management / Monitoring
- Demonstrated experience with Splunk & Syslog-NG
- Demonstrated experience with Linux environment, editing and maintaining Splunk configuration files and
apps.
- Demonstrated knowledge of SCOM (2012 R2, 2016, 1807) and Vendor Management Packs
- Demonstrated experience with SCOM Report Customization and SQL Report Server administration
- Demonstrated experience with SolarWinds
- Demonstrated experience with (basic) admin level Windows / Linux Basic Operations around file system
- Demonstrated experience with Powershell scripting
- Demonstrated experience in using scripting languages such as Bash or Python, specifically for systems
automation
Additional Qualifications
The following will also be considered:
- Demonstrated knowledge and understanding of load testing, monitoring, and performance
management tools for every layer of the environment
- Demonstrated experience with testing high availability environments with performance of disaster
recovery tests
- Demonstrated knowledge and expertise in Site Reliability Engineering, creating SLOs, SLAs, enhancing
observability, incident management work.
- Demonstrated experience with infrastructure scripting and automation or familiarity with Infrastructure as
Code
Maplesoft Group prides itself on its distinct corporate culture and recognizes that success is a direct reflection of our most valuable asset - our people. Therefore, attitude and ambition are key personality traits we seek out, along with skill and aptitude, in potential employees.
All employment decisions are made based on business needs, job requirements, and individual qualifications.
We have other current jobs related to this field that you can find below
-
Site Reliability Specialist
2 weeks ago
Ottawa, Canada Innovapost Full time**Requisition Number**:2196 **Location**:March Road **Province**: Ontario (CA-ON) **Country**:Canada (CA) **Employment Type**: Regular **Who is Innovapost?** Great question! We are the technology arm of the Canada Post Group of Companies. This includes Canada Post, Purolator, and SCI. By joining us you will be able to make a positive impact on how...
-
Manager, Site Reliability Engineering and DevOps
2 months ago
Ottawa, ON, Canada Lightspeed Commerce Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...
-
Staff Site Reliability Engineer
5 days ago
Ottawa, Canada Lightspeed Restaurant Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff, Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...
-
Staff Site Reliability Engineer
4 days ago
Ottawa, Canada Lightspeed Restaurant Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff, Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...
-
Ottawa, Canada Reliability Screening Solutions Inc. Full time**Job Summary**: **Key Responsibilities**: 1. International and Retail Client Services Coordination: - Serve as a main point of contact for both local and international clients, addressing inquiries and providing support regarding background screening services. - Guide clients in selecting appropriate screening services based on their specific needs. 2....
-
Site Reliability Engineer
2 days ago
Ottawa, Canada Themesoft Inc. Full timePosition: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...
-
Site Reliability Engineer
2 days ago
Ottawa, Canada Themesoft Inc. Full timeRole: Site Reliability EngineerLocation: Ottawa, ON(Onsite)Responsibilities:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid understanding of...
-
Copy of Principal Site Reliability Engineer
2 months ago
Ottawa, Canada Lightspeed Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size...
-
Site Reliability Engineer
3 days ago
Ottawa, Canada Themesoft Inc. Full timeJob Title: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Duration: 6 Months ContractJob Description:Should have 7+ years of experienceStrong experience in Ansible and TerraformStrong experience of CI/CD Pipelines with YAML/ARM Template Knowledge & experience on Live site maintenanceDemonstrated ability to debug, fix, and optimize...
-
Site Reliability Engineer
4 days ago
Ottawa, Canada Themesoft Inc. Full timeJob Title: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Duration: 6 Months ContractJob Description:Should have 7+ years of experienceStrong experience in Ansible and TerraformStrong experience of CI/CD Pipelines with YAML/ARM Template Knowledge & experience on Live site maintenanceDemonstrated ability to debug, fix, and optimize...
-
Site Reliability Engineer
3 days ago
Ottawa, Canada Themesoft Inc. Full timePosition: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...
-
Site Reliability Engineer
2 days ago
Ottawa, Canada Themesoft Inc. Full timePosition : SRE (Site Reliability Engineer) Location : Ottawa, ON (Hybrid Onsite) Job Description: Proven experience as a Site Reliability Engineer or similar role. Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC). Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g.,...
-
Site Reliability Engineer
3 days ago
Ottawa, Canada Themesoft Inc. Full timePosition: SRE (Site Reliability Engineer)Location: Ottawa, ON (Hybrid Onsite)Job Description:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid...
-
Client Services Coordinator and Screening Specialist
2 months ago
Ottawa, Canada Reliability Screening Solutions Inc. Full time**Job Summary**: **Key Responsibilities**: 1. International and Retail Client Services Coordination: - Serve as a main point of contact for both local and international clients, addressing inquiries and providing support regarding background screening services. - Guide clients in selecting appropriate screening services based on their specific needs. 2....
-
Principal Site Reliability Engineer
1 month ago
Ottawa, Canada Lightspeed Restaurant Full timeAre you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place!We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....
-
Principal Site Reliability Engineer
6 days ago
Ottawa, Ontario, Canada Lightspeed Restaurant Full timeAre you actively looking for a new opportunity? Or just checking the market? Well... you might just be in the right place We're looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....
-
Principal Site Reliability Engineer
3 weeks ago
Ottawa, Canada Lightspeed Restaurant Full timeAre you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place!We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....
-
Principal Site Reliability Engineer
3 weeks ago
Ottawa, Canada Lightspeed Restaurant Full timeAre you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business....
-
Site Reliability Engineer
3 days ago
Ottawa, Canada Themesoft Inc. Full timeRole: Site Reliability EngineerLocation: Ottawa, ON(Onsite)Responsibilities:Proven experience as a Site Reliability Engineer or similar role.Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC).Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Solid understanding of...
-
Site Reliability Engineer
2 days ago
Ottawa, Canada Themesoft Inc. Full timeRole: Site Reliability Engineer Location: Ottawa, ON(Onsite) Responsibilities: Proven experience as a Site Reliability Engineer or similar role. Strong proficiency with Ansible and Terraform for automation and infrastructure as code (IaC). Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes). Solid...