Current jobs related to Sre and System Administrator - Vancouver - Uptime.com

  • Director SRE

    4 weeks ago


    Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryJob DescriptionWhat is the opportunity?The Director of SRE will be responsible for the vision, design, development, implementation, and support of Site Reliability Engineering (SRE) solutions for all applications across a line of business within CNB (City National Bank), an RBC company. This individual is expected to lead a team that drives...

  • Director SRE

    1 month ago


    Vancouver, Canada RBC Full time

    Job SummaryJob DescriptionWhat is the opportunity?The Director of SRE will be responsible for the vision, design, development, implementation, and support of Site Reliability Engineering (SRE) solutions for all applications across a line of business within CNB (City National Bank), an RBC company. This individual is expected to lead a team that drives...

  • Director SRE

    1 month ago


    Vancouver, Canada RBC Full time

    Job SummaryJob DescriptionWhat is the opportunity?The Director of SRE will be responsible for the vision, design, development, implementation, and support of Site Reliability Engineering (SRE) solutions for all applications across a line of business within CNB (City National Bank), an RBC company. This individual is expected to lead a team that drives...

  • Director SRE

    2 months ago


    VANCOUVER, Canada Royal Bank of Canada Full time

    Job SummaryJob DescriptionWhat is the opportunity?The Director of SRE will be responsible for the vision, design, development, implementation, and support of Site Reliability Engineering (SRE) solutions for all applications across a line of business within CNB (City National Bank), an RBC company. This individual is expected to lead a team that drives...

  • Director SRE

    2 months ago


    Vancouver, Canada Royal Bank of Canada> Full time

    Job SummaryJob DescriptionWhat is the opportunity?The Director of SRE will be responsible for the vision, design, development, implementation, and support of Site Reliability Engineering (SRE) solutions for all applications across a line of business within CNB (City National Bank), an RBC company. This individual is expected to lead a team that drives...


  • Vancouver, Canada Conexiom Full time

    About the Opportunity: Conexiom is seeking a dedicated and experienced Site Reliability Engineering (SRE) Senior Manager to lead our SRE team. The role involves leading the Cloud SRE team in day-to-day operations, which include monitoring, support activities, ensuring customer satisfaction through reliable service, and building and designing cloud...


  • Vancouver, Canada Compest Solutions Inc Full time

    **Job Title**: - **SRE/ Release Manager, Project Systems (Oracle Primavera Unifier)** **Location**:Vancouver, Canada - onsite role** **Position Type-Regular** Full-Time Salary **Please reply** with your **expected Salary range--** **Responsibilities & Required Skills** On a project assignment: Manage what is happening in each environment Maintain a...


  • North Vancouver, Canada Compest Solutions Inc Full time

    **Job Title**: - **SRE/ Release Manager, Project Systems (Oracle Primavera Unifier)** **Location**:Vancouver, Canada - onsite role** **Position Type-Regular** Full-Time Salary **Please reply** with your **expected Salary range--** **Responsibilities & Required Skills** On a project assignment: Manage what is happening in each environment Maintain a...


  • Vancouver, Canada Arista Networks Full time

    Job DescriptionWho You’ll Work WithSREs at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you’ll be part of the team responsible for our global service fleet.What You’ll DoAs an SRE you’ll be responsible for our global CloudVision service fleet. This includes:Building the...


  • Vancouver, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...


  • Vancouver, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...


  • Vancouver, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...


  • Vancouver, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...


  • Vancouver, British Columbia, British Columbia, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...


  • Vancouver, BC, Canada Arista Networks Full time

    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in...

  • DevOps Engineer

    16 hours ago


    Vancouver, British Columbia, Canada Global Relay Full time

    About Global RelayGlobal Relay is a leading provider of enterprise information archiving solutions, with over 20 years of experience in the industry. We offer a range of cloud-based archiving, surveillance, eDiscovery, and analytics solutions that help our clients meet their regulatory requirements and improve their business operations.Your RoleWe are...

  • DevOps Engineer

    20 hours ago


    Vancouver, British Columbia, Canada Global Relay Full time

    About Global RelayGlobal Relay is a leading provider of enterprise information archiving solutions, with over 20 years of experience in the industry. We offer a range of cloud-based archiving, surveillance, eDiscovery, and analytics solutions that help our clients meet their regulatory requirements and improve their business operations.Your RoleWe are...


  • Vancouver, British Columbia, Canada Tyk Technologies Full time

    About Tyk TechnologiesTyk Technologies is a leading provider of API Management solutions, helping organizations connect systems and services across the globe. Our platform is used by thousands of users worldwide, including major brands like Lotte, Bell, and RBS.Job DescriptionWe are seeking an experienced Senior Site Reliability Engineer to join our team. As...


  • Vancouver, British Columbia, Canada Tyk Technologies Full time

    About Tyk TechnologiesTyk Technologies is a leading provider of API Management solutions, helping organizations connect systems and services across the globe. Our platform is used by thousands of users worldwide, including major brands like Lotte, Bell, and RBS.Job DescriptionWe are seeking an experienced Senior Site Reliability Engineer to join our team. As...

  • DevOps Engineer

    1 week ago


    Vancouver, British Columbia, Canada Global Relay Full time

    About Global RelayGlobal Relay is a leading provider of enterprise information archiving solutions, with over 20 years of experience in the industry. We offer a range of cloud-based services, including archiving, surveillance, eDiscovery, and analytics.Job SummaryWe are seeking a highly skilled DevOps/SRE professional to join our team. As a DevOps/SRE, you...

Sre and System Administrator

4 months ago


Vancouver, Canada Uptime.com Full time

You will play a critical role in ensuring the high availability, reliability, and performance of our systems. You will be responsible for designing and implementing scalable, resilient, and automated infrastructure solutions. Additionally, you will collaborate closely with cross-functional teams, such as developers and product managers, to achieve common goals and drive continuous improvement. Your expertise in monitoring, incident management, infrastructure as code, and cloud platforms like AWS or GCP will be invaluable in maintaining and enhancing our systems.

**Responsibilities**:
(As SRE)
- Design and implement highly-available, scalable SaaS systems including web servers, background processing, and databases.
- Develop and maintain infrastructure as code using tools like Terraform, Docker, and Kubernetes.
- Ensure high availability and reliability of systems through effective monitoring, incident management, and proactive troubleshooting.
- Collaborate with cross-functional teams to implement and manage centralized logging systems, ensuring comprehensive visibility into the environment.
- Identify performance bottlenecks and scalability issues, and implement optimizations to improve system performance.
- Design and implement disaster recovery plans to ensure business continuity in the event of a system failure or disaster.
- Stay up-to-date with the latest technologies and industry trends in SRE, and actively seek opportunities for improvement and innovation.
- Foster a culture of ownership and independence, taking responsibility for complex infrastructure implementations and driving them independently when required.

(As System Admin)
- Offer user support: Assist end-users by responding to inquiries, troubleshooting software or hardware issues, and providing guidance on system usage. Handle user requests, password resets, access permissions, and other user-related tasks to enhance user productivity and satisfaction.
- Monitor system health and performance: Utilize monitoring tools to proactively monitor the health and performance of systems, networks, and servers. Identify anomalies, potential issues, or performance bottlenecks and take necessary actions to maintain system stability and efficiency.
- Participate in incident response: Respond promptly to system failures, outages, security breaches, or other critical incidents. Take necessary measures to mitigate the impact, minimize downtime, and restore systems to normal operation.
- Contribute to on-call rotations: Participate in on-call rotations to provide 24/7 support for emergency situations, system alerts, and critical incidents outside of regular working hours. Respond promptly to on-call requests, resolve urgent issues, and ensure continuous system availability.
- Maintain documentation: Create and maintain detailed documentation, including configuration settings, troubleshooting steps, and standard operating procedures (SOPs). Contribute to knowledge bases and share expertise with the team. Ensure accurate and up-to-date documentation for future reference.

**Requirements**:

- Requirements:

- Bachelor's degree in computer science, engineering, or a related field (or equivalent experience).
- Strong experience in designing and implementing highly-available, scalable SaaS systems.
- Proficiency in infrastructure as code tools like Terraform, Docker, and Kubernetes.
- Familiarity with cloud platforms like AWS or GCP, with hands-on experience in designing and managing infrastructure on these platforms.
- Solid understanding of incident management processes and experience in resolving critical incidents.
- Strong problem-solving and troubleshooting skills, with the ability to work effectively under pressure.
- Excellent communication and collaboration skills, with the ability to work closely with cross-functional teams to achieve common objectives.

**Benefits**

How we will support your growth and success:

- Partner with executives, leadership and cross-functional organization including engineering, marketing and business operations.
- Professional development opportunities to further skills and knowledge
- Discover the exciting world of monitoring, observability, and SRE while becoming an advocate and drive innovation in the industry.
- A supportive team of passionate and dedicated individuals all focused on building the best monitoring service in the world.
- Unlimited Paid Time Off (Vacation, Sick & Public Holidays) - even for our global contractors
- Family Leave (Maternity, Paternity)
- Training & Development
- Work From Home