Site Reliability Engineering

1 month ago


Montreal, Canada Cisco Systems, Inc. Full time
Site Reliability Engineering - Technical Leader

Location:

Alternate Location

Area of Interest

Compensation Range

138300 CAD - 203700 CAD

Job Type

Professional

Cloud and Data Center, Software Development

Job Id

1421618

Who We Are

As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi" - multi-layer, multi-domain, and multi-vendor networks. Accedian's open and scalable platform removes roadblocks to innovation, enabling cloud-native analytics and empowering customers to launch new assured services based on 5G, SD-WAN, and edge technologies.

Who You Are You are an expert in deployment and network operations, skilled in using scripts and automation tools to enhance software processes. With a passion for scripting and automation, you contribute to effective software strategies, oversee maintenance, and optimize systems. Proficient with Kubernetes and Docker Swarm, you seek new ways to monitor deployment health and performance. Your proactive nature and dedication to tech excellence make you a valuable team member in operational efficiency and reliability. Who You'll Work With

Our team prioritizes your growth in technical, business, and soft skills within a culture that values team strength and investment. We adopt a "You build it, you run it" approach, empowering team members to actively manage and improve our software. Committed to continuous learning, we support mastering new technologies and champion a culture of ambition and innovation in cloud computing.

What You’ll Do

Our growing team is looking for dedicated Service Reliability Engineering professional (SRE) to work with a small, innovative team of industry experts to help perfect our platform by improving our automation processes around deployment and operations.

You will take charge of enhancing the product life cycle, manage configuration, assist in deployment and scripting for management purposes, and collaborate within a cross-functional team. Your responsibility will be to spearhead the initiatives and orchestrate the DevOps cycle. Your responsibilities will include:
  • Monitoring our cloud and Customer On-Premise infrastructure: Assessing its health to offer 24/7 service to our customers.
  • Detecting potential issues : Configure monitoring to intercept them before an outage occurs.
  • Participating in system troubleshooting: and recommend improvements to our platform and tools, regular and systematic code testing, and deployment.
  • Supporting our public cloud deployments : Research, propose and participate in the implementation of security best practices for public cloud deployments and data management.
  • Prioritizing and escalating: Raising problems to Development, collaborating with our Operations lead and on-call engineer to investigate operational issues impacting users and identify root causes.
  • Driving automation development: Build configuration management tools and scripts to address operational incidents.
  • Improving our Security posture: Enforce policies for environment security and their application to our DevOps tools.

This role includes periodic participation in an on-call rotation approximately once every six weeks.

Minimum Qualifications:
  • 12 years of related experience as a Software Engineer, DevOps Engineer, Site Reliability Engineer or a role in a related field.
  • Experience administering Cloud or Virtualized environments using UNIX/LINUX command line and scripting.
  • IT support experience focused on handling and troubleshooting system-wide solutions.
  • Demonstrated experience deploying multi-service applications on cloud platforms such as AWS, Google Cloud, or Azure using a modern toolset.
  • Experience in developing continuous monitoring and automated alerting systems to ensure the stability and reliability of IT systems.
Preferred Qualifications:
  • Experience with configuration management tools such as Ansible, Salt, Puppet, Chef, or similar.
  • Bachelors in a STEM related discipline.
  • A deep understanding of Docker containerization and orchestration, with Kubernetes experience.
  • Knowledge of IP networking, VPNs, DNS, load balancing, and firewall management.
  • Familiarity with infrastructure management solutions; experience with HashiCorp Terraform and HashiCorp Vault is.
  • Experience in setting up and maintaining continuous integration and deployment pipelines.
  • Ability to write and speak French.

Why Cisco?

#WeAreCisco. We are all unique, but collectively we bring our talents to work as a team, to develop innovative technology and power a more inclusive, digital future for everyone. How do we do it? Well, for starters – with people like you

Nearly every internet connection around the world touches Cisco. We’re the Internet’s optimists. Our technology makes sure the data traveling at light speed across connections does so securely, yet it’s not what we make but what we make happen which marks us out. We’re helping those who work in the health service to connect with patients and each other; schools, colleges, and universities to teach in even the most challenging of times. We’re helping businesses of all shapes and sizes to connect with their employees and customers in new ways, providing people with access to the digital skills they need and connecting the most remote parts of the world – whether through 5G, or otherwise.

We tackle whatever challenges come our way. We have each other’s backs, we recognize our accomplishments, and we grow together. We celebrate and support one another – from big and small things in life to big career moments. And giving back is in our DNA (we get 10 days off each year to do just that).

We know that powering an inclusive future starts with us. Because without diversity and a dedication to equality, there is no moving forward. Our 30 Inclusive Communities, that bring people together around commonalities or passions, are leading the way. Together we’re committed to learning, listening, caring for our communities, whilst supporting the most vulnerable with a collective effort to make this world a better place either with technology, or through our actions.

So, you have colorful hair? Don’t care. Tattoos? Show off your ink. Like polka dots? That’s cool. Pop culture geek? Many of us are. Passion for technology and world changing? Be you, with us #WeAreCisco

Message to applicants applying to work in the U.S. and/or Canada:

When available, the salary range posted for this position reflects the projected hiring range for new hire, full-time salaries in U.S. and/or Canada locations, not including equity or benefits. For non-sales roles the hiring ranges reflect base salary only; employees are also eligible to receive annual bonuses. Hiring ranges for sales positions include base and incentive compensation target. Individual pay is determined by the candidate's hiring location and additional factors, including but not limited to skillset, experience, and relevant education, certifications, or training. Applicants may not be eligible for the full salary range based on their U.S. or Canada hiring location. The recruiter can share more details about compensation for the role in your location during the hiring process.

U.S. employees have access to quality medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, short and long-term disability coverage, basic life insurance and numerous wellbeing offerings. Employees receive up to twelve paid holidays per calendar year, which includes one floating holiday, plus a day off for their birthday. Employees accrue up to 20 days of Paid Time Off (PTO) each year and have access to paid time away to deal with critical or emergency issues without tapping into their PTO. We offer additional paid time to volunteer and give back to the community. Employees are also able to purchase company stock through our Employee Stock Purchase Program.

Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components. For quota-based incentive pay, Cisco typically pays as follows:

.75% of incentive target for each 1% of revenue attainment up to 50% of quota;

1.5% of incentive target for each 1% of attainment between 50% and 75%;

1% of incentive target for each 1% of attainment between 75% and 100%; and once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.

For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.

Sign up to receive notifications of similar jobs #J-18808-Ljbffr

  • Montreal, Canada Soho Square Solutions Full time

    Role: Site Reliability Engineer Duration: 12 Months Location: Montreal, QC Bilingual: French & English Hybrid Role As a Site Reliability Engineering Developer, you are a specialist in the development and management of resilient critical assets. You actively participate in achieving our DevOps visio


  • Montreal, Canada Soho Square Solutions Full time

    Role: Site Reliability Engineer Duration: 12 MonthsLocation: Montreal, QCBilingual: French & EnglishHybrid RoleAs a Site Reliability Engineering Developer, you are a specialist in the development and management of resilient critical assets. You actively participate in achieving our DevOps vision by integrating SRE best practices. Advanced knowledge in Java....


  • Montreal, Canada Soho Square Solutions Full time

    Role: Site Reliability Engineer Duration: 12 MonthsLocation: Montreal, QCBilingual: French & EnglishHybrid RoleAs a Site Reliability Engineering Developer, you are a specialist in the development and management of resilient critical assets. You actively participate in achieving our DevOps vision by integrating SRE best practices. Advanced knowledge in Java....


  • Montreal, Canada Soho Square Solutions Full time

    Role: Site Reliability Engineer Duration: 12 Months Location: Montreal, QC Bilingual: French & English Hybrid Role As a Site Reliability Engineering Developer, you are a specialist in the development and management of resilient critical assets. You actively participate in achieving our DevOps vision by integrating SRE best practices. Advanced knowledge in...


  • Montreal, Canada Soho Square Solutions Full time

    Role: Site Reliability Engineer Duration: 12 MonthsLocation: Montreal, QCBilingual: French & EnglishHybrid RoleAs a Site Reliability Engineering Developer, you are a specialist in the development and management of resilient critical assets. You actively participate in achieving our DevOps vision by integrating SRE best practices. Advanced knowledge in Java....


  • Montreal, Quebec, Canada Cisco Systems, Inc. Full time

    Site Reliability Engineering - Technical Leader Location: Alternate Location Area of Interest Compensation Range CAD CAD Job Type Professional Cloud and Data Center, Software Development Job Id Who We Are As a part of Cisco, Accedian is a leader in per


  • Montreal, Canada Lyft Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.As a leader in micromobility, Lyft powers...


  • Montreal, Canada Socotra, Inc. Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Canada Socotra, Inc. Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Canada Socotra, Inc. Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Canada Lyft Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.As a leader in micromobility, Lyft powers...


  • Montreal, Canada Socotra, Inc. Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Quebec, Canada Noverka Conseil Full time

    At Noverka, our values illustrate who we are and define our beliefs: Human, Transparent, Passionate. We are driven by innovation and success, both in our relationships and in our practices.Finding the right job for the right person is what we do bestOur client, an organization in the banking industry is looking for a Site Reliability Engineering (SRE)...


  • Montreal, Quebec, Canada Lyft Full time

    At Lyft, our mission is to enhance people's lives with top-notch transportation services. We strive to foster an inclusive and diverse environment in our community, valuing the unique contributions of each team member. Our goal is to revolutionize the way the world approaches transportation, envisioning a future where cities feel more connected and...


  • Montreal, Quebec, Canada SAP Full time

    We help the world run better Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly...


  • Montreal, Quebec, Canada Cisco Full time

    ```htmlWho We AreAs a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi" - multi-layer, multi-domain, and multi-vendor networks. Accedian's open...


  • Montreal, Quebec, Canada Socotra, Inc. Full time

    At Lyft, our mission is to improve people's lives with the world's best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Canada Plexia Full time

    Job DescriptionAs a Junior Site Reliability Engineer (SRE) you will play a crucial role within the R&D and Innovation department. You will be called upon to collaborate with the Plexia product-aligned and core architecture team. The highly sensitive nature of health and medical systems expertise makes it so that the availability and reliability of our...


  • Montreal, Canada Plexia Full time

    Job DescriptionAs a Junior Site Reliability Engineer (SRE) you will play a crucial role within the R&D and Innovation department. You will be called upon to collaborate with the Plexia product-aligned and core architecture team. The highly sensitive nature of health and medical systems expertise makes it so that the availability and reliability of our...


  • Montreal, Canada Lightspeed Full time

    Welcome to NuOrder by Lightspeed! Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...