Site Reliability Engineer
3 weeks ago
Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.
We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.
All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.
About the Role
We're looking for an experienced site reliability engineer (SRE) who can thrive in a dynamic start-up environment. The main responsibilities for this role are:
- Improving our observability by adding/adjusting metrics
- Building easily parsable dashboards
- Designing and overseeing our on-call rotations
- Improving our deployment process to increase reliability.
An ideal candidate meets at least the following requirements:
- Expert in at least one programming language that compiles to machine code such as Rust, C++, or Go. Rust or C++ experience is preferred
- Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty
- Expert knowledge of deployment technologies such as Pulumi or Terraform
- Expert knowledge of Kubernetes.
Location
The role is based in our London office close to Piccadilly Circus underground station. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates must be willing to attend late meetings at least twice a week to coordinate with the rest of our team, which is based in California. This role includes semi-regular business trips to California. We are also open to hiring in our HQ office in Palo Alto, CA.
After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview ("phone interview") during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of two technical interviews.
Our goal is to finish the process within one week. All interviews will be conducted via Google Meet.
- Competitive cash-based compensation
- xAI equity
- Private health and dental insurance
- Unlimited time off subject to prior approval
xAI is an equal opportunity employer and does not unlawfully discriminate based on race, color, religion, ethnicity, ancestry, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, disability, medical conditions, genetic information, marital status, military or veteran status, or any other applicable legally protected characteristics.
#J-18808-Ljbffr
-
Site Reliability Engineer
4 weeks ago
London, Ontario, Canada Prodigy Education Inc. Full timeProdigy Education is a global leader in game-based learning and one of the fastest-growing EdTech companies in North America. Our mission is to help every student in the world love learning, motivating millions worldwide via fun, secure, and accessible curriculum-aligned gameplay experiences. Visit www.prodigygame.com to learn more.The Site Reliability...
-
Site Reliability Engineer
2 weeks ago
London, Ontario, Canada Tbwa ChiatDay Inc Full timeSite Reliability Engineer (SRE) - grok.com & APIAbout xAIxAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on...
-
Site Reliability Engineering Manager
3 days ago
London, Ontario, Canada Sentry Full timeAbout This Role">We're seeking an experienced Engineering Manager to lead our Site Reliability team, ensuring resilience in our products as they scale. You'll work closely with your team to monitor production, own incident management, and promote operational principles throughout the engineering culture.
-
Site Reliability Engineer
3 weeks ago
London, Ontario, Canada Sentry Full timeBad software is everywhere, and we're tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than $217 million in funding and 100,000+ organizations that believe we're on to something, we're building performance and error monitoring tools that help companies like Disney,...
-
Site Reliability Engineer
3 weeks ago
London, Ontario, Canada Xai Limited Full timexAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.We operate with a flat organizational structure. All...
-
London, Ontario, Canada Interac Corp. Full timeWe are looking for a Technical Lead to join our team at Interac Corp. As a Technical Lead, you will play a critical role in shaping the future of Canadian payment systems.Job DescriptionWe are seeking a skilled Technical Lead with expertise in Site Reliability engineering, DevOps, and Application Management. Your primary responsibility will be to lead...
-
15h Left Senior Site Reliability Engineer II
2 weeks ago
London, Ontario, Canada Tbwa ChiatDay Inc Full timeSenior Site Reliability Engineer II (Kafka)At Braze, we have found our people. We're a genuinely approachable, exceptionally kind, and intensely passionate crew.We seek to ignite that passion by setting high standards, championing teamwork, and creating work-life harmony as we collectively navigate rapid growth on a global scale while striving for greater...
-
Reliability Systems Engineer
1 day ago
London, Ontario, Canada Avanti Software Inc. Full timeAt Avanti Software Inc., we are seeking a skilled Reliability Systems Engineer to join our team at the Lac des Iles Mine site. This role will play a crucial part in ensuring the performance of problematic equipment and proposing sustainable solutions.Key Responsibilities:Monitor and initiate actions on live feed data and leading indicators such as vibration...
-
Reliability Engineer
2 days ago
London, Ontario, Canada Avanti Software Inc. Full timeImpala Canada is the owner and operator of the Lac des Iles Mine, located 90 minutes northwest of Thunder Bay, Ontario. In operation for 30 years, the LDI Mine is one of only two known pure palladium sources in North America. Palladium contributes to a cleaner global environment, with its leading use in catalytic converters that reduce harmful emissions from...
-
Reliability Engineer
5 days ago
London, Ontario, Canada General Dynamics Land Systems - Canada Corporation Full time**Job Overview**We are seeking a highly skilled Reliability Engineer to join our team at General Dynamics Land Systems - Canada Corporation.This role is an excellent opportunity for individuals working towards a University Degree in a technical discipline to gain hands-on experience in reliability and maintainability analysis, design failure modes and...
-
Site Reliability Architect
2 weeks ago
London, Ontario, Canada Synopsys, Inc. Full timeWe Are:At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the...
-
On-Site Engineer
1 week ago
London, Ontario, Canada Parsons Oman Full timeIn a world of possibilities, pursue one with endless opportunities. Imagine NextWhen it comes to what you want in your career, if you can imagine it, you can do it at Parsons. Imagine a career working with intelligent, diverse people sharing a common quest. Imagine a workplace where you can be yourself. Where you can thrive. Where you can find your next,...
-
Application Reliability Engineer
4 days ago
London, Ontario, Canada GuruLink Full timeJob DescriptionWe are seeking a skilled Application Reliability Engineer to join our team at GuruLink. The successful candidate will be responsible for ensuring the reliability and performance of our payment systems and applications. Key responsibilities will include:RequirementsTo be successful in this role, you will need: A bachelor's degree in Computer...
-
Reliability Engineering Manager
5 days ago
London, Ontario, Canada Iamgold Corporation Full timeIAMGOLD believes in rewarding outstanding performance with an attractive total rewards package.About IAMGOLD CorporationIAMGOLD is a leading mid-tier gold producer and developer based in Canada, with operating mines in North America and West Africa. Our commitment to responsible and sustainable mining practices is demonstrated through our Zero Harm...
-
Site Reliability Engineer Leader
2 days ago
London, Ontario, Canada Interac Corp. Full timeAt Interac Corp., we're committed to empowering Canadians to transact digitally with confidence. As a Technical Lead, you'll play a vital role in shaping the future of Canadian payment systems and ensuring the security and integrity of our services.Key ResponsibilitiesYour primary responsibilities will include:Leading technical analysis and driving...
-
Reliability Engineering Expert
3 days ago
London, Ontario, Canada Genesys Full timeRequired Skills and QualificationsTo be successful in this role, you should have a minimum of 5 years of experience in software engineering using Python, Go, Java, or TypeScript. You should also have a strong understanding of microservices architecture and distributed systems.Key Qualifications:Minimum 5 years of experience in software engineering using...
-
Mobile Reliability Technician
2 weeks ago
London, Ontario, Canada Infragistics Full timeOrla Mining is striving to be the emerging gold producer of choice with a geographically diversified asset base, a prospective development and exploration portfolio, an experienced management team with a successful track record, and a high-quality board and shareholder base.Orla operates the Camino Rojo Oxide Gold Mine, a gold and silver open-pit heap leach...
-
Site Support Engineer
4 hours ago
London, Ontario, Canada Alamos Gold Inc. Full timeJob DescriptionWe are seeking a Construction Miner to join our team at Alamos Gold Inc. This role involves providing support to our construction operations, ensuring the safe execution of work activities, and maintaining accurate records.Key Tasks- Conduct site inspections to identify potential hazards and take corrective action as needed- Perform routine...
-
Manager - Cloud Engineering
2 weeks ago
London, Ontario, Canada CARFAX Full timeJoin Team CARFAX as a Manager - Cloud EngineeringIsn't it time you bragged about where you work? At CARFAX, we do, every day. We pride ourselves on being mission-focused on helping to grow a brand built on accuracy and integrity. We care deeply about our products and our customers. We're more than just a company: We help millions of consumers make more...
-
Lead Engineer, Electrical and I&C
3 weeks ago
London, Ontario, Canada Merck Gruppe - MSD Sharp & Dohme Full timeJob DescriptionA fantastic opportunity has arisen for an Electrical and I&C Lead Engineer at our Dublin Biotech site to lead our Electrical, Instrumentation and Controls systems and oversee Operational Excellence projects within our Engineering Centre of Excellence. If you have a strong background in electrical systems and enjoy working collaboratively, we...