Automation Infrastructure System Admin

7 days ago


Markham, Ontario, Canada Advanced Micro Devices, Inc Full time

Overview:

WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world.

Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.

AMD together we advance_

Responsibilities:

Automation

Infrastructure System Admin

THE ROLE:

Our automation & tools code base runs from pre-silicon environments, prototype lab systems to the fastest supercomputers in existence. Join a team using modern industry best practices across the full stack spectrum. Make a difference by helping us accelerate AMD's pace of innovation.


As part of the Datacenter GPU/AP Infrastructure team, you will be involved in the development of automation tools, content, and infrastructure to validate datacenter GPU/CPU hardware and software.

Your work will enable validation teams to improve their processes through developing new automation features, or by helping debug or co-create their automated test content.


THE PERSON:

  • Linux Systems Administrator with background with modern best practices and stack understanding
  • Strong problemsolving and troubleshooting skills
  • Eagerness to learn, adapt to new technologies, and stay uptodate with industry trends
  • Customer service mindset for providing support to lab teams
  • Detail oriented close attention to the finer details of systems and processes to identify potential issues and areas for improvement
  • Excellent written and verbal communication skills

KEY RESPONSIBILITIES:

  • Support inhouse automation and infrastructure solutions that can scale across multiple sites and geographies
  • Respond to and troubleshoot incidences reported by internal users or infrastructure alerts
  • Perform postmortem analysis as well as improve processes or add solutions to prevent future outages
  • Help with capacity planning, performance tuning and optimization of solutions

PREFERRED EXPERIENCE:

  • Experience working in a technical support and/or operations role
  • Understanding of network, OSI model, and troubleshooting
  • Strong understanding of Linux, Virtualization, and proficiency in Windows
  • Understanding of incident management, including incident response, and postmortem analysis
  • Experience with Python programming and Ansible
  • Awareness of emerging trends and technologies in the reliability and infrastructure space, such AI/MLbased monitoring solutions
  • Basic understanding of Kubernetes and containers
  • Database knowledge of Postgres / MySQL
  • Experience with tools and techniques for collecting, analyzing, and monitoring log data, such as ELK Stack (Elasticsearch, Logstash, Kibana) or Splunk
  • Azure cloud knowledge for managing infrastructure and services
  • Familiarity with Agile to effectively participate in the team's work process
  • Proficiency in using version control systems like Git and Infrastructure as Code
  • Familiarity with CI/CD tools and processes, like Jenkins, GitLab CI, or Azure DevOps,
  • Familiarity with lab environments is an asset
  • Familiarity with SRE best practices, such as the Google SRE handbook and other industry standards
  • Nice to have: certifications in relevant technologies and/or methodologies (e.g., Azure/Cloud, Kubernetes, or SCRUM/Agile)

ACADEMIC CREDENTIALS:

  • A background in computer science, engineering, or a related field

LOCATION:

Markham, Ontario

Qualifications:

  • Benefits offered are described: _AMD benefits at a glance.


  • Markham, Ontario, Canada Manrkē Full time

    Are you passionate about success? We're scaling rapidly and we need best-in-class talent.At Manrkē, business and tax professionals from around the world are empowering clients to become financial champions. From top athletes to musicians and global influencers as well as entrepreneurs in business tech, digital media, e-commerce, virtual education, and more,...


  • Markham, Ontario, Canada Extendicare Full time

    Job Description:The role will be responsible for user administration and support of the Windows Active Directory, Azure Cloud and Data Centre environment within the Extendicare landscape. The individual will work closely with the Systems infrastructure team to help actively support all hardware, software and systems requirements across all environments and...


  • Markham, Ontario, Canada Green Infrastructure Partners Full time

    Reporting to the Director, Enterprise Applications, the Business Systems Analyst will be involved in performing detailed systems tests, developing new system architectures, translating strategic objectives into technology solutions, creating end-user-facing reports, developing process flows, building dashboards and will be responsible for developing and...


  • Markham, Ontario, Canada AMD Full time

    Job Description WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Markham, Ontario, Canada AMD Full time

    Job Description WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Markham, Ontario, Canada Qualcomm Full time

    Company: Qualcomm Canada ULC Job Area: Engineering Group, Engineering Group > Software Engineering General Summary: We are looking for an Automation and DevOps oriented software engineer to help deliver cutting edge AI software technology. Our team manages the cross-site infrastructure and tools that enable Qualcomm's global AISW teams to...


  • Markham, Ontario, Canada Pathway Communications Full time

    Pathway Communications is a leader in delivering high-quality Managed IT and Cybersecurity Solutions to midsized organizations across Canada. We count several of Canada's best-known and most prestigious brands amongst our clients.We are looking for an individual with a strong Data Centre, Storage, Hardware, System Management, Technology and Operations...


  • Markham, Ontario, Canada Pathway Communications Full time

    Pathway Communications is a leader in delivering high-quality Managed IT and Cybersecurity Solutions to midsized organizations across Canada. We count several of Canada's best-known and most prestigious brands amongst our clients.We are looking for an individual with a strong Data Centre, Storage, Hardware, System Management, Technology and Operations...


  • Markham, Ontario, Canada AMPHENOL CANADA CORP Full time

    LET'S CONNECTAmphenol Canada Corp.Reasons to Join Amphenol Canada Excellent benefits coverage, including health, dental, vision & travellers' insurance Health Care Spending Account Company pension plan Progressive employee incentive plans Talent development we invest in your growth Company Recreation Club offers fun draws with exciting prizes, summer BBQs,...


  • Markham, Ontario, Canada AMPHENOL CANADA CORP Full time

    LET'S CONNECTPosition title:Systems AdministratorReporting to:Director, Information TechnologyType of position: Full-time, permanentSummary of Business:The System Administrator is responsible for the maintenance, configuration, and reliable operation of the organization's computer systems and servers. You will install hardware and software and participate in...


  • Markham, Ontario, Canada Saint Elizabeth Health Care Full time

    JOB SUMMARY: The Junior DBA and Storage Admin will play a crucial role in managing and maintaining the organization's database and backup systems. This position involves ensuring data accuracy, performance, security, and smooth workflow. The ideal candidate will work closely with various departments to support and improve our database infrastructure. JOB...


  • Markham, Ontario, Canada Saint Elizabeth Health Care Full time

    JOB SUMMARY: The Junior DBA and Storage Admin will play a crucial role in managing and maintaining the organization's database and backup systems. This position involves ensuring data accuracy, performance, security, and smooth workflow. The ideal candidate will work closely with various departments to support and improve our database infrastructure. JOB...


  • Markham, Ontario, Canada Saint Elizabeth Full time

    JOB SUMMARY: The Junior DBA and Storage Admin will play a crucial role in managing and maintaining the organization's database and backup systems. This position involves ensuring data accuracy, performance, security, and smooth workflow. The ideal candidate will work closely with various departments to support and improve our database infrastructure. JOB...


  • Markham, Ontario, Canada Pathway Communications Full time

    Duties and ResponsibilitiesVirtualization: Oversee and maintain virtualization technologies like VMware/Hyper-V to support efficient resource allocation and scalability.Cybersecurity: Implement robust cybersecurity practices, including vulnerability assessments, threat detection, and incident response. Utilize tools like Nessus, Rapid7, Bitdefender, MS...


  • Markham, Ontario, Canada CB Canada Full time

    QA Automation AnalystOn behalf of our client, Procom is seeking an experienced QA Automation Analyst for a 12 month contract opportunity based in Markham, ONQA Automation Analyst Job DetailsYou will become an expert on our clients administration system and will participate in various aspects of test planning and executionyou will assist in developing...


  • Markham, Ontario, Canada Allstate Canada Full time

    Who is Allstate:Allstate Insurance Company of Canada is a leading home and auto insurer focused on providing its customers prevention and protection products and services for every stage of life. The company is proud to have been named a Best Employer in Canada for nine consecutive years and prioritizes supporting employees and fostering an inclusive,...


  • Markham, Ontario, Canada Green Infrastructure Partners Full time

    Reporting to the Director, Enterprise Applications, the Business Systems Analyst will be involved in performing detailed systems tests, developing new system architectures, translating strategic objectives into technology solutions, creating end-user-facing reports, developing process flows, buildin


  • Markham, Ontario, Canada SCI Lease Corp. Full time

    Seeking a Lease Services Admin who will help ensure optimal customer experience throughout the customer's lease journey. As a Lease Services Admin, you will support the lease administration teams initiate customers into their lease, walk them through SCI's leasing process as well as provide continuous administrative support. You will play a key role in...


  • Markham, Ontario, Canada AMD Full time

    Job Description WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Markham, Ontario, Canada AMD Full time

    Job Description WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....