Network Engineer, AI/ML Infrastructure

5 days ago


Toronto, Ontario, Canada Boson AI Full time US$150,000 - US$250,000

About The Role

We're seeking an experienced Network Engineer to design, build, and optimize the high-performance networking infrastructure powering our AI/ML operations in Toronto. You'll work at the cutting edge of network technology—managing InfiniBand and ultra-high-speed Ethernet fabrics that connect NVIDIA H100 and A100 GPUs, over 20PB of Ceph storage, and hundreds of servers.

You'll be hands-on with the full lifecycle of our network infrastructure: planning, building, testing, deploying, and keeping everything running at peak performance. That means troubleshooting issues as they arise, monitoring network performance and throughput, developing automation to streamline operations, and working closely with HPC and ML teams to ensure they have the bandwidth they need. You'll also help us plan for future capacity and evaluate emerging network technologies as we scale to meet increasingly demanding workloads.

Responsibilities
  • Configure and maintain InfiniBand and high-speed Ethernet fabrics
  • Optimize network performance for RDMA, and GPU-to-GPU communication
  • Manage network switches (Mellanox, NVIDIA, Micas Networks)
  • Troubleshoot network bottlenecks and latency issues
  • Plan and execute network upgrades and expansions
  • Network security implementation (firewalls, VLANs, ACLs)
  • Collaborate on storage network optimizationInfrastructure monitoring
Minimum Qualifications
  • 4+ years of network engineering experience in production environments
  • Strong understanding of L2/L3 networking protocols (TCP/IP, BGP, OSPF, VLANs)
  • Hands-on experience with high-speed networking (100Gb+ Ethernet and InfiniBand)
  • Hands-on experience with network security (firewalls, ACLs, network segmentation)
  • Knowledge of HPC network topologies
  • Experience with InfiniBand fabrics including RDMA, RoCE, IPoIB
  • Strong troubleshooting and problem-solving skills
Preferred Qualifications
  • Experience in data center environments or AI/ML infrastructure
  • Hands-on experience with high-performance Ethernet switches (e.g., Broadcom Tomahawk), and latest InfiniBand switches (e.g., Nvidia/Mellanox)
  • Experience optimizing networks for GPU-to-GPU communication
  • Experience with open-source firewall solutions (OPNsense, pfSense, or similar)
  • Experience with network automation tools
  • Understanding of distributed storage networking (Ceph cluster networks)
  • Familiarity with network monitoring and observability tools (Prometheus, Grafana)
  • Knowledge of multi-site network connectivity and WAN optimization
  • Familiarity with cloud networking in at least one platform (AWS, GCP, or Azure) including VPC design, site-to-site VPN configuration, Direct Connect/ExpressRoute/Cloud Interconnect, hybrid cloud connectivity, and cloud-to-datacenter network integration
$150,000 - $250,000 a year

If you're a natural problem-solver with a passion for continuous learning, we'd love to hear from you.



  • Toronto, Ontario, Canada Boson AI Full time $120,000 - $180,000 per year

    About The Role We're seeking an experienced Network Engineer to design, build, and optimize the high-performance networking infrastructure powering our AI/ML operations in Toronto. You'll work at the cutting edge of network technology—managing InfiniBand and ultra-high-speed Ethernet fabrics that connect NVIDIA H100 and A100 GPUs, over 20PB of Ceph...


  • Toronto, Ontario, Canada Boson AI Full time US$150,000 - US$250,000

    About The RoleWe're looking for a Senior High Performance Computing Engineer to help us run one of the most exciting GPU clusters around—our Toronto datacenter packed with NVIDIA H100 and A100 GPUs, over 20PB of Ceph storage, terabit networking, and hundreds of servers.You'll be hands-on with the full lifecycle of HPC infrastructure: planning, building,...


  • Toronto, Ontario, Canada Boson AI Full time $120,000 - $180,000 per year

    About The Role We're looking for a Senior High Performance Computing Engineer to help us run one of the most exciting GPU clusters around—our Toronto datacenter packed with NVIDIA H100 and A100 GPUs, over 20PB of Ceph storage, terabit networking, and hundreds of servers You'll be hands-on with the full lifecycle of HPC infrastructure: planning, building,...


  • Toronto, Ontario, Canada 43North Full time $120,000 - $180,000 per year

    About CoverRight CoverRight is on a mission to make Medicare, and retiree health-related-finances transparent and accessible. We're building technology to empower older adults to confidently navigate their healthcare options through personalized guidance and modern digital experiences. We're a fast-growing, venture-backed company blending human expertise...


  • Toronto, Ontario, Canada Amyantek Full time $100,000 - $150,000 per year

    JD-2025-AIML-1: Senior AI/ML Engineer – Agentic AILocation: Toronto, ON (Hybrid)Client: Applab/LoblawsType: Full-time Team: Machine Learning Platform / Digital & DataAbout the RoleLoblaws Digital is hiring a Senior AI/ML Engineer with a strong emphasis on Agentic AI systems. This role focuses on building production-grade multi-agent workflows, LLM-powered...


  • Toronto, Ontario, Canada BMO Full time $1,232,000 - $2,304,000 per year

    Role OverviewWe are seeking a highly analytical and technically proficient Senior ML/AI Engineer to join our ARC team. This role is ideal for someone with a strong foundation in mathematics, statistics, and programming, and a passion for applying AI to solve complex financial problems. You will work to develop AI/ML/DS features for enterprise-wide AI...


  • Toronto, Ontario, Canada BMO Full time US$103,200 - US$192,000

    Application Deadline:10/30/2025Address:100 King Street West Job Family Group:Data Analytics & ReportingRole Overview We are seeking a highly analytical and technically proficient Senior ML/AI Engineer to join our ARC team. This role is ideal for someone with a strong foundation in mathematics, statistics, and programming, and a passion for applying AI...

  • AI/ML Engineer

    2 weeks ago


    Toronto, Ontario, Canada Spait Infotech Private Limited Full time $86,119 - $176,760 per year

    Key ResponsibilitiesDesign and implement machine learning models and AI algorithms to solve business problems.Work with large datasets — cleaning, preprocessing, and feature engineering for model optimization.Deploy ML models into production environments using MLOps frameworks and CI/CD pipelines.Collaborate with cross-functional teams to integrate AI...


  • Toronto, Ontario, Canada MERCOR Full time $150,000 - $200,000 per year

    Mercor is hiring AI Agent Infrastructure Engineers on behalf of a leading AI Lab developing scalable systems to power the next generation of intelligent, autonomous agents. This is a unique opportunity to work with world-class AI researchers and engineers, building the infrastructure that enables advanced reasoning, multi-agent coordination, and real-world...


  • Toronto, Ontario, Canada Unity Technologies Full time $120,000 - $140,000 per year

    The opportunityAt Unity, we're shaping the future of real-time 3D by applying machine learning to revolutionize how games are created and experienced. From neural rendering to on-device inference optimization on high-end mobile devices, we're building the next generation of ML-powered graphics pipelines that enable faster, more immersive, and more efficient...