Current jobs related to Member of Technical Staff, Training and Inference - Toronto, Ontario - Boson AI

  • Technical Architect

    7 days ago


    Toronto, Ontario, Canada ShyftLabs Full time

    About ShyftLabs At ShyftLabs, we live and breathe data. Since 2020, we've been helping Fortune 500 companies unlock growth with cutting-edge digital solutions that transform industries and create measurable business impact. We're growing fast and we're looking for passionate problem-solvers who are ready to turn big ideas into real outcomes. The...


  • Toronto, Ontario, Canada NVIDIA Full time

    We are seeking highly motivated and skilled systems engineers to join our team to help in developing an AI Platform that offers an efficient infrastructure for inference and training large scale models. As a systems engineer, you will play a crucial role in building a unified solution that brings our innovative NVIDIA technologies such as high-performance,...


  • Toronto, Ontario, Canada ALSTOM Full time

    Req ID:506396We create smart innovations to meet the mobility challenges of today and tomorrow. We design and manufacture a complete range of transportation systems, from high-speed trains to electric buses and driverless trains, as well as infrastructure, signalling and digital mobility solutions. Joining us means joining a truly global community of more...


  • Toronto, Ontario, Canada Alstom Full time

    Req ID: 506396We create smart innovations to meet the mobility challenges of today and tomorrow. We design and manufacture a complete range of transportation systems, from high-speed trains to electric buses and driverless trains, as well as infrastructure, signalling and digital mobility solutions. Joining us means joining a truly global community of more...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada University of Toronto Full time

    Date Posted: 11/28/2025Req ID: 45614Faculty/Division: LibraryDepartment: Collection Development DepCampus: St. George (Downtown Toronto)Description:About the University of Toronto LibrariesThe University of Toronto Libraries (UTL) system is the largest academic library in Canada and is ranked in the top five among peer institutions in North America. The...


  • Toronto, Ontario, Canada Virtusa Full time

    P2-C3-TSTSWe are seeking a Gen AI Developer is responsible for building and implementing generative AI solutions that address business challenges and enhance operational efficiency. This role focuses on applying AI models to create innovative applications, automate processes, and deliver intelligent features that align with organizational objectives. The...


  • Toronto, Ontario, Canada Damcosoft Full time

    Role: AI ArchitectLocation: Toronto Canada (100% Onsite) –FTE and C2CSkills Required:AI & MLAgentic AIRAG / Graph RAGLLMs, Vision-LLMs, Diffusion ModelsInference optimization (vLLM, TGI)Systems & ArchitectureMulti-agent orchestrationTool calling & safe action schemasSecure API integrationEnd‑to‑end AI application designData & PlatformDatabricks ML,...


  • Toronto, Ontario, Canada Hitachi Full time

    About UsA career at Hitachi Rail will help create a legacy. With operations in every corner of the world, our work goes to the cutting-edge of digital transformation and technology. From the multi-cultural strength of our global organisation to the sustainable and innovative ways we work to bring people together, there's something for everyone to get stuck...


  • Toronto, Ontario, Canada Love's Travel Stops and Country Stores Full time

    Req ID: 465895Address: 1041 NW Washington Ave. Ontario, OR, 97914Benefits: * Fuel Your Growth with Love's - company funded tuition assistance program * Paid Time Off * Flexible Scheduling * 401(k) – 100% Match up to 5% * Medical/Dental/Vision Insurance after 30 days * Competitive Pay * Career Development * Hiring ImmediatelyWelcome to Love'sRestaurant...

Member of Technical Staff, Training and Inference

1 day ago


Toronto, Ontario, Canada Boson AI Full time
Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on implementing and improving distributed optimization algorithms, performance tune architectures, improve inference and help us make Deep Networks perform efficiently on our cluster. The ideal candidate will possess a strong background in CUDA, Triton, PyTorch, distributed optimization and deep learning architectures.
We encourage you to apply even if you do not believe you meet every single qualification. As long as you are motivated to learn and join the development of foundation models, we'd love to chat. Responsibilities
  • Optimize model architectures and loss objectives to handle combinations of images, video, text, speech, and audio data.
  • Implement and optimize kernels for efficient training on Hopper and Blackwell GPUs
  • Performance optimization (floating point formats, sparsity, systems level optimization)
  • Distributed optimization and training
You may be a good fit if you have:
  • Experience in writing clean and efficient codeMaster or Doctoral degree in computer science or equivalent.
  • Proficiency in at least one deep learning framework, such as PyTorch or JAX.
  • Participated in at least 1 research project related to distributed training or inference.
Strong candidates may also have:
  • Experience in implementing your own kernels in CUDA or another compiler/toolkit (Triton, ThunderKittens, PTX, etc.)
  • Experience in distributed optimization (e.g. using DeepSpeed, FSDP), ideally designing performance optimizations
  • Experience in computer networking (e.g. Infiniband, using SHARP)
  • Experience in handling data at billions-scale
$150,000 - $600,000 a year Total compensations includes base pay, equity, and benefits. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.