AI Research Engineer

12 hours ago


Canada EnCharge AI Full time

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in‑memory computing technology provides orders‑of‑magnitude higher compute efficiency and density compared to today’s best‑in‑class solutions. The high‑performance architecture is coupled with seamless software integration and will enable the immense potential of AI to be accessible in power, energy, and space constrained applications. EnCharge AI launched in 2022 and is led by veteran technologists with backgrounds in semiconductor design and AI systems. About the Role EnCharge AI is looking for an experienced AI Research Engineer to optimize deep learning models for deployment on edge AI platforms. You will work on model compression, quantization strategies, and efficient inference techniques to improve the performance of AI workloads. Responsibilities Research and develop quantization‑aware training (QAT) and post‑training quantization (PTQ) techniques for deep learning models. Implement low‑bit precision optimizations (e.g., INT8, BF16). Design and optimize efficient inference algorithms for AI workloads, focusing on latency, memory footprint, and power efficiency. Work with frameworks such as PyTorch, ONNX Runtime, and TVM to deploy optimized models. Analyze accuracy trade‑offs and develop calibration techniques to mitigate precision loss in quantized models. Collaborate with hardware engineers to optimize model execution for edge devices, and NPUs. Contribute to research on knowledge distillation, sparsity, pruning, and model compression techniques. Benchmark performance across different hardware and software stacks. Stay updated with the latest advancements in AI efficiency, model compression, and hardware acceleration. Qualifications Master’s or Ph.D. in Computer Science, Electrical Engineering, or a related field. Strong expertise in deep learning, model optimization, and numerical precision analysis. Hands‑on experience with model quantization techniques (QAT, PTQ, mixed precision). Proficiency in Python, C++, CUDA, or OpenCL for performance optimization. Experience with AI frameworks: PyTorch, TensorFlow, ONNX Runtime, TVM, TensorRT, or OpenVINO. Understanding of low‑level hardware acceleration (e.g., SIMD, AVX, Tensor Cores, VNNI). Familiarity with compiler optimizations for ML workloads (e.g., XLA, MLIR, LLVM). EnchargeAI is an equal employment opportunity employer in the United States. #J-18808-Ljbffr



  • Canada, Germany, Norway, United States EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...


  • , , Canada EnCharge AI Full time

    A leading AI technology company in Canada is seeking an experienced AI Research Engineer to optimize deep learning models for edge AI platforms. Responsibilities include developing quantization strategies and efficient inference techniques. Candidates must possess a Master's or Ph.D. in Computer Science or Electrical Engineering, with expertise in deep...

  • Director, AI Research

    2 weeks ago


    , , Canada Info-Tech Research Full time

    The Director of AI Research & Advisory is responsible for delivering Info-Tech’s research projects and advisory services to members in the domains of AI Strategy, Agentic AI Prototyping and Development, Workflow Orchestration, Solution Implementation & Integration. The successful candidate will provide research and experience-based insight and guidance,...

  • AI Compiler Engineer

    2 weeks ago


    U.S., Canada, Germany, Norway EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...

  • Research Engineer

    1 day ago


    , , Canada Yotta Labs Full time

    Research Engineer - Decentralized AI Systems Join to apply for the Research Engineer - Decentralized AI Systems role at Yotta Labs. About Yotta Labs: Yotta Labs is pioneering the development of a Decentralized Operating System (DeOS) for AI workload orchestration at a planetary scale. Our mission is to democratize access to AI resources by aggregating...


  • , , Canada Yasp Full time

    Yasp is pioneering the future of software development with a compiler that leverages agentic AI for advanced optimization and code generation. We are looking for a visionary Research Engineer – AI for Code to join our team and drive the core innovation that will define the next generation of our technology. We don’t draw boundaries between research and...


  • , BC, Canada Stellar AI Full time

    A global AI technology firm is looking for a Senior Software Engineer to work remotely. In this role, you will advance AI research, analyze large codebases, and ensure correctness in software behaviors. Successful candidates will have a bachelor's degree in Computer Science and at least 2 years of software engineering experience. This position offers...

  • STEM PhDs

    1 day ago


    , , Canada Weekday AI (YC W21) Full time

    We are seeking Engineering PhDs to contribute to a cutting-edge project with a leading AI research lab. This role offers competitive pay of $65-$75/hour . In this position, you will apply your expertise to craft high-quality, challenging problems with real-world applicability, helping to advance frontier large language models and shape the next generation of...


  • Silver Drive Vancouver, British Columbia, VH Y Canada Huawei Technologies Canada Co. Full time

    Job description Huawei Canada has an immediate permanent opening for a full-time permanent Principal AI Research Engineer.About the Team:Established in 2012, the Central Media Technology Institute (CMTI) is Huawei's center for media technology innovation and engineering, enhancing the technical competitiveness of media products. Sitting within CMTI, the...


  • , , Canada Yasp Full time

    A pioneering software development company in Canada is seeking a Research Engineer – AI for Code to spearhead innovations in AI-driven code generation and optimization. The ideal candidate will hold a Master’s or PhD in relevant fields and possess exceptional skills in machine learning and programming. You will collaborate in a dynamic team, engage in...