Chankyu Lee

Chankyu Lee

Senior Research Scientist

NVIDIA

Biography

Chankyu Lee is a Senior Research Scientist at NVIDIA's Applied Deep Learning Research (ADLR) team, where he focuses on advancing large language models, including post-training research for embedding, reasoning, and agentic coding models. He obtained Ph.D. degree from Electrical and Computer Engineering, Purdue University (Advisor: Prof. Kaushik Roy).

Interests

  • Large Language Models
  • Agentic AI
  • Algorithm-Hardware Co-design

Education

  • PhD in Electrical and Computer Engineering, Fall 2015 - Spring 2021

    Purdue University, West Lafayette, IN, USA

  • BS in Electrical and Electronics Engineering, Spring 2009 - Spring 2015

    Sungkyunkwan University (SKKU), Suwon, South Korea

  • Exchange Student Program, Electronic and Computer Engineering, Fall 2013

    Hong Kong University of Science and Technology (HKUST), Hong Kong

Experience

 
 
 
 
 

Senior Research Scientist

NVIDIA Corporation

Mar 2022 – Present Santa Clara, California, USA

Applied Deep Learning Research

  • Post-training LLM for reasoning capability (agentic coding, code generation, math, etc). Part of Ace-reason and nemotron-cascade effort, best-in-class 8B, 14B and 30B-A3B LLM. Extended to flagship nemotron model family (nano/super/ultra).
  • Information Retrieval: embedding model and Retrieval Augmented Generation (RAG). NV-Embed-v1 and v2: No. 1 ranking embedding models on the MTEB leaderboard and 2M huggingface model downloads. MM-Embed: multimodal extension of NV-Embed.

TAO-toolkit and autoML

  • Accelerating the model training process by abstracting away the AI/deep learning framework complexity [link].
 
 
 
 
 

Largescale Machine Learning Engineer

Intel Corporation

Apr 2021 – Mar 2022 Austin, Texas, USA
AI workload performance optimization (i.e., improving the runtime, efficiency, scalability) for the largescale machine learning training on Intel AI accelerators (Gaudi-Habana Labs) [link].
 
 
 
 
 

Graduate Internship

Bell Labs, Nokia

Jun 2018 – Aug 2018 Murray Hill, New Jersey, USA
Developed the functional modeling simulator for mapping and scheduling CNN (AlexNet, VGG, ResNet) algorithms on a MIMD (Multi-Instruction Multi-Data) processor toward accelerated and energy-efficient AI computing.
 
 
 
 
 

Graduate Research Assistant

Purdue University

Aug 2016 – Dec 2020 West Lafayette, Indiana, USA

Exploratory research on neuromorphic computing for energy-efficient and robust deep learning, overcoming limitations of current artificial intelligence through algorithm-hardware co-design.

  • Topic 1: Developed novel unsupervised/supervised/semi-supervised learnings for deep convolutional Spiking Neural Networks (SNNs) to efficiently harness machine learning algorithms.
  • Topic 2: Developed energy-efficient motion estimation algorithms for event-based camera in challenging scenes such as high-speed and wide-dynamic range.
  • Research Outputs: 13 publications including 6 first-authored papers (1 IEEE TCDS, 2 Frontiers in Neuroscience, 1 ECCV, 1 Neurocomputing, 1 IEEE ICRA).
 
 
 
 
 

Undergraduate Research Assistant

Graduate School of Convergence Science and Technology, Seoul National University

Jul 2014 – Aug 2014 Suwon, South Korea
Research on mid-field wireless powering for simulating neural signals in the brain-machine interface. Advised by Prof. Yoonkyu Song

Selected Publications

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Scaling cascaded reinforcement learning for general-purpose reasoning models, achieving best-in-class 8B, 14B and 30B-A3B LLMs.

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Improved techniques for training LLMs as generalist embedding models, achieving No. 1 ranking on the MTEB leaderboard.

Enabling Spike-Based Backpropagation for Training Deep Neural Network Architectures

Spiking Neural Networks (SNNs) have recently emerged as a prominent neural computing paradigm. However, the typical shallow SNN …

Contact