Vignesh Kothapalli

Hi , I’m a Senior ML Engineer in the Foundation Models and Data team at LinkedIn AI. My work centers on post-training techniques for enhancing multi-task learning in large language models. I earned my MSc in Computer Science at NYU Courant, where I worked with Prof. Joan Bruna on the Neural Collapse phenomenon. I have also contributed to TensorFlow and served as a maintainer for TensorFlow-IO during my time at IBM. I hold a B.Tech in Electronics and Communication Engineering from IIT Guwahati.

Research

I’m primarily interested in understanding the computational aspects of learning in neural networks, and using these insights to develop novel modeling techniques.

Foundation Models: Chain-of-Thought reasoning, In-Context Learning and Multi-Task Learning in Transformer based LLMs.
Learning Dynamics: Understanding feature evolution, weight matrix spectra and generalization in neural networks.
Efficient Training/Inference: Developing low-level (randomized) algorithms for training and inference of large scale LLMs/GNNs.

Whats New

[2025.06] Our “Liger Kernel” paper has been accepted to CODE ML Workshop, ICML 2025!
[2025.05] The “CoT-ICL Lab” paper has been accepted to ACL Main 2025!
[2025.04] “Can Kernel Methods Explain How the Data Affects Neural Collapse?” has been accepted to TMLR!
[2025.02] Our tech-report on training and deploying production-grade LLMs at LinkedIn is available on arxiv!
[2025.01] The 360Brew foundation model tech-report is available on arxiv!
[2024.10] Our tech-report on Liger-Kernels is now available on arxiv. (GPU Mode/Lightning AI/AMD+Embedded LLM/Blog)

Academic Service

Conference Reviewer: NeurIPS 2024, ICML 2025
Journal Reviewer: : IEEE Transactions on Cybernetics, IEEE Access, IEEE Transactions on Industrial Informatics, TMLR

Selected Publications

See full list at Google Scholar.

ACL

CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations

Vignesh Kothapalli, Hamed Firooz, and Maziar Sanjabi

Proceedings of Association for Computational Linguistics, 2025

PDF Code
ICML

Liger-Kernel: Efficient Triton Kernels for LLM Training

Pin-Lun Hsu, Yun Dai, Vignesh Kothapalli, and 8 more authors

Championing Open-source DEvelopment in ML Workshop @ ICML25, 2025

PDF Video Blog Code
NeurIPS

Toward Understanding How the Data Affects Neural Collapse: A Kernel-Based Approach

Vignesh Kothapalli, and Tom Tirer

In NeurIPS 2024 Workshop on Symmetry and Geometry in Neural Representations , 2024

PDF Code
NeurIPS

A Neural Collapse Perspective on Feature Evolution in Graph Neural Networks

Vignesh Kothapalli, Tom Tirer, and Joan Bruna

Advances in Neural Information Processing Systems, 2023

PDF Code Slides
ICML

Randomized Schur Complement Views for Graph Contrastive Learning

Vignesh Kothapalli

In International Conference on Machine Learning , 2023

PDF Code
TMLR

Neural Collapse: A Review on Modelling Principles and Generalization

Vignesh Kothapalli

Transactions on Machine Learning Research, 2023

PDF