November 26, 2024
Iterative Combinatorial Brain Surgeon: Scalable Pruning of Large Language and Vision Models (LLVMs)
By: Elton Zhu & Serdar Kadioglu
The Challenge
State-of-the-art large language and vision models (LLVMs) have seen tremendous success, but their massive scale comes with a hefty price in terms of computational resources. The need to balance performance and efficiency has led to a growing interest in model compression techniques. By using methods like pruning, quantization, or distillation, researchers aim to streamline these models without sacrificing their impressive accuracy.
The Impact
With the integration of advanced methods — such as the one proposed below — and specialized hardware support for sparse models, we can significantly decrease the computational power and energy required to run AI models, all while maintaining their original performance. This can enable the deployment of smaller, more efficient models directly on devices, rather than relying on server-side processing — ultimately helping to enhance data privacy.
The Outcomes
We proposed iterative Combinatorial Brain Surgeon (iCBS), a scalable iterative pruning algorithm that optimizes over small blocks of weights in neural networks using block gradient descent. This blockwise approach can allow iCBS to scale to very large models, including LLVMs with billions of parameters, while helping to achieve higher performance compared to existing one-shot pruning techniques.
For further details on this project, read the full paper.
References & Disclaimers
1176959.1.0
Related posts
Ask an FCAT Researcher: Deanna Laufer on Shrinking Families
By: Matt Ehlers
June 18, 2024
Deanna Laufer leads FCAT’s research efforts around what we call the “Next America” — focusing on how we work, play, raise families, and experience aging — and the ways in which these trends are constantly evolving.
Technology & Society, Artificial Intelligence
How AI Can Foster Inclusion
By: Sarah Hoffman
December 9, 2021
We've spent a lot of time discussing the unintended bias that can easily creep into AI algorithms. But the same technology, properly designed and trained, can also be used to confront biases. A new generation of automated tools seeks to proactively...
Technology & Society, Artificial Intelligence
Eliminating AI Bias: A Human + Machine Approach
By: Sarah Hoffman
August 27, 2020
Bias in AI is a known problem. Cases involving medical care, parole, recruiting, and loans have all been tainted by flawed data sampling or training data that includes biased human decisions.1 The good news: large organizations are waking up. Even...