Skip to content

Artificial Intelligence

November 26, 2024

Iterative Combinatorial Brain Surgeon: Scalable Pruning of Large Language and Vision Models (LLVMs)

By: Elton Zhu & Serdar Kadioglu

FCAT collaborated with Amazon Quantum Solutions Lab to propose a new scalable pruning algorithm for large language and vision models.

The Challenge

State-of-the-art large language and vision models (LLVMs) have seen tremendous success, but their massive scale comes with a hefty price in terms of computational resources. The need to balance performance and efficiency has led to a growing interest in model compression techniques. By using methods like pruning, quantization, or distillation, researchers aim to streamline these models without sacrificing their impressive accuracy.

The Impact

With the integration of advanced methods — such as the one proposed below — and specialized hardware support for sparse models, we can significantly decrease the computational power and energy required to run AI models, all while maintaining their original performance. This can enable the deployment of smaller, more efficient models directly on devices, rather than relying on server-side processing — ultimately helping to enhance data privacy.

The Outcomes

We proposed iterative Combinatorial Brain Surgeon (iCBS), a scalable iterative pruning algorithm that optimizes over small blocks of weights in neural networks using block gradient descent. This blockwise approach can allow iCBS to scale to very large models, including LLVMs with billions of parameters, while helping to achieve higher performance compared to existing one-shot pruning techniques.

For further details on this project, read the full paper.

References & Disclaimers

1176959.1.0

Related posts

Artificial Intelligence

How Generative AI Will Change the Way We Learn

Sarah Hoffman

September 8, 2023

While ChatGPT can be a useful learning tool on its own, plugins and new tools that incorporate generative AI can further enhance the learning experience. These improvements will likely have implications that go far beyond the classroom.

Artificial Intelligence

AI: The Next Generation

Sarah Hoffman

March 8, 2023

We’re in a new era of AI. “It feels like we’re going from spring to summer,” said Jack Clark, a co-chair of Stanford University’s annual A.I. Index Report. “In spring, you have these vague suggestions of progress, and little green shoots everywhere....

Artificial Intelligence, Emerging Technology

AI, Neuralink, and the Evolution of Human-Machine Interfaces

Seth Brooks

October 22, 2020

Humans have enjoyed an intimate and physical relationship with technology and how they have interacted with tools throughout time. Throughout history the tools created by people have the same two basic characteristics: tools require that a user...