← All companies

Exla

Active

An SDK to run transformer models anywhere

W25·Winter 2025·B2B·San Francisco, CA, USA·Team of 2·Founded 2025

About

Exla aggressively quantizes AI models to minimize memory usage and maximize inference speed. Whether you're deploying LLMs, VLMs, VLAs, or custom models, Exla reduces memory footprint by up to 80% and accelerates inference by 3–20x - all with just a few lines of code. https://cal.com/exla-ai/schedule

Founders

  • Pranav Nair· Co-Founder

    CTO at Exla. Previously an OS engineer at Apple leading sleep/hibernation for all Apple devices. B.S. Computer Science from Purdue.

  • Viraat Das· Founder

    CEO @ Exla. Previously machine learning engineer @ Amazon.

Product launches · 1 launch

Change history · none recorded

No changes recorded yet. Subsequent syncs will populate this timeline as fields drift.