/ 00 ABOUT · EST. 2023 · SLC

We’re building the sound layer of software.

CassetteAI is a small research team turning decades of audio DSP and the latest diffusion research into a single engine for music, sound effects, and speech — fast enough to run inside your app, honest enough to ship in production.

MISSION

Sound should feel like
a first-class primitive.

Software has had text generation, image generation, video generation. Audio has waited its turn — gated by latency, by licensing, by tooling that treats sound as an afterthought.

We built CassetteAI to change that. One SDK, three modalities, streaming responses under 50 milliseconds. Prompt it like you would prompt anything else, but ship it where it actually belongs: inside the game, inside the call, inside the browser tab.

Our north star: make generative audio so cheap, so fast, and so controllable that every product has a soundtrack — and no one has to think about how it got there.

/ 01 PRINCIPLES · HOW WE BUILD

Four ideas we won’t trade.

01

Realtime or nothing

If the user waits, the experience breaks. We measure every ship in milliseconds, not megabytes.

02

Ship on device

Servers are a tax on latency and a liability for privacy. Our models fit in your app bundle.

03

No servers, no rent

Pay-per-use pricing: $0.01 per SFX generation and $0.02 per output minute of music. Spin it up today, pay only for what plays.

04

Creators in the loop

The model is evaluated by the ears it is meant to serve — not just spectrograms on a chart.

/ 02 TIMELINE · FROM SLC TO THE EDGE

From a founder’s bedroom to a soundtrack layer.

2023
Q4
●───

CassetteAI launches

Akhil Tolani starts CassetteAI under Pixl Technologies out of Salt Lake City. First public music model goes live.

2024
Q2
●───

O'Shaughnessy grant

Backed as an OSV Fellow. Proceeds go into on-device inference R&D and the first edge SDK prototype.

2025
Q2
●───

SFX API launches

The SDK expands beyond music: CassetteAI's SFX Generator goes live, generating up to 30 seconds of sound in ~1 second of processing time.

2025
Q4
●───

Edge SDK v1

First-sample latency crosses 50ms on mobile. The hosted API goes live for developers without on-device access.

2026
Now
●───

100k requests / month

Powering audio for games, creator apps, accessibility tooling, and robotics — all through one SDK.

/ 03 BACKED & FEATURED · REAL NAMES

Supported by the people who ship audio.

O'Shaughnessy Ventures
OSV Fellows grantee · 2024
TechCrunch
Featured · 2023
Music Ally
Launch coverage · 2023
Billboard
AI companies directory · 2024
fal
Hosted inference partner
MIDiA Research
Industry analysis · 2024