All activity
Universal-3 Pro is a new class of speech language model built for Voice AI. Control transcription using instructions and domain context like names, terminology, and topics to get accurate output at the source. No custom models, no post-processing pipelines, no hallucinations. Includes 1,000 keyterms, audio tagging, and 6-language code-switching for $0.21/hr.

Universal-3 ProThe first of its kind promptable speech language model
Meredith Rauchleft a comment
We built Universal-3 Pro because we were tired of seeing developers spend 40% of their time on transcription workarounds instead of shipping features. Today, developers are stuck with rigid solutions. They can transcribe their audio, then run an increasingly complex pipeline of regex and LLM calls to extract what they need. Company names get mangled and jargon becomes gibberish. Then they have...

Universal-3 ProThe first of its kind promptable speech language model
Meredith Rauchleft a comment
The Universal-1 Speech-to-Text model was created to focus on the nuances of spoken language across accents, tone, dialect, faithfulness, and more. We hope the new capabilities of Universal-1 will help power the next generation of AI products and features built with voice data. Give it a try and let us know what you're building!

Universal-1Multilingual speech AI model trained on 12.5M hours of data
Try AssemblyAI's most capable and highly trained speech recognition model trained on 12.5M hours of multilingual audio data. Universal-1 achieves best-in-class speech-to-text accuracy, reduces word error rate and hallucinations, and improves timestamps.

Universal-1Multilingual speech AI model trained on 12.5M hours of data
LeMUR is a framework for applying Large Language Models to spoken data. In a few lines of code, you can do things like generate summaries or ask questions about your meetings, phone calls, videos, or podcasts.

LeMURThe easiest way to build LLM apps on voice data

