Word-level timestamps in milliseconds

The fastest, developer-first WhatsApp Audio API.

Convert raw .ogg voice notes into structured JSON with precise word-level timing, clean API responses, and prepaid billing that never expires.

No subscriptions
Credits never expire
GitHub or Google sign-in required

Audiex Pro overview

Clean API, predictable billing, production-ready output

Live

Input

Raw WhatsApp .ogg audio, posted as multipart/form-data or uploaded from your app.

Output

Structured JSON with transcript text, segments, and word-level timestamps in milliseconds.

Billing

Pay once, consume credits only when you transcribe, and keep the balance forever.

Access

Free minutes require GitHub or Google authentication to reduce abuse and block throwaway emails.

Response integrity Milliseconds preserved
text "Let's ship the release after lunch." words 26 aligned tokens with start_ms and end_ms.

Live Playground

Ship against a response your backend can trust.

One request in, structured JSON out. The layout below mirrors what developers expect when debugging audio pipelines.

Request

curl sending a WhatsApp .ogg file

POST /v1/transcribe
curl -X POST "https://audiex.pro/v1/transcribe" \ -H "Authorization: Bearer sk_live_xxx" \ -F "file=@whatsapp-voice-note.ogg" \ -F "language=en"

Authorization

Bearer token

Body

multipart/form-data

Audio

.ogg from WhatsApp

Response

Structured JSON with timestamps in milliseconds

200 OK
{ "text": "Let's ship the release after lunch.", "language": "en", "duration_ms": 12840, "words": [ { "word": "Let's", "start_ms": 220, "end_ms": 460 }, { "word": "ship", "start_ms": 468, "end_ms": 690 }, { "word": "release", "start_ms": 792, "end_ms": 1130 } ] }
Audiex normalizes the output for your app, so your frontend can render transcripts, timing bars, and playback cursors without extra parsing.

Pricing

Prepaid credits built for teams that want predictable spend.

Pay-as-you-go, credits never expire, and there are no subscriptions hiding in the fine print.

Lower friction, higher clarity

Starter

$7.99

5 hours included

For product validation, small integrations, and early experiments.

Credits never expire
Most popular

Growth

$19.99

15 hours included

Ideal for a real feature launch, active users, and weekly processing.

Pay-as-you-go, credits never expire

Agency

$49.99

45 hours included

Built for client work, recurring operations, and multiple projects.

Scale without subscription lock-in

Enterprise

$99.99

100 hours included

For high-volume teams that want room to grow without re-buying each month.

Credits never expire

API Documentation

A single endpoint designed to feel boring in the best way.

POST /v1/transcribe accepts multipart audio, checks your token, and returns structured data ready for product use.

Request overview

Headers
Authorization: Bearer sk_live_..., Content-Type is set automatically by multipart/form-data clients.
Body
file (required): .ogg voice note from WhatsApp. language (optional): BCP-47 code such as en, es, pt-BR.
Response
Transcript text, language, duration, segments, and words with timestamps in milliseconds.

Standard errors

Predictable failures
401

Unauthorized

Missing or invalid bearer token.

402

Payment required

No credits left or billing balance is exhausted.

400

Bad request

Empty payload, unreadable audio, or malformed form data.

Example response shape

{
  "text": "Let's ship the release after lunch.",
  "language": "en",
  "duration_ms": 12840,
  "words": [
    { "word": "Let's" , "start_ms": 220, "end_ms": 460 },
    { "word": "ship", "start_ms": 468, "end_ms": 690 },
    { "word": "release", "start_ms": 792, "end_ms": 1130 }
  ]
}

Abuse Control

Free access is gated with real identity, not throwaway inboxes.

The 10 free minutes require authentication with GitHub or Google, which keeps the trial clean and prevents disposable-email abuse.

1

Sign in with GitHub or Google

Only verified identities can claim the free tier.

2

Receive 10 free minutes

Enough to test the API on real WhatsApp audio before paying.

3

Upgrade only when it is useful

No card is required to start, and no spammy sign-up path is needed.

Disposable and temporary emails are intentionally blocked to protect the free trial from abuse.

Terms & Conditions

Audiex Pro is a prepaid transcription service. Usage is measured against processed audio duration, and credits are deducted when a request is accepted and processed.

Refund Policy

If the service does not meet your expectations, we offer a 14-day refund for unused balance. Requests are reviewed against account activity and the remaining credit balance.