Segment timestamps with detected language

The fastest, developer-first WhatsApp Audio API.

Convert raw .ogg voice notes into plain text, detected language, and clean segment timestamps, with prepaid billing valid for 3, 6, 9, or 12 months depending on the plan.

600 seconds included
No subscriptions
Simple checkout
600s Start with 600 seconds before you upgrade.

Audiex Pro overview

Clean API, predictable billing, production-ready output

Live

Input

Raw WhatsApp .ogg audio, posted as multipart/form-data or uploaded from your app.

Output

Simple JSON with transcript text, detected language, and cleaned segments.

Billing

Usage is measured in processed audio seconds, with a minimum of one second per request.

Access

Account access stays tied to Google sign-in for now.

Response integrity Segment timestamps
status success language en text "Let's ship the release after lunch." segments [{ start: 0.00, end: 2.94, text: "Let's ship the release after lunch." }]

Built for

WhatsApp AI Agents, CRM Automation, Customer Support, Voice Forms, Lead Qualification, and n8n Workflows.

Turn voice notes into structured data for support tickets, lead capture, and workflow automations without manually parsing audio.

600 seconds included
WhatsApp AI Agents
CRM Automation
Customer Support
Voice Forms
Lead Qualification
n8n Workflows

Live Playground

Ship against a response your backend can trust.

One request in, plain JSON out. The layout below mirrors the request and response your production integration will use for the public text-only endpoint.

Request

Common SDKs and automation patterns

POST /v1/transcribe

cURL request with auto language detection

curl -X POST \
  "https://audiex.pro/v1/transcribe" \
  -H "Authorization: Bearer sk_live_xxx" \
  -F "file=@whatsapp-voice-note.ogg" \
  -F "language=auto"

Authorization

Bearer token

Body

multipart/form-data

Audio

.ogg from WhatsApp

Response

Simple JSON with text and language

200 OK
{
  "status": "success",
  "text": "Let's ship the release.",
  "language": "en"
}

Pricing

Prepaid seconds built for teams that want predictable spend.

Pay-as-you-go, seconds stay valid for 3, 6, 9, or 12 months by plan, and there are no subscriptions hiding in the fine print.

Prepaid only
By purchasing a plan, you agree to our Terms of Service and Refund Policy.

Starter

$7.99

18,000 seconds included (5 hours)

For product validation, small integrations, and early experiments.

Seconds valid for 3 months.
Buy Starter
Most popular

Growth

$19.99

54,000 seconds included (15 hours)

Ideal for a real feature launch, active users, and weekly processing.

Seconds valid for 6 months.
Buy Growth

Agency

$49.99

162,000 seconds included (45 hours)

Built for client work, recurring operations, and multiple projects.

Seconds valid for 9 months.
Buy Agency

Enterprise

$99.99

360,000 seconds included (100 hours)

For high-volume teams that want room to grow without re-buying each month.

Seconds valid for 12 months.
Buy Enterprise

API Documentation

One public endpoint, one clean contract, and no extra ceremony.

POST /v1/transcribe is the public endpoint we document right now. It returns plain text, detected language, and cleaned segment timestamps from the live API.

Request overview

Headers
Authorization: Bearer sk_live_..., Content-Type is set automatically by multipart/form-data clients.
Body
file (required): WhatsApp .ogg voice note. language (optional, default auto): BCP-47 code such as en, es, pt-BR.
Response
status, text, language, and segments.

Technical limits

  • Maximum upload size: 5 MB.
  • Maximum supported duration: 60 seconds.
  • Official input: WhatsApp .ogg voice notes.
  • OpenAPI: /openapi.json and /docs.

Standard error body

{
  "status": "error",
  "detail": "Audio demasiado largo",
  "error": {
    "code": "payload_too_large",
    "message": "Audio demasiado largo"
  }
}

Common status codes: 400, 401, 402, 413, 429, 500.

Example response shape

Public JSON with transcript text, detected language, and cleaned segment timestamps.

{
  "status": "success",
  "text": "Let's ship the release.",
  "language": "en",
  "segments": [
    {
      "start": 0.0,
      "end": 2.94,
      "text": "Let's ship the release."
    }
  ]
}

Account access

Google sign-in will unlock account access.

The product already supports a clean API-first flow, and the social login layer can be enabled without changing the billing model.

1

Connect your identity

Use Google once sign-in is enabled on the account portal.

2

Claim trial access

The 10-minute trial stays tied to a real account instead of disposable email addresses.

3

Manage billing and usage

One account keeps seconds, invoices, and API access in the same place.

Disposable and temporary emails stay blocked from the public onboarding flow.

Account sign-in

Use Google to link your account and keep access in one place.

Terms of Service

Audiex Pro is a prepaid transcription service. Usage is measured in processed audio seconds, one second is deducted per real second transcribed with a minimum of one second per request, and unused seconds expire after 3, 6, 9, or 12 months depending on the plan purchased.

Refund Policy

If the service does not meet your expectations, we offer a 14-day refund for unused seconds. Requests are reviewed against account activity and the remaining seconds on the account.

Privacy & Compliance

Audio is deleted immediately after transcription. All API communication requires HTTPS. We do not retain voice data persistently, and the handling model is designed around GDPR and CCPA data-minimization and deletion principles.