Back to blog

WhatsApp Voice Message Automation: Send Voice Notes at Scale with an API

Jonathan Lis|
whatsappautomationvoice notes

Most outreach on WhatsApp looks the same: plain text, maybe a link, maybe an emoji. It gets ignored. WhatsApp voice message automation changes the equation entirely. Voice notes feel personal, demand attention, and convert at rates that text simply cannot match. The problem has always been doing it at scale without spending your entire day talking into your phone.

Svara solves this with a single API call. Generate a voice note from text, deliver it as a native WhatsApp voice message, and do it for hundreds of contacts without recording a single thing manually.

Why WhatsApp Voice Notes Outperform Text

WhatsApp has over 2 billion active users. It is the dominant messaging channel in most markets outside the US. And yet, the vast majority of business communication on WhatsApp is still plain text.

Voice notes stand out because they are fundamentally different from text:

  • They feel personal. A voice note signals that someone took the time to speak directly to the recipient. Even when generated by AI, the result feels more human than any template.
  • They are harder to ignore. A voice note sits in the chat with a play button. Curiosity alone drives a higher open rate than text.
  • They carry tone and intent. Enthusiasm, confidence, warmth. These are things that text cannot convey no matter how many exclamation marks you add.

Across early Svara customers, WhatsApp voice notes are seeing 2-3x higher reply rates compared to equivalent text messages sent to the same audience segments.

How Svara's WhatsApp Integration Works

Unlike LinkedIn or Telegram, WhatsApp requires an authenticated session tied to a real phone number. Svara handles this through a simple two-step process:

  1. Create a session by scanning a QR code (one-time setup per phone number).
  2. Send voice notes through the API using that session.

Your session stays active in the background. You authenticate once, then send as many voice notes as your plan allows.

Step 1: Authenticate Your WhatsApp Session

Create a session by calling the sessions endpoint. This returns a QR code that you scan with WhatsApp on your phone, just like WhatsApp Web.

curl -X POST https://api.svarapi.io/v1/whatsapp/sessions \
  -H "Authorization: Bearer YOUR_API_KEY"

The response includes a base64-encoded QR code image and a session ID:

{
  "session_id": "sess_abc123",
  "qr_code": "data:image/png;base64,...",
  "status": "qr_pending"
}

Open that QR code image in your browser or render it in your app. Scan it with WhatsApp on your phone (Settings > Linked Devices > Link a Device). Once scanned, the session status changes to connected and you are ready to send.

Step 2: Send a Voice Note

With an active session, sending a WhatsApp voice note is a single POST request:

curl -X POST https://api.svarapi.io/v1/send \
  -H "x-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "platform": "whatsapp",
    "recipient": "+14155551234",
    "text": "Hey Sarah, just wanted to follow up on our conversation about the Q2 marketing budget. I think there is a real opportunity to double down on the influencer channel. Let me know if you have 15 minutes this week to chat."
  }'

Svara converts your text to natural-sounding speech and delivers it as a native WhatsApp voice note. The recipient sees the standard green waveform in their chat. It looks and sounds like you recorded it yourself.

Code Examples

Node.js

const response = await fetch("https://api.svarapi.io/v1/send", {
  method: "POST",
  headers: {
    "x-api-key": process.env.SVARA_API_KEY,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
    platform: "whatsapp",
    recipient: "+14155551234",
    text: "Hi David, I saw your post about scaling customer support teams. We have been working on something that might help. Would love to share a quick demo if you are open to it.",
  }),
});

const result = await response.json();
console.log(result);
// { id: "msg_xyz789", status: "sent", platform: "whatsapp" }

Python

import requests
import os

response = requests.post(
    "https://api.svarapi.io/v1/send",
    headers={
        "x-api-key": os.environ["SVARA_API_KEY"],
        "Content-Type": "application/json",
    },
    json={
        "platform": "whatsapp",
        "recipient": "+14155551234",
        "text": "Hey Alex, I noticed you are hiring for a head of growth role. I have worked with three similar companies this year and helped them cut their cost per acquisition by 40 percent. Happy to share what worked if that is useful.",
    },
)

print(response.json())

Sending at Scale: Batch Automation

For outreach campaigns, you will likely want to send voice notes to a list of contacts. Here is a simple Node.js script that iterates through a list with a delay between each send to keep things natural:

const contacts = [
  { phone: "+14155551234", name: "Sarah" },
  { phone: "+14155555678", name: "David" },
  { phone: "+14155559012", name: "Alex" },
];

for (const contact of contacts) {
  const response = await fetch("https://api.svarapi.io/v1/send", {
    method: "POST",
    headers: {
      "x-api-key": process.env.SVARA_API_KEY,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      platform: "whatsapp",
      recipient: contact.phone,
      text: `Hi ${contact.name}, quick voice note for you about something I think you will find interesting. Would love to connect this week if you have a few minutes.`,
    }),
  });

  const result = await response.json();
  console.log(`Sent to ${contact.name}:`, result.status);

  // Wait 30-60 seconds between sends
  await new Promise((r) => setTimeout(r, 30000 + Math.random() * 30000));
}

Best Practices for WhatsApp Voice Outreach

Keep Messages Short

The sweet spot for WhatsApp voice notes is 15-30 seconds. That translates to roughly 40-80 words of text. Anything longer and the recipient is likely to skip it. Get to the point fast, mention something specific to them, and end with a clear ask.

Personalize the First Line

Starting with the recipient's name and a specific reference to their work, company, or a recent post makes the voice note feel tailored. Even a single personal detail dramatically increases the chance of a reply.

Time Your Sends

WhatsApp is a personal channel. Sending during business hours (in the recipient's timezone) and on weekdays tends to perform best. Avoid evenings and weekends unless you know the recipient well.

Monitor Delivery Status

Use the status endpoint to check whether your voice notes were delivered:

curl https://api.svarapi.io/v1/status/msg_xyz789 \
  -H "x-api-key: YOUR_API_KEY"

This helps you track which messages landed and follow up appropriately.

Getting Started

Svara offers a free tier with 50 voice notes across all platforms, no credit card required. That is enough to run a small test campaign and see the difference in reply rates for yourself.

  1. Create your account and grab your API key from the dashboard.
  2. Authenticate a WhatsApp session by scanning the QR code.
  3. Send your first voice note with the code examples above.

The entire setup takes under five minutes. Get started for free and see why voice notes are the highest-converting outreach channel on WhatsApp.

Ask Svara

Hey! I'm the Svara assistant. Ask me anything about integrating voice notes into your product.

Powered by Svara