🔓 Unlock all 10,000+ workflows & prompts free Join Newsletter →
✅ Full access unlocked — explore all 10,000 AI workflow and prompt templates Browse Templates →
Home n8n Workflow
January 22, 2026

Telegram to HeyGen, voice notes become ready shorts

Lisa Granqvist Partner Workflow Automation Expert

Recording a quick voice idea is easy. Turning it into a polished short, uploading it, writing captions, and logging it somewhere sensible is where your day disappears.

This Telegram HeyGen automation hits content creators first, honestly. But marketing managers trying to post consistently and agency owners juggling client accounts run into the same mess, just with more tabs open. The outcome is simple: you speak once, and you get a ready-to-publish avatar video plus a clean paper trail.

Below you’ll see how the workflow turns a Telegram voice note into a HeyGen video, saves the assets to Drive/Sheets, publishes via Blotato, and sends you a confirmation when it’s live.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: Telegram to HeyGen, voice notes become ready shorts

The Problem: Voice ideas die in the “production gap”

You already have the hard part: ideas. They show up while you’re driving, between meetings, or right after a client call. Then the “I’ll turn this into a short later” promise turns into a mini production project. Transcribe the audio. Write a hook. Create a caption that doesn’t sound generic. Build the video. Export it correctly. Upload it. Repeat for each platform. And because there’s no consistent system, you also lose track of what shipped, what’s stuck, and what needs a rewrite.

It adds up fast. Here’s where it breaks down for most teams.

  • Transcription and first-draft copy end up happening in fragments, so your “quick post” becomes an all-afternoon task.
  • Publishing across TikTok, YouTube Shorts, and Instagram means three separate uploads, three sets of tiny requirements, and three chances to mess up.
  • Assets get scattered across desktops, Drive folders, and chat threads, which makes repurposing or auditing content a pain.
  • Without a log, consistency becomes guesswork and client reporting turns into last-minute scrambling.

The Solution: A voice-note-to-video publishing pipeline

This workflow starts with the most natural input you have: a Telegram voice note. n8n grabs the audio, transcribes it with OpenAI Whisper, and then uses GPT-5 to turn that raw thought into a publishable package (a strong title plus a caption that’s actually usable). Next, it creates an AI avatar video in HeyGen so you’re not stuck recording on camera every time. Finally, it saves the important files and metadata to Google Drive and Google Sheets, uploads the finished video to Blotato, and publishes to your selected platforms. When it’s done, you get a Telegram confirmation so you’re not constantly checking.

The workflow begins in Telegram with a voice idea. From there, OpenAI handles transcription and copy generation, HeyGen builds the avatar video, and Google Drive/Sheets keep everything organized. Blotato pushes the final video out to TikTok, YouTube Shorts, Instagram (and more if you enable them), then n8n updates your log and notifies you.

What You Get: Automation vs. Results

Example: What This Looks Like

Say you publish 5 shorts a week to three platforms (TikTok, YouTube Shorts, Instagram). Manually, you might spend about 10 minutes per platform per video just on uploading, captions, and checks, which is roughly 2.5 hours a week, and that’s before editing. With this workflow, you record a 60-second voice note, then wait for processing and publishing while you do other work. Your “hands-on” time becomes a few minutes per video, not a mini project.

What You’ll Need

  • n8n instance (try n8n Cloud free)
  • Self-hosting option if you prefer (Hostinger works well)
  • Telegram Bot for capturing voice notes as input
  • OpenAI to transcribe audio and generate captions
  • HeyGen API key (get it from your HeyGen dashboard)

Skill level: Intermediate. You’ll connect accounts, add API keys, and tweak a couple prompts and folder IDs.

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

A Telegram voice note triggers everything. You send a voice message to your bot, and n8n receives the audio file plus basic metadata like time and message ID.

The audio becomes usable text. The workflow reads the audio as a binary file and sends it to OpenAI Whisper for transcription. If you ever tested voice-to-text tools and gave up, this part is the difference between “rough notes” and “content input.”

GPT-5 turns the transcript into a post-ready package. It generates a catchy title and a caption you can publish with minimal edits. Conditional logic (If nodes) can route different formats or prompts based on channel, content type, or even keywords in the transcript.

HeyGen creates the avatar video and Blotato publishes. n8n calls HeyGen via HTTP requests to generate the video, then organizes assets in Google Drive and logs the run in Google Sheets. Finally, it uploads the finished video to Blotato, publishes to the platforms you’ve enabled, and sends a confirmation back to Telegram.

You can easily modify the publishing destinations to focus on TikTok only (or add YouTube Shorts and Instagram) based on your needs. See the full implementation guide below for customization options.

Common Gotchas

  • Telegram Bot credentials can expire or get blocked by permissions. If things break, check your Telegram bot token in n8n credentials and confirm the bot can still receive voice messages.
  • If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
  • HeyGen and Blotato API access can be plan-dependent. If publishing suddenly fails, confirm your HeyGen API key is active and your Blotato Pro API access is enabled in their dashboard.

Frequently Asked Questions

How long does it take to set up this Telegram HeyGen automation?

About an hour if you already have the accounts and API keys.

Do I need coding skills to automate Telegram HeyGen automation?

No. You’ll mostly paste API keys, connect Google apps, and edit prompts. The “hard” part is deciding your caption style and where you want files stored.

Is n8n free to use for this Telegram HeyGen automation workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in OpenAI API usage for Whisper and GPT-5, plus HeyGen and Blotato plan costs.

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this Telegram HeyGen automation workflow for a human approval step before publishing?

Yes. Add a Telegram approval message after the caption/video is generated, then only continue to the Blotato publish step if you reply “Approve.” Common tweaks include changing the GPT-5 prompt for brand voice, swapping HeyGen avatars/voices, and enabling only the platforms you actually use.

Why is my HeyGen connection failing in this workflow?

Most of the time it’s an expired or inactive HeyGen API key, so regenerate it in HeyGen and update the credential in n8n. It can also be a plan limitation if API access isn’t enabled. If the video generation request is accepted but never completes, increase any waiting/polling time and confirm the HTTP request is hitting the correct HeyGen endpoint for your account.

How many videos can this Telegram HeyGen automation handle?

On self-hosted n8n, it’s basically limited by your server and the external APIs. On n8n Cloud, your plan’s monthly executions matter, and heavy workflows can consume more than one execution per published video. In practice, most small teams run dozens of videos a week comfortably, then upgrade if they start publishing at high volume.

Is this Telegram HeyGen automation better than using Zapier or Make?

Often, yes, because this is not a simple two-step Zap. You’re doing transcription, AI generation, file handling, conditional logic, and a publish step, and n8n is built for that kind of branching without you paying extra for every little path. Self-hosting is also a big deal if you plan to scale. Zapier or Make can still win for quick prototypes and very lightweight workflows, so it depends on your tolerance for setup time and how many posts you run weekly. If you’re unsure, Talk to an automation expert and you’ll get a straight recommendation.

You get a repeatable pipeline from “idea” to “posted,” with the files and logs to prove it. Set it up once, then let the workflow do the busywork.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

×

Use template

Get instant access to this n8n workflow Json file

💬
Get a free quote today!
Get a free quote today!

Tell us what you need and we'll get back to you within one working day.

Get a free quote today!
Get a free quote today!

Tell us what you need and we'll get back to you within one working day.

Launch login modal Launch register modal