January 22, 2026

WhatsApp + Google Docs: instant support replies

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Your WhatsApp inbox moves fast. Then a voice note shows up. Someone sends a blurry product photo. Another customer writes in Roman Urdu. Suddenly, “quick replies” turn into a copy-paste marathon and a lot of guesswork.

WhatsApp support automation hits Support Leads first, because they own response time. But ecommerce operators and service business owners feel it too. The outcome is simple: customers get consistent answers in seconds, even when the message isn’t plain text.

This workflow connects WhatsApp to your Google Docs knowledge base, adds AI that can read images and transcribe voice notes, and replies automatically. You’ll see what it fixes, what it produces, and what you need to run it reliably.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: WhatsApp + Google Docs: instant support replies

Click to explore

flowchart LR

    subgraph sg0["Incoming WhatsApp Hook Flow"]
        direction LR
        n0["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/whatsapp.svg' width='40' height='40' /></div><br/>Incoming WhatsApp Hook"]
        n1@{ icon: "mdi:robot", form: "rounded", label: "Support AI Orchestrator", pos: "b", h: 48 }
        n2@{ icon: "mdi:memory", form: "rounded", label: "Conversation Memory", pos: "b", h: 48 }
        n3@{ icon: "mdi:swap-horizontal", form: "rounded", label: "Classify Input Format", pos: "b", h: 48 }
        n4["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/whatsapp.svg' width='40' height='40' /></div><br/>Fetch Image Link"]
        n5["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Retrieve Image File"]
        n6@{ icon: "mdi:robot", form: "rounded", label: "Image Content Review", pos: "b", h: 48 }
        n7@{ icon: "mdi:swap-vertical", form: "rounded", label: "Image Text Composer", pos: "b", h: 48 }
        n8@{ icon: "mdi:swap-vertical", form: "rounded", label: "Text Message Formatter", pos: "b", h: 48 }
        n9["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/whatsapp.svg' width='40' height='40' /></div><br/>Fetch Audio Link"]
        n10["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Retrieve Audio File"]
        n11@{ icon: "mdi:robot", form: "rounded", label: "Audio Transcription", pos: "b", h: 48 }
        n12@{ icon: "mdi:swap-vertical", form: "rounded", label: "Audio Text Formatter", pos: "b", h: 48 }
        n13["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/whatsapp.svg' width='40' height='40' /></div><br/>Send WhatsApp Reply"]
        n14@{ icon: "mdi:cog", form: "rounded", label: "Fetch Docs Reference", pos: "b", h: 48 }
        n15@{ icon: "mdi:brain", form: "rounded", label: "OpenRouter Chat Engine", pos: "b", h: 48 }
        n1 --> n13
        n12 --> n1
        n6 --> n7
        n9 --> n10
        n4 --> n5
        n2 -.-> n1
        n10 --> n11
        n5 --> n6
        n3 --> n9
        n3 --> n4
        n3 --> n8
        n8 --> n1
        n11 --> n12
        n0 --> n3
        n7 --> n1
        n15 -.-> n1
        n14 -.-> n1
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n0 trigger
    class n1,n6,n11 ai
    class n15 aiModel
    class n2 ai
    class n3 decision
    class n5,n10 api
    classDef customIcon fill:none,stroke:none
    class n0,n4,n5,n9,n10,n13 customIcon

The Problem: WhatsApp Support Becomes Unscalable Fast

WhatsApp is great for customers because it’s effortless. For your team, it’s a constant context switch. One minute it’s “What’s your return policy?”, the next it’s a 40-second voice note with three questions inside it, and then a photo asking “Is this the right size?” If you’re relying on humans to remember your policies, pricing, and edge cases, you get slow replies, inconsistent answers, and a support backlog that grows at the worst time (after hours, weekends, launches). Honestly, the mental load is the real cost.

The friction compounds. Here’s where it breaks down in real day-to-day support.

Voice notes and images force a manual “decode” step before anyone can even start replying.
Two agents answer the same question differently, which creates refunds, arguments, and “but your team said…” screenshots.
Updating canned responses doesn’t work when your policies live in someone’s head instead of one source of truth.
Multilingual chats slow everything down, because translating and staying polite takes time and attention.

The Solution: Auto-Reply From Your Google Docs Knowledge Base

This workflow turns WhatsApp into an always-on support channel backed by your Google Docs knowledge base. It starts when a customer message hits your WhatsApp webhook in n8n. The workflow figures out what type of message it is (text, voice note, or image), then converts everything into clean text: voice gets transcribed, images get described, and plain text gets formatted. That text is sent to an AI agent that answers like a professional support rep, pulls facts from your Google Docs content, and keeps conversation context per phone number so the customer doesn’t have to repeat themselves. Finally, the reply goes straight back to WhatsApp automatically.

The workflow begins with an incoming WhatsApp message and a quick classification. From there, media is downloaded when needed and processed through AI (transcription for audio, vision analysis for images). The AI agent then uses your Google Docs reference to generate a consistent, on-brand response and sends it back within seconds.

What You Get: Automation vs. Results

What This Workflow Automates

Results You’ll Get

Classifies WhatsApp messages as text, voice, or image automatically.
Downloads voice notes and transcribes them into readable text.
Analyzes incoming images and turns them into a useful description for support.
Pulls answers from a Google Docs knowledge base and replies in the customer’s language.

Most teams go from “hours later” to replies in under a minute.
Customers get the same policy answer every time, even across different agents or shifts.
Fewer back-and-forth messages because the AI keeps context in the conversation memory.
Support coverage after hours without adding headcount.
Less time translating and rewriting, especially for English + Roman Urdu chats.

Example: What This Looks Like

Say your inbox gets about 40 WhatsApp messages a day, and roughly 10 of them are voice notes or images. Manually, those 10 messages often take about 5 minutes each (listen, interpret, check the doc, translate, reply), so you burn close to an hour just on the “hard” messages. With this workflow, the customer message triggers instantly, transcription or image reading happens in the background, and the reply is sent back automatically in under a minute. That’s about an hour back most days, and the answers stay consistent.

What You’ll Need

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
WhatsApp Business API for receiving and sending messages
Google Docs API to query your knowledge base document
OpenAI API key (get it from your OpenAI dashboard)

Skill level: Intermediate. You’ll connect APIs, set permissions, and test a few message types end-to-end.

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

A WhatsApp message triggers the workflow. n8n receives the incoming message via your WhatsApp webhook, tied to a verified business number.

The workflow standardizes the message into text. A classifier routes the message by type. If it’s a voice note, the audio file is fetched and transcribed with OpenAI. If it’s an image, the file is retrieved and analyzed with an AI vision step so the question becomes “plain text” the agent can answer.

The AI agent generates the support reply. The agent uses an OpenRouter chat model and a Google Docs tool to look up your latest policies, pricing, and FAQs. Conversation memory is attached per phone number, which means follow-up questions don’t reset the thread.

The response is sent back to WhatsApp. The final message is formatted for WhatsApp and delivered automatically, so the customer gets a fast answer without waiting for a human to free up.

You can easily modify the knowledge base structure to support new product lines or a different tone of voice based on your needs. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the WhatsApp Trigger

Start by setting up the WhatsApp webhook that receives incoming customer messages.

Add the Incoming WhatsApp Hook node as your trigger.
Set Updates to messages.
Credential Required: Connect your whatsAppTriggerApi credentials in Incoming WhatsApp Hook.

Execution begins at Incoming WhatsApp Hook and routes into Classify Input Format.

Step 2: Classify the Incoming Message Type

Route incoming WhatsApp messages into text, image, or audio processing paths using the switch logic.

Add the Classify Input Format node after Incoming WhatsApp Hook.
Configure the Voice rule to check {{$json.messages[0].audio}} with the Exists operator.
Configure the Image rule to check {{$json.messages[0].image}} with the Exists operator.
Configure the Text rule to check {{$json.messages[0].text.body}} with the Exists operator.

The Classify Input Format node routes to Fetch Audio Link, Fetch Image Link, or Text Message Formatter based on the message type.

Step 3: Process Text, Image, and Audio Inputs

Set up the input-specific pipelines that transform each message type into a unified text payload.

In Text Message Formatter, set the text assignment value to {{ $('Incoming WhatsApp Hook').item.json.messages[0].text.body }}.
For audio: configure Fetch Audio Link with Resource media, Operation mediaUrlGet, and Media Get ID {{ $('Incoming WhatsApp Hook').item.json.messages[0].audio.id }}.
Connect Fetch Audio Link → Retrieve Audio File and set URL to {{$json.url}}, with Authentication genericCredentialType and Generic Auth Type httpHeaderAuth.
Connect Retrieve Audio File → Audio Transcription and set Resource to audio and Operation to transcribe.
In Audio Text Formatter, set the text assignment value to {{$json.text}}.
For images: configure Fetch Image Link with Resource media, Operation mediaUrlGet, and Media Get ID {{ $('Incoming WhatsApp Hook').item.json.messages[0].image.id }}.
Connect Fetch Image Link → Retrieve Image File and set URL to {{$json.url}}, with Authentication genericCredentialType and Generic Auth Type httpHeaderAuth.
Connect Retrieve Image File → Image Content Review and set Resource to image, Operation to analyze, Input Type to base64, and Text to Describe the image in detail..
In Image Text Composer, set the text assignment value to # The user provided the following image and text. ## IMAGE CONTENT: {{ $json.content }} ## USER MESSAGE: {{ $('Incoming WhatsApp Hook').item.json.messages[0].image.caption || "Describe the image" }}.

Credential Required: Connect your whatsAppApi credentials in Fetch Audio Link and Fetch Image Link.

Credential Required: Connect your httpHeaderAuth credentials in Retrieve Audio File and Retrieve Image File.

Credential Required: Connect your openAiApi credentials in Audio Transcription and Image Content Review.

⚠️ Common Pitfall: If the WhatsApp media URL expires quickly, test the workflow immediately after sending an audio or image message.

Step 4: Configure the AI Orchestration Layer

Set up the AI agent, memory, language model, and document tool used to generate accurate responses.

In Support AI Orchestrator, set Text to {{$json.text}} and keep Prompt Type as define.
Ensure the system prompt in Support AI Orchestrator includes dynamic values like {{ $('Incoming WhatsApp Hook').item.json.contacts[0].profile.name }} and {{ $now.toString() }}.
Connect Conversation Memory to Support AI Orchestrator via the AI Memory port and set Session Key to {{ $('Incoming WhatsApp Hook').item.json.messages[0].from }} with Context Window Length 20.
Connect Fetch Docs Reference to Support AI Orchestrator as an AI Tool and set Operation to get with Document URL [YOUR_ID].
Connect OpenRouter Chat Engine to Support AI Orchestrator as the AI Language Model and set Model to anthropic/claude-sonnet-4.

Credential Required: Connect your openRouterApi credentials for the OpenRouter Chat Engine connection on Support AI Orchestrator.

Credential Required: Connect your googleDocsOAuth2Api credentials for the Fetch Docs Reference tool connection on Support AI Orchestrator.

Tip: Conversation Memory and Fetch Docs Reference are AI sub-nodes. Manage their credentials and connections from Support AI Orchestrator, not directly on the sub-nodes.

Step 5: Configure WhatsApp Reply Delivery

Send the AI-generated response back to the WhatsApp user.

Add Send WhatsApp Reply after Support AI Orchestrator.
Set Operation to send and Text Body to {{$json.output}}.
Set Phone Number ID to [YOUR_ID].
Set Recipient Phone Number to {{ $('Incoming WhatsApp Hook').item.json.messages[0].from }}.
Credential Required: Connect your whatsAppApi credentials in Send WhatsApp Reply.

⚠️ Common Pitfall: Replace [YOUR_ID] in Send WhatsApp Reply and Fetch Docs Reference with your real WhatsApp Business Phone Number ID and Google Doc URL.

Step 6: Test and Activate Your Workflow

Validate each branch (text, image, audio) and enable the workflow for production use.

Click Execute Workflow and send a WhatsApp test message with text to trigger Text Message Formatter → Support AI Orchestrator → Send WhatsApp Reply.
Send an image and confirm the flow Fetch Image Link → Retrieve Image File → Image Content Review → Image Text Composer → Support AI Orchestrator completes with a reply.
Send a voice note and confirm the flow Fetch Audio Link → Retrieve Audio File → Audio Transcription → Audio Text Formatter → Support AI Orchestrator completes with a reply.
Verify successful execution by checking that Send WhatsApp Reply returns a message to the sender.
Toggle the workflow to Active once all branches respond correctly.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Common Gotchas

WhatsApp Business API credentials can expire or require specific permissions. If things break, check your Meta developer app settings and webhook subscriptions first.
If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
Default prompts in AI nodes are generic. Add your brand voice early or you’ll be editing outputs forever.

Frequently Asked Questions

How long does it take to set up this WhatsApp support automation automation?

Plan for about 2-3 hours if your API accounts are ready.

Do I need coding skills to automate WhatsApp support automation?

No. You’ll mostly be connecting accounts and pasting API keys into n8n credentials.

Is n8n free to use for this WhatsApp support automation workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in OpenAI API costs for transcription and image analysis, plus OpenRouter model usage for the chat agent.

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this WhatsApp support automation workflow for adding more languages?

Yes. You’ll adjust the AI agent’s system prompt to add language rules, and you can expand language detection logic in the same place you currently handle English and Roman Urdu. Many teams also customize the Google Docs structure (clear FAQ headings help) and add a “human handoff” rule when the agent isn’t confident.

Why is my WhatsApp Business API connection failing in this workflow?

Usually it’s expired credentials or a webhook/subscription issue in your Meta app. Regenerate the token, confirm the phone number is still connected, and re-check webhook permissions. If it fails only on media messages, the file download request is often the culprit (wrong URL, missing auth header, or the media link already expired).

How many messages can this WhatsApp support automation automation handle?

A lot, but it depends on where you run n8n and your API limits.

Is this WhatsApp support automation automation better than using Zapier or Make?

For media-heavy WhatsApp support, n8n is usually the better fit because you can branch logic freely, keep conversation memory, and self-host for high volume without paying per tiny step. Zapier and Make can work, but multi-step AI flows (download media, transcribe, analyze, query knowledge base, respond) get expensive and harder to maintain. n8n also makes it easier to swap models later if you want. The tradeoff is setup complexity, especially around WhatsApp Business API. Talk to an automation expert if you want help choosing.

Once this is running, your Google Doc becomes the brain and WhatsApp becomes the front desk. Set it up, tune the tone, and let the workflow handle the repetitive questions while you focus on the exceptions.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"meta": {
"instanceId": "a7b753c0364ca1eee991dfe97646e08c4f38f54553c351f09fbdede5b94cac8e",
"templateCredsSetupCompleted": null,
"templateId": null
},
"nodes": [
{
"id": "flowpast-topbar-9027",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
90,
-40
],
"parameters": {
"color": 7,
"width": 1155,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "ab7c3125-5bfd-4f3c-8abc-0ab47edaf85b",
"name": "Incoming WhatsApp Hook",
"type": "n8n-nodes-base.whatsAppTrigger",
"position": [
1145,
235
],
"webhookId": "d3978cae-2aca-4553-8ac7-ab89068deabc",
"parameters": {
"options": [],
"updates": [
"messages"
]
},
"credentials": {
"whatsAppTriggerApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "07fcf5e2-04ae-4f4a-a440-093f18a1e55e",
"name": "Support AI Orchestrator",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
330,
240
],
"parameters": {
"text": "={{ $json.text }}",
"options": {
"systemMessage": "=# WhatsApp Customer Support Agent - Sycorda AI Assistant\n\n#call Fetch Docs Reference to get details of Sycorda\n\n## Core Identity\nYou are a professional customer support representative for sycorda. You communicate through WhatsApp with customers ZZZzseeking information about our services and solutions.\n\n## Communication Context\n- **Current User**: {{ $('Incoming WhatsApp Hook').item.json.contacts[0].profile.name }}\n- **Timestamp**: {{ $now.toString() }}\n- **Platform**: WhatsApp Business API\n\n## Primary Directive\nYour sole purpose is to provide accurate information about Sycorda by retrieving relevant details from the Google Doc knowledge base. You must ONLY answer based on documented information - never fabricate or assume details.\n\n## Language Protocol\n\n### Language Detection & Response Rules\n1. **English Input** \u2192 Respond in English\n2. **Urdu Input** \u2192 Respond in Roman Urdu\n3. **Voice Notes**: Transcribe first, then apply language matching\n4. **Images**: Acknowledge receipt, extract text if present, respond accordingly\n\n### Roman Urdu Guidelines\n- Write naturally: \"Aap ka sawal ka jawab yeh hai\" NOT \"\u0100p k\u0101 saw\u0101l k\u0101 jaw\u0101b yah hai\"\n- Use common spellings: \"kya\", \"kaise\", \"theek\", \"shukriya\"\n- Avoid diacritical marks and special characters\n- Match casual WhatsApp conversation style\n\n## Response Framework\n\n### Tone Requirements\n- **Laconic**: Maximum 2-3 sentences per response unless complex explanation needed\n- **Spartan**: Direct, no fluff, no unnecessary pleasantries\n- **Human**: Natural conversational flow, not robotic\n\n### Response Structure\n1. **Acknowledge** (if needed): Brief recognition of query\n2. **Answer**: Direct information from knowledge base\n3. **Clarify** (if needed): One follow-up question maximum\n\n## Tool Integration Protocol\n\n### Google Doc Access\nWhen user asks a question:\n1. Parse query for key terms\n2. Search Google Doc for relevant section\n3. Extract specific answer\n4. Reformulate in appropriate language/tone\n5. Deliver response\n\n### Information Retrieval Rules\n- ONLY use information present in Google Doc\n- If information not found: \"Is baare mein mujhe docs mein info nahi mili. Kya aap kuch aur puchna chahte hain?\" (Roman Urdu) or \"I couldn't find this information in our documentation. Would you like to know something else?\" (English)\n- Never guess or provide generic responses\n\n## Message Type Handlers\n\n### Text Messages\n\n\nDirect query \u2192 Search Doc \u2192 Concise Answer\n\n\n### Voice Notes\n\n\nTranscribe \u2192 Detect Language \u2192 Search Doc \u2192 Reply in same language\n\n\n### Images\n\n\nAcknowledge \u2192 Extract text/context \u2192 Process as text query\n\n\n## Example Interactions\n\n### English Example\n**User**: \"What services does sycorda offer?\"\n**Agent**: \"Sycorda provides [specific services from doc]. We specialize in [main specialty from doc].\"\n\n### Roman Urdu Example\n**User**: \"Sycordar ki fees kya hai?\"\n**Agent**: \"Hamari fees [doc se specific amount] hai. Payment monthly ya project basis pe ho sakti hai.\"\n\n### Mixed Context Example\n**User sends voice note in Urdu asking about contact**\n**Agent**: \"Aap humse [contact info from doc] pe rabta kar sakte hain. Office timings [timings from doc] hain.\"\n\n## Error Handling\n\n### Information Not Found\n- English: \"I don't have that specific information. Can I help with something else?\"\n- Roman Urdu: \"Yeh info mere paas nahi hai. Kuch aur puch sakte hain?\"\n\n### Unclear Query\n- English: \"Could you clarify what you need to know about [topic]?\"\n- Roman Urdu: \"[topic] ke baare mein kya janna chahte hain?\"\n\n## Constraints\n1. NEVER provide information not in the Google Doc\n2. NEVER use formal Urdu script or complex transliterations\n3. NEVER exceed 3 sentences unless absolutely necessary\n4. NEVER sound like an automated bot\n5. ALWAYS maintain consistent tone throughout conversation\n\n## Performance Metrics\n- Response accuracy: 100% from documented sources\n- Response brevity: <50 words average\n- Language matching: 100% accuracy\n- Human-like interaction: Natural flow maintained"
},
"promptType": "define"
},
"retryOnFail": true,
"typeVersion": 1.9
},
{
"id": "ff32e635-cabc-451a-8ec8-7d740996f3b5",
"name": "Conversation Memory",
"type": "@n8n/n8n-nodes-langchain.memoryBufferWindow",
"position": [
345,
140
],
"parameters": {
"sessionKey": "={{ $('Incoming WhatsApp Hook').item.json.messages[0].from }}",
"sessionIdType": "customKey",
"contextWindowLength": 20
},
"typeVersion": 1.3
},
{
"id": "212f1042-a90e-4e91-9660-97dac9dcbc7b",
"name": "Classify Input Format",
"type": "n8n-nodes-base.switch",
"position": [
910,
260
],
"parameters": {
"rules": {
"values": [
{
"outputKey": "Voice",
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "b7b64446-f1ea-4622-990c-22f3999a8269",
"operator": {
"type": "object",
"operation": "exists",
"singleValue": true
},
"leftValue": "={{ $json.messages[0].audio }}",
"rightValue": ""
}
]
},
"renameOutput": true
},
{
"outputKey": "Image",
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "202af928-a324-411a-bf15-68a349e7bf9e",
"operator": {
"type": "object",
"operation": "exists",
"singleValue": true
},
"leftValue": "={{ $json.messages[0].image }}",
"rightValue": ""
}
]
},
"renameOutput": true
},
{
"outputKey": "Text",
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "08fd0c80-307e-4f45-b1de-35192ee4ec5e",
"operator": {
"type": "string",
"operation": "exists",
"singleValue": true
},
"leftValue": "={{ $json.messages[0].text.body }}",
"rightValue": ""
}
]
},
"renameOutput": true
}
]
},
"options": []
},
"typeVersion": 3.2
},
{
"id": "4d6c41e6-4f24-49b4-a27c-78ee4f0dab70",
"name": "Fetch Image Link",
"type": "n8n-nodes-base.whatsApp",
"position": [
710,
295
],
"webhookId": "280bd5de-32d7-4d8f-93d2-e91e3b0bc161",
"parameters": {
"resource": "media",
"operation": "mediaUrlGet",
"mediaGetId": "={{ $('Incoming WhatsApp Hook').item.json.messages[0].image.id }}"
},
"credentials": {
"whatsAppApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "9257518f-7e4d-4734-8afd-ea70982ea76c",
"name": "Retrieve Image File",
"type": "n8n-nodes-base.httpRequest",
"position": [
520,
250
],
"parameters": {
"url": "={{ $json.url }}",
"options": [],
"authentication": "genericCredentialType",
"genericAuthType": "httpHeaderAuth"
},
"credentials": {
"httpHeaderAuth": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.2
},
{
"id": "a518322c-525a-47a8-a11a-37d50fdc5c88",
"name": "Image Content Review",
"type": "@n8n/n8n-nodes-langchain.openAi",
"position": [
310,
290
],
"parameters": {
"text": "=Describe the image in detail.",
"modelId": {
"__rl": true,
"mode": "list",
"value": "chatgpt-4o-latest",
"cachedResultName": "CHATGPT-4O-LATEST"
},
"options": [],
"resource": "image",
"inputType": "base64",
"operation": "analyze"
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.8
},
{
"id": "c96a91dc-1f91-49db-9278-2a3393f1a6ad",
"name": "Image Text Composer",
"type": "n8n-nodes-base.set",
"position": [
120,
245
],
"parameters": {
"options": [],
"assignments": {
"assignments": [
{
"id": "67552183-de2e-494a-878e-c2948e8cb6bb",
"name": "text",
"type": "string",
"value": "=# The user provided the following image and text.\n\n## IMAGE CONTENT:\n{{ $json.content }}\n\n## USER MESSAGE:\n{{ $('Incoming WhatsApp Hook').item.json.messages[0].image.caption || \"Describe the image\" }}"
}
]
}
},
"typeVersion": 3.4
},
{
"id": "c7c85070-4c64-469d-af59-4d71cb582386",
"name": "Text Message Formatter",
"type": "n8n-nodes-base.set",
"position": [
540,
450
],
"parameters": {
"options": [],
"assignments": {
"assignments": [
{
"id": "c05a7fbf-309a-407e-9fee-7e0b03f4a5c8",
"name": "text",
"type": "string",
"value": "={{ $('Incoming WhatsApp Hook').item.json.messages[0].text.body }}"
}
]
}
},
"typeVersion": 3.4
},
{
"id": "37afcfdc-29d9-49bb-a259-c714862345bc",
"name": "Fetch Audio Link",
"type": "n8n-nodes-base.whatsApp",
"position": [
720,
70
],
"webhookId": "87caa300-7204-47b5-959a-94f4a8fbf8cf",
"parameters": {
"resource": "media",
"operation": "mediaUrlGet",
"mediaGetId": "={{ $('Incoming WhatsApp Hook').item.json.messages[0].audio.id }}"
},
"credentials": {
"whatsAppApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "0bae0627-5109-4fc4-825b-4b49287cdd25",
"name": "Retrieve Audio File",
"type": "n8n-nodes-base.httpRequest",
"position": [
515,
90
],
"parameters": {
"url": "={{ $json.url }}",
"options": [],
"authentication": "genericCredentialType",
"genericAuthType": "httpHeaderAuth"
},
"credentials": {
"httpHeaderAuth": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.2
},
{
"id": "261d1bd3-76c4-436e-ae62-501d3c967cab",
"name": "Audio Transcription",
"type": "@n8n/n8n-nodes-langchain.openAi",
"position": [
295,
110
],
"parameters": {
"options": [],
"resource": "audio",
"operation": "transcribe"
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.8
},
{
"id": "6af95a0a-c27a-4b47-abe7-3e128d96284a",
"name": "Audio Text Formatter",
"type": "n8n-nodes-base.set",
"position": [
110,
60
],
"parameters": {
"options": [],
"assignments": {
"assignments": [
{
"id": "219577d5-b028-48fc-90be-980f4171ab68",
"name": "text",
"type": "string",
"value": "={{ $json.text }}"
}
]
}
},
"typeVersion": 3.4
},
{
"id": "1d0c8c18-3bba-49fd-a929-cf9465ff23c1",
"name": "Send WhatsApp Reply",
"type": "n8n-nodes-base.whatsApp",
"position": [
90,
270
],
"webhookId": "23834751-5066-48ba-8e19-549680df2b27",
"parameters": {
"textBody": "={{ $json.output }}",
"operation": "send",
"phoneNumberId": "[YOUR_ID]",
"additionalFields": [],
"recipientPhoneNumber": "={{ $('Incoming WhatsApp Hook').item.json.messages[0].from }}"
},
"credentials": {
"whatsAppApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "e6b85721-39a8-4377-9347-7d17da22adaa",
"name": "Fetch Docs Reference",
"type": "n8n-nodes-base.googleDocsTool",
"position": [
345,
335
],
"parameters": {
"operation": "get",
"documentURL": "[YOUR_ID]"
},
"credentials": {
"googleDocsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 2
},
{
"id": "e0c0f236-b08f-44a0-9efa-89668d005040",
"name": "OpenRouter Chat Engine",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenRouter",
"position": [
345,
410
],
"parameters": {
"model": "anthropic/claude-sonnet-4",
"options": []
},
"credentials": {
"openRouterApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
}
],
"pinData": [],
"connections": {
"Support AI Orchestrator": {
"main": [
[
{
"node": "Send WhatsApp Reply",
"type": "main",
"index": 0
}
]
]
},
"Audio Text Formatter": {
"main": [
[
{
"node": "Support AI Orchestrator",
"type": "main",
"index": 0
}
]
]
},
"Image Content Review": {
"main": [
[
{
"node": "Image Text Composer",
"type": "main",
"index": 0
}
]
]
},
"Fetch Audio Link": {
"main": [
[
{
"node": "Retrieve Audio File",
"type": "main",
"index": 0
}
]
]
},
"Fetch Image Link": {
"main": [
[
{
"node": "Retrieve Image File",
"type": "main",
"index": 0
}
]
]
},
"Conversation Memory": {
"ai_memory": [
[
{
"node": "Support AI Orchestrator",
"type": "ai_memory",
"index": 0
}
]
]
},
"Retrieve Audio File": {
"main": [
[
{
"node": "Audio Transcription",
"type": "main",
"index": 0
}
]
]
},
"Retrieve Image File": {
"main": [
[
{
"node": "Image Content Review",
"type": "main",
"index": 0
}
]
]
},
"Classify Input Format": {
"main": [
[
{
"node": "Fetch Audio Link",
"type": "main",
"index": 0
}
],
[
{
"node": "Fetch Image Link",
"type": "main",
"index": 0
}
],
[
{
"node": "Text Message Formatter",
"type": "main",
"index": 0
}
]
]
},
"Text Message Formatter": {
"main": [
[
{
"node": "Support AI Orchestrator",
"type": "main",
"index": 0
}
]
]
},
"Audio Transcription": {
"main": [
[
{
"node": "Audio Text Formatter",
"type": "main",
"index": 0
}
]
]
},
"Incoming WhatsApp Hook": {
"main": [
[
{
"node": "Classify Input Format",
"type": "main",
"index": 0
}
]
]
},
"Image Text Composer": {
"main": [
[
{
"node": "Support AI Orchestrator",
"type": "main",
"index": 0
}
]
]
},
"OpenRouter Chat Engine": {
"ai_languageModel": [
[
{
"node": "Support AI Orchestrator",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Fetch Docs Reference": {
"ai_tool": [
[
{
"node": "Support AI Orchestrator",
"type": "ai_tool",
"index": 0
}
]
]
}
},
"id": "",
"versionId": "",
"name": "Automated WhatsApp Support Workflow"
}