January 21, 2026

Apify to Google Sheets, competitor topic map ready

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Competitor research sounds simple until you’re ten tabs deep, copying headings into a doc, and you still can’t tell what topics actually drive their traffic.

SEO specialists feel this in every content audit. A content strategist trying to build a topic map feels it too. And if you run an agency, it turns into billable hours spent on busywork. This competitor research automation replaces the manual browsing with one structured table in Google Sheets.

You’ll learn what this workflow does, what you need to run it, and how to use the output to plan content with a lot more confidence.

How This Automation Works

Here’s the complete workflow you’ll be setting up:

n8n Workflow Template: Apify to Google Sheets, competitor topic map ready

Click to explore

flowchart LR

    subgraph sg0["Language Analysis Ag Flow"]
        direction LR
        n0["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/webhook.dark.svg' width='40' height='40' /></div><br/>Incoming Webhook Trigger"]
        n1["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/webhook.dark.svg' width='40' height='40' /></div><br/>Return Form Page"]
        n2["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Assemble Crawl Payload"]
        n3@{ icon: "mdi:swap-horizontal", form: "rounded", label: "Validate Input Presence", pos: "b", h: 48 }
        n4@{ icon: "mdi:database", form: "rounded", label: "Log Request Details", pos: "b", h: 48 }
        n5@{ icon: "mdi:swap-horizontal", form: "rounded", label: "Crawl Rival Site", pos: "b", h: 48 }
        n6["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Retrieve Crawl Dataset"]
        n7["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Derive Page Metadata"]
        n8@{ icon: "mdi:brain", form: "rounded", label: "Gemini Content Review", pos: "b", h: 48 }
        n9@{ icon: "mdi:robot", form: "rounded", label: "Language Analysis Agent", pos: "b", h: 48 }
        n10["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Normalize Model Output"]
        n11["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Generate Sheet Label"]
        n12@{ icon: "mdi:database", form: "rounded", label: "Create Results Sheet", pos: "b", h: 48 }
        n13["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/merge.svg' width='40' height='40' /></div><br/>Combine Streams"]
        n14@{ icon: "mdi:swap-vertical", form: "rounded", label: "Map Sheet Fields", pos: "b", h: 48 }
        n15@{ icon: "mdi:database", form: "rounded", label: "Store Captured Results", pos: "b", h: 48 }
        n16@{ icon: "mdi:swap-horizontal", form: "rounded", label: "Check Email Payload", pos: "b", h: 48 }
        n17@{ icon: "mdi:message-outline", form: "rounded", label: "Dispatch Email Report", pos: "b", h: 48 }
        n13 --> n14
        n9 --> n10
        n0 --> n1
        n0 --> n2
        n11 --> n12
        n11 --> n13
        n14 --> n15
        n2 --> n3
        n8 -.-> n9
        n16 --> n17
        n7 --> n9
        n6 --> n7
        n3 --> n5
        n3 --> n4
        n15 --> n16
        n5 --> n6
        n12 --> n13
        n10 --> n11
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n9 ai
    class n8 aiModel
    class n3,n5,n16 decision
    class n4,n12,n15 database
    class n0,n1,n6 api
    class n2,n7,n10,n11 code
    classDef customIcon fill:none,stroke:none
    class n0,n1,n2,n6,n7,n10,n11,n13 customIcon

Why This Matters: Competitor Topic Research Is Slow (and Messy)

Manual competitor research is the kind of task that steals time in tiny pieces. You review a competitor’s blog, skim categories, open articles, copy a few headings, then realize you missed the “resources” section or the glossary that’s quietly pulling links. Next thing you know, you’ve got scattered notes, inconsistent naming, and no way to compare two competitors without starting over. Honestly, the worst part is the mental load: you spend energy collecting data instead of spotting patterns and planning what to publish.

It adds up fast. Here’s where it usually breaks down:

Copying titles, entities, and categories into spreadsheets takes about 2 hours per competitor site, even when you move quickly.
Two people will label the same topic differently, so your “analysis” becomes cleanup.
Important pages get missed because navigation hides them, or the site structure is deeper than you expected.
You can’t reliably score content depth or coverage without a consistent, repeatable extraction process.

What You’ll Build: An Automated Competitor Topic Map in Google Sheets

This workflow starts with a simple form submission through an n8n webhook. You enter the competitor domain(s), plus crawl limits like depth and max pages. From there, Apify crawls the site and produces a dataset of URLs and page content you can analyze. The workflow pulls that dataset back into n8n, derives page metadata, and then uses AI (Gemini plus an analysis agent) to turn messy page text into structured outputs like topic hierarchies, entities, and depth scores. Finally, everything is normalized into clean fields, a labeled results sheet is created in Google Sheets, and rows are saved in a format you can filter, chart, or export. If you want, it can also email a report via Gmail once the run completes.

The workflow begins when you submit the crawl request. It then crawls and parses competitor pages, turning content into a consistent topic structure. At the end, it writes a tidy, analyzable table to Google Sheets (and optionally sends an email summary).

What You’re Building

What Gets Automated

What You’ll Achieve

Collect competitor page data at a defined crawl depth using Apify.
Extract page metadata and content signals from the crawl dataset.
Use AI to map topics, entities, and content depth into structured fields.
Create a labeled Google Sheet and store the results automatically.

Turn a 2-hour manual audit into about 20 minutes of setup and waiting.
Get one consistent topic map you can compare across competitors.
Spot content gaps and weak coverage without rereading every page.
Hand your team a spreadsheet that’s ready for filtering and pivots.
Send a finished report via email when the crawl completes.

Expected Results

Say you audit 5 competitors for a quarterly content refresh. Manually, if each site takes about 2 hours to review and log, that’s roughly 10 hours before you even start planning. With this workflow, you spend about 10 minutes submitting each crawl (around 50 minutes total), then you wait for Apify and the AI steps to process. You still review the output, but the copy-paste part is gone, and most teams get the bulk of that 10 hours back.

Before You Start

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
Google Sheets for storing the topic map output.
Apify to crawl competitor websites at scale.
Google Generative AI (Gemini) key (get it from Google AI Studio)

Skill level: Intermediate. You’ll connect accounts, add API keys, and tweak a few crawl settings safely.

Want someone to build this for you? Talk to an automation expert (free 15-minute consultation).

Step by Step

You submit a crawl request. The workflow is triggered by an incoming webhook, which returns a simple form page so you can provide a competitor domain and crawl limits without touching the workflow every time.

Your inputs get validated and logged. n8n checks that required fields are present, then writes a request log to Google Sheets so you have a record of what was crawled and when.

Apify crawls the competitor site and returns a dataset. The workflow runs the Apify crawler, then pulls the resulting dataset via HTTP Request and derives useful metadata from each page.

AI converts raw pages into a topic map. Gemini and the language analysis agent extract structured topics, entities, and depth signals, then a normalization step cleans everything into consistent fields for reporting.

Results are written to a fresh Google Sheet (and optionally emailed). A sheet label is generated, a new results sheet is created, streams are merged, and final rows are stored. If you include an email payload, Gmail sends a report.

You can easily modify crawl depth and max pages based on your needs. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Webhook Trigger

Set up the inbound request and the HTML response page for submitting competitor crawl requests.

Open Incoming Webhook Trigger and set Path to competitors.
Set Response Mode to responseNode and enable Multiple Methods.
Open Return Form Page and keep Respond With set to text with the provided HTML in Response Body.
Copy your production webhook URL and replace https://[YOUR_WEBHOOK_URL] inside the HTML in Return Form Page.

⚠️ Common Pitfall: If you do not replace https://[YOUR_WEBHOOK_URL] in the HTML, the form will submit to a placeholder URL and the workflow will never trigger.

Step 2: Connect Google Sheets

Prepare the logging and results storage in Google Sheets.

Open Log Request Details and select the target spreadsheet and Sheet Name CONFIG.
Credential Required: Connect your googleSheetsOAuth2Api credentials in Log Request Details.
Open Create Results Sheet and set Title to {{ $json.sheet_name }}.
Credential Required: Connect your googleSheetsOAuth2Api credentials in Create Results Sheet and select the same spreadsheet.
Open Store Captured Results and set Sheet Name to {{ $('Generate Sheet Label').item.json.sheet_name }}.
Credential Required: Connect your googleSheetsOAuth2Api credentials in Store Captured Results.

Tip: Keep the same spreadsheet for Log Request Details, Create Results Sheet, and Store Captured Results to centralize request logs and analysis outputs.

Step 3: Set Up Crawl Assembly, Validation, and Parallel Logging

Build the crawl payload, validate inputs, and run the crawl and logging in parallel.

Open Assemble Crawl Payload and confirm the code handles form input normalization and Apify payload construction.
Open Validate Input Presence and keep the condition {{ $json.startUrls[0].method }} equals GET.
Validate Input Presence outputs to both Crawl Rival Site and Log Request Details in parallel.
Open Crawl Rival Site and confirm Custom Body contains {{ JSON.stringify({ ... }) }} with maxRequestsPerCrawl and maxDepth referencing the input values.
Credential Required: Connect your apifyApi credentials in Crawl Rival Site.
Open Retrieve Crawl Dataset and keep URL set to {{ "https://api.apify.com/v2/datasets/" + ($json.data?.defaultDatasetId || $json.defaultDatasetId || "[YOUR_ID]") + "/items?clean=true&format=json&offset=0&limit=1" }}.

Step 4: Set Up AI Analysis and Normalization

Extract metadata, run the AI analysis, and normalize structured output for storage.

Open Derive Page Metadata to ensure the code generates page_url, title, and markdown fields from the crawl result.
Open Language Analysis Agent and keep the Text prompt with the JSON schema and the expression {{ $json["markdown"] }}.
Open Gemini Content Review and ensure it is linked as the language model to Language Analysis Agent.
Credential Required: Connect your googlePalmApi credentials in Gemini Content Review. This credential powers Language Analysis Agent.
Open Normalize Model Output to ensure it formats main_topics_flat and key_entities_flat for downstream use.
Open Generate Sheet Label to keep the sheet naming logic and ensure each analysis generates a unique tab name.

Tip: If AI responses contain code fences or extra text, Normalize Model Output strips and parses them automatically—do not remove those parsing helpers.

Step 5: Configure Output, Merging, and Email Delivery

Merge sheet creation with analysis output, store results, and send email reports conditionally.

Generate Sheet Label outputs to both Create Results Sheet and Combine Streams in parallel.
Open Combine Streams to ensure it merges the created sheet with the analysis output before mapping.
Open Map Sheet Fields and confirm mappings:
- page_url → {{ $('Normalize Model Output').item.json.page_url }}
- main_topics → {{ $('Normalize Model Output').item.json.main_topics_flat }}
- key_words → {{ $('Normalize Model Output').item.json.key_entities_flat }}
Open Check Email Payload and verify the conditions:
- {{ $json.page_url }} matches ^https?://
- {{ $json.main_topics }} is not empty
- {{ $json.key_words }} is not empty
Open Dispatch Email Report and set:
- Send To to {{ $('Assemble Crawl Payload').item.json.notify_email }}
- Subject to SEO Audit Report: {{ $json.page_url }}
- Message to the provided HTML template
Credential Required: Connect your gmailOAuth2 credentials in Dispatch Email Report.

Step 6: Test and Activate Your Workflow

Verify the workflow end-to-end using the webhook form and then enable it for production use.

Click Execute Workflow and open the Incoming Webhook Trigger test URL in your browser.
Submit the form from Return Form Page with a valid competitor URL and email address.
Confirm a new tab is created by Create Results Sheet and rows are appended by Store Captured Results.
Verify that Dispatch Email Report sends a report when Check Email Payload passes.
Once successful, toggle the workflow to Active to accept production requests.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Troubleshooting Tips

Google Sheets credentials can expire or need specific permissions. If things break, check the Google connection in n8n’s Credentials list first.
Apify runs can fail when crawl limits are too aggressive for a site. Review the run logs in Apify, then lower crawl_depth_num or max_pages_num and try again.
Default AI prompts are generic. Add your exact definition of “topic,” “entity,” and “depth score” early, or you will keep editing outputs later.

Quick Answers

What’s the setup time for this competitor research automation automation?

About 30 minutes if your keys and accounts are ready.

Is coding required for this competitor research automation task?

No. You’ll mostly connect services and adjust crawl settings. The included Code steps are already written, and you can run the workflow without editing them.

Is n8n free to use for this competitor research automation workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in Apify usage and Gemini/OpenAI API costs for the AI analysis.

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I modify this competitor research automation workflow for different use cases?

Yes, and you should. You can change the competitor domains and crawl limits in the form inputs, then adjust max_pages_num and crawl_depth_num in the crawl payload logic. If you want different columns in Google Sheets, update the “Map Sheet Fields” step (and the normalization/code steps that shape the output). Common tweaks include adding a “content type” column, capturing internal link counts, and routing a summary to Slack instead of email.

Why is my Google Sheets connection failing in this workflow?

Usually it’s expired OAuth access or a Google account permission issue. Reconnect the Google Sheets credential in n8n and confirm the target spreadsheet is accessible to that account. Also check that the workflow is allowed to create new sheets if you’re using the “Create Results Sheet” action. If it only fails sometimes, you may be hitting Google API limits during big crawls, so batching writes helps.

What volume can this competitor research automation workflow process?

It depends on your crawl limits and plan, but most teams run a few hundred pages per competitor without issue if the crawl depth is sane. On n8n Cloud, higher tiers support higher monthly execution volume, while self-hosting removes execution caps (your server becomes the limit). Apify and the AI steps usually become the bottleneck before n8n does.

Is this competitor research automation automation better than using Zapier or Make?

For this use case, often yes. You’re combining a crawler run, dataset retrieval, multi-step AI parsing, normalization, and conditional email delivery, which is a lot of branching and data shaping. n8n is more comfortable with that kind of “workflow logic,” and self-hosting matters because this workflow includes community nodes that require it. Zapier or Make can still work for a simpler version, but you’ll usually end up compromising on the crawl/AI depth or paying more as volume grows. If you want help deciding, Talk to an automation expert and explain your monthly crawl size.

Once this is running, competitor research stops being a recurring project and becomes a repeatable input to your content plan. The workflow collects the data. You make the calls.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"id": "",
"meta": {
"instanceId": "4eb8ffe289b6e26d8a816c02387de777bc4cbb697d1d5d0d89cffece2737ec30",
"templateCredsSetupCompleted": null,
"templateId": null
},
"name": "Automated Competitor Topic Analysis Flow",
"tags": [],
"nodes": [
{
"id": "flowpast-topbar-8446",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
20,
-10
],
"parameters": {
"color": 7,
"width": 1205,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "cd5b853c-8331-442b-b4c4-d772c6bdc4f8",
"name": "Incoming Webhook Trigger",
"type": "n8n-nodes-base.webhook",
"position": [
30,
220
],
"webhookId": "300e7628-acda-4e87-ad89-6179f6d89260",
"parameters": {
"path": "competitors",
"options": [],
"responseMode": "responseNode",
"multipleMethods": true
},
"typeVersion": 2.1
},
{
"id": "be10836c-5727-4607-9dfb-12d50ad1f12b",
"name": "Return Form Page",
"type": "n8n-nodes-base.respondToWebhook",
"position": [
20,
90
],
"parameters": {
"options": [],
"respondWith": "text",
"responseBody": "=<!DOCTYPE html>\n<html lang=\"en\">\n<head>\n  <meta charset=\"UTF-8\" />\n  <title>Competitor Crawl Request</title>\n  <meta name=\"viewport\" content=\"width=device-width,initial-scale=1\" />\n  <style>\n    :root{\n      --bg-0:#0b1220;\n      --bg-1:#0f172a;\n      --bg-2:#111827;\n      --card:#0b1220;\n      --text:#e5e7eb;\n      --muted:#94a3b8;\n      --line:#1f2937;\n      --accent:#7c3aed;      /* violet */\n      --accent-2:#06b6d4;    /* cyan */\n      --ok:#10b981;          /* green */\n      --err:#ef4444;         /* red */\n    }\n    *{box-sizing:border-box}\n    html,body{height:100%}\n    body{\n      margin:0;\n      background:\n        radial-gradient(1200px 600px at 20% -10%, rgba(124,58,237,.18), transparent 60%),\n        radial-gradient(1000px 400px at 120% 10%, rgba(6,182,212,.20), transparent 60%),\n        linear-gradient(180deg, #0b1220, #0b1220 60%, #0a1020 100%);\n      color:var(--text);\n      font-family: ui-sans-serif, system-ui, -apple-system, Segoe UI, Roboto, Arial, \"Noto Sans\", \"Apple Color Emoji\",\"Segoe UI Emoji\";\n      display:flex; align-items:center; justify-content:center;\n      padding:24px;\n    }\n    .card{\n      width:100%;\n      max-width: 720px;\n      background: linear-gradient(180deg, rgba(255,255,255,.03), rgba(255,255,255,.01));\n      border:1px solid var(--line);\n      border-radius: 16px;\n      box-shadow: 0 10px 30px rgba(0,0,0,.45), inset 0 1px 0 rgba(255,255,255,.03);\n      backdrop-filter: blur(8px);\n      overflow: hidden;\n    }\n    .card header{\n      padding:20px 24px;\n      border-bottom:1px solid var(--line);\n      display:flex; align-items:center; gap:12px;\n      background: linear-gradient(180deg, rgba(124,58,237,.15), rgba(124,58,237,.02));\n    }\n    .logo{\n      width:36px; height:36px; border-radius:10px;\n      background: conic-gradient(from 210deg, var(--accent), var(--accent-2));\n      box-shadow: 0 0 0 2px rgba(255,255,255,.06), 0 8px 16px rgba(124,58,237,.2);\n    }\n    h1{font-size:1.15rem; margin:0}\n    .sub{font-size:.9rem; color:var(--muted); margin-top:2px}\n    form{padding:20px 24px 24px}\n    .grid{\n      display:grid;\n      grid-template-columns: 1fr;\n      gap:16px;\n    }\n    @media (min-width: 720px){\n      .grid{ grid-template-columns: 1fr 1fr; }\n      .grid .full{ grid-column: 1 / -1; }\n    }\n    label{\n      display:block; font-size:.92rem; font-weight:600; margin-bottom:8px; color:#c7d2fe;\n    }\n    input, select, textarea{\n      width:100%;\n      background:#0d1426;\n      color:var(--text);\n      border:1px solid #1e293b;\n      border-radius:10px;\n      padding:12px 12px;\n      font-size: 0.98rem;\n      outline:none;\n      transition: border .15s, box-shadow .15s, transform .02s;\n    }\n    input::placeholder, textarea::placeholder{ color:#6b7280 }\n    input:focus, select:focus, textarea:focus{\n      border-color: var(--accent);\n      box-shadow: 0 0 0 3px rgba(124,58,237,.2);\n    }\n    select[multiple]{\n      min-height:132px;\n      padding:10px;\n    }\n    small.help{ display:block; color:var(--muted); margin-top:6px; }\n    .row{\n      display:flex; gap:10px; align-items:center; margin-top:6px;\n    }\n    .row input{ flex:1 }\n    .pill{\n      display:inline-flex; align-items:center; gap:8px;\n      padding:10px 14px; border-radius:999px;\n      border:1px solid #233046; background:#0e1629;\n      font-weight:700; letter-spacing:.2px;\n      color:#e9d5ff;\n      cursor:pointer; user-select:none;\n      transition: transform .02s, background .15s, border .15s;\n    }\n    .pill:hover{ background:#121b31; border-color:#2a3a57 }\n    .submit{\n      display:flex; gap:12px; align-items:center; justify-content:flex-end;\n      padding:18px 24px; border-top:1px solid var(--line);\n      background: linear-gradient(180deg, rgba(6,182,212,.08), rgba(6,182,212,.02));\n    }\n    button{\n      all:unset;\n      background: linear-gradient(90deg, var(--accent), var(--accent-2));\n      padding:12px 18px; border-radius:12px; cursor:pointer; font-weight:700;\n      box-shadow: 0 10px 22px rgba(124,58,237,.25);\n    }\n    button[disabled]{ opacity:.7; cursor:not-allowed; box-shadow:none }\n    .msg{ padding:12px 16px; font-weight:600; }\n    .success{ color: var(--ok); }\n    .error{ color: var(--err); }\n    .inline-note{ font-size:.85rem; color:var(--muted); margin-left:10px }\n  </style>\n<link rel="preload" as="image" href="https://flowpast.s3.eu-north-1.amazonaws.com/featured_blog_images/105452.webp"><link rel="preload" as="image" href="https://flowpast.s3.eu-north-1.amazonaws.com/featured_blog_images/105452.webp"></head>
\n<body>\n  <div class=\"card\" role=\"region\" aria-label=\"Competitor Crawl Request\">\n    <header>\n      <div class=\"logo\" aria-hidden=\"true\"></div>\n      <div>\n        <h1>Competitor Crawl Request</h1>\n        <div class=\"sub\">Send a domain and optional scoping rules for the crawl</div>\n      </div>\n    </header>\n\n    <form id=\"requestForm\" novalidate>\n      <div class=\"grid\">\n        <div class=\"full\">\n          <label for=\"competitors\">Competitor URL / Domain</label>\n          <input id=\"competitors\" name=\"competitors\" type=\"text\" required placeholder=\"https://example.com or example.com\" />\n          <small class=\"help\">A full URL or bare domain is fine \u2014 we\u2019ll normalize it.</small>\n        </div>\n\n        <div>\n          <label for=\"name\">Your Name</label>\n          <input id=\"name\" name=\"name\" type=\"text\" required placeholder=\"Your name\u2026\" autocomplete=\"name\" />\n        </div>\n\n        <div>\n          <label for=\"email\">Email (for report)</label>\n          <input id=\"email\" name=\"email\" type=\"email\" required placeholder=\"you@domain.com\" autocomplete=\"email\" inputmode=\"email\" />\n          <small class=\"help\">We\u2019ll email the summary here.</small>\n        </div>\n\n        <div class=\"full\">\n          <label for=\"include_paths\">Include paths (multi-select)</label>\n          <select id=\"include_paths\" multiple>\n            <option value=\"/\">/</option>\n            <option value=\"/blog/\">/blog/</option>\n            <option value=\"/services/\">/services/</option>\n            <option value=\"/products/\">/products/</option>\n            <option value=\"/news/\">/news/</option>\n            <option value=\"/portfolio/\">/portfolio/</option>\n          </select>\n          <div class=\"row\">\n            <input id=\"include_custom\" type=\"text\" placeholder=\"Custom paths (comma-separated, e.g. /guide/, /docs/ )\" />\n            <span class=\"pill\" id=\"add_include\">+ Add</span>\n          </div>\n          <small class=\"help\">Hold Ctrl/\u2318 to select multiple. Custom items are appended automatically.</small>\n        </div>\n\n        <div class=\"full\">\n          <label for=\"exclude_paths\">Exclude paths (multi-select)</label>\n          <select id=\"exclude_paths\" multiple>\n            <option value=\"/privacy\">/privacy</option>\n            <option value=\"/terms\">/terms</option>\n            <option value=\"/login\">/login</option>\n            <option value=\"/cart\">/cart</option>\n            <option value=\"/checkout\">/checkout</option>\n            <option value=\"/wp-admin\">/wp-admin</option>\n            <option value=\"/tag/\">/tag/</option>\n            <option value=\"/category/\">/category/</option>\n            <option value=\"/feed\">/feed</option>\n          </select>\n          <div class=\"row\">\n            <input id=\"exclude_custom\" type=\"text\" placeholder=\"Custom paths (comma-separated, e.g. /search, /admin )\" />\n            <span class=\"pill\" id=\"add_exclude\">+ Add</span>\n          </div>\n          <small class=\"help\">Use this to skip utility/low-value areas.</small>\n        </div>\n\n        <div>\n          <label for=\"max_pages\">Max pages</label>\n          <select id=\"max_pages\">\n            <option value=\"5\">5</option>\n            <option value=\"10\">10</option>\n            <option value=\"20\" selected>20</option>\n            <option value=\"50\">50</option>\n            <option value=\"100\">100</option>\n            <option value=\"200\">200</option>\n          </select>\n          <span class=\"inline-note\">Default 20</span>\n        </div>\n\n        <div>\n          <label for=\"crawl_depth\">Crawl depth</label>\n          <select id=\"crawl_depth\">\n            <option value=\"0\">0 (only start URL)</option>\n            <option value=\"1\" selected>1</option>\n            <option value=\"2\">2</option>\n            <option value=\"3\">3</option>\n          </select>\n          <span class=\"inline-note\">Default 1</span>\n        </div>\n      </div>\n\n      <div class=\"submit\">\n        <div id=\"response\" class=\"msg\" aria-live=\"polite\"></div>\n        <button id=\"submitBtn\" type=\"submit\">Send Request</button>\n      </div>\n    </form>\n  </div>\n\n  <script>\n    const $ = (id) => document.getElementById(id);\n\n    function showMsg(text, type) {\n      const el = $('response');\n      el.className = 'msg ' + (type || '');\n      el.textContent = text;\n    }\n\n    function normalizeUrl(input) {\n      const s = (input || '').trim();\n      if (!s) return '';\n      return /^https?:\\/\\//i.test(s) ? s : `https://${s}`;\n    }\n\n    function selectedValues(selectEl) {\n      return Array.from(selectEl.options)\n        .filter(o => o.selected)\n        .map(o => o.value.trim())\n        .filter(Boolean);\n    }\n\n    function parseCustom(inputEl) {\n      return (inputEl.value || '')\n        .split(',')\n        .map(s => s.trim())\n        .filter(Boolean);\n    }\n\n    function addCustomToSelect(inputEl, selectEl) {\n      parseCustom(inputEl).forEach(v => {\n        const opt = document.createElement('option');\n        opt.value = v;\n        opt.textContent = v;\n        opt.selected = true;\n        selectEl.appendChild(opt);\n      });\n      inputEl.value = '';\n    }\n\n    // Wire add buttons\n    $('add_include').addEventListener('click', () => addCustomToSelect($('include_custom'), $('include_paths')));\n    $('add_exclude').addEventListener('click', () => addCustomToSelect($('exclude_custom'), $('exclude_paths')));\n\n    $('requestForm').addEventListener('submit', async (e) => {\n      e.preventDefault();\n      showMsg('', '');\n\n      if (!e.target.checkValidity()) {\n        e.target.reportValidity();\n        return;\n      }\n\n      const name  = $('name').value.trim();\n      const email = $('email').value.trim();\n      const competitors = normalizeUrl($('competitors').value.trim());\n\n      const emailOk = /^[^\\s@]+@[^\\s@]+\\.[^\\s@]+$/.test(email);\n      if (!emailOk) { $('email').focus(); return showMsg('\u274c Please enter a valid email.', 'error'); }\n\n      if (!competitors) { $('competitors').focus(); return showMsg('\u274c Please enter a competitor URL.', 'error'); }\n\n      // Merge dropdown selections + any custom text entries\n      const includeSel = selectedValues($('include_paths'));\n      const excludeSel = selectedValues($('exclude_paths'));\n      const includeCustom = parseCustom($('include_custom'));\n      const excludeCustom = parseCustom($('exclude_custom'));\n\n      const include_paths = [...includeSel, ...includeCustom].join(', ');\n      const exclude_paths = [...excludeSel, ...excludeCustom].join(', ');\n\n      const max_pages   = $('max_pages').value;\n      const crawl_depth = $('crawl_depth').value;\n\n      const submitBtn = $('submitBtn');\n      submitBtn.setAttribute('disabled','true');\n      submitBtn.textContent = 'Sending\u2026';\n\n      try {\n        const webhookUrl = 'https://[YOUR_WEBHOOK_URL]';\n        const res = await fetch(webhookUrl, {\n          method: 'POST',\n          headers: { 'Content-Type': 'application/json' },\n          body: JSON.stringify({\n            name,\n            email,\n            competitors,\n            include_paths,\n            exclude_paths,\n            max_pages,\n            crawl_depth\n          })\n        });\n\n        if (res.ok) {\n          showMsg('\u2705 Request sent. We\u2019ll email you when it\u2019s processed.', 'success');\n          e.target.reset();\n        } else {\n          const txt = await res.text().catch(() => '');\n          showMsg('\u274c Failed to send. ' + (txt || ''), 'error');\n        }\n      } catch (err) {\n        showMsg('\u274c Network error. Please try again.', 'error');\n      } finally {\n        submitBtn.removeAttribute('disabled');\n        submitBtn.textContent = 'Send Request';\n      }\n    });\n  </script>\n</body>\n</html>"
},
"executeOnce": false,
"retryOnFail": true,
"typeVersion": 1.4
},
{
"id": "a9a4fa59-bc0a-4175-afd8-46855bb271d3",
"name": "Assemble Crawl Payload",
"type": "n8n-nodes-base.code",
"position": [
260,
230
],
"parameters": {
"jsCode": "// 1) Read POST body (supports Webhook { body: {...} } or raw JSON)\nconst src = items?.[0]?.json ?? {};\nconst bodyRaw = (src.body ?? src ?? {});\nconst body = (typeof bodyRaw === 'string')\n  ? (() => { try { return JSON.parse(bodyRaw); } catch { return {}; } })()\n  : bodyRaw;\n\n// 2) Helpers\nfunction normalizeDomain(input = '') {\n  const s = String(input).trim();\n  if (!s) return '';\n  return s\n    .replace(/^https?:\\/\\//i, '') // remove protocol\n    .replace(/^www\\./i, '')       // remove www\n    .replace(/\\/+$/g, '');        // remove trailing slashes\n}\nfunction splitList(input) {\n  if (Array.isArray(input)) return input.map(String);\n  return String(input || '')\n    .split(/[,\\n\\r;]+/)\n    .map(s => s.trim())\n    .filter(Boolean);\n}\nfunction toGlobs(list) {\n  return splitList(list).map(p => (p.endsWith('*') ? p : `${p}*`));\n}\n\n// 3) Map incoming fields\nconst row = {\n  client_name:   body.client_name ?? body.name ?? '',\n  client_domain: body.client_domain ?? '',\n  competitors:   body.competitors ?? body.competitor ?? body.url ?? '',\n  include_paths: body.include_paths ?? '',\n  exclude_paths: body.exclude_paths ?? '',\n  max_pages:     body.max_pages ?? '',\n  crawl_depth:   body.crawl_depth ?? '',\n  notify_email:  body.notify_email ?? body.email ?? '',\n  processed:     (typeof body.processed === 'boolean'\n                    ? body.processed\n                    : String(body.processed ?? '').toLowerCase() === 'true'),\n  rowIndex:      body.rowIndex ?? null,\n};\n\n// 4) Early exit if sender marked it processed\nif (row.processed === true) return [];\n\n// 5) Build Apify fields\nconst competitorDomain = normalizeDomain(row.competitors);\nconst clientDomain     = normalizeDomain(row.client_domain);\n\nconst startUrls = competitorDomain\n  ? [{ url: `https://${competitorDomain}/`, method: 'GET' }]\n  : [];\n\nif (!startUrls.length) {\n  return [{ json: { error: 'NO_START_URLS', raw_competitors: row.competitors || '' } }];\n}\n\nconst includeArr = toGlobs(row.include_paths);\nconst excludeArr = toGlobs(row.exclude_paths);\n\nconst maxPages   = Number(row.max_pages)   || 20;\nconst crawlDepth = Number(row.crawl_depth) || 1;\n\n// 6) Output for next nodes\nreturn [{\n  json: {\n    client_name:       String(row.client_name || ''),\n    client_domain:     clientDomain,\n    competitor_domain: competitorDomain,\n    startUrls,\n    includeArr,\n    excludeArr,\n    max_pages_num:     maxPages,\n    crawl_depth_num:   crawlDepth,\n    notify_email:      String(row.notify_email || ''),\n    rowIndex:          row.rowIndex,\n  }\n}];"
},
"typeVersion": 2
},
{
"id": "2b28b4f7-e07d-4d14-950c-61a2d36fdb34",
"name": "Validate Input Presence",
"type": "n8n-nodes-base.if",
"position": [
405,
265
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "c87f5a6e-e9c8-4c57-a5b3-2c77371cd529",
"operator": {
"name": "filter.operator.equals",
"type": "string",
"operation": "equals"
},
"leftValue": "={{ $json.startUrls[0].method }}",
"rightValue": "GET"
}
]
}
},
"typeVersion": 2.2
},
{
"id": "6182edc0-197d-40b6-bdd0-a00b6d0fa73e",
"name": "Log Request Details",
"type": "n8n-nodes-base.googleSheets",
"position": [
460,
410
],
"parameters": {
"columns": {
"value": {
"max_pages": "={{ $json.max_pages_num }}",
"client_name": "={{ $json.client_name }}",
"competitors": "={{ $json.competitor_domain }}",
"crawl_depth": "={{ $json.crawl_depth_num }}",
"notify_email": "={{ $json.notify_email }}",
"exclude_paths": "={{ $json.excludeArr }}",
"include_paths": "={{ $json.includeArr }}"
},
"schema": [
{
"id": "client_name",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "client_name",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "competitors",
"type": "string",
"display": true,
"required": false,
"displayName": "competitors",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "include_paths",
"type": "string",
"display": true,
"required": false,
"displayName": "include_paths",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "exclude_paths",
"type": "string",
"display": true,
"required": false,
"displayName": "exclude_paths",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "max_pages",
"type": "string",
"display": true,
"required": false,
"displayName": "max_pages",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "crawl_depth",
"type": "string",
"display": true,
"required": false,
"displayName": "crawl_depth",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "notify_email",
"type": "string",
"display": true,
"required": false,
"displayName": "notify_email",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "gid=0",
"cachedResultUrl": "",
"cachedResultName": "CONFIG"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Content Gap Analyzer"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.7
},
{
"id": "82bce0ee-a888-4e7e-9dc6-47bdcf3e18e9",
"name": "Crawl Rival Site",
"type": "@apify/n8n-nodes-apify.apify",
"position": [
630,
280
],
"parameters": {
"actorId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Website Content Crawler (apify/website-content-crawler)"
},
"timeout": [],
"customBody": "={{ JSON.stringify({\n  crawlerType: \"cheerio\",\n  startUrls: $json.startUrls,                 \n  useSitemaps: false,\n  maxDepth: $json.crawl_depth_num  || 0,\n  maxRequestsPerCrawl: $json.max_pages_num || 20,\nmaxConcurrency:1,\n  respectRobotsTxtFile: true,\n  ignoreHttpsErrors: true,\n  proxyConfiguration: { useApifyProxy: true },\n  removeCookieWarnings: true,\n  removeElementsCssSelector: \"nav, footer, script, style, noscript, svg, img[src^='data:']\",\n  blockMedia: true,\n  saveMarkdown: true,\n  saveHtml: false,\n  saveFiles: false,\n  saveScreenshots: false\n}) }}\n"
},
"credentials": {
"apifyApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "d7eb2fc0-464d-450e-9be9-ac7e2e9488c3",
"name": "Retrieve Crawl Dataset",
"type": "n8n-nodes-base.httpRequest",
"position": [
915,
215
],
"parameters": {
"url": "={{   \"https://api.apify.com/v2/datasets/\"   + ($json.data?.defaultDatasetId || $json.defaultDatasetId || \"[YOUR_ID]\")   + \"/items?clean=true&format=json&offset=0&limit=1\" }}",
"options": []
},
"typeVersion": 4.2
},
{
"id": "9e970002-63b5-4f1b-8189-b59eb92a9471",
"name": "Derive Page Metadata",
"type": "n8n-nodes-base.code",
"position": [
1080,
290
],
"parameters": {
"jsCode": "// Renamed variables for clarity; behavior unchanged.\nconst sourceItem = items?.[0]?.json;\nif (!sourceItem) return [];\n\nconst markdownText = String(sourceItem.markdown || sourceItem.text || '');\nconst wordCount = markdownText.trim().split(/\\s+/).filter(Boolean).length;\n\nfunction getTitleFromMarkdown(markdown, url) {\n  const lines = markdown.split(/\\r?\\n/).map(line => line.trim());\n\n  const h1Line = lines.find(line => /^#\\s+/.test(line));\n  if (h1Line) return h1Line.replace(/^#\\s+/, '').trim();\n\n  const firstMeaningfulLine = lines.find(line => line.length > 0 && line.length < 120);\n  if (firstMeaningfulLine) return firstMeaningfulLine;\n\n  try {\n    const parsedUrl = new URL(url);\n    const pathSegments = parsedUrl.pathname.split('/').filter(Boolean);\n    return (pathSegments.pop() || parsedUrl.hostname).replace(/[-_]/g, ' ');\n  } catch {\n    return url;\n  }\n}\n\nreturn [{\n  json: {\n    page_url: sourceItem.url,\n    title: getTitleFromMarkdown(markdownText, sourceItem.url),\n    word_count: wordCount,\n    reading_time_min: Math.max(1, Math.ceil(wordCount / 200)),\n    excerpt: markdownText.slice(0, 600),\n    markdown: markdownText\n  }\n}];"
},
"typeVersion": 2
},
{
"id": "7e929b2d-89d8-4a36-a13e-f82ae18f3f03",
"name": "Gemini Content Review",
"type": "@n8n/n8n-nodes-langchain.lmChatGoogleGemini",
"position": [
1100,
560
],
"parameters": {
"options": []
},
"credentials": {
"googlePalmApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "8d912fd2-61b3-41df-9631-c7eaf6050bc9",
"name": "Language Analysis Agent",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
1125,
410
],
"parameters": {
"text": "=You are a competitive content analyst.\nAnalyze the competitor website page provided in `markdown`.\n\nReturn ONLY valid JSON matching this schema:\n\n{\n  \"page_url\": \"string\",\n  \"title\": \"string\",\n  \"content_type\": \"string\",\n  \"main_topics\": [\n    {\n      \"topic\": \"string\",\n      \"level\": \"number (must start at 1 for top-level)\",\n      \"subtopics\": [\n        {\n          \"topic\": \"string\",\n          \"level\": \"number (increment by 1 for each deeper heading)\",\n          \"subtopics\": []\n        }\n      ]\n    }\n  ],\n  \"key_entities\": [\"string\"],\n  \"depth_score_1_to_5\": \"number\"\n}\n\nRules:\n- Strict JSON output. No extra text, no explanations.\n- content_type: choose from [\"landing\",\"blog\",\"service\",\"about\"].\n- \"main_topics\" must always start at level 1.\n- Subtopics increment level by +1 relative to their parent.\n- If no subtopics, return [].\n- Always close JSON properly.\n\nContent to analyze:\n{{ $json[\"markdown\"] }}\n",
"options": [],
"promptType": "define"
},
"typeVersion": 2.2
},
{
"id": "f8c1643b-4382-49c0-9ec3-35876bd52eb2",
"name": "Normalize Model Output",
"type": "n8n-nodes-base.code",
"position": [
855,
480
],
"parameters": {
"jsCode": "\nfunction stripCodeFences(text = \"\") {\n  \n  return String(text)\n    .replace(/^\\s*```(?:json)?\\s*/i, \"\")\n    .replace(/\\s*```\\s*$/i, \"\")\n    .trim();\n}\n\nfunction safeParseModelJson(payload) {\n  \n  if (payload && (payload.page_url || payload.main_topics || payload.key_entities)) return payload;\n\n  let raw = payload?.output ?? payload;\n  if (typeof raw !== \"string\") raw = JSON.stringify(raw ?? {});\n  raw = stripCodeFences(raw);\n\n  let obj;\n  try { obj = JSON.parse(raw); } catch {}\n\n\n  if (Array.isArray(obj)) {\n    if (obj[0]?.output) {\n      try { obj = JSON.parse(stripCodeFences(obj[0].output)); } catch { obj = obj[0] || {}; }\n    } else {\n      obj = obj[0] || {};\n    }\n  } else if (obj && typeof obj === \"object\" && typeof obj.output === \"string\") {\n    try { obj = JSON.parse(stripCodeFences(obj.output)); } catch {}\n  }\n\n  return obj || {};\n}\n\nfunction normalizeTopicTree(nodes = [], level = 1) {\n  return nodes.map(node => ({\n    topic: String(node?.topic ?? \"\"),\n    level: Number(node?.level) || level,\n    subtopics: normalizeTopicTree(node?.subtopics || [], (Number(node?.level) || level) + 1),\n  }));\n}\n\nfunction toTopicTree(modelObj) {\n  let t = modelObj.main_topics;\n  if (typeof t === \"string\") {\n    try { t = JSON.parse(t); } catch { t = []; }\n  }\n  if (!Array.isArray(t)) t = [];\n  return normalizeTopicTree(t, 1);\n}\n\nfunction toBulletLines(tree, indent = \"\") {\n  const lines = [];\n  for (const node of tree) {\n    const text = String(node.topic || \"\").trim();\n    if (text) lines.push(`${indent}- ${text}`);\n    if (node.subtopics?.length) lines.push(...toBulletLines(node.subtopics, indent + \"  \"));\n  }\n  return lines;\n}\n\nfunction parseEntitiesToBullets(value) {\n  let arr = value;\n\n  if (typeof arr === \"string\") {\n    const s = arr.trim();\n    if (/^- /m.test(s)) {\n      // Already looks like \"- item\" lines\n      arr = s.split(/\\r?\\n/).map(l => l.replace(/^\\s*-\\s*/, \"\").trim()).filter(Boolean);\n    } else {\n      try { arr = JSON.parse(s); }\n      catch { arr = s.split(\",\").map(x => x.trim()).filter(Boolean); }\n    }\n  }\n\n  if (!Array.isArray(arr)) arr = [];\n  return arr.length ? arr.map(e => `- ${String(e)}`).join(\"\\n\") : \"- (none)\";\n}\n\nfunction chooseUrl(...candidates) {\n  for (const c of candidates) {\n    const v = String(c ?? \"\").trim();\n    if (v && !/^n\\/?a$/i.test(v)) return v; // reject \"N/A\", \"n/a\"\n  }\n  return \"\";\n}\n\n// ---------- main ----------\nconst inputPayload = items?.[0]?.json ?? {};\nconst modelObj     = safeParseModelJson(inputPayload);\n\n// Build bullets\nconst topicTree        = toTopicTree(modelObj);\nconst mainTopicsFlat   = topicTree.length ? toBulletLines(topicTree).join(\"\\n\") : \"- (none)\";\nconst keyEntitiesFlat  = parseEntitiesToBullets(modelObj.key_entities);\n\n// Prefer the true URL from \"Derive Page Metadata\"\nlet urlFromMeta;\ntry {\n  urlFromMeta =\n    $items(\"Derive Page Metadata\", 0, 0)?.json?.page_url ||\n    $items(\"Derive Page Metadata\", 0, 0)?.json?.url;\n} catch {}\n\nconst pageUrl = chooseUrl(\n  urlFromMeta,\n  inputPayload?.page_url,\n  modelObj?.page_url\n);\n\n// Output\nreturn [{\n  json: {\n    page_url: pageUrl,\n    main_topics_flat: mainTopicsFlat,\n    key_entities_flat: keyEntitiesFlat,\n  }\n}];"
},
"typeVersion": 2
},
{
"id": "e5c2649e-d2ea-49a0-a3ad-5f9916843da2",
"name": "Generate Sheet Label",
"type": "n8n-nodes-base.code",
"position": [
690,
420
],
"parameters": {
"jsCode": "\nfunction toSheetName(raw) {\n  let s = String(raw || '').trim();\n  if (!s) return 'Page';\n  if (!/^https?:\\/\\//i.test(s)) s = 'https://' + s;\n\n  try {\n    const u = new URL(s);\n    const host = u.hostname.replace(/^www\\./, '');\n    let path = (u.pathname || '/')\n      .replace(/\\/+/g, '/')\n      .replace(/^\\/|\\/$/g, '');\n    let name = path ? `${host}_${path}` : host;\n    name = name.replace(/[:\\\\\\/\\?\\*\\[\\]]/g, '\u00b7').replace(/\\s+/g, ' ').trim();\n    return name || 'Page';\n  } catch {\n    let name = s.replace(/^https?:\\/\\//i, '').replace(/[:\\\\\\/\\?\\*\\[\\]]/g, '\u00b7').trim();\n    return name || 'Page';\n  }\n}\n\n// Make N random digits as a string (e.g., \"48291\")\nfunction randDigits(n = 5) {\n  let out = '';\n  for (let i = 0; i < n; i++) out += Math.floor(Math.random() * 10);\n  return out;\n}\n\nconst MAX_LEN = 90;        // Sheets tab name limit\nconst DIGITS = 5;          // how many random digits to append\n\nreturn items.map(({ json }) => {\n  const url = json.page_url ?? json.url ?? json.pageUrl ?? '';\n  const base = toSheetName(url);\n\n  const suffix = '_' + randDigits(DIGITS);          \n  const roomForBase = Math.max(0, MAX_LEN - suffix.length);\n  const safeBase = base.length > roomForBase ? base.slice(0, roomForBase) : base;\n\n  const sheet_name = `${safeBase}${suffix}`;\n\n  return {\n    json: {\n      ...json,\n      sheet_name,\n      _sheet_name_src: url \n    }\n  };\n});"
},
"typeVersion": 2
},
{
"id": "ecec493c-7b3b-4746-90f7-1b28ce224d9c",
"name": "Create Results Sheet",
"type": "n8n-nodes-base.googleSheets",
"position": [
420,
485
],
"parameters": {
"title": "={{ $json.sheet_name }}",
"options": [],
"operation": "create",
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Content Gap Analyzer"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"executeOnce": false,
"notesInFlow": false,
"retryOnFail": false,
"typeVersion": 4.7
},
{
"id": "7b85f71d-7bdf-4de2-a572-494347582699",
"name": "Combine Streams",
"type": "n8n-nodes-base.merge",
"position": [
245,
415
],
"parameters": [],
"typeVersion": 3.2
},
{
"id": "76b234af-da62-4017-a648-e5a8d7a61c51",
"name": "Map Sheet Fields",
"type": "n8n-nodes-base.set",
"position": [
35,
490
],
"parameters": {
"options": [],
"assignments": {
"assignments": [
{
"id": "367b45c6-c47d-43de-ae52-ea269cdbe938",
"name": "page_url",
"type": "string",
"value": "={{ $('Normalize Model Output').item.json.page_url }}"
},
{
"id": "420d1ace-2bb3-4fbf-ada2-9301ca570f9a",
"name": "main_topics",
"type": "string",
"value": "={{ $('Normalize Model Output').item.json.main_topics_flat }}"
},
{
"id": "707356b5-ae9f-4706-820d-213916363049",
"name": "key_words",
"type": "string",
"value": "={{ $('Normalize Model Output').item.json.key_entities_flat }}"
}
]
}
},
"typeVersion": 3.4
},
{
"id": "c97679a0-efc5-43ab-b603-c40173c6d4e7",
"name": "Store Captured Results",
"type": "n8n-nodes-base.googleSheets",
"position": [
190,
560
],
"parameters": {
"columns": {
"value": [],
"schema": [
{
"id": "page_url",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "page_url",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "main_topics",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "main_topics",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "key_words",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "key_words",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "autoMapInputData",
"matchingColumns": [],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "appendOrUpdate",
"sheetName": {
"__rl": true,
"mode": "name",
"value": "={{ $('Generate Sheet Label').item.json.sheet_name }}"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Content Gap Analyzer"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"executeOnce": true,
"typeVersion": 4.7
},
{
"id": "7273236a-42f6-4c62-8f79-52e039e15433",
"name": "Check Email Payload",
"type": "n8n-nodes-base.if",
"position": [
480,
590
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "f1cdaf14-a4af-4ec9-8db2-4a8ddb78f4c4",
"operator": {
"type": "string",
"operation": "regex"
},
"leftValue": "={{ $json.page_url }}",
"rightValue": "^https?://"
},
{
"id": "8600a236-03c6-45db-91fe-89f93926c97e",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $json.main_topics }}",
"rightValue": ""
},
{
"id": "ba4c3349-2b5c-4122-873e-fd93e5f3b944",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $json.key_words }}",
"rightValue": ""
}
]
}
},
"typeVersion": 2.2
},
{
"id": "0964d490-2039-4658-be5f-c9b374fab9dc",
"name": "Dispatch Email Report",
"type": "n8n-nodes-base.gmail",
"position": [
635,
560
],
"webhookId": "1d1f6479-dd01-4dee-97cc-ff1a3b706dcb",
"parameters": {
"sendTo": "={{ $('Assemble Crawl Payload').item.json.notify_email }}",
"message": "=<div style=\"font-family: Arial, sans-serif; color: #333;\">\n  <h2 style=\"color:#4CAF50;\">\ud83d\udcca Competitor SEO Content Analysis Report</h2>\n\n  <p><b>Page URL:</b> <a href=\"{{ $json.page_url }}\" target=\"_blank\">{{ $json.page_url }}</a></p>\n\n  <h3>Main Topics</h3>\n  <pre style=\"background:#f9f9f9; padding:10px; border:1px solid #ddd; white-space:pre-wrap; margin:0;\">\n{{ $json.main_topics }}\n  </pre>\n\n  <h3>Keywords Identified</h3>\n  <pre style=\"background:#f9f9f9; padding:10px; border:1px solid #ddd; white-space:pre-wrap; margin:0;\">\n{{ $json.key_words }}\n  </pre>\n</div>",
"options": [],
"subject": "=SEO Audit Report: {{ $json.page_url }}"
},
"credentials": {
"gmailOAuth2": {
"id": "credential-id",
"name": ""
}
},
"executeOnce": true,
"typeVersion": 2.1
}
],
"active": false,
"pinData": [],
"settings": {
"timezone": "Europe/Helsinki",
"callerPolicy": "workflowsFromSameOwner",
"executionOrder": "v1"
},
"versionId": "",
"connections": {
"Combine Streams": {
"main": [
[
{
"node": "Map Sheet Fields",
"type": "main",
"index": 0
}
]
]
},
"Language Analysis Agent": {
"main": [
[
{
"node": "Normalize Model Output",
"type": "main",
"index": 0
}
]
]
},
"Incoming Webhook Trigger": {
"main": [
[
{
"node": "Return Form Page",
"type": "main",
"index": 0
}
],
[
{
"node": "Assemble Crawl Payload",
"type": "main",
"index": 0
}
]
]
},
"Return Form Page": {
"main": [
[]
]
},
"Generate Sheet Label": {
"main": [
[
{
"node": "Create Results Sheet",
"type": "main",
"index": 0
},
{
"node": "Combine Streams",
"type": "main",
"index": 0
}
]
]
},
"Map Sheet Fields": {
"main": [
[
{
"node": "Store Captured Results",
"type": "main",
"index": 0
}
]
]
},
"Assemble Crawl Payload": {
"main": [
[
{
"node": "Validate Input Presence",
"type": "main",
"index": 0
}
]
]
},
"Gemini Content Review": {
"ai_languageModel": [
[
{
"node": "Language Analysis Agent",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Check Email Payload": {
"main": [
[
{
"node": "Dispatch Email Report",
"type": "main",
"index": 0
}
]
]
},
"Derive Page Metadata": {
"main": [
[
{
"node": "Language Analysis Agent",
"type": "main",
"index": 0
}
]
]
},
"Retrieve Crawl Dataset": {
"main": [
[
{
"node": "Derive Page Metadata",
"type": "main",
"index": 0
}
]
]
},
"Validate Input Presence": {
"main": [
[
{
"node": "Crawl Rival Site",
"type": "main",
"index": 0
},
{
"node": "Log Request Details",
"type": "main",
"index": 0
}
]
]
},
"Store Captured Results": {
"main": [
[
{
"node": "Check Email Payload",
"type": "main",
"index": 0
}
]
]
},
"Crawl Rival Site": {
"main": [
[
{
"node": "Retrieve Crawl Dataset",
"type": "main",
"index": 0
}
]
]
},
"Log Request Details": {
"main": [
[]
]
},
"Create Results Sheet": {
"main": [
[
{
"node": "Combine Streams",
"type": "main",
"index": 1
}
]
]
},
"Normalize Model Output": {
"main": [
[
{
"node": "Generate Sheet Label",
"type": "main",
"index": 0
}
]
]
}
}
}