January 22, 2026

Google Sheets + ScrapingBee: enriched leads, ready

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Your lead sheet looks “full,” but half the rows are dead ends. Wrong websites, missing contact pages, generic directories, and emails that bounce the moment you hit send.

This is the kind of mess that slows down marketing ops first. But agency owners building prospect lists and sales teams doing weekly outreach feel it too. With Sheets lead enrichment automation, you turn a basic “business type + city + state” row into a usable company site and real emails, without spending your afternoon in Google.

This workflow pulls leads from Google Sheets, searches with Serper.dev, scrapes likely pages via ScrapingBee, extracts email addresses, and writes everything back to your sheet with clear status updates.

How This Automation Works

See how this solves the problem:

n8n Workflow Template: Google Sheets + ScrapingBee: enriched leads, ready

Click to explore

flowchart LR

    subgraph sg0["Google Sheets Flow"]
        direction LR
        n0@{ icon: "mdi:swap-vertical", form: "rounded", label: "Loop Over Items", pos: "b", h: 48 }
        n1@{ icon: "mdi:swap-horizontal", form: "rounded", label: "If", pos: "b", h: 48 }
        n2["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Scraping Bee"]
        n3["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Website Options"]
        n4@{ icon: "mdi:play-circle", form: "rounded", label: "Google Sheets Trigger", pos: "b", h: 48 }
        n5@{ icon: "mdi:swap-horizontal", form: "rounded", label: "If1", pos: "b", h: 48 }
        n6@{ icon: "mdi:swap-horizontal", form: "rounded", label: "If2", pos: "b", h: 48 }
        n7@{ icon: "mdi:cog", form: "rounded", label: "Wait", pos: "b", h: 48 }
        n8@{ icon: "mdi:swap-horizontal", form: "rounded", label: "If3", pos: "b", h: 48 }
        n9@{ icon: "mdi:swap-vertical", form: "rounded", label: "Set Information", pos: "b", h: 48 }
        n10["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Search Companies (Serper.dev)"]
        n11["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Extract Company & Website"]
        n12@{ icon: "mdi:database", form: "rounded", label: "Update Running Status", pos: "b", h: 48 }
        n13@{ icon: "mdi:database", form: "rounded", label: "Update Missing Information S..", pos: "b", h: 48 }
        n14@{ icon: "mdi:database", form: "rounded", label: "Add research Results", pos: "b", h: 48 }
        n15["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Test pages"]
        n16@{ icon: "mdi:database", form: "rounded", label: "Update Finished Status", pos: "b", h: 48 }
        n17["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Email Extractor"]
        n18@{ icon: "mdi:database", form: "rounded", label: "Get Emails", pos: "b", h: 48 }
        n19@{ icon: "mdi:database", form: "rounded", label: "Add Emails", pos: "b", h: 48 }
        n1 --> n2
        n1 --> n0
        n5 --> n17
        n5 --> n0
        n6 --> n18
        n6 --> n0
        n8 --> n12
        n8 --> n13
        n7 --> n0
        n19 --> n7
        n18 --> n19
        n15 --> n1
        n2 --> n5
        n17 --> n6
        n0 --> n16
        n0 --> n15
        n9 --> n10
        n3 --> n0
        n14 --> n3
        n4 --> n8
        n12 --> n9
        n11 --> n14
        n10 --> n11
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n4 trigger
    class n1,n5,n6,n8 decision
    class n12,n13,n14,n16,n18,n19 database
    class n2,n10,n15 api
    class n3,n11,n17 code
    classDef customIcon fill:none,stroke:none
    class n2,n3,n10,n11,n15,n17 customIcon

The Challenge: Turning “leads” into contacts you can actually email

Building a lead list is easy. Building one that’s outreach-ready is where the time disappears. You start with a few columns in Google Sheets, then you open a new tab for every row: search the company, guess which site is real, click around for a contact page, and copy-paste anything that looks like an email. Now multiply that by 200 rows. Mistakes creep in fast, and frankly it’s mentally exhausting because every business formats their site differently and directories keep showing up in search results.

It adds up fast. Here’s where it breaks down in real life.

You waste about 5–10 minutes per lead just figuring out the “real” website versus listings and aggregator pages.
People copy the wrong URL into the sheet, and that one bad field poisons your entire outreach sequence.
Email hunting becomes inconsistent, so one person finds great contacts while another finds nothing and nobody knows why.
Status tracking is usually manual, which means duplicates, skipped rows, and “Did we already do this?” meetings.

The Fix: Google Sheets lead enrichment with Serper.dev + ScrapingBee

This workflow starts inside your Google Sheet. When you “activate” a row, it first checks that the basics are present (business type, city, state). If something is missing, it flags the row so you don’t waste cycles on junk inputs. If the row looks good, it marks the status as Running, prepares search inputs like country and language, then queries Serper.dev to find likely company websites. Next, it generates a set of “site variants” and candidate contact pages, validates those URLs, and sends the best options to ScrapingBee for scraping. Emails are extracted from the scraped pages, checked against what you already have in the sheet, and then written back in a clean comma-separated format. Finally, the row gets marked Finished so your list stays organized.

The workflow kicks off from a Sheets Row Trigger. From there, Serper.dev is used to locate the best company pages, and ScrapingBee handles the messy part of pulling content reliably. The output is simple: updated columns in Google Sheets (company name, URL, emails, and status) so your outreach list stays ready to use.

What Changes: Before vs. After

What This Eliminates

Impact You’ll See

Manually searching Google and guessing which site is legitimate.
Copy-pasting URLs and “Contact” pages into random columns.
Scraping pages by hand and scanning for email addresses.
Updating statuses manually and losing track of what’s done.

Most teams get a few hours back per 100 leads enriched.
Fewer bad URLs in your CRM or outreach tool because candidates are validated first.
Emails are captured consistently, stored in one field, and easier to dedupe.
Your sheet becomes a production queue with clear Running, Missing information, and Finished states.
It’s easier to scale list building without adding more people to the “tab-opening” job.

Real-World Impact

Say you enrich 100 leads every week. Manually, you might spend about 8 minutes per lead between searching, clicking, and hunting for a usable email, which is roughly 13 hours of busywork. With this workflow, you activate rows in Google Sheets and let it run: a minute to set up the queue, then the automation searches, validates, scrapes, and updates the sheet while you do other work. Even if you still review the results for a few minutes at the end, you’re usually saving most of that day.

Requirements

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
Google Sheets to store leads and results.
Serper.dev to search for real company websites.
ScrapingBee to scrape pages and extract emails.
Google Sheets API credentials (get them from Google Cloud Console).
Serper.dev API key (get it from your Serper.dev dashboard).
ScrapingBee API key (get it from your ScrapingBee dashboard).

Skill level: Beginner. You’ll connect accounts, paste API keys, and match a few Google Sheets columns.

Need help implementing this? Talk to an automation expert (free 15-minute consultation).

The Workflow Flow

A Google Sheets row gets “activated.” The trigger watches your sheet for rows you want processed, so you control what runs and when (handy when you’re cleaning up inputs first).

Basic input validation happens immediately. If business type, city, or state is missing, the workflow writes a “Missing information” status and moves on. No wasted API calls.

Serper.dev finds likely company pages. n8n sends a search request, parses the results, and appends research rows so the workflow can test multiple candidates instead of trusting the first link it sees.

URLs are validated, then ScrapingBee scrapes the best options. The workflow checks that pages respond properly, scrapes content, and extracts email addresses. If emails exist, it looks up what you already have and updates the record.

Google Sheets is updated and the row is finalized. You get company name, URL, comma-separated emails, and a Finished status so your sheet stays clean.

You can easily modify country, language, or result count to fit different regions and niches. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Sheets Row Trigger

Set up the trigger to watch for row updates in your input sheet.

Add the Sheets Row Trigger node and set Event to rowUpdate.
Set Columns To Watch to Activate.
Set Poll Times to everyMinute.
Select the target Document and Sheet Name for your input sheet.
Credential Required: Connect your googleSheetsTriggerOAuth2Api credentials.

Step 2: Connect Google Sheets for Status and Data Operations

Configure the Google Sheets nodes that update status and write research data.

In Mark Status Running, set Operation to update, map Client to {{ $json.Client }}, and set Status to Running.
In Flag Missing Data, set Operation to update, map Client to {{ $json.Client }}, and set Status to Missing data.
In Append Research Rows, set Operation to append and map: City to {{ $('Assign Search Inputs').item.json.city }}, State to {{ $('Assign Search Inputs').item.json.state }}, Client to {{ $json.client }}, Company to {{ $json.company }}, and website to {{ $json.Website }}.
In Lookup Existing Emails, set the filter to lookup Company with {{ $json.company }}.
In Update Email Records, set Operation to update and map emails to {{ $json.emails ? $json.emails + ", " + $('Extract Email Addresses').item.json.email : $('Extract Email Addresses').item.json.email }}, and Company to {{ $('Extract Email Addresses').item.json.company }}.
In Mark Status Finished, set Operation to update, map Client to {{ $('Generate Site Variants').item.json.client }}, and set Status to Finished.

Credential Required: Connect your googleSheetsOAuth2Api credentials to Mark Status Running and Flag Missing Data.

⚠️ Common Pitfall: The other Google Sheets nodes (Append Research Rows, Lookup Existing Emails, Update Email Records, Mark Status Finished) also require Google Sheets credentials, but none are configured. Add the same googleSheetsOAuth2Api credentials to each of them.

Step 3: Set Up Validation and Search Input Preparation

Validate incoming rows and build search inputs for the Serper query.

In Input Validation, ensure the three conditions check for non-empty values: {{ $json.Client }}, {{ $json.City }}, and {{ $json.State }}.
Confirm Input Validation routes valid rows to Mark Status Running and invalid rows to Flag Missing Data.
In Assign Search Inputs, enable Keep Only Set and set fields: state to {{ $('Sheets Row Trigger').item.json.State }}, city to {{ $('Sheets Row Trigger').item.json.City }}, client to {{ $('Sheets Row Trigger').item.json.Client }}, business_type to {{ $node["Sheets Row Trigger"].json["Business Type"] }}, country to Argentina, country_code to AR, language to es-419, and result_count to 10.

Step 4: Configure Search and Link Parsing

Query Serper and filter the organic results into company candidates.

In Serper Search Request, set URL to https://google.serper.dev/search and Request Method to POST.
Enable JSON Parameters and set Body Parameters JSON to { "q": "{{ $json.business_type }} in {{ $json.city }}, {{ $json.state }}, {{ $json.country }}", "num": {{ $json.result_count }}, "gl": "{{ $json.country_code }}", "hl": "{{ $json.language }}" }.
Credential Required: Connect your httpHeaderAuth credentials for the Serper API.
Keep Parse Company Links as-is to filter out blacklisted results and map company, Website, client, state, and city values.

Step 5: Generate URL Variants and Batch Processing

Create multiple contact/support URL variants and iterate through them for scraping.

Keep Append Research Rows connected after Parse Company Links to log candidate companies before scraping.
In Generate Site Variants, keep the JavaScript that builds multiple URL paths for each website.
In Batch Iterator, leave the default batching options unless you need to control throughput.
Confirm the flow: Append Research Rows → Generate Site Variants → Batch Iterator → Validate Page URLs.

⚠️ Common Pitfall: If your input sheet uses different column names (e.g., “website” vs “Website”), adjust the mapping in Append Research Rows and Generate Site Variants accordingly.

Step 6: Configure Scraping, Parsing, and Email Updates

Validate pages, scrape HTML, extract emails, and update existing records.

In Validate Page URLs, set URL to {{ $('Generate Site Variants').item.json.Website }}.
In Scrape Result Check, keep the condition that checks {{ $json.error.message }} is empty before continuing to scrape.
In ScrapingBee Request, set URL to https://app.scrapingbee.com/api/v1/?api_key=[CONFIGURE_YOUR_API_KEY]={{ $('Generate Site Variants').item.json.Website }}&render_js=true and replace [CONFIGURE_YOUR_API_KEY] with your ScrapingBee key.
In Scrape Success Check, keep the condition that checks {{ $json.error.message }} is empty before extracting emails.
In Extract Email Addresses, keep the JavaScript that extracts and deduplicates emails from the HTML data field.
In Email Presence Check, use the not-empty condition on {{ $('Extract Email Addresses').item.json.email }} to decide whether to update.
Confirm the update flow: Email Presence Check → Lookup Existing Emails → Update Email Records → Delay Pause → Batch Iterator.

⚠️ Common Pitfall: The ScrapingBee URL includes a placeholder API key. If you leave [CONFIGURE_YOUR_API_KEY] unchanged, scraping will fail and the workflow will loop back via Batch Iterator.

Step 7: Test and Activate Your Workflow

Verify the full enrichment pipeline and then turn it on for production updates.

Manually run the workflow with a test row update in the input sheet and ensure Sheets Row Trigger fires.
Check that Mark Status Running updates the input sheet status to Running or Flag Missing Data updates it to Missing data when fields are empty.
Verify Append Research Rows appends results to the data sheet and Update Email Records writes email values.
Confirm Mark Status Finished sets the final status to Finished after batch processing.
Activate the workflow by toggling the Active switch in n8n.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Watch Out For

Google Sheets credentials can expire or need specific permissions. If things break, check the n8n credential connection and your Google Cloud OAuth consent/settings first.
If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
Default prompts in AI nodes are generic. Add your brand voice early or you’ll be editing outputs forever.

Common Questions

How quickly can I implement this Sheets lead enrichment automation?

About 30 minutes if your API keys and Google Sheets access are ready.

Can non-technical teams implement this lead enrichment?

Yes. You won’t write code, but you will need to map your sheet columns and paste a couple of API keys into n8n.

Is n8n free to use for this Sheets lead enrichment workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in Serper.dev and ScrapingBee usage (both have free tiers, then usage-based pricing).

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

How do I adapt this Sheets lead enrichment solution to my specific challenges?

Start by making country, country_code, language, and result_count come from columns in your sheet, so each row can control how it searches. You can also expand the blacklist logic in the “Generate Site Variants” / filtering code to avoid directories you hate. If you want more than emails, extend the “Extract Email Addresses” code to capture phones or social links and write them back as new columns.

Why is my Google Sheets connection failing in this Sheets lead enrichment workflow?

Usually it’s expired Google OAuth credentials or the wrong Google account connected to the n8n credential. Reconnect Google Sheets in n8n, then confirm the sheet is shared with that account and the correct spreadsheet is selected. If only updates fail, check that your column names match what the workflow expects (including the activate field).

What’s the capacity of this Sheets lead enrichment solution?

If you self-host, there’s no execution cap (it’s mainly your server and API limits). On n8n Cloud, capacity depends on your plan’s monthly executions. Practically, this workflow is gated by Serper.dev and ScrapingBee rate limits, so most teams run it in batches of a few dozen to a few hundred leads at a time.

Is this Sheets lead enrichment automation better than using Zapier or Make?

Often, yes, because this flow relies on branching logic (multiple checks), looping through candidate URLs, and code-based parsing, which gets awkward and pricey in many no-code tools. n8n handles split-in-batches loops cleanly, and you can self-host for unlimited executions. Zapier or Make can still be fine if your process is “search once, store one result,” but this workflow is built for real-world messiness: duplicates, bad URLs, and multiple pages per company. One more thing: keeping status fields like Running and Finished inside Google Sheets makes ops handoffs easier, and n8n fits that pattern well. If you’re on the fence, Talk to an automation expert and we’ll sanity-check your setup.

Once this is running, your sheet stops being a wish list and starts being an outreach queue. The workflow handles the repetitive digging so you can focus on the message and the offer.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"id": null,
"meta": {
"instanceId": "b0e84eee08e38d704953a5ea96eee383f43fedd01e14ca94e3eb8451b466d56b",
"templateId": null,
"templateCredsSetupCompleted": null
},
"name": "Automated Lead Email Enrichment",
"tags": [],
"nodes": [
{
"id": "flowpast-topbar-7071",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
0,
20
],
"parameters": {
"color": 7,
"width": 1320,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "f6d286f2-7d8e-48ce-a846-060948b5adfc",
"name": "Batch Iterator",
"type": "n8n-nodes-base.splitInBatches",
"position": [
520,
360
],
"parameters": {
"options": []
},
"typeVersion": 3,
"alwaysOutputData": false
},
{
"id": "77536dd3-7863-4dfd-a531-48c45734546f",
"name": "Scrape Result Check",
"type": "n8n-nodes-base.if",
"position": [
980,
360
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "faa41c08-7cc6-4fd1-8447-884a00754fba",
"operator": {
"type": "string",
"operation": "empty",
"singleValue": true
},
"leftValue": "={{ $json.error.message }}",
"rightValue": ""
}
]
}
},
"typeVersion": 2.2
},
{
"id": "f2651a42-0c4a-45d2-afba-c9478e0d45ac",
"name": "ScrapingBee Request",
"type": "n8n-nodes-base.httpRequest",
"onError": "continueRegularOutput",
"position": [
1220,
410
],
"parameters": {
"url": "=https://app.scrapingbee.com/api/v1/?api_key=[CONFIGURE_YOUR_API_KEY]={{ $('Generate Site Variants').item.json.Website }}&render_js=true",
"options": {
"batching": {
"batch": {
"batchSize": 5
}
}
}
},
"typeVersion": 4.2
},
{
"id": "8df36dea-5646-4900-a50f-995601085c49",
"name": "Generate Site Variants",
"type": "n8n-nodes-base.code",
"position": [
280,
400
],
"parameters": {
"jsCode": "// Function to extract the base URL from a website string\nfunction getBaseUrl(website) {\n    // Match the protocol and domain (e.g., https://example.com)\n    const match = website.match(/^(https?:\\/\\/[^\\/]+)/);\n    // Return the matched base URL or the original website if no match\n    return match ? match[1] : website;\n}\n\n// Process all input items and generate multiple URL variants for each company\nreturn $input.all().reduce((acc, item) => {\n    // Extract fields from the current item\n    const company = item.json[\"Company\"];\n    const website = item.json[\"website\"];\n    const client = item.json[\"Client\"];\n    const city = item.json[\"City\"];\n    const state = item.json[\"State\"]\n    \n    // Get the base URL from the website\n    const baseUrl = getBaseUrl(website);\n\n    // List of contact and support page paths to append to the base URL\n    const paths = [\n        \"/\",               // home\n        \"/contacto/\",\n        \"/contact/\",\n        \"/contactanos/\",\n        \"/comunicate/\",\n        \"/contact/\",\n        \"/contact-us/\",\n        \"/support/\",\n        \"/help/\",\n        \"/customer-service/\",\n        \"/help-center/\",\n        \"/contactus/\",\n        \"/customer-support/\",\n        \"/feedback/\",\n        \"/get-in-touch/\"\n    ];\n\n    // Create an array of variants, each with a different path appended to the base URL\n    const variants = paths.map(path => {\n        return {\n            json: {\n                company: company,\n                Website: `${baseUrl}${path}`,\n                client: client,\n                city: city,\n                state: state\n            }\n        };\n    });\n\n    // Concatenate the variants for this item to the accumulator array\n    return acc.concat(variants);\n}, []);\n"
},
"typeVersion": 2
},
{
"id": "ffe396f7-dc1c-4ae0-ab47-fedd20c3bace",
"name": "Sheets Row Trigger",
"type": "n8n-nodes-base.googleSheetsTrigger",
"position": [
1200,
170
],
"parameters": {
"event": "rowUpdate",
"options": {
"columnsToWatch": [
"Activate"
]
},
"pollTimes": {
"item": [
{
"mode": "everyMinute"
}
]
},
"sheetName": {
"__rl": true,
"mode": "list",
"value": "",
"cachedResultUrl": "",
"cachedResultName": ""
},
"documentId": {
"__rl": true,
"mode": "id",
"value": ""
}
},
"credentials": {
"googleSheetsTriggerOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "85a56d75-5c7c-4a99-8b80-3ba1b09763f2",
"name": "Scrape Success Check",
"type": "n8n-nodes-base.if",
"position": [
1180,
560
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "faa41c08-7cc6-4fd1-8447-884a00754fba",
"operator": {
"type": "string",
"operation": "empty",
"singleValue": true
},
"leftValue": "={{ $json.error.message }}",
"rightValue": ""
}
]
}
},
"typeVersion": 2.2
},
{
"id": "32910d3a-079b-4e56-be68-9968a00f8966",
"name": "Email Presence Check",
"type": "n8n-nodes-base.if",
"position": [
760,
570
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "and",
"conditions": [
{
"id": "74bfe580-4c7b-4859-85da-02e846bb2d68",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $('Extract Email Addresses').item.json.email }}",
"rightValue": ""
}
]
}
},
"typeVersion": 2.2
},
{
"id": "ee67ab82-b78b-4487-97a7-d27e33bddf3a",
"name": "Delay Pause",
"type": "n8n-nodes-base.wait",
"position": [
80,
545
],
"webhookId": "a7f40dd8-a7d1-4b15-9c7b-fa017f0a6f94",
"parameters": {
"unit": "minutes",
"amount": 1
},
"typeVersion": 1.1
},
{
"id": "823f2cfb-c1a7-4c16-a95a-100aad15426a",
"name": "Input Validation",
"type": "n8n-nodes-base.if",
"position": [
980,
150
],
"parameters": {
"options": [],
"conditions": {
"options": {
"version": 2,
"leftValue": "",
"caseSensitive": true,
"typeValidation": "strict"
},
"combinator": "or",
"conditions": [
{
"id": "1f2515cd-fc6f-422d-a364-760a473aace3",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $json.Client }}",
"rightValue": ""
},
{
"id": "3fdcf4ff-79d1-45bc-a3d0-f9a53be69158",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $json.City }}",
"rightValue": ""
},
{
"id": "6d59f840-f110-4c57-8d10-cc1e6976772c",
"operator": {
"type": "string",
"operation": "notEmpty",
"singleValue": true
},
"leftValue": "={{ $json.State }}",
"rightValue": ""
}
]
}
},
"typeVersion": 2.2
},
{
"id": "9e4d96df-7be4-4a3b-b8ec-71f40698a8e8",
"name": "Assign Search Inputs",
"type": "n8n-nodes-base.set",
"position": [
520,
165
],
"parameters": {
"values": {
"number": [
{
"name": "result_count",
"value": 10
}
],
"string": [
{
"name": "state",
"value": "={{ $('Sheets Row Trigger').item.json.State }}"
},
{
"name": "city",
"value": "={{ $('Sheets Row Trigger').item.json.City }}"
},
{
"name": "client",
"value": "={{ $('Sheets Row Trigger').item.json.Client }}"
},
{
"name": "business_type",
"value": "={{ $node[\"Sheets Row Trigger\"].json[\"Business Type\"] }}"
},
{
"name": "country",
"value": "Argentina"
},
{
"name": "country_code",
"value": "AR"
},
{
"name": "language",
"value": "es-419"
}
]
},
"options": [],
"keepOnlySet": true
},
"typeVersion": 2
},
{
"id": "293eabf8-c7c5-485f-9631-60a6b3b746d0",
"name": "Serper Search Request",
"type": "n8n-nodes-base.httpRequest",
"position": [
300,
190
],
"parameters": {
"url": "https://google.serper.dev/search",
"options": [],
"requestMethod": "POST",
"authentication": "genericCredentialType",
"jsonParameters": true,
"genericAuthType": "httpHeaderAuth",
"bodyParametersJson": "={\n  \"q\": \"{{ $json.business_type }} in {{ $json.city }}, {{ $json.state }}, {{ $json.country }}\",\n  \"num\": {{ $json.result_count }},\n  \"gl\": \"{{ $json.country_code }}\",\n  \"hl\": \"{{ $json.language }}\"\n}"
},
"credentials": {
"httpHeaderAuth": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 2
},
{
"id": "9944d54a-7a52-4b4d-94d3-4b635839b284",
"name": "Parse Company Links",
"type": "n8n-nodes-base.code",
"position": [
0,
160
],
"parameters": {
"jsCode": "// Get the organic search results or an empty array if undefined\nconst results = $json.organic ?? [];\n\n// Define a blacklist of terms to filter out unwanted results\nconst blacklist = [\n    \"telecom\", \"teco\", \"claro\", \"telmex\", \"telefonica\",\n    \"cirion\", \"silica\", \"level3\", \"movistar\", \"personal\", \"orbith\", \"expereo\", \"linkup\", \"telecentro\", \"cabase\", \"argentina\", \"fibertel\", \"insat\", \"sawerin\", \"iplan\", \"metrotel\", \"sion\", \"starlink\", \"selectra\", \"directvla\", \"buenosaires\", \"lanacion\", \"infobae\", \"internexa\", \"ifxnetworks\", \"centurylink\", \"arnet\", \"fibertel\", \"cablevision\", \"arnet\", \"arsat\", \"instagram\", \"facebook\", \"tiktok\", \"wikipedia\", \"twitter\"\n];\n\n// Filter out results that contain any term from the blacklist in the title or link\nconst filtrados = results.filter(r => {\n    const lowerTitle = (r.title || \"\").toLowerCase();\n    const lowerLink = (r.link || \"\").toLowerCase();\n    return !blacklist.some(term => lowerTitle.includes(term) || lowerLink.includes(term));\n});\n\n// Map the filtered results to a new object structure, adding additional info from 'Assign Search Inputs'\nreturn filtrados.map(r => ({\n    json: {\n        company: r.title || \"\",\n        Website: r.link || \"\",\n        client: $('Assign Search Inputs').first().json.client, // Get client info from 'Assign Search Inputs'\n        state: $('Assign Search Inputs').first().json.state,   // Get state info from 'Assign Search Inputs'\n        city: $('Assign Search Inputs').first().json.city,     // Get city info from 'Assign Search Inputs'\n    }\n}));\n"
},
"typeVersion": 2
},
{
"id": "eb3b27a5-dcf0-48c0-ab2d-3947ed1e95da",
"name": "Mark Status Running",
"type": "n8n-nodes-base.googleSheets",
"position": [
760,
205
],
"parameters": {
"columns": {
"value": {
"Client": "={{ $json.Client }}",
"Status": "Running"
},
"schema": [
{
"id": "Client",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Client",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Business Type.",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Business Type.",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "City",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "City",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "State",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "State",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Activate",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Activate",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Status",
"type": "string",
"display": true,
"required": false,
"displayName": "Status",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "row_number",
"type": "string",
"display": true,
"removed": true,
"readOnly": true,
"required": false,
"displayName": "row_number",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [
"Client"
],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "update",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Input"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.6
},
{
"id": "ac6e7f24-34ae-4811-8a50-bb0ca5e14eea",
"name": "Flag Missing Data",
"type": "n8n-nodes-base.googleSheets",
"position": [
840,
470
],
"parameters": {
"columns": {
"value": {
"Client": "={{ $json.Client }}",
"Status": "Missing data"
},
"schema": [
{
"id": "Client",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Client",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Business Type.",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Business Type.",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "City",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "City",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "State",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "State",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Activate",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Activate",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Status",
"type": "string",
"display": true,
"required": false,
"displayName": "Status",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "row_number",
"type": "string",
"display": true,
"removed": true,
"readOnly": true,
"required": false,
"displayName": "row_number",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [
"Client"
],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "update",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Input"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.6
},
{
"id": "3109c8db-837b-48f2-b1ff-9f86d9c8d79e",
"name": "Append Research Rows",
"type": "n8n-nodes-base.googleSheets",
"position": [
40,
370
],
"parameters": {
"columns": {
"value": {
"City": "={{ $('Assign Search Inputs').item.json.city }}",
"State": "={{ $('Assign Search Inputs').item.json.state }}",
"Client": "={{ $json.client }}",
"Company": "={{ $json.company }}",
"website": "={{ $json.Website }}"
},
"schema": [
{
"id": "Client",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Client",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "City",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "City",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "State",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "State",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Company",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Company",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "website",
"type": "string",
"display": true,
"required": false,
"displayName": "website",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "emails",
"type": "string",
"display": true,
"required": false,
"displayName": "emails",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Data"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"typeVersion": 4.6
},
{
"id": "382f7581-ac23-4ba6-928c-9a6e4a990230",
"name": "Validate Page URLs",
"type": "n8n-nodes-base.httpRequest",
"onError": "continueRegularOutput",
"position": [
760,
390
],
"parameters": {
"url": "={{ $('Generate Site Variants').item.json.Website }}",
"options": []
},
"typeVersion": 4.2
},
{
"id": "bbda5b3d-8293-436e-b7ad-cfc0d719d92c",
"name": "Mark Status Finished",
"type": "n8n-nodes-base.googleSheets",
"position": [
600,
120
],
"parameters": {
"columns": {
"value": {
"Client": "={{ $('Generate Site Variants').item.json.client }}",
"Status": "Finished"
},
"schema": [
{
"id": "Client",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Client",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Business Type.",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Business Type.",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "City",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "City",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "State",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "State",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Activate",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Activate",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Status",
"type": "string",
"display": true,
"required": false,
"displayName": "Status",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "row_number",
"type": "string",
"display": true,
"removed": true,
"readOnly": true,
"required": false,
"displayName": "row_number",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [
"Client"
],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "update",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Input"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"typeVersion": 4.6
},
{
"id": "652ea686-3d10-495d-a136-a956ff6445be",
"name": "Extract Email Addresses",
"type": "n8n-nodes-base.code",
"position": [
960,
530
],
"parameters": {
"jsCode": "// Get the HTML content from the current item's \"data\" field\nconst html = $json[\"data\"];\n\n// Regular expression to match email addresses with specific domains\nconst emailRegex = /[a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+(?:\\.[a-zA-Z0-9-]+)*\\.[a-zA-Z]{2,}/g;\n\n// Find all emails in the HTML content (returns an array or empty array if none found)\nconst emailsFound = html.match(emailRegex) || [];\n\n// Remove duplicate emails by converting the array to a Set and back to an array\nconst uniqueEmails = [...new Set(emailsFound)];\n\n// Return an object with client, company, website, and all unique emails joined by comma\nreturn {\n  client: $('Generate Site Variants').item.json.client,  // Client name from the \"Generate Site Variants\" node\n  company: $('Generate Site Variants').item.json.company,  // Company name from the \"Generate Site Variants\" node\n  website: $('Generate Site Variants').item.json.Website,  // Website URL from the \"Generate Site Variants\" node\n  email: uniqueEmails.join(\", \"),  // Unique emails found, separated by comma\n};"
},
"typeVersion": 2
},
{
"id": "3ac582d2-d295-4ebf-a898-c7c1d0bb75e1",
"name": "Lookup Existing Emails",
"type": "n8n-nodes-base.googleSheets",
"position": [
520,
535
],
"parameters": {
"options": [],
"filtersUI": {
"values": [
{
"lookupValue": "={{ $json.company }}",
"lookupColumn": "Company"
}
]
},
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Data"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"typeVersion": 4.6
},
{
"id": "b629c2b5-75df-4731-af2a-4c4c42258bc0",
"name": "Update Email Records",
"type": "n8n-nodes-base.googleSheets",
"position": [
300,
570
],
"parameters": {
"columns": {
"value": {
"emails": "={{ $json.emails ? $json.emails + \", \" + $('Extract Email Addresses').item.json.email : $('Extract Email Addresses').item.json.email }}",
"Company": "={{ $('Extract Email Addresses').item.json.company }}"
},
"schema": [
{
"id": "Client",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Client",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "City",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "City",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "State",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "State",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Company",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "Company",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "website",
"type": "string",
"display": true,
"required": false,
"displayName": "website",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "emails",
"type": "string",
"display": true,
"required": false,
"displayName": "emails",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "row_number",
"type": "string",
"display": true,
"removed": true,
"readOnly": true,
"required": false,
"displayName": "row_number",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [
"Company"
],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "update",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "Data"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "ISP Search"
}
},
"typeVersion": 4.6
}
],
"active": false,
"pinData": [],
"settings": {
"executionOrder": "v1"
},
"versionId": null,
"connections": {
"Scrape Result Check": {
"main": [
[
{
"node": "ScrapingBee Request",
"type": "main",
"index": 0
}
],
[
{
"node": "Batch Iterator",
"type": "main",
"index": 0
}
]
]
},
"Scrape Success Check": {
"main": [
[
{
"node": "Extract Email Addresses",
"type": "main",
"index": 0
}
],
[
{
"node": "Batch Iterator",
"type": "main",
"index": 0
}
]
]
},
"Email Presence Check": {
"main": [
[
{
"node": "Lookup Existing Emails",
"type": "main",
"index": 0
}
],
[
{
"node": "Batch Iterator",
"type": "main",
"index": 0
}
]
]
},
"Input Validation": {
"main": [
[
{
"node": "Mark Status Running",
"type": "main",
"index": 0
}
],
[
{
"node": "Flag Missing Data",
"type": "main",
"index": 0
}
]
]
},
"Delay Pause": {
"main": [
[
{
"node": "Batch Iterator",
"type": "main",
"index": 0
}
]
]
},
"Update Email Records": {
"main": [
[
{
"node": "Delay Pause",
"type": "main",
"index": 0
}
]
]
},
"Lookup Existing Emails": {
"main": [
[
{
"node": "Update Email Records",
"type": "main",
"index": 0
}
]
]
},
"Validate Page URLs": {
"main": [
[
{
"node": "Scrape Result Check",
"type": "main",
"index": 0
}
]
]
},
"ScrapingBee Request": {
"main": [
[
{
"node": "Scrape Success Check",
"type": "main",
"index": 0
}
]
]
},
"Extract Email Addresses": {
"main": [
[
{
"node": "Email Presence Check",
"type": "main",
"index": 0
}
]
]
},
"Batch Iterator": {
"main": [
[
{
"node": "Mark Status Finished",
"type": "main",
"index": 0
}
],
[
{
"node": "Validate Page URLs",
"type": "main",
"index": 0
}
]
]
},
"Assign Search Inputs": {
"main": [
[
{
"node": "Serper Search Request",
"type": "main",
"index": 0
}
]
]
},
"Generate Site Variants": {
"main": [
[
{
"node": "Batch Iterator",
"type": "main",
"index": 0
}
]
]
},
"Append Research Rows": {
"main": [
[
{
"node": "Generate Site Variants",
"type": "main",
"index": 0
}
]
]
},
"Sheets Row Trigger": {
"main": [
[
{
"node": "Input Validation",
"type": "main",
"index": 0
}
]
]
},
"Mark Status Running": {
"main": [
[
{
"node": "Assign Search Inputs",
"type": "main",
"index": 0
}
]
]
},
"Parse Company Links": {
"main": [
[
{
"node": "Append Research Rows",
"type": "main",
"index": 0
}
]
]
},
"Serper Search Request": {
"main": [
[
{
"node": "Parse Company Links",
"type": "main",
"index": 0
}
]
]
}
}
}