January 22, 2026

Bright Data + Google Sheets: research in a cell

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Manual web research in a spreadsheet is a special kind of frustrating. You open ten tabs, copy a few lines, paste them back, then realize the next row needs the same thing… and you do it all again.

This is the kind of mess that slows down market researchers first, but e-commerce operators tracking prices and growth teams doing lead lists feel it too. With Bright Data research automation inside Google Sheets, you get consistent answers per row without the tab hopping, and it usually saves a few minutes per lookup.

Below is how the workflow turns a simple spreadsheet formula into a fast research “assistant”, what you need to run it, and where teams usually tweak it for their own use.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: Bright Data + Google Sheets: research in a cell

Click to explore

flowchart LR

    subgraph sg0["Bright Data Search A Flow"]
        direction LR
        n0["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/webhook.dark.svg' width='40' height='40' /></div><br/>Respond to Webhook"]
        n1@{ icon: "mdi:robot", form: "rounded", label: "Bright Data Search Agent", pos: "b", h: 48 }
        n2@{ icon: "mdi:wrench", form: "rounded", label: "Bright Data MCP", pos: "b", h: 48 }
        n3["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/webhook.dark.svg' width='40' height='40' /></div><br/>Webhook Call"]
        n4@{ icon: "mdi:robot", form: "rounded", label: "Adjust Query Agent", pos: "b", h: 48 }
        n5["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Bright Data - Data Extraction"]
        n6@{ icon: "mdi:robot", form: "rounded", label: "Extract Data", pos: "b", h: 48 }
        n7@{ icon: "mdi:robot", form: "rounded", label: "Summarize Information", pos: "b", h: 48 }
        n8@{ icon: "mdi:cog", form: "rounded", label: "Update Logs", pos: "b", h: 48 }
        n9@{ icon: "mdi:robot", form: "rounded", label: "Structured Output Parser - 1", pos: "b", h: 48 }
        n10@{ icon: "mdi:robot", form: "rounded", label: "Structured Output Parser - 2", pos: "b", h: 48 }
        n11@{ icon: "mdi:brain", form: "rounded", label: "GPT 4o Mini - 1", pos: "b", h: 48 }
        n12@{ icon: "mdi:brain", form: "rounded", label: "GPT 4o Mini - 2", pos: "b", h: 48 }
        n13@{ icon: "mdi:brain", form: "rounded", label: "GPT 4o - 1", pos: "b", h: 48 }
        n14@{ icon: "mdi:brain", form: "rounded", label: "GPT 4.1 Mini - 1", pos: "b", h: 48 }
        n15@{ icon: "mdi:robot", form: "rounded", label: "Structured Output Parser - 3", pos: "b", h: 48 }
        n16@{ icon: "mdi:swap-vertical", form: "rounded", label: "Set Variables", pos: "b", h: 48 }
        n13 -.-> n1
        n6 --> n7
        n3 --> n16
        n16 --> n4
        n2 -.-> n1
        n11 -.-> n6
        n12 -.-> n7
        n14 -.-> n4
        n4 --> n1
        n7 --> n0
        n7 --> n8
        n1 --> n5
        n9 -.-> n1
        n10 -.-> n6
        n15 -.-> n7
        n5 --> n6
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n1,n4,n6,n7,n9,n10,n15 ai
    class n11,n12,n13,n14 aiModel
    class n2 ai
    class n0,n3,n5 api
    classDef customIcon fill:none,stroke:none
    class n0,n3,n5 customIcon

The Problem: Web Research Doesn’t Scale Past 10 Rows

If you’ve ever tried to “just research it quickly” from inside a spreadsheet, you know how it goes. You search Google, skim a few results, open pages that may or may not load, and paste a half-relevant snippet into a cell. Then you repeat for the next row. After 20 rows, you’re not researching anymore, you’re doing clerical work. Worse, results end up inconsistent because you change your phrasing, click different sources, or forget what you did three minutes ago. Small errors creep in, and suddenly your sheet looks complete but can’t be trusted.

The friction compounds. Here’s where it breaks down.

Each lookup steals about 3–5 minutes once you include searching, reading, and pasting notes.
Two people can research the same thing and get totally different answers, which makes reviews and QA painful.
Spreadsheets become a graveyard of half-sourced notes because nobody has time to standardize formatting.
Bot blocks and “access denied” pages waste time, especially when you’re checking lots of sites repeatedly.

The Solution: Bright Data Research That Runs From a Sheet Cell

This workflow turns Google Sheets into a lightweight research console. You type a custom function like =BRIGHTDATA(“C3″,”What is the current price of the product?”) and the sheet sends a secure request to n8n. From there, an AI agent refines your query so it’s specific enough to fetch the right information, then Bright Data scrapes the relevant pages (including sites that tend to block basic scrapers). A second AI pass filters what came back, extracts the useful details, and composes a clean plain-text answer. Finally, n8n replies directly to the webhook so Google Sheets can drop the result into the cell. No copy-paste. No tab juggling. Honestly, it feels like cheating once it’s working.

The workflow starts when the Apps Script function sends a POST request from your spreadsheet. AI improves the query, Bright Data retrieves the page content, and AI summarizes it into a tight response. Then the workflow logs the request for monitoring and returns text to your sheet in under 25 seconds.

What You Get: Automation vs. Results

What This Workflow Automates

Results You’ll Get

A Google Sheets custom function triggers research with a single formula.
AI refines your prompt into a cleaner, more searchable query.
Bright Data scrapes the web and handles common blocks automatically.
AI extracts relevant details and returns a plain-text summary to the cell.

Most teams save about 3–5 minutes per row, per lookup.
You get more consistent outputs across rows, which makes reviews faster.
Less context switching, so you stay in the spreadsheet and keep momentum.
Cleaner notes because formatting and summarization are standardized.
Better reliability on “difficult” sites thanks to Bright Data scraping.

Example: What This Looks Like

Say you’re building a competitor sheet with 40 products, and you want a current price note for each one. Manually, even a “fast” lookup takes about 4 minutes once you open results, confirm the number, and paste a clean note, so you’re looking at roughly 2.5 hours. With this workflow, you fill a column with =BRIGHTDATA() formulas, wait about 20 seconds per row, and let it run while you work on something else. You still review the outputs, but the busywork is gone.

What You’ll Need

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
Bright Data for scraping web pages reliably.
Google Sheets to run the custom BRIGHTDATA() function.
OpenAI API key (get it from the OpenAI API dashboard).

Skill level: Intermediate. You’ll paste a short Apps Script snippet, add a couple API keys, and test a webhook.

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

A spreadsheet formula triggers the request. Google Sheets runs an Apps Script function that sends your prompt (and the active cell context) to an n8n webhook using header authentication.

Your input gets cleaned and structured. n8n assigns the incoming fields (prompt, source, and context like spreadsheet ID and cell address), which keeps everything predictable for the AI and scraping steps.

AI improves the query, then the web gets scraped. A “refine query” agent uses an OpenAI chat model to tighten your wording, then Bright Data runs the scrape request so you get content even when sites try to block basic bots.

The workflow extracts and returns a plain-text answer. Another AI pass pulls out relevant details, composes a short summary, logs the run, and responds to the webhook so the text lands directly in your sheet cell.

You can easily modify the output format to return bullet points or a tighter “one-line” note based on your needs. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Webhook Trigger

Set up the inbound webhook that starts the workflow and passes the search prompt payload into the flow.

Add and configure Incoming Webhook Trigger with Path set to brightdata-search, HTTP Method set to POST, and Response Mode set to responseNode.
Set Authentication to headerAuth.
Credential Required: Connect your httpHeaderAuth credentials in Incoming Webhook Trigger.

Tip: Use a tool like Postman to send a JSON body that includes source and prompt fields so downstream nodes can map inputs correctly.

Step 2: Map Incoming Inputs

Normalize the request payload into consistent fields for the AI agents and summarizers.

In Assign Input Fields, set userPrompt to {{ $json.body.source }}.
Set cellReference to {{ $json.body.prompt }}.
Set ouputLanguage to Hebrew (or update to your preferred language).

Step 3: Set Up Query Refinement and Search

Use AI to refine the query, run a web search, and parse the best link.

Configure Refine Query Agent with the prompt text User prompt: {{ $json.userPrompt }} Prompt's referral: {{ $json.cellReference }}.
Ensure GPT-4.1 Mini Core is connected as the language model for Refine Query Agent.
Credential Required: Connect your openAiApi credentials in GPT-4.1 Mini Core.
Configure Bright Data Search Bot with its defined search prompt (keep the JSON-only output requirement intact).
Connect GPT-4o Model Core as the language model for Bright Data Search Bot.
Credential Required: Connect your openAiApi credentials in GPT-4o Model Core.
Attach Bright Data MCP Tool as the tool for Bright Data Search Bot and set endpointUrl to https://mcp.brightdata.com/mcp?token=[CONFIGURE_YOUR_TOKEN]&pro=1.
Attach Link JSON Parser as the output parser for Bright Data Search Bot with schema { "link": "" }.

⚠️ Common Pitfall: The Bright Data token is a placeholder in Bright Data MCP Tool. Replace [CONFIGURE_YOUR_TOKEN] with your actual token or the search tool will fail.

Step 4: Configure Scraping and Detail Extraction

Scrape the selected source and extract only relevant content based on the user’s query.

In Bright Data Scrape Request, set URL to https://api.brightdata.com/request and keep Method as POST.
Set body parameters to include: zone mcp_unlocker, url {{ $json.output.link }}, format json, method GET, country il, and data_format markdown.
Set the header Authorization to Bearer [CONFIGURE_YOUR_TOKEN] and replace with your Bright Data token.
Configure Extract Relevant Details with the input text ## Input ### The user's original request: {{ $('Assign Input Fields').item.json.cellReference }} - {{ $('Assign Input Fields').item.json.userPrompt }} ### Full content scanned from a website: {{ $json.body }}.
Ensure Mini GPT Model A is connected as the language model for Extract Relevant Details.
Credential Required: Connect your openAiApi credentials in Mini GPT Model A.
Attach Summary JSON Parser as the output parser for Extract Relevant Details with schema { "summary": "" }.

Tip: If summaries are empty, verify the scrape response includes full page content in {{ $json.body }} and that the source page is accessible by Bright Data.

Step 5: Generate Final Summary and Configure Outputs

Compose the final response and send it back to the requester while logging the output.

Configure Compose Summary Output with the prompt scraping summary information: {{ $json.output.summary }} the actual user request/question: {{ $('Assign Input Fields').item.json.cellReference }} - {{ $('Assign Input Fields').item.json.userPrompt }}.
Ensure Mini GPT Model B is connected as the language model for Compose Summary Output.
Credential Required: Connect your openAiApi credentials in Mini GPT Model B.
Attach Final Summary Parser as the output parser for Compose Summary Output using schema { "summary": "Intel was founded in 1968." }.
Configure Return Webhook Reply to respond with Respond With set to text and Response Body set to {{ $json.output.summary }}.
Configure Append Log Records to write input_prompt as {{ $('Assign Input Fields').item.json.userPrompt }} - {{ $('Assign Input Fields').item.json.cellReference }} and output as {{ $json.output.summary }} into the Search Logs data table (replace [YOUR_ID] with your table ID).
Confirm the parallel execution: Compose Summary Output outputs to both Return Webhook Reply and Append Log Records in parallel.

⚠️ Common Pitfall: If the webhook returns empty text, verify Final Summary Parser outputs summary and that Return Webhook Reply references {{ $json.output.summary }}.

Step 6: Test & Activate Your Workflow

Validate the end-to-end flow from webhook input to summary output, then activate for production use.

Click Execute Workflow and send a POST request to the Incoming Webhook Trigger URL with a JSON body containing source and prompt.
Confirm Bright Data Search Bot returns a JSON link and Bright Data Scrape Request receives content.
Verify Return Webhook Reply responds with a concise summary text and Append Log Records writes a new row in your data table.
Once validated, toggle the workflow to Active so it can receive production webhook requests.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Common Gotchas

Bright Data credentials can expire or need specific permissions. If things break, check your Bright Data API token status in the Bright Data console first.
If you’re using Wait nodes or external scraping, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
Default prompts in OpenAI nodes are generic. Add your brand voice early or you’ll be editing outputs forever.

Frequently Asked Questions

How long does it take to set up this Bright Data research automation?

About 20 minutes if your accounts and API keys are ready.

Do I need coding skills to automate Bright Data research?

No. You’ll paste a provided Apps Script function and connect credentials in n8n.

Is n8n free to use for this Bright Data research workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in Bright Data and OpenAI usage (this workflow is often around $0.02–0.05 per search in Bright Data, plus your OpenAI calls).

Where can I host n8n to run this Bright Data research automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this Bright Data research workflow for a different output format (like bullets or “price + source”)?

Yes, and it’s usually a quick change. Update the instructions in the “Compose Summary Output” agent so it returns exactly what you want (bullets, a single sentence, or a “Price: / Source: / Date:” format). If you want tighter parsing, adjust the “Summary JSON Parser” or “Final Summary Parser” so the model is forced into a consistent structure. Common tweaks include changing output language, limiting the answer length, and prioritizing specific sources.

Why is my Bright Data connection failing in this workflow?

Usually it’s an invalid or expired Bright Data API key. Check the Bright Data console, regenerate the token if needed, then update the credentials used in the “Bright Data Scrape Request” step in n8n.

How many lookups can this Bright Data research automation handle?

It depends on how you run n8n and your budget. On n8n Cloud, your monthly executions are capped by plan, which matters if you fill hundreds of rows with formulas. If you self-host, there’s no platform execution limit, but Google Sheets still has a ~30-second ceiling per function call, so you want the workflow finishing in about 20 seconds. Practically, teams run this in batches (like 50–200 rows), then review results and rerun only the misses.

Is this Bright Data research automation better than using Zapier or Make?

For this workflow, n8n has a few advantages: more complex logic with unlimited branching at no extra cost, a self-hosting option for unlimited executions, and native AI agent patterns that are awkward (or expensive) elsewhere. Zapier or Make can be fine for simple two-step flows, but they’re not built around “a spreadsheet cell triggers web scraping + AI summarization” in one tight loop. Also, the webhook + Apps Script approach is straightforward to control, which matters when you’re running lots of rows. If you’re unsure, Talk to an automation expert and describe your volume and use case.

Once this is in place, your sheet stops being a place where research goes to die. It becomes the place where research gets done, consistently, in minutes.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"id": "",
"name": "Automated Web Search Summarizer",
"versionId": "",
"meta": {
"instanceId": "74a02d23c50e1ceb75a998e057ac3681a698412dfa5d24acb87d2727d828ed4f",
"templateId": null,
"templateCredsSetupCompleted": null
},
"nodes": [
{
"id": "flowpast-topbar-10119",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
0,
20
],
"parameters": {
"color": 7,
"width": 1305,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "14cd7efc-ef41-418a-af6d-bedef22b8085",
"name": "Return Webhook Reply",
"type": "n8n-nodes-base.respondToWebhook",
"position": [
270,
470
],
"parameters": {
"options": {
"responseHeaders": {
"entries": [
{
"name": "Content-Type",
"value": "text/plain; charset=utf-8"
}
]
}
},
"respondWith": "text",
"responseBody": "={{ $json.output.summary }}"
},
"typeVersion": 1.4
},
{
"id": "b33b25ad-aff9-4192-946b-becd709dfe6c",
"name": "Bright Data Search Bot",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
890,
235
],
"parameters": {
"text": "=You are an AI agent specialized in web search and information retrieval.\n\nQuery: \"{{ $json.output }}\"\n\n## Objective:  \nPerform a focused search and return the most relevant link in **JSON format only**:  \n```json\n{\n  \"link\": \"\"\n}\n\n## Execution Process:\n\n### 1. Run Search\nsearch_engine(\n  query: [the query you constructed],\n  engine: \"google\",\n  limit: 10,\n  country: \"us\" or \"il\" depending on relevance\n)\n\n### 2. Filter Results\n\nPrioritize reliable sources:\nMajor news websites (Reuters, Bloomberg, WSJ, Haaretz, Globes)\nOfficial websites (investor relations, about pages)\nOfficial financial reports (SEC filings, quarterly reports)\nTrusted data sources (Wikipedia, Crunchbase, LinkedIn)\n\nSkip:\nAds and irrelevant marketing content\nUnverified forums\nOutdated content (unless historical info is requested)\nBroken or invalid links\n\nFind a suitable link based on:\nRelevance to the query\nRecency of the source\nSource credibility\n\n### 3. Return Result\n\nRequired format \u2013 JSON only, no additional text:\n{\n  \"link\": \"https://...\"\n}\n\nIf no high-quality result is found, search another source until you find one or return \"\".\n\n## Error Handling:\n\nIf search fails \u2192 try an alternative query\nIf no results found \u2192 return JSON with empty fields\nAlways return valid JSON, with no explanations or extra text",
"options": [],
"promptType": "define",
"hasOutputParser": true
},
"typeVersion": 2.2
},
{
"id": "d018693a-0fdf-4ae8-ae22-1b4f79bed2a1",
"name": "Bright Data MCP Tool",
"type": "@n8n/n8n-nodes-langchain.mcpClientTool",
"position": [
980,
340
],
"parameters": {
"include": "selected",
"options": {
"timeout": 120000
},
"endpointUrl": "https://mcp.brightdata.com/mcp?token=[CONFIGURE_YOUR_TOKEN]&pro=1",
"includeTools": [
"search_engine"
],
"serverTransport": "httpStreamable"
},
"typeVersion": 1.1
},
{
"id": "ddf8a398-38ae-4da4-a2aa-0fe6a4b3e62b",
"name": "Incoming Webhook Trigger",
"type": "n8n-nodes-base.webhook",
"position": [
0,
250
],
"webhookId": "adc93cdb-28a7-4e48-ad3e-b5c8924f92fa",
"parameters": {
"path": "brightdata-search",
"options": [],
"httpMethod": "POST",
"responseMode": "responseNode",
"authentication": "headerAuth"
},
"credentials": {
"httpHeaderAuth": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 2.1
},
{
"id": "8dfdd46c-ad8f-458a-a403-45910dcc13e5",
"name": "Refine Query Agent",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
560,
255
],
"parameters": {
"text": "=User prompt: {{ $json.userPrompt }}\nPrompt's referral: {{ $json.cellReference }}\n",
"options": {
"systemMessage": "=### 1. Request Analysis  \n\nIdentify the query type:  \n- **News/Updates**: \"News about X\", \"What\u2019s happening with X\"  \n- **Financial Data**: \"Revenue\", \"Earnings\", \"Financial Reports\" \n- **Factual Information**: \"Who is the CEO\", \"How many employees\", \"Where is the HQ\"  \n- **Analysis/Comparison**: \"Competitors\", \"Compare to Y\", \"Alternatives\"  \n- **General Research**: Open-ended questions about a topic/field  \n\n### 2. Search Query Construction  \nBuild an optimal query based on the type:\n\n**News**: `\"{{ entity }}\" news [year/month if relevant]`  \n**Financial Data**: `\"{{ entity }}\" revenue earnings \"Q[X] [year]\"` or `financial results`  \n**Factual Information**: `\"{{ entity }}\" [specific fact] official`  \n**Analysis**: `\"{{ entity }}\" analysis competitors market share`  \n\n**Principles:**  \n- Always translate to English to maximize results  \n- Keep double quotes around specific names  \n- Add relevant time \u2014 current time is {{ $now.format('dd/LL/yyyy') }}  \n- Match keywords to the requested information type  \n\n## Examples:\n\n**Input**: Prompt's referral=\"Apple\", user prompt=\"Search for news about the company\"  \noutput example: \"Apple Inc\" news 2025 technology  \n\n**Input**: Prompt's referral=\"Tesla\", user prompt=\"Monthly revenue April\"  \noutput example: \"Tesla\" revenue \"April 2024\" OR \"Q2 2024\" financial results  \n\n**Input**: Prompt's referral=\"United States\", user prompt=\"Who is the president\"  \noutput example: \"United States\" president 19/10/2025 current  \n"
},
"promptType": "define"
},
"typeVersion": 2.2
},
{
"id": "c12218ae-674c-4bf9-b35b-6a8d6b360c9a",
"name": "Bright Data Scrape Request",
"type": "n8n-nodes-base.httpRequest",
"position": [
1205,
265
],
"parameters": {
"url": "https://api.brightdata.com/request",
"method": "POST",
"options": {
"batching": {
"batch": {
"batchSize": 1,
"batchInterval": 2000
}
}
},
"sendBody": true,
"sendHeaders": true,
"bodyParameters": {
"parameters": [
{
"name": "zone",
"value": "mcp_unlocker"
},
{
"name": "url",
"value": "={{ $json.output.link }}"
},
{
"name": "format",
"value": "json"
},
{
"name": "method",
"value": "GET"
},
{
"name": "country",
"value": "il"
},
{
"name": "data_format",
"value": "markdown"
}
]
},
"headerParameters": {
"parameters": [
{
"name": "Authorization",
"value": "Bearer [CONFIGURE_YOUR_TOKEN]"
}
]
}
},
"typeVersion": 4.2
},
{
"id": "9f8b11b7-f997-4906-a929-2656cdee4204",
"name": "Extract Relevant Details",
"type": "@n8n/n8n-nodes-langchain.chainLlm",
"onError": "continueRegularOutput",
"position": [
870,
460
],
"parameters": {
"text": "=## Input\n### The user's original request:\n{{ $('Assign Input Fields').item.json.cellReference }} - {{ $('Assign Input Fields').item.json.userPrompt }}\n### Full content scanned from a website:\n{{ $json.body }}",
"batching": [],
"messages": {
"messageValues": [
{
"message": "=# You are an AI agent specialized in extracting relevant information. Your role is to receive:\n\n1. **The user's original request** \u2013 the question or topic requested  \n2. **Full content scanned from a website** \u2013 raw information from a source  \n\n## Objective\n\nExtract and summarize the most relevant information from the scanned content based on the user's query.\n\n## Workflow\n\n1. Carefully read the scanned content  \n2. Identify information directly relevant to the user's query  \n3. Create a focused and concise summary of only the relevant information  \n4. Completely ignore unrelated content  \n\n## Output Rules\n\n- **Return only valid JSON** according to the schema below  \n- Do not include any text outside the JSON  \n- The summary should include **only** information directly relevant to the user's query  \n- Maximum preferred length: **400 characters**  \n- If no relevant information is found, leave the `summary` field empty (`\"\"`)  \n\n## Edge Case Handling\n\n- **Partial or corrupted content:** Extract only the existing and relevant information  \n- **Multiple topics in content:** Focus only on the topic relevant to the user's query  \n- **Different language:** Translate relevant information to {{ $('Assign Input Fields').item.json.ouputLanguage }}  \n- **No relevant information:** Return JSON with an empty `summary`  \n- **Always return valid JSON**, no extra text  \n\n## Key Principles\n\n** Extract factual, accurate, and focused information **  \n** Translate to {{ $('Assign Input Fields').item.json.ouputLanguage }} when needed **  \n** Summarize concisely within 400 characters **  \n\n** Do not add information that is not in the original content **  \n** Do not include information unrelated to the query **  \n** Do not deviate from the JSON format **  \n"
}
]
},
"promptType": "define",
"hasOutputParser": true
},
"typeVersion": 1.7
},
{
"id": "d7a349a0-0158-4ec2-bb94-beebebcbe2bd",
"name": "Compose Summary Output",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
550,
435
],
"parameters": {
"text": "=scraping summary information: {{ $json.output.summary }}\nthe actual user request/question: {{ $('Assign Input Fields').item.json.cellReference }} - {{ $('Assign Input Fields').item.json.userPrompt }}\n",
"options": {
"systemMessage": "=Generate a conclusion in {{ $('Assign Input Fields').item.json.ouputLanguage }} based on all the information above. Filter and summarize the most relevant information according to the user's question into a maximum of 400 characters.  \nAlways return valid JSON, with no explanations or additional text.  \n"
},
"promptType": "define",
"hasOutputParser": true
},
"typeVersion": 2.2
},
{
"id": "4ca74428-5c4a-4c3f-8359-95a1df48a4b9",
"name": "Append Log Records",
"type": "n8n-nodes-base.dataTable",
"position": [
10,
450
],
"parameters": {
"columns": {
"value": {
"output": "={{ $json.output.summary }}",
"input_prompt": "={{ $('Assign Input Fields').item.json.userPrompt }} - {{ $('Assign Input Fields').item.json.cellReference }}"
},
"schema": [
{
"id": "input_prompt",
"type": "string",
"display": true,
"removed": false,
"readOnly": false,
"required": false,
"displayName": "input_prompt",
"defaultMatch": false
},
{
"id": "output",
"type": "string",
"display": true,
"removed": false,
"readOnly": false,
"required": false,
"displayName": "output",
"defaultMatch": false
}
],
"mappingMode": "defineBelow",
"matchingColumns": [
"logs"
],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"dataTableId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "/projects/[YOUR_ID]/datatables/[YOUR_ID]",
"cachedResultName": "Search Logs"
}
},
"typeVersion": 1
},
{
"id": "1abd9a03-bbb3-44cf-b23c-40363d54458e",
"name": "Link JSON Parser",
"type": "@n8n/n8n-nodes-langchain.outputParserStructured",
"position": [
820,
350
],
"parameters": {
"jsonSchemaExample": "{\n  \"link\": \"\"\n}"
},
"typeVersion": 1.3
},
{
"id": "0b99dc9d-bc59-4054-b97a-7bc708439a22",
"name": "Summary JSON Parser",
"type": "@n8n/n8n-nodes-langchain.outputParserStructured",
"position": [
760,
585
],
"parameters": {
"jsonSchemaExample": "{\n  \"summary\": \"\"\n}"
},
"typeVersion": 1.3
},
{
"id": "1987c731-3ef5-4977-8029-8415726355ea",
"name": "Mini GPT Model A",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
1020,
560
],
"parameters": {
"model": {
"__rl": true,
"mode": "list",
"value": "gpt-4o-mini",
"cachedResultName": "gpt-4o-mini"
},
"options": []
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.2
},
{
"id": "2c9c2b61-56f7-40ef-912c-1a1bf34688a2",
"name": "Mini GPT Model B",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
380,
580
],
"parameters": {
"model": {
"__rl": true,
"mode": "list",
"value": "gpt-4o-mini",
"cachedResultName": "gpt-4o-mini"
},
"options": []
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.2
},
{
"id": "7fd1cabe-0429-4ca4-81ba-8b2e6344279b",
"name": "GPT-4o Model Core",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
870,
120
],
"parameters": {
"model": {
"__rl": true,
"mode": "list",
"value": "gpt-4o",
"cachedResultName": "gpt-4o"
},
"options": []
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.2
},
{
"id": "95e48871-e554-442c-83fb-b028a020b342",
"name": "GPT-4.1 Mini Core",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
550,
130
],
"parameters": {
"model": {
"__rl": true,
"mode": "list",
"value": "gpt-4.1-mini"
},
"options": []
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1.2
},
{
"id": "759b9362-6511-47a5-b96c-a7cb340bae16",
"name": "Final Summary Parser",
"type": "@n8n/n8n-nodes-langchain.outputParserStructured",
"position": [
180,
585
],
"parameters": {
"jsonSchemaExample": "{\n\t\"summary\": \"Intel was founded in 1968.\"\n}\n"
},
"typeVersion": 1.3
},
{
"id": "f6fd6c4c-24b2-4e9f-a45d-9e84af36b0dd",
"name": "Assign Input Fields",
"type": "n8n-nodes-base.set",
"position": [
280,
230
],
"parameters": {
"options": [],
"assignments": {
"assignments": [
{
"id": "a4357b33-04b4-4246-ad30-f0d91b6687d2",
"name": "userPrompt",
"type": "string",
"value": "={{ $json.body.source }}"
},
{
"id": "977e4bde-3030-4d02-9c2f-3d46975901be",
"name": "cellReference",
"type": "string",
"value": "={{ $json.body.prompt }}"
},
{
"id": "c63d7766-11b9-4edb-92db-90e25535721b",
"name": "ouputLanguage",
"type": "string",
"value": "Hebrew"
}
]
}
},
"typeVersion": 3.4
}
],
"pinData": [],
"connections": {
"GPT-4o Model Core": {
"ai_languageModel": [
[
{
"node": "Bright Data Search Bot",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Extract Relevant Details": {
"main": [
[
{
"node": "Compose Summary Output",
"type": "main",
"index": 0
}
]
]
},
"Incoming Webhook Trigger": {
"main": [
[
{
"node": "Assign Input Fields",
"type": "main",
"index": 0
}
]
]
},
"Assign Input Fields": {
"main": [
[
{
"node": "Refine Query Agent",
"type": "main",
"index": 0
}
]
]
},
"Bright Data MCP Tool": {
"ai_tool": [
[
{
"node": "Bright Data Search Bot",
"type": "ai_tool",
"index": 0
}
]
]
},
"Mini GPT Model A": {
"ai_languageModel": [
[
{
"node": "Extract Relevant Details",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Mini GPT Model B": {
"ai_languageModel": [
[
{
"node": "Compose Summary Output",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"GPT-4.1 Mini Core": {
"ai_languageModel": [
[
{
"node": "Refine Query Agent",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Refine Query Agent": {
"main": [
[
{
"node": "Bright Data Search Bot",
"type": "main",
"index": 0
}
]
]
},
"Compose Summary Output": {
"main": [
[
{
"node": "Return Webhook Reply",
"type": "main",
"index": 0
},
{
"node": "Append Log Records",
"type": "main",
"index": 0
}
]
]
},
"Bright Data Search Bot": {
"main": [
[
{
"node": "Bright Data Scrape Request",
"type": "main",
"index": 0
}
]
]
},
"Link JSON Parser": {
"ai_outputParser": [
[
{
"node": "Bright Data Search Bot",
"type": "ai_outputParser",
"index": 0
}
]
]
},
"Summary JSON Parser": {
"ai_outputParser": [
[
{
"node": "Extract Relevant Details",
"type": "ai_outputParser",
"index": 0
}
]
]
},
"Final Summary Parser": {
"ai_outputParser": [
[
{
"node": "Compose Summary Output",
"type": "ai_outputParser",
"index": 0
}
]
]
},
"Bright Data Scrape Request": {
"main": [
[
{
"node": "Extract Relevant Details",
"type": "main",
"index": 0
}
]
]
}
}
}