January 22, 2026

Google Sheets + Gemini: article summaries you can reuse

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Your research process probably looks like this: a spreadsheet full of links, a dozen open tabs, and “I’ll summarize this later” turning into “Where did I read that?” by Friday. Copy-paste notes drift. Titles get shortened. Sources go missing.

This Sheets Gemini summaries setup hits marketing leads and content strategists first, but founders running lean and analysts doing weekly scans feel it too. You will turn raw URLs into consistent, structured summaries that are actually searchable and reusable.

Below is the exact n8n workflow approach: pull URLs from Google Sheets, extract the main text, let Gemini produce a clean JSON summary, then write everything back into an “output” tab you can filter, tag, and reuse.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: Google Sheets + Gemini: article summaries you can reuse

Click to explore

flowchart LR

    subgraph sg0["When clicking ‘Execute workflow’ Flow"]
        direction LR
        n0@{ icon: "mdi:play-circle", form: "rounded", label: "When clicking ‘Execute workf..", pos: "b", h: 48 }
        n1@{ icon: "mdi:cog", form: "rounded", label: "Decodo", pos: "b", h: 48 }
        n2["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Code in JavaScript"]
        n3@{ icon: "mdi:robot", form: "rounded", label: "AI Agent", pos: "b", h: 48 }
        n4@{ icon: "mdi:brain", form: "rounded", label: "Google Gemini Chat Model", pos: "b", h: 48 }
        n5["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Code in JavaScript1"]
        n6@{ icon: "mdi:database", form: "rounded", label: "Get row(s) in sheet", pos: "b", h: 48 }
        n7@{ icon: "mdi:database", form: "rounded", label: "Append row in sheet", pos: "b", h: 48 }
        n8@{ icon: "mdi:swap-vertical", form: "rounded", label: "Loop Over Items", pos: "b", h: 48 }
        n1 --> n2
        n3 --> n5
        n8 --> n1
        n2 --> n3
        n7 --> n8
        n5 --> n7
        n6 --> n8
        n4 -.-> n3
        n0 --> n6
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n0 trigger
    class n3 ai
    class n4 aiModel
    class n6,n7 database
    class n2,n5 code
    classDef customIcon fill:none,stroke:none
    class n2,n5 customIcon

The Problem: Research Notes Become Unusable Fast

Saving links is easy. Turning them into something you can use next week is the hard part. You read an article, grab a few lines, then the next one pulls you away. Later, you can’t remember which source said what, and the same ideas get “rediscovered” in meetings like they’re new. Meanwhile, someone asks for a quick roundup, and you are stuck re-opening tabs and trying to reconstruct your thinking from scattered highlights. It’s not just slow. It’s mentally exhausting.

The friction compounds. A messy link list turns into messy decisions.

Manual summaries vary every time, so comparing articles side-by-side becomes guesswork.
Key details like publication date and domain often get skipped, which makes your “source of truth” not very trustworthy.
Long-form pages waste your time because you have to hunt for the real point buried in the middle.
When the list grows past about 30 links, you stop using it and start over in a new doc.

The Solution: Structured Summaries Written Back to Sheets

This workflow starts with a simple input: a Google Sheet named “input” with one column called url. When you run it, n8n pulls each row, visits the page, and uses Decodo to extract the main text (so you are not summarizing navigation menus and cookie banners). That cleaned text is passed to an AI Agent connected to the Gemini Chat Model, which returns a structured response in JSON. Finally, n8n parses that JSON and appends a new row into a second sheet named “output” with the fields you actually need: title, source, published date (when available), main topic, three key ideas, a short summary, and the text type.

The workflow begins when you manually start it in n8n. It loops through every URL in the input tab, extracts the readable page text, then asks Gemini to produce consistent fields you can sort and reuse. When it’s done, your “output” tab becomes a living research database instead of a link graveyard.

What You Get: Automation vs. Results

What This Workflow Automates

Results You’ll Get

It reads your Google Sheet and pulls every URL you’ve queued up.
It scrapes and cleans the article text via Decodo, so the AI gets the real content.
It prompts Gemini through an AI Agent to return structured fields in JSON.
It appends a complete, standardized summary row into your output sheet automatically.

A usable summary record per link in about a minute of run time (instead of 10–15 minutes of manual note-taking).
Consistent “Key Ideas” across sources, which makes weekly roundups much faster.
Cleaner citations because URL, title, source, and date are captured together.
A searchable research log you can filter by topic, site, or content type.
Less re-reading, fewer reopened tabs, and fewer “where did we see that?” moments.

Example: What This Looks Like

Say you collect 20 links for a weekly newsletter. Manually, even a quick process is maybe 10 minutes per link between reading, pulling takeaways, and pasting notes, so that’s about 3 hours. With this workflow, you paste the 20 URLs into the “input” tab, run n8n, and wait while it processes them (often around 20 minutes total, depending on the sites). You get a filled “output” tab with titles, sources, and key ideas ready to skim, which means the “writing” part starts sooner.

What You’ll Need

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
Google Sheets for the input and output tabs.
Decodo to extract the article’s main text.
Decodo API key (get it from your Decodo dashboard).

Skill level: Beginner. You’re mainly connecting accounts and pasting an API key, plus light prompt tweaking if you want a specific format.

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

You run it when you’re ready. The workflow uses a manual start trigger, so you can collect links all day (or all week) and then process them in one clean batch.

Sheets provides the queue. n8n reads the “input” sheet and loops over each row using a batch iterator, which keeps the run stable even when you have lots of URLs.

Decodo extracts the readable text. For each link, the workflow fetches the page, strips out the clutter, and produces a text version that’s much easier for an AI model to summarize well.

Gemini turns text into structured fields. The AI Agent asks the Gemini Chat Model for a JSON response containing title, source, published date when available, main topic, three key ideas, a short summary, and the content type. Then n8n parses that JSON so it can be saved reliably.

Results land back in Google Sheets. A new row is appended to the “output” sheet for each URL, which means you can filter, search, and build a repeatable research library.

You can easily modify the output fields to match your workflow, like adding “Audience” or “Action Items” based on your needs. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Manual Trigger

Set up the workflow to start manually so you can validate the input data and downstream processing.

Add or select the Manual Execution Start node as the trigger.
Leave the node parameters empty (this trigger has no required fields).
Confirm the connection flows from Manual Execution Start to Retrieve Sheet Rows.

Use this manual trigger during development so you can inspect each step’s output before enabling automation.

Step 2: Connect Google Sheets

Configure the input and output sheets used to read URLs and store the summarized results.

Open Retrieve Sheet Rows and set Document ID to your spreadsheet, replacing [YOUR_ID].
Set Sheet Name to the input tab by selecting input (value gid=0).
Credential Required: Connect your Google Sheets credentials in Retrieve Sheet Rows (no credentials are currently configured).
Open Append Sheet Row and set Document ID to the same spreadsheet, replacing [YOUR_ID].
Set Sheet Name to the output tab by selecting output (value 60764768).
Confirm the column mappings in Append Sheet Row use expressions like {{ $json.url }}, {{ $json.short_summary }}, and {{ $json.published_date }}.
Credential Required: Connect your Google Sheets credentials in Append Sheet Row (no credentials are currently configured).

⚠️ Common Pitfall: If the output sheet columns don’t match the schema in Append Sheet Row, rows may fail to append or be misaligned.

Step 3: Set Up Scraping and Text Processing

Fetch page content and convert it into clean text that the AI agent can analyze.

Open Iterate Records to ensure it receives data from Retrieve Sheet Rows and loops through each URL.
Configure Web Scrape Service with Operation set to universal.
Credential Required: Connect your Decodo credentials in Web Scrape Service (no credentials are currently configured).
Review Transform HTML Text to confirm it extracts url, fuente, titulo, article_text, and fecha_guardado from the scrape output.
Confirm the execution flow: Iterate Records → Web Scrape Service → Transform HTML Text.

If the scraped HTML is empty, verify the URL format in your input sheet and that the Decodo service can access the page.

Step 4: Configure the AI Analysis

Set up the agent prompt and connect the Gemini model to produce structured JSON summaries.

Open LLM Analysis Agent and keep Prompt Type set to define.
Ensure the Text field contains the full JSON schema and uses expressions like {{ $json["url"] }}, {{ $json["title"] }}, and {{ $json.fecha_guardado }}.
Confirm the system message is present in LLM Analysis Agent to enforce JSON-only output.
Open Gemini Chat Model and connect it as the language model for LLM Analysis Agent.
Credential Required: Connect your Google Gemini credentials in Gemini Chat Model (no credentials are currently configured). Add credentials to the parent model node, not the agent.

⚠️ Common Pitfall: If the model returns markdown or extra text, the next parsing step may fail. Keep the JSON-only instruction intact.

Step 5: Configure Output Parsing and Storage

Parse the AI output and append structured data to the output sheet, then loop to the next URL.

Review Parse LLM JSON to ensure it parses item.json.output and outputs fields like title, source, and short_summary.
Confirm the flow from LLM Analysis Agent → Parse LLM JSON → Append Sheet Row.
Verify Append Sheet Row maps all required fields, including {{ $json.main_topic }} and {{ $json.three_key_insights }}.
Check that Append Sheet Row loops back to Iterate Records to continue processing remaining URLs.

If parsing fails, review the LLM output in the execution log and update prompt constraints in LLM Analysis Agent.

Step 6: Test and Activate Your Workflow

Run a manual test to confirm the full chain works end-to-end, then enable for production use.

Click Execute Workflow and verify Manual Execution Start triggers the run.
Check that Retrieve Sheet Rows outputs rows from your input sheet.
Confirm Web Scrape Service returns content and Transform HTML Text outputs clean text fields.
Validate LLM Analysis Agent returns a JSON-only response, and Parse LLM JSON outputs structured fields.
Verify Append Sheet Row adds a new row to the output sheet with the mapped values.
When satisfied, switch the workflow to Active to use it in production (still manual-triggered unless you replace the trigger).

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Common Gotchas

Google Sheets credentials can expire or need specific permissions. If things break, check your n8n Credentials panel and confirm the connected Google account still has access to both “input” and “output” sheets first.
If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
Default prompts in AI nodes are generic. Add your brand voice early or you’ll be editing outputs forever.

Frequently Asked Questions

How long does it take to set up this Sheets Gemini summaries automation?

About 30 minutes if your Google account is already connected.

Do I need coding skills to automate Google Sheets article summaries?

No. You’ll connect Google Sheets, paste your Decodo API key, and adjust the AI prompt if you want different fields.

Is n8n free to use for this Sheets Gemini summaries workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in Decodo usage plus your Gemini API costs.

Where can I host n8n to run this Sheets Gemini summaries automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this Sheets Gemini summaries workflow for a different output format?

Yes, and it’s the part most teams should tweak. Change the fields requested in the AI Agent prompt, then mirror those fields in the “Parse LLM JSON” code step so n8n can reliably map them into columns. Common customizations include adding a “Category” column, saving a one-line “Newsletter hook,” and forcing a specific tone for the summary. If you want the published date to be stricter, you can tell Gemini to return “unknown” when it can’t confidently find one.

Why is my Google Sheets connection failing in this Sheets Gemini summaries workflow?

Usually it’s an expired Google auth token in n8n. Reconnect the Google Sheets credential and confirm the same account can open both the input and output spreadsheets. Also check that the sheet names match exactly (“input” and “output”), because a small mismatch can look like a permissions issue.

How many links can this Sheets Gemini summaries automation handle?

A few hundred links in a run is realistic on a typical setup, and self-hosting can scale further if your server is sized well.

Is this Sheets Gemini summaries automation better than using Zapier or Make?

Often, yes, because scraping + text cleanup + structured AI output is not a simple two-step Zap. n8n handles looping, branching, and “parse JSON then map to columns” logic cleanly, and you can self-host for high volume without paying per task. Zapier or Make can still work if you only summarize a small number of links and you want the quickest possible setup. The real deciding factor is control: if you care about consistent fields and reliable parsing, n8n tends to feel less fragile. If you want help choosing, Talk to an automation expert.

Once this is running, your “research” stops being a pile of tabs and starts behaving like an asset. Set it up, feed it links, and let the sheet do its job.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"id": null,
"meta": {
"instanceId": "8d09c3379c3023829a768dfaf6d38a729fffed700e238b5f4737e528c1ba31c0",
"templateId": "",
"templateCredsSetupCompleted": ""
},
"name": "Automated Research Summary Workflow",
"tags": [],
"nodes": [
{
"id": "flowpast-topbar-11389",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
0,
45
],
"parameters": {
"color": 7,
"width": 1055,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "97a37f53-8d6a-409e-acd5-f2a24545b441",
"name": "Manual Execution Start",
"type": "n8n-nodes-base.manualTrigger",
"position": [
0,
145
],
"parameters": [],
"typeVersion": 1
},
{
"id": "59bdf3b7-fb95-43b4-9ef0-e503c3590a25",
"name": "Retrieve Sheet Rows",
"type": "n8n-nodes-base.googleSheets",
"position": [
295,
175
],
"parameters": {
"options": [],
"sheetName": {
"__rl": true,
"mode": "list",
"value": "gid=0",
"cachedResultUrl": "",
"cachedResultName": "input"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "urls_AI_ML"
}
},
"typeVersion": 4.7
},
{
"id": "a6e6c98c-42c4-48aa-99c5-361f21cc48cb",
"name": "Iterate Records",
"type": "n8n-nodes-base.splitInBatches",
"position": [
575,
150
],
"parameters": {
"options": []
},
"typeVersion": 3
},
{
"id": "a12bdc4e-0c86-4073-ac97-702fbe385288",
"name": "Web Scrape Service",
"type": "@decodo/n8n-nodes-preview-decodo.decodo",
"position": [
955,
190
],
"parameters": {
"operation": "universal"
},
"credentials": [],
"typeVersion": 1
},
{
"id": "7fe1a395-608e-43b2-a53f-8e7e74d15067",
"name": "Transform HTML Text",
"type": "n8n-nodes-base.code",
"position": [
940,
370
],
"parameters": {
"jsCode": "// Este Function node toma la salida del scraper de Decodo\n// y devuelve solo lo que nos interesa para el LLM:\n// - url\n// - fuente (dominio)\n// - titulo (de <title>)\n// - article_text (texto plano del art\u00edculo)\n\nfunction extractTitle(html) {\n  const match = html.match(/<title[^>]*>([^<]+)<\\/title>/i);\n  if (match && match[1]) {\n    return match[1].trim();\n  }\n  return \"\";\n}\n\nfunction htmlToText(html) {\n  if (!html || typeof html !== \"string\") {\n    return \"\";\n  }\n\n  let text = html;\n\n  // 1) Eliminar scripts y estilos\n  text = text.replace(/<script[\\s\\S]*?<\\/script>/gi, \"\");\n  text = text.replace(/<style[\\s\\S]*?<\\/style>/gi, \"\");\n\n  // 2) Sustituir algunos tags por saltos de l\u00ednea\n  text = text.replace(/<\\/(p|div|section|article|li|h1|h2|h3|h4|h5|h6)>/gi, \"\\n\");\n  text = text.replace(/<br\\s*\\/?>/gi, \"\\n\");\n\n  // 3) Eliminar el resto de tags HTML\n  text = text.replace(/<\\/?[^>]+>/g, \"\");\n\n  // 4) Decodificar entidades b\u00e1sicas\n  text = text\n    .replace(/ /gi, \" \")\n    .replace(/&/gi, \"&\")\n    .replace(/"/gi, \"\\\"\")\n    .replace(/'/gi, \"'\")\n    .replace(/</gi, \"<\")\n    .replace(/>/gi, \">\");\n\n  // 5) Limpiar espacios extra y l\u00edneas vac\u00edas\n  text = text\n    .split(\"\\n\")\n    .map(line => line.trim())\n    .filter(line => line.length > 0)\n    .join(\"\\n\");\n\n  return text;\n}\n\nfunction getDomainFromUrl(url) {\n  try {\n    const u = new URL(url);\n    return u.hostname.replace(/^www\\./, \"\");\n  } catch (e) {\n    return \"\";\n  }\n}\n\nconst newItems = [];\n\nfor (const item of items) {\n  const json = item.json || {};\n\n  // Si viene como en tu ejemplo: { results: [ { content, url, ... } ] }\n  let result = json;\n  if (Array.isArray(json.results) && json.results.length > 0) {\n    result = json.results[0];\n  }\n\n  const html = result.content || \"\";\n  const url = result.url || json.url || \"\";\n\n  const titulo = extractTitle(html);\n  const article_text = htmlToText(html);\n  const fuente = getDomainFromUrl(url);\n\n  // Aqu\u00ed tambi\u00e9n podemos ya incluir la fecha de guardado (hoy)\n  const today = new Date().toISOString().slice(0, 10); // YYYY-MM-DD\n\n  newItems.push({\n    json: {\n      url,\n      fuente,\n      titulo,\n      article_text,\n      fecha_guardado: today\n    }\n  });\n}\n\nreturn newItems;\n"
},
"typeVersion": 2
},
{
"id": "8cc45657-66e8-43a1-a5de-cf2718105853",
"name": "LLM Analysis Agent",
"type": "@n8n/n8n-nodes-langchain.agent",
"position": [
595,
395
],
"parameters": {
"text": "=I will give you the scraped content from a webpage, including:\n\n- URL\n- Source domain\n- Extracted title\n- Today's date (saved date)\n- Full cleaned article text\n\nYour task is to analyze this information and respond ONLY with a JSON object using the EXACT keys and structure below:\n\n{\n  \"url\": \"\",\n  \"title\": \"\",\n  \"source\": \"\",\n  \"published_date\": \"\",\n  \"saved_date\": \"\",\n  \"resource_type\": \"\",\n  \"main_topic\": \"\",\n  \"level\": \"\",\n  \"three_key_insights\": \"\",\n  \"short_summary\": \"\",\n  \"content_idea\": \"\",\n  \"language\": \"\"\n}\n\nHere are the detailed instructions for each field:\n\n- \"url\":\n  Copy exactly the URL provided.\n\n- \"title\":\n  Provide a clean, concise title for the article or resource.\n  You may improve formatting if the extracted title is messy.\n\n- \"source\":\n  Convert the domain into a readable name.\n  Examples:\n  - \"openai.com\" \u2192 \"OpenAI\"\n  - \"anthropic.com\" \u2192 \"Anthropic\"\n  - \"arxiv.org\" \u2192 \"arXiv\"\n\n- \"published_date\":\n  - If a clear publication date appears in the article text or metadata, extract it and return it in \"YYYY-MM-DD\" format.\n  - If NO publication date is clearly indicated, return an empty string \"\".\n\n- \"saved_date\":\n  Copy exactly the date I provided (in \"YYYY-MM-DD\" format).\n  Do NOT invent anything here.\n\n- \"resource_type\":\n  Choose ONE of the following (fallback to \"blog\" if unclear):\n  \"blog\", \"paper\", \"docs\", \"video\", \"tweet\", \"thread\", \"repository\", \"documentation\"\n\n- \"main_topic\":\n  Summarize the central topic in a maximum of 2\u20133 words.\n  Examples: \"RAG\", \"fine-tuning\", \"evaluation\", \"LLM agents\", \"robotics + LLMs\", \"MLOps\"\n\n- \"level\":\n  Choose one of:\n  \"beginner\"\n  \"intermediate\"\n  \"advanced\"\n  Base it on complexity of the language and concepts.\n\n- \"three_key_insights\":\n  Write exactly three bullet points, each separated by a newline \"\\n\".\n  Each bullet should capture an important idea, in one or two lines max.\n\n- \"short_summary\":\n  A concise 3\u20134 line paragraph summarizing the article and why it matters.\n\n- \"content_idea\":\n  Suggest a reusable content idea based on the article.\n  Examples:\n  - \"YouTube video explaining the experiment\"\n  - \"LinkedIn post with the key takeaways\"\n  - \"Module for an AI course\"\n  - \"Example for a lesson on agents\"\n\n- \"language\":\n  Return the language code of the article's main content:\n  - \"en\" for English\n  - \"es\" for Spanish\n  - use other ISO codes when relevant\n\n--------------------------------------\n\nHere is the content you must analyze:\n\nURL:\n{{ $json[\"url\"] }}\n\nSource domain:\n{{ $json[\"source\"] }}\n\nExtracted title:\n{{ $json[\"title\"] }}\n\nSaved date (today):\n{{ $json.fecha_guardado }}\n\nFull article text:\n\"\"\"\n{{ $json[\"article_text\"] }}\n\"\"\"\n\nReturn ONLY a valid JSON object. No explanations, no markdown, no backticks.\n",
"options": {
"systemMessage": "You are an AI assistant specialized in analyzing articles, papers, blogs and documentation related to Artificial Intelligence, Machine Learning and Large Language Models (LLMs).\n\nYour task is to read the input content extracted from a webpage and return a structured JSON object containing metadata and insights useful for research and knowledge management.\n\nYou MUST:\n- Follow the JSON schema exactly as requested.\n- Return ONLY a valid JSON object.\n- Avoid any explanation outside of the JSON.\n- Infer missing fields when possible, but NEVER fabricate dates or factual information.\n- Keep fields concise, clean and useful for later indexing.\n\nAlways output strictly and only the final JSON object.\n"
},
"promptType": "define"
},
"typeVersion": 3
},
{
"id": "89d95a58-f246-4a65-80f0-187a67b5ab3f",
"name": "Gemini Chat Model",
"type": "@n8n/n8n-nodes-langchain.lmChatGoogleGemini",
"position": [
660,
545
],
"parameters": {
"options": []
},
"typeVersion": 1
},
{
"id": "ad37c2d2-ce1d-4203-9970-d8c267b7564d",
"name": "Parse LLM JSON",
"type": "n8n-nodes-base.code",
"position": [
295,
365
],
"parameters": {
"jsCode": "// Each item has a field \"output\" that is a JSON string.\n// We parse it and return the parsed object as the new item.json\n\nconst newItems = [];\n\nfor (const item of items) {\n  const rawOutput = item.json.output;\n\n  if (typeof rawOutput === 'string') {\n    try {\n      const parsed = JSON.parse(rawOutput);\n\n      newItems.push({\n        json: {\n          url: parsed.url || \"\",\n          title: parsed.title || \"\",\n          source: parsed.source || \"\",\n          published_date: parsed.published_date || \"\",\n          saved_date: parsed.saved_date || \"\",\n          resource_type: parsed.resource_type || \"\",\n          main_topic: parsed.main_topic || \"\",\n          three_key_insights: parsed.three_key_insights || \"\",\n          short_summary: parsed.short_summary || \"\",\n          content_idea: parsed.content_idea || \"\",\n          language: parsed.language || \"\"\n        }\n      });\n    } catch (error) {\n      // If parsing fails, you can decide what to do.\n      // For now, we just keep the original item with an error message.\n      newItems.push({\n        json: {\n          error: 'Failed to parse LLM output',\n          original_output: rawOutput\n        }\n      });\n    }\n  } else {\n    // If output is not a string, just forward the item\n    newItems.push(item);\n  }\n}\n\nreturn newItems;\n"
},
"typeVersion": 2
},
{
"id": "9efa2562-3417-41ec-9d78-c68e781343c0",
"name": "Append Sheet Row",
"type": "n8n-nodes-base.googleSheets",
"position": [
15,
410
],
"parameters": {
"columns": {
"value": {
"url": "={{ $json.url }}",
"title": "={{ $json.title }}",
"topic": "={{ $json.main_topic }}",
"source": "={{ $json.source }}",
"summary": "={{ $json.short_summary }}",
"key_ideas": "={{ $json.three_key_insights }}",
"text_type": "={{ $json.resource_type }}",
"main_topic": "={{ $json.main_topic }}",
"published_date": "={{ $json.published_date }}"
},
"schema": [
{
"id": "url",
"type": "string",
"display": true,
"required": false,
"displayName": "url",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "topic",
"type": "string",
"display": true,
"required": false,
"displayName": "topic",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "key_ideas",
"type": "string",
"display": true,
"required": false,
"displayName": "key_ideas",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "summary",
"type": "string",
"display": true,
"required": false,
"displayName": "summary",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "published_date",
"type": "string",
"display": true,
"required": false,
"displayName": "published_date",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "title",
"type": "string",
"display": true,
"required": false,
"displayName": "title",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "source",
"type": "string",
"display": true,
"required": false,
"displayName": "source",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "text_type",
"type": "string",
"display": true,
"required": false,
"displayName": "text_type",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "main_topic",
"type": "string",
"display": true,
"required": false,
"displayName": "main_topic",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": 60764768,
"cachedResultUrl": "",
"cachedResultName": "output"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "[YOUR_ID]",
"cachedResultUrl": "",
"cachedResultName": "urls_AI_ML"
}
},
"typeVersion": 4.7
}
],
"active": false,
"pinData": [],
"settings": {
"executionOrder": "v1"
},
"versionId": null,
"connections": {
"Web Scrape Service": {
"main": [
[
{
"node": "Transform HTML Text",
"type": "main",
"index": 0
}
]
]
},
"LLM Analysis Agent": {
"main": [
[
{
"node": "Parse LLM JSON",
"type": "main",
"index": 0
}
]
]
},
"Iterate Records": {
"main": [
[],
[
{
"node": "Web Scrape Service",
"type": "main",
"index": 0
}
]
]
},
"Transform HTML Text": {
"main": [
[
{
"node": "LLM Analysis Agent",
"type": "main",
"index": 0
}
]
]
},
"Append Sheet Row": {
"main": [
[
{
"node": "Iterate Records",
"type": "main",
"index": 0
}
]
]
},
"Parse LLM JSON": {
"main": [
[
{
"node": "Append Sheet Row",
"type": "main",
"index": 0
}
]
]
},
"Retrieve Sheet Rows": {
"main": [
[
{
"node": "Iterate Records",
"type": "main",
"index": 0
}
]
]
},
"Gemini Chat Model": {
"ai_languageModel": [
[
{
"node": "LLM Analysis Agent",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"Manual Execution Start": {
"main": [
[
{
"node": "Retrieve Sheet Rows",
"type": "main",
"index": 0
}
]
]
}
}
}