January 22, 2026

Wikipedia to Google Sheets, research notes ready

Lisa Granqvist Partner Workflow Automation Expert

Get a free AI assessment → ⬇️ Use template

Research starts simple, then turns into a mess. You open five Wikipedia tabs, copy a few paragraphs, paste them somewhere “temporary,” and somehow lose the best source right when you need it.

This is the kind of problem that hits marketers building niche campaigns first, but content creators and small-team operators feel it too. With Wikipedia Sheets automation, you can turn a topic into a clean summary plus a timeline row in Google Sheets in minutes, not a whole afternoon.

Below you’ll see how the workflow runs, what it produces, and how to use it responsibly for repeatable research you can actually reuse later.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: Wikipedia to Google Sheets, research notes ready

Click to explore

flowchart LR

    subgraph sg0["When clicking "Execute Workflow" Flow"]
        direction LR
        n0@{ icon: "mdi:play-circle", form: "rounded", label: "When clicking 'Execute Workf..", pos: "b", h: 48 }
        n1@{ icon: "mdi:swap-vertical", form: "rounded", label: "Set Topic", pos: "b", h: 48 }
        n2["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/httprequest.dark.svg' width='40' height='40' /></div><br/>Wikipedia Search API"]
        n3@{ icon: "mdi:cog", form: "rounded", label: "ScrapeOps Scraper", pos: "b", h: 48 }
        n4@{ icon: "mdi:database", form: "rounded", label: "Append row in sheet", pos: "b", h: 48 }
        n5@{ icon: "mdi:robot", form: "rounded", label: "Message a model", pos: "b", h: 48 }
        n6["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Extract History Section"]
        n7["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Format AI Output"]
        n8["<div style='background:#f5f5f5;padding:10px;border-radius:8px;display:inline-block;border:1px solid #e0e0e0'><img src='https://flowpast.com/wp-content/uploads/n8n-workflow-icons/code.svg' width='40' height='40' /></div><br/>Construct Page URL"]
        n1 --> n2
        n5 --> n7
        n7 --> n4
        n3 --> n6
        n8 --> n3
        n2 --> n8
        n6 --> n5
        n0 --> n1
    end

    %% Styling
    classDef trigger fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    classDef ai fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef aiModel fill:#e8eaf6,stroke:#3f51b5,stroke-width:2px
    classDef decision fill:#fff8e1,stroke:#f9a825,stroke-width:2px
    classDef database fill:#fce4ec,stroke:#c2185b,stroke-width:2px
    classDef api fill:#fff3e0,stroke:#e65100,stroke-width:2px
    classDef code fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    classDef disabled stroke-dasharray: 5 5,opacity: 0.5
    class n0 trigger
    class n5 ai
    class n4 database
    class n2 api
    class n6,n7,n8 code
    classDef customIcon fill:none,stroke:none
    class n2,n6,n7,n8 customIcon

The Problem: Wikipedia Research Turns Into Copy-Paste Chaos

Wikipedia is great for getting oriented fast, but manual extraction is where momentum dies. You read a page, hunt for “History” or “Background,” then pull out dates and key events by hand. Next comes the copying, the formatting, and the second-guessing (“Did I grab the right section?”). A week later, you’re back in the same rabbit hole because the notes you saved aren’t structured, searchable, or consistent. Even worse, some teams try scraping directly and run into blocks, broken requests, or HTML that’s a pain to clean up.

It adds up fast. Here’s where it breaks down in real life:

Finding the right Wikipedia page is not always one search, especially with similar names and disambiguation pages.
Copying “just the useful part” still means skimming long sections and reformatting text into something your team can reuse.
Dates and milestones usually end up as vague notes, which makes content planning and research audits frustrating later.
Scraping without a proxy can trigger rate limits or IP blocks, so your “quick script” becomes a maintenance chore.

The Solution: Turn a Topic Into a Summary + Timeline Row

This n8n workflow takes a topic, finds the most relevant Wikipedia page, pulls the page content through ScrapeOps (so you’re less likely to get blocked), and extracts the most useful “History,” “Origins,” or “Background” section. Then it sends that section to an OpenAI chat model (GPT-4o-mini in the template) to generate two things you actually want: a concise summary and a structured timeline with key dates. Finally, it appends everything into Google Sheets as a new row, so your research lives in one place and stays consistent across topics. No messy copy-paste. No “where did we put that note?” moment.

The workflow starts with a manual launch trigger and a topic value you set. From there it queries Wikipedia’s API, builds the page URL, fetches the page via ScrapeOps, extracts the right section, and lets AI convert it into clean, spreadsheet-friendly output.

What You Get: Automation vs. Results

What This Workflow Automates

Results You’ll Get

It searches Wikipedia via API and selects the most relevant page.
It fetches page HTML through ScrapeOps Proxy API to reduce blocking issues.
It extracts the “History/Origins/Background” segment and ignores the rest.
It generates a summary and timeline, then appends a row to Google Sheets.

Most teams get 1 research entry into Sheets in about 10 minutes instead of an hour.
Your summaries stay consistent, which means less rewriting for blogs, briefs, and scripts.
You get timeline-ready dates for planning “then vs. now” angles and launch narratives.
Your research becomes searchable and reusable instead of stuck in tabs.
You can rerun the same process for dozens of topics without reinventing your note format.

Example: What This Looks Like

Say you’re researching 10 niche topics for next month’s content calendar. Manually, it’s easy to spend about 30 minutes per topic finding the right page, pulling the history section, and turning it into a usable summary plus a few dated milestones, so roughly 5 hours total. With this workflow, you launch the run, wait for scraping and AI output, and the row lands in Google Sheets; call it about 10 minutes of hands-on time per topic. That’s roughly 3 to 4 hours back for actual planning and writing.

What You’ll Need

n8n instance (try n8n Cloud free)
Self-hosting option if you prefer (Hostinger works well)
Google Sheets for storing summaries and timelines.
ScrapeOps Proxy API to fetch Wikipedia pages reliably.
OpenAI API key (get it from your OpenAI dashboard).

Skill level: Intermediate. You’ll connect accounts, paste API keys, and be comfortable editing a couple of nodes (topic input and sheet mapping).

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

You set the topic and launch it. The workflow begins with a manual trigger, then assigns a topic value (your keyword) so every run is focused on one subject.

Wikipedia is queried, then the right page is chosen. n8n sends an HTTP request to Wikipedia’s API to find the best match, then builds a clean page URL from the result. This reduces “wrong page” errors before scraping even starts.

Scraping happens through ScrapeOps, not your own IP. Instead of pulling HTML directly, the workflow uses the ScrapeOps node to fetch the page content more reliably. That’s the difference between “works today” and “works whenever you need it.”

AI turns a long section into structured output. A code step extracts the “History/Origins/Background” segment, then the OpenAI chat model generates a concise summary and a timeline with key dates. Another code step parses that response into fields that fit neatly into Google Sheets.

You can easily modify the topic input and the sheet columns to match your planning style. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Manual Trigger

Start the workflow with a manual trigger and define the topic that will be used to query Wikipedia.

Add the Manual Launch Trigger node as the workflow trigger.
Open Assign Topic Value and add a field named topic with the string value n8n.
Connect Manual Launch Trigger → Assign Topic Value.

You can change the topic value at any time to generate history for a new subject.

Step 2: Connect Wikipedia Search and Page Fetching

Query Wikipedia’s API, build a page URL, then fetch the full page HTML.

Open Wikipedia Query Request and set URL to https://en.wikipedia.org/w/api.php.
In Wikipedia Query Request, set Query Parameters: action = query, list = search, srsearch = ={{ $json.topic }}, format = json.
In Wikipedia Query Request, set Header Parameters → User-Agent to n8n-workflow/1.0 ([YOUR_EMAIL]).
Connect Assign Topic Value → Wikipedia Query Request → Build Page URL.
Open ScrapeOps Page Fetcher and set URL to ={{ $json.wikipedia_page_url }}.
In ScrapeOps Page Fetcher, enable render_js (already set in advancedOptions).
Credential Required: Connect your scrapeOpsApi credentials in ScrapeOps Page Fetcher.
Connect Build Page URL → ScrapeOps Page Fetcher.

⚠️ Common Pitfall: If the Wikipedia API returns no results, Build Page URL will throw an error. Ensure the topic exists in Wikipedia.

Step 3: Extract History and Generate AI Summary

Extract the History/Origins section from the HTML, then summarize it with AI.

Connect ScrapeOps Page Fetcher → Extract History Segment.
Review the custom parser in Extract History Segment (no changes required) to ensure it returns history_raw along with the metadata from Build Page URL.
Open AI Summary Composer and confirm the model is set to gpt-4o-mini.
In AI Summary Composer, confirm the user message includes the variables: {{ $json.topic }}, {{ $json.wikipedia_page_title }}, {{ $json.wikipedia_page_url }}, {{ $json.search_query_url }}, and {{ $json.history_raw }}.
Credential Required: Connect your openAiApi credentials in AI Summary Composer.
Connect Extract History Segment → AI Summary Composer → Parse AI Response.

The Parse AI Response node is tolerant of different OpenAI response formats and will still extract the fields even if the model changes its output shape.

Step 4: Configure Google Sheets Output

Append the AI-generated history summary to a Google Sheet.

Open Append Sheet Row and keep Operation set to append.
Set Document to your Google Sheet URL (currently https://docs.google.com/spreadsheets/d/[YOUR_ID]/edit?gid=0#gid=0).
Set Sheet Name to Sheet1 (value gid=0).
Map the column values as shown: Topic = ={{ $json.Topic }}, Timeline = ={{ $json.Timeline }}, History_Raw = ={{ $json.History_Raw }}, History_Cleaned = ={{ $json.History_Cleaned }}, History_Summary = ={{ $json.History_Summary }}, Search_Query_URL = ={{ $json.Search_Query_URL }}, Wikipedia_Page_URL = ={{ $json.Wikipedia_PAGE_URL }}, Wikipedia_Page_Title = ={{ $json.Wikipedia_Page_Title }}.
Credential Required: Connect your googleSheetsOAuth2Api credentials in Append Sheet Row.
Connect Parse AI Response → Append Sheet Row.

⚠️ Common Pitfall: Ensure your sheet has column headers that match the mapped field names, including any trailing carriage returns in Search_Query_URL and Wikipedia_Page_Title if they exist in your schema.

Step 5: Test and Activate Your Workflow

Run the workflow end-to-end and verify the final row in Google Sheets before activating.

Click Execute Workflow and manually run Manual Launch Trigger.
Confirm Wikipedia Query Request returns a valid search result and Build Page URL outputs a wikipedia_page_url.
Verify Extract History Segment outputs a non-empty history_raw value.
Check Parse AI Response for properly parsed fields like History_Summary and Timeline.
Open your Google Sheet and confirm a new row is appended by Append Sheet Row.
When satisfied, toggle the workflow to Active to use it in production runs.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Common Gotchas

Google Sheets credentials can expire or need specific permissions. If things break, check the connected Google account in n8n’s Credentials and confirm it can edit the target spreadsheet.
If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
Default prompts in AI nodes are generic. Add your brand voice early or you’ll be editing outputs forever.

Frequently Asked Questions

How long does it take to set up this Wikipedia Sheets automation automation?

About 10 minutes if you already have the API keys.

Do I need coding skills to automate Wikipedia Sheets automation?

No. You’ll connect ScrapeOps, OpenAI, and Google Sheets, then edit a topic field and pick the sheet tab.

Is n8n free to use for this Wikipedia Sheets automation workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in OpenAI API costs and ScrapeOps usage, which depend on how many pages you process.

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this Wikipedia Sheets automation workflow for a different section like “Early life” instead of “History”?

Yes, but you’ll want to adjust the “Extract History Segment” code logic so it searches for your preferred headings. Common tweaks include extracting a different section, changing the AI prompt to output more (or fewer) timeline events, and mapping extra fields into new Google Sheets columns.

Why is my ScrapeOps connection failing in this workflow?

Usually it’s an invalid or expired ScrapeOps API key added to the ScrapeOps node. Check the ScrapeOps dashboard for key status, then confirm the key is pasted into the correct credentials field in n8n. If the key is fine, it can be a plan limit or the target page returning a non-200 response, which your workflow should handle with a simple “If” fallback. Also, Wikipedia pages change; if the HTML structure shifts, the extraction code may need a small update.

How many topics can this Wikipedia Sheets automation automation handle?

On n8n Cloud Starter, you can run a healthy volume for small teams, and self-hosting removes execution caps (your server becomes the limit). Practically, most people run this in batches of 20–50 topics at a time so they can spot-check output quality and avoid hammering any single source.

Is this Wikipedia Sheets automation automation better than using Zapier or Make?

Often, yes, because this workflow needs multi-step logic (API lookup, proxy scraping, extraction, AI formatting, and structured parsing). That kind of flow is doable in Zapier/Make, but it tends to get expensive and harder to debug once you add branching and custom parsing. n8n also gives you a real self-host option, which matters if you’re doing research at scale. The flip side: if you only need a simple “send a link, save a note” flow, Zapier or Make can be quicker. Talk to an automation expert if you want help choosing.

Once this is set up, research stops being a fragile pile of tabs and half-finished notes. Your sheet becomes the system, and you can finally build on what you learned instead of redoing it.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

{
"id": null,
"meta": {
"instanceId": "d57e449df80105b7194d997118b6313791628b9933879dca306b8a492b38b300",
"templateId": "",
"templateCredsSetupCompleted": null
},
"name": "Automated Topic History Workflow",
"tags": [],
"nodes": [
{
"id": "flowpast-topbar-11482",
"name": "Flowpast Branding",
"type": "n8n-nodes-base.stickyNote",
"position": [
80,
85
],
"parameters": {
"color": 7,
"width": 960,
"height": 80,
"content": "## Flowpast.com | Automation Workflow Library\n**\ud83d\udcd6 Full tutorial & setup guide:** flowpast.com"
},
"typeVersion": 1
},
{
"id": "5734320a-5061-4438-a690-2b8524e26243",
"name": "Manual Launch Trigger",
"type": "n8n-nodes-base.manualTrigger",
"position": [
80,
230
],
"parameters": [],
"typeVersion": 1
},
{
"id": "92c4455f-4db1-4021-aed9-69f33fc0fb5f",
"name": "Assign Topic Value",
"type": "n8n-nodes-base.set",
"position": [
330,
220
],
"parameters": {
"fields": {
"values": [
{
"name": "topic",
"stringValue": "n8n"
}
]
},
"options": []
},
"typeVersion": 3.2
},
{
"id": "2a11e959-ef81-4c1d-ae38-f76b1903f277",
"name": "Wikipedia Query Request",
"type": "n8n-nodes-base.httpRequest",
"position": [
685,
185
],
"parameters": {
"url": "https://en.wikipedia.org/w/api.php",
"options": [],
"sendQuery": true,
"sendHeaders": true,
"queryParameters": {
"parameters": [
{
"name": "action",
"value": "query"
},
{
"name": "list",
"value": "search"
},
{
"name": "srsearch",
"value": "={{ $json.topic }}"
},
{
"name": "format",
"value": "json"
}
]
},
"headerParameters": {
"parameters": [
{
"name": "User-Agent",
"value": "n8n-workflow/1.0 ([YOUR_EMAIL])"
}
]
}
},
"typeVersion": 4.1
},
{
"id": "d319b30d-ce61-43a8-bb41-7423b59f89ff",
"name": "Build Page URL",
"type": "n8n-nodes-base.code",
"position": [
920,
225
],
"parameters": {
"jsCode": "// Get topic from Assign Topic Value node (not from Wikipedia API response)\nconst topic = $('Assign Topic Value').first().json.topic;\nconst search = items[0].json.query.search;\n\nif (!search || search.length === 0) {\n  throw new Error(\"No Wikipedia page found for: \" + topic);\n}\n\nconst title = search[0].title;\nconst encoded = encodeURIComponent(title.replace(/ /g, \"_\"));\n\nreturn [\n  {\n    json: {\n      topic,\n      wikipedia_page_title: title,\n      wikipedia_page_url: `https://en.wikipedia.org/wiki/${encoded}`,\n      search_query_url: `https://en.wikipedia.org/w/api.php?action=query&list=search&srsearch=${encodeURIComponent(topic)}&format=json`\n    }\n  }\n];"
},
"typeVersion": 2
},
{
"id": "c0765484-7583-43f5-83aa-905761c8d878",
"name": "ScrapeOps Page Fetcher",
"type": "@scrapeops/n8n-nodes-scrapeops.ScrapeOps",
"position": [
940,
410
],
"parameters": {
"url": "={{ $json.wikipedia_page_url }}",
"advancedOptions": {
"render_js": true
}
},
"credentials": {
"scrapeOpsApi": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 1
},
{
"id": "6d76b574-3f8d-45e9-96cd-421e6b1b2d8b",
"name": "Extract History Segment",
"type": "n8n-nodes-base.code",
"position": [
695,
355
],
"parameters": {
"jsCode": "// Clean Wikipedia Text (Regex version - No Cheerio)\nconst item = items[0].json;\nlet html = '';\n\n// 1. Extract HTML from Input\nif (typeof item === 'string') {\n  html = item;\n} else if (item.response?.body) {\n  html = item.response.body;\n} else if (item.body) {\n  html = item.body;\n} else if (item.content) {\n  html = item.content;\n} else if (item.data) {\n  html = item.data;\n} else {\n    // Handle \"exploded\" string (numeric keys) from ScrapeOps\n    const keys = Object.keys(item).filter(k => !isNaN(k));\n    if (keys.length > 0) {\n        keys.sort((a, b) => parseInt(a) - parseInt(b));\n        html = keys.map(k => item[k]).join('');\n    }\n}\n\n// 2. Basic Cleanup (Remove scripts, styles, tables, references)\nhtml = html.replace(/<script\\b[^>]*>[\\s\\S]*?<\\/script>/gim, \"\");\nhtml = html.replace(/<style\\b[^>]*>[\\s\\S]*?<\\/style>/gim, \"\");\nhtml = html.replace(/<table\\b[^>]*>[\\s\\S]*?<\\/table>/gim, \"\"); \nhtml = html.replace(/<sup\\b[^>]*class=[\"'][^\"']*reference[^\"']*[\"'][^>]*>[\\s\\S]*?<\\/sup>/gim, \"\");\nhtml = html.replace(/<span\\b[^>]*class=[\"'][^\"']*mw-editsection[^\"']*[\"'][^>]*>[\\s\\S]*?<\\/span>/gim, \"\");\n\n// 3. Find the History/Origins/Background Section\nconst headerRegex = /<h[23][^>]*>.*?(?:History|Origins|Background).*?<\\/h[23]>/i;\nconst headerMatch = headerRegex.exec(html);\n\nlet historyText = '';\n\nif (headerMatch) {\n    const startIndex = headerMatch.index + headerMatch[0].length;\n    const remainder = html.substring(startIndex);\n    const nextHeaderRegex = /<h[23][^>]*>/i;\n    const nextHeaderMatch = nextHeaderRegex.exec(remainder);\n    \n    let sectionContent = remainder;\n    if (nextHeaderMatch) {\n        sectionContent = remainder.substring(0, nextHeaderMatch.index);\n    }\n    \n    const pRegex = /<p\\b[^>]*>([\\s\\S]*?)<\\/p>/gi;\n    let pMatch;\n    while ((pMatch = pRegex.exec(sectionContent)) !== null) {\n        let pText = pMatch[1];\n        pText = pText.replace(/<[^>]+>/g, '');\n        pText = pText.replace(/ /g, ' ').replace(/&/g, '&').replace(/"/g, '\"').replace(/</g, '<').replace(/>/g, '>');\n        historyText += pText.trim() + '\\n\\n';\n    }\n} else {\n    // Fallback: Grab first 5 paragraphs\n    const pRegex = /<p\\b[^>]*>([\\s\\S]*?)<\\/p>/gi;\n    let pMatch;\n    let count = 0;\n    while ((pMatch = pRegex.exec(html)) !== null && count < 5) {\n        let pText = pMatch[1];\n        pText = pText.replace(/<[^>]+>/g, '');\n        pText = pText.replace(/ /g, ' ').replace(/&/g, '&').replace(/"/g, '\"').replace(/</g, '<').replace(/>/g, '>');\n        if (pText.trim().length > 50) {\n            historyText += pText.trim() + '\\n\\n';\n            count++;\n        }\n    }\n}\n\nhistoryText = historyText.replace(/\\[.*?\\]/g, '').trim();\n\n// 4. Return Data (Restore metadata from Build Page URL node)\n// We use $('Build Page URL').first().json to get the original topic/url info\nconst originalData = $('Build Page URL').first().json;\n\nreturn [\n  {\n    json: {\n      ...originalData,\n      history_raw: historyText\n    }\n  }\n];"
},
"typeVersion": 2
},
{
"id": "950ac15e-6f9a-4cc2-a170-fa09031dc131",
"name": "AI Summary Composer",
"type": "@n8n/n8n-nodes-langchain.openAi",
"position": [
335,
395
],
"parameters": {
"modelId": {
"__rl": true,
"mode": "list",
"value": "gpt-4o-mini",
"cachedResultName": "GPT-4O-MINI"
},
"options": [],
"messages": {
"values": [
{
"role": "system",
"content": "== You are the Niche History Generator AI. Return EXACTLY and ONLY a single valid JSON object \u2014 nothing else. Do NOT include explanation, commentary, markdown, triple backticks, or any extra text. The JSON must contain exactly these keys: Topic, Search_Query_URL, Wikipedia_Page_Title, Wikipedia_PAGE_URL, History_Raw, History_Cleaned, History_Summary, Timeline. Use \\\\n\\\\n inside paragraphs only. Timeline entries must use single-line entries starting with - **YYYY:**. Return a JSON object (not a JSON string)."
},
{
"content": "=Topic:{{ $json.topic }}\nWikipedia Page Title: {{ $json.wikipedia_page_title }}\nWikipedia Page URL:{{ $json.wikipedia_page_url }}\nSearch Query URL:{{ $json.search_query_url }}\nRaw History:{{ $json.history_raw }}"
},
[]
]
}
},
"credentials": {
"openAiApi": {
"id": "credential-id",
"name": ""
}
},
"executeOnce": true,
"typeVersion": 1.8
},
{
"id": "dfdb13c2-6c31-460c-908f-3f6a5304be71",
"name": "Parse AI Response",
"type": "n8n-nodes-base.code",
"position": [
390,
580
],
"parameters": {
"jsCode": "// Universal robust parser \u2014 handles many OpenAI node shapes including message.content\nconst item = items[0];\nlet raw = '';\n// try many known places the assistant text may appear\nif (item.json.output_text) {\n  raw = item.json.output_text;\n} else if (item.json.content?.[0]?.text) {\n  raw = item.json.content[0].text;\n} else if (item.json.output?.[0]?.content?.[0]?.text) {\n  raw = item.json.output[0].content[0].text;\n} else if (item.json.choices?.[0]?.message?.content) {\n  raw = item.json.choices[0].message.content;\n} else if (item.json.choices?.[0]?.text) {\n  raw = item.json.choices[0].text;\n} else if (typeof item.json.message?.content === 'string') {\n  // your specific shape: item.json.message.content is a string with JSON\n  raw = item.json.message.content;\n} else if (Array.isArray(item.json.output) && item.json.output[0]?.content?.[0]?.text) {\n  raw = item.json.output[0].content[0].text;\n} else {\n  raw = JSON.stringify(item.json);\n}\n// normalize\nraw = String(raw).trim();\n// remove triple backticks if present\nraw = raw.replace(/^\\s*```(?:json)?\\s*/i, '');\nraw = raw.replace(/\\s*```\\s*$/i, '');\n// extract the first {...} JSON block\nconst match = raw.match(/\\{[\\s\\S]*\\}/);\nconst jsonText = match ? match[0] : raw;\n// parse with fallback (handle escaped newlines)\nlet parsed;\ntry {\n  parsed = JSON.parse(jsonText);\n} catch (e1) {\n  try {\n    parsed = JSON.parse(jsonText.replace(/\\\\n/g, '\\n'));\n  } catch (e2) {\n    return [{ json: { parse_error: e2.message, raw } }];\n  }\n}\n// tolerant key lookup\nfunction getVal(obj, names){\n  for (const n of names){\n    if (Object.prototype.hasOwnProperty.call(obj, n)) return obj[n];\n  }\n  // case-insensitive normalized\n  const norm = {};\n  for (const k of Object.keys(obj||{})) norm[k.toLowerCase().replace(/[_\\s]+/g,'')] = obj[k];\n  for (const n of names){\n    const key = n.toLowerCase().replace(/[_\\s]+/g,'');\n    if (norm[key] !== undefined) return norm[key];\n  }\n  return undefined;\n}\n// build output\nconst out = {};\nout.Topic = getVal(parsed, ['Topic','topic']) || '';\nout.Search_Query_URL = getVal(parsed, ['Search_Query_URL','search_query_url','searchqueryurl']) || '';\nout.Wikipedia_Page_Title = getVal(parsed, ['Wikipedia_Page_Title','Wikipedia Page Title','wikipedia_page_title']) || '';\nout.Wikipedia_PAGE_URL = getVal(parsed, ['Wikipedia_PAGE_URL','Wikipedia Page URL','wikipedia_page_url']) || '';\nout.History_Raw = getVal(parsed, ['History_Raw','history_raw']) || '';\nout.History_Cleaned = getVal(parsed, ['History_Cleaned','history_cleaned']) || '';\nout.History_Summary = getVal(parsed, ['History_Summary','history_summary']) || '';\n// handle Timeline: array -> newline string, or string (maybe JSON string)\nlet timeline = getVal(parsed, ['Timeline','timeline','TimeLine']);\nif (Array.isArray(timeline)) {\n  out.Timeline = timeline.join('\\n');\n} else if (typeof timeline === 'string') {\n  try {\n    const maybe = JSON.parse(timeline);\n    out.Timeline = Array.isArray(maybe) ? maybe.join('\\n') : timeline;\n  } catch (e) {\n    out.Timeline = timeline;\n  }\n} else {\n  out.Timeline = '';\n}\n// unescape double-escaped newlines\nfor (const k of ['History_Raw','History_Cleaned','History_Summary','Timeline']) {\n  if (typeof out[k] === 'string') out[k] = out[k].replace(/\\\\n/g, '\\n');\n}\nreturn [{ json: out }];"
},
"typeVersion": 2
},
{
"id": "8814f99a-63a2-4494-a38a-853230986a97",
"name": "Append Sheet Row",
"type": "n8n-nodes-base.googleSheets",
"position": [
625,
540
],
"parameters": {
"columns": {
"value": {
"Topic": "={{ $json.Topic }}",
"Timeline": "={{ $json.Timeline }}",
"History_Raw": "={{ $json.History_Raw }}",
"History_Cleaned": "={{ $json.History_Cleaned }}",
"History_Summary": "={{ $json.History_Summary }}",
"Search_Query_URL\r": "={{ $json.Search_Query_URL }}",
"Wikipedia_Page_URL": "={{ $json.Wikipedia_PAGE_URL }}",
"Wikipedia_Page_Title\r": "={{ $json.Wikipedia_Page_Title }}"
},
"schema": [
{
"id": "Topic",
"type": "string",
"display": true,
"required": false,
"displayName": "Topic",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Search_Query_URL\r",
"type": "string",
"display": true,
"required": false,
"displayName": "Search_Query_URL\r",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Wikipedia_Page_Title\r",
"type": "string",
"display": true,
"required": false,
"displayName": "Wikipedia_Page_Title\r",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Wikipedia_Page_URL",
"type": "string",
"display": true,
"required": false,
"displayName": "Wikipedia_Page_URL",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "History_Raw",
"type": "string",
"display": true,
"required": false,
"displayName": "History_Raw",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "History_Cleaned",
"type": "string",
"display": true,
"required": false,
"displayName": "History_Cleaned",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "History_Summary",
"type": "string",
"display": true,
"required": false,
"displayName": "History_Summary",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "Timeline",
"type": "string",
"display": true,
"required": false,
"displayName": "Timeline",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "defineBelow",
"matchingColumns": [],
"attemptToConvertTypes": false,
"convertFieldsToString": false
},
"options": [],
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": "gid=0",
"cachedResultUrl": "https://docs.google.com/spreadsheets/d/[YOUR_ID]/edit#gid=0",
"cachedResultName": "Sheet1"
},
"documentId": {
"__rl": true,
"mode": "url",
"value": "https://docs.google.com/spreadsheets/d/[YOUR_ID]/edit?gid=0#gid=0"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "credential-id",
"name": ""
}
},
"typeVersion": 4.7
}
],
"active": false,
"pinData": [],
"settings": {
"executionOrder": "v1"
},
"versionId": null,
"connections": {
"Assign Topic Value": {
"main": [
[
{
"node": "Wikipedia Query Request",
"type": "main",
"index": 0
}
]
]
},
"AI Summary Composer": {
"main": [
[
{
"node": "Parse AI Response",
"type": "main",
"index": 0
}
]
]
},
"Parse AI Response": {
"main": [
[
{
"node": "Append Sheet Row",
"type": "main",
"index": 0
}
]
]
},
"ScrapeOps Page Fetcher": {
"main": [
[
{
"node": "Extract History Segment",
"type": "main",
"index": 0
}
]
]
},
"Build Page URL": {
"main": [
[
{
"node": "ScrapeOps Page Fetcher",
"type": "main",
"index": 0
}
]
]
},
"Wikipedia Query Request": {
"main": [
[
{
"node": "Build Page URL",
"type": "main",
"index": 0
}
]
]
},
"Extract History Segment": {
"main": [
[
{
"node": "AI Summary Composer",
"type": "main",
"index": 0
}
]
]
},
"Manual Launch Trigger": {
"main": [
[
{
"node": "Assign Topic Value",
"type": "main",
"index": 0
}
]
]
}
}
}