🔓 Unlock all 10,000+ workflows & prompts free Join Newsletter →
✅ Full access unlocked — explore all 10,000 AI workflow and prompt templates Browse Templates →
Home n8n Workflow
January 22, 2026

arXiv to Notion, with Telegram digests you trust

Lisa Granqvist Partner Workflow Automation Expert

Checking arXiv “quickly” turns into 40 open tabs, half-read PDFs, and that nagging feeling you still missed the one paper everyone will talk about tomorrow. It’s not the reading that breaks you. It’s the hunting, sorting, and saving.

Founders tracking what might shift their roadmap feel this pain first. A research lead trying to keep a small team aligned gets it too. And if you do marketing for an AI product, arXiv Notion automation is an easy way to stay sharp without living in feeds.

This workflow pulls new AI papers from arXiv, files them into Notion with clean metadata and PDF links, generates Gemini “deep summaries,” then sends a daily Telegram digest you can actually trust. You’ll see how it works, what you need, and where people usually get stuck.

How This Automation Works

The full n8n workflow, from trigger to final output:

n8n Workflow Template: arXiv to Notion, with Telegram digests you trust

The Problem: arXiv overload turns into missed papers

arXiv is an incredible resource, but it’s also a firehose. You open the AI feed intending to scan a handful of abstracts, and suddenly it’s a mini project: filter out duplicates, figure out what’s new since yesterday, click into PDFs, then save “the good ones” somewhere you will remember later. Even if you have a Notion database, getting papers in there usually means copy-pasting titles, IDs, authors, and links (and inevitably messing up one of them). The cost isn’t just time. It’s context switching, and it makes “keeping up” feel heavier than it should.

It adds up fast. Here’s where it breaks down in real life.

  • You keep re-finding the same papers because there’s no reliable dedupe across days.
  • Saving papers to Notion is manual, which means metadata gets inconsistent and searching later becomes annoying.
  • Even when you grab the PDFs, summarizing them takes long enough that you postpone it, then forget why you saved them.
  • Your “daily check” depends on willpower, so busy mornings quietly erase your research habit.

The Solution: arXiv → Notion pages + Gemini summaries + Telegram digest

This n8n workflow turns arXiv into a daily research assistant that runs at 08:00. It pulls the latest Artificial Intelligence papers from arXiv’s API, converts the feed into clean JSON, then filters to a recent time window so you’re not reprocessing yesterday’s list. Next, it removes duplicate records and builds a structured Notion page for each paper, including the core metadata and a direct PDF URL. After the Notion page exists, Gemini reads the PDF and produces a “deep research” summary in chunks, which are appended back into that same Notion page as readable blocks. Finally, the workflow posts a Telegram update with the title, a short abstract, and links to both the PDF and your Notion entry.

The workflow starts on a schedule (or manually when you’re testing). arXiv data gets cleaned, filtered, and deduped before anything is saved. Then Notion becomes your system of record, and Telegram becomes your daily reminder to actually look.

What You Get: Automation vs. Results

Example: What This Looks Like

Say your team wants to track 10 new AI papers a day. Manually, it’s maybe 6 minutes per paper to open arXiv, grab the PDF link, copy the title/authors, paste into Notion, and write even a rough 3–4 sentence takeaway. That’s about an hour, every weekday. With this workflow, you spend a couple minutes skimming the Telegram digest, then open only the 1–2 Notion pages that look relevant. The rest is already filed and summarized for you.

What You’ll Need

  • n8n instance (try n8n Cloud free)
  • Self-hosting option if you prefer (Hostinger works well)
  • Notion for your paper database and summaries.
  • Telegram to post a daily digest to a channel.
  • Google Gemini API key (get it from Google AI Studio / Google Cloud console).

Skill level: Intermediate. You’ll mostly map fields and add credentials, plus light tweaking if your Notion schema is custom.

Don’t want to set this up yourself? Talk to an automation expert (free 15-minute consultation).

How It Works

A daily scheduled trigger starts the run. The workflow is set to fire at 08:00, and there’s also a manual trigger for testing so you’re not waiting until tomorrow morning.

arXiv data is fetched and cleaned. n8n pulls the AI feed via HTTP Request, converts the XML into JSON, and splits entries into individual paper items that are easier to process.

Recent-only filtering and deduping keeps things sane. Code nodes apply a time window and remove duplicates, so your Notion database doesn’t slowly fill with repeats or stale results.

Notion becomes the archive, then Gemini fills in the insight. For each paper, the workflow creates a Notion page, inserts a “summary” heading, asks Gemini to produce a deep summary from the PDF, parses the summary into chunks, and appends them as rich-text blocks. A Wait node pauses between papers so you don’t overwhelm downstream calls.

You can easily modify the arXiv query to focus on specific topics (like agents, retrieval, or diffusion) based on your needs. See the full implementation guide below for customization options.

Step-by-Step Implementation Guide

Step 1: Configure the Trigger Type

Set up both manual and scheduled triggers so you can run on-demand or daily.

  1. Open Manual Execution Trigger to enable manual runs (no configuration required).
  2. Open Scheduled Automation Start and confirm the rule interval is set to run at 8 (hour of day).
  3. Verify both triggers connect to External API Request so either trigger can start the workflow.

Use Manual Execution Trigger for initial testing before enabling Scheduled Automation Start in production.

Step 2: Connect the Research Feed Source

Configure the arXiv API request and convert XML to JSON for downstream parsing.

  1. Open External API Request and set URL to https://export.arxiv.org/api/query.
  2. Under Query Parameters, set search_query to abs:"artificial intelligence", sortBy to submittedDate, sortOrder to descending, start to 0, and max_results to 100.
  3. Open XML to JSON Mapper and enable Merge Attributes so XML attributes are preserved.
  4. Open Split Feed Entries and set Field to Split Out to feed.entry.

Step 3: Filter and Deduplicate Papers

Keep only recent entries and remove duplicates across runs.

  1. Open Filter Recent Papers and confirm the time window is const HOURS = 24; in the JS code.
  2. Review the output fields built in Filter Recent Papers (e.g., title, summary, authors, published).
  3. Open Remove Duplicate Records and keep $getWorkflowStaticData('global') for persistent deduplication.

⚠️ Common Pitfall: The dedupe logic requires the Remove Duplicate Records node to run in “Run Once for All Items” mode. Ensure this is enabled to avoid missing duplicates.

Step 4: Set Up Notion Payload and Field Mapping

Build the Notion-ready payload and map fields before creating records.

  1. Open Compose Notion Payload and replace TU_DATABASE_ID with your Notion database ID.
  2. Confirm Map Fields for Notion assignments such as title set to {{ $json.properties.title.title[0].text.content }} and published set to {{ $json.properties.published.date.start }}.
  3. Keep the abstract_clean cleaning expression exactly as configured: {{ String($json.properties.abstract.rich_text[0].text.content || $json.summary).replace(/\\n/g, ' ').replace(/\n/g, ' ').replace(/\s+/g, ' ').trim() }}.
  4. Confirm url_pdf builds the PDF URL with the expression {{ (() => { const src = ($json.link || $json.properties.arxiv_id.rich_text[0].text.content || '').trim(); if (!src) return ''; let u = src.replace('/abs/', '/pdf/'); if (!/^https?:\/\/i.test(u)) u = 'https://' + u.replace(/^\/+/, ''); if (!u.toLowerCase().endsWith('.pdf')) u += '.pdf'; return u;})() }}.

Credential Required: Connect your Notion credentials for Create Notion Record, Insert Summary Heading, and Append Summary Blocks. These nodes have no credentials configured yet.

Step 5: Configure Notion Record Creation and Iteration

Create database pages, capture page IDs, and prepare the loop for per-paper processing.

  1. Open Create Notion Record and set Title to {{ $json.title }} with Resource set to databasePage.
  2. In Create Notion Record properties, confirm each field uses the mapped expressions, such as {{ $json.abstract_clean }} for the abstract.
  3. Open Store Page Identifier and keep Include Other Fields enabled, mapping =page_id to {{ $json.id }}.
  4. Open Insert Summary Heading and confirm the block type is heading_2 with text Deep Research Summary.
  5. Open Iterate Paper Processing to ensure batch processing is enabled for looped summary insertion.

Create Notion Record outputs to both Store Page Identifier and Dispatch Telegram Update in parallel, so both Notion and Telegram flows proceed simultaneously.

Step 6: Configure AI Summarization and Notion Block Appending

Generate deep summaries, parse them into chunks, and append to the Notion page while pacing requests.

  1. Open Normalize Page Identifier and keep the expression {{ $json.page_id || $json.parent?.page_id || $json.page_id || $json.results?.[0]?.parent?.page_id || $json.id }} to standardize page IDs.
  2. In Generate Deep Summary, confirm Resource is document and Document URLs is {{ $('Map Fields for Notion').item.json.url_pdf }}.
  3. Credential Required: Connect your googlePalmApi credentials in Generate Deep Summary and verify the model is models/gemini-2.5-pro.
  4. Confirm Normalize Page Identifier outputs to both Generate Deep Summary and Combine Page and Summary in parallel, and that Combine Page and Summary uses Mode combine with Combine By combineByPosition.
  5. Open Parse Summary Chunks and leave the JSON parsing code intact to extract chunks into slice items.
  6. Open Prepare Chunk Debug and confirm mappings like {{ $json.slice.length }} for dbg_slice_len.
  7. Open Append Summary Blocks and set block Text Content to {{ $json.slice || '—' }} with Resource block.
  8. Open Pause Between Summaries and set Amount to 0.4 to throttle Notion updates.
  9. Ensure Loop Back to Next Paper connects to Iterate Paper Processing to continue the loop.

Step 7: Configure Telegram Notifications

Send a Telegram update when each Notion page is created.

  1. Open Dispatch Telegram Update and set Chat ID to your target value, e.g., [YOUR_ID].
  2. Keep the message template as configured, including expressions like {{ $('Create Notion Record').item.json.property_title }} and {{ $('Map Fields for Notion').item.json.abstract_clean }}.
  3. Credential Required: Connect your telegramApi credentials in Dispatch Telegram Update.
  4. Confirm Dispatch Telegram Update outputs to End Telegram Path.

Dispatch Telegram Update is triggered in parallel with Store Page Identifier after Create Notion Record, so notifications are sent immediately after page creation.

Step 8: Test and Activate Your Workflow

Run a manual test, verify outputs in Notion and Telegram, then activate the schedule.

  1. Click Execute Workflow using Manual Execution Trigger and watch data pass through External API Request, XML to JSON Mapper, and Split Feed Entries.
  2. Confirm new pages appear in Notion from Create Notion Record and that blocks are appended by Append Summary Blocks.
  3. Verify Telegram messages arrive from Dispatch Telegram Update with valid links and summary text.
  4. Once successful, activate the workflow and rely on Scheduled Automation Start for daily execution.

⚠️ Common Pitfall: If Notion pages or blocks fail to create, re-check that Notion credentials are connected to Create Notion Record, Insert Summary Heading, and Append Summary Blocks.

🔒

Unlock Full Step-by-Step Guide

Get the complete implementation guide + downloadable template

Common Gotchas

  • Notion credentials can expire or need specific permissions. If things break, check your Notion integration connection inside n8n and confirm the database is shared with that integration.
  • If you’re using Wait nodes or external rendering, processing times vary. Bump up the wait duration if downstream nodes fail on empty responses.
  • Gemini API limits and PDF accessibility can cause flaky summaries. If the PDF link is blocked or too slow, the “Generate Deep Summary” step may return partial output, so review the run history for that node first.

Frequently Asked Questions

How long does it take to set up this arXiv Notion automation automation?

Plan on about 45 minutes if your Notion database and Telegram bot are ready.

Do I need coding skills to automate arXiv Notion automation?

No. You’ll connect accounts, paste in API keys, and map a few Notion properties.

Is n8n free to use for this arXiv Notion automation workflow?

Yes. n8n has a free self-hosted option and a free trial on n8n Cloud. Cloud plans start at $20/month for higher volume. You’ll also need to factor in Gemini API usage, which depends on how many PDFs you summarize per day.

Where can I host n8n to run this automation?

Two options: n8n Cloud (managed, easiest setup) or self-hosting on a VPS. For self-hosting, Hostinger VPS is affordable and handles n8n well. Self-hosting gives you unlimited executions but requires basic server management.

Can I customize this arXiv Notion automation workflow for a different arXiv category or keyword search?

Yes, and it’s one of the best tweaks to make. Update the arXiv query in the “External API Request” HTTP Request node to target your preferred categories or search terms. Common customizations include narrowing to subtopics (like agents or RAG), increasing max_results on busy days, and changing the time-window logic in the “Filter Recent Papers” code node so you catch weekend releases.

Why is my Notion connection failing in this workflow?

Usually it’s permissions, not n8n. Make sure the Notion database is shared with your Notion integration, then re-check the credentials in n8n and re-select the database in the “Create Notion Record” node. If it fails only sometimes, look at the property mapping in “Map Fields for Notion,” because a mismatched select/tag value can cause the create call to error out.

How many papers can this arXiv Notion automation automation handle?

Practically, it handles “as many as you’re willing to summarize,” because the bottleneck is the PDF summarization step. On n8n Cloud Starter you’re limited by monthly executions, while self-hosting has no execution limit (your server and API quotas matter more). If you expect 50+ papers a day, add a stricter filter (keywords, categories, or max_results) and keep the Wait node so you don’t spike API errors. Most people start with 10–20 papers daily and adjust from there. Honestly, you want fewer, better papers anyway.

Is this arXiv Notion automation automation better than using Zapier or Make?

For this workflow, n8n has a few advantages: more complex logic with unlimited branching at no extra cost, a self-hosting option for unlimited executions, and native code/looping support for chunked summaries that Zapier tends to make awkward. Zapier or Make can work if you only want “RSS in, message out,” but the Notion + PDF summarization + append-in-chunks part is where you’ll feel the limits. If you want to tune deduping, time windows, or the summary format, n8n stays flexible. Talk to an automation expert if you’re not sure which fits.

Once this is running, your research “system” stops depending on motivation. The workflow does the collecting and summarizing, and you just show up for the few papers worth your attention.

Need Help Setting This Up?

Our automation experts can build and customize this workflow for your specific needs. Free 15-minute consultation—no commitment required.

Lisa Granqvist

Workflow Automation Expert

Expert in workflow automation and no-code tools.

×

Use template

Get instant access to this n8n workflow Json file

💬
Get a free quote today!
Get a free quote today!

Tell us what you need and we'll get back to you within one working day.

Get a free quote today!
Get a free quote today!

Tell us what you need and we'll get back to you within one working day.

Launch login modal Launch register modal