✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini — n8n Workflow

Mittel Komplexität Auslöser14 Knoten AI👁 35,213 Aufrufevon Dataki

Übersicht

Important Notes: Check Legal Regulations:
This workflow involves scraping, so ensure you comply with the legal regulations in your country before getting started. Better safe than sorry!

Workflow Description:
😮‍💨 Tired of struggling with XPath, CSS selectors, or DOM specificity when scraping ?

This AI-powered solution is here to simplify your workflow! With a vision-based AI Agent, you can extract data effortlessly without worrying about how the DOM is structured.

This workflow leverage

Verwendete Knoten

Google SheetsHTTP RequestAI AgentStructured Output ParserCall n8n Workflow ToolGoogle Gemini Chat Model

Workflow-Vorschau

Trigger
The default trigger is When clicking ‘Test workflow’, m
You can replace this by selecting a trigger of your cho
Google Sheets - List of URLs
The Google Sheet will contain two sheets:
- List of URLs to scrape
- Results page, populated with the scraping results and
Here is an *
Set Fields
This node allows you to define the fields that will be
In this template, only one field is pre-configured: url
ScrapingBee - Get Page Screenshot
This node uses ScrapingBee, a powerful scraping tool, t
You can try ScrapingBee and en
Vision-Based Scraping AI Agent
This is the central node of the workflow, powered by an
- System Prompt: Instructs the AI on how and what data
Google Gemini Chat Model
The default model is gemini-1.5-pro. It offers excellen
HTML-Based Scraping Tool
This tool is invoked when the AI Agent requires the HTM
Structured Output Parser
This node organizes the extracted data into an easy-to-
In this template, the JSON is designed for an e-commerc
Split Out
This node splits the array created by the `Structured O
Google Sheets - Create Rows
This node creates rows in the Results sheet using the e
You can use the [example Google Sheet](https://docs.goo
HTML-Scraping Tool Trigger
This node serves as the entry point for the HTML scrapi
It is triggered by the AI Agent only when it fails to e
Set Fields - From AI Agent Query
This node sets the fields from the AI Agent’s query.
In this template, the only field configured is url.
ScrapingBee - Get Page HTML
This node utilizes the ScrapingBee API to retrieve the
HTML to Markdown
This node converts the HTML from the previous node into
The converted Markdown is then automatically sent to th
✨ Vision-Based AI Agent Scraper - with Google
Important notes :
Check legal regulations:
This workflow involves scraping, so make sure to check
modeltoolparser
W
When clicking ‘Test work…
ScrapingBee- Get page HTML
Structured Output Parser
Google Gemini Chat Model
S
Split Out
Google Sheets - Get list…
S
Set fields
ScrapingBee - Get page s…
HTML-based Scraping Tool
Google Sheets - Create R…
Vision-based Scraping Ag…
H
HTML-Scraping Tool
S
Set fields - from AI age…
H
HTML to Markdown
14 nodes12 edges

So funktioniert es

  1. 1

    Auslöser

    Der Workflow startet mit einem auslöser-Auslöser.

  2. 2

    Verarbeitung

    Die Daten fließen durch 14 Knoten, connecting agent, executeworkflowtrigger, googlesheets.

  3. 3

    Ausgabe

    Der Workflow schließt seine Automatisierung ab und liefert das Ergebnis an das konfigurierte Ziel.

Knotendetails (14)

GO

Google Sheets

googleSheets

#1
HT

HTTP Request

httpRequest

#2
AI

AI Agent

n8n-nodes-langchain.agent

#3
ST

Structured Output Parser

n8n-nodes-langchain.outputParserStructured

#4
CA

Call n8n Workflow Tool

n8n-nodes-langchain.toolWorkflow

#5
GO

Google Gemini Chat Model

n8n-nodes-langchain.lmChatGoogleGemini

#6

So importieren Sie diesen Workflow

  1. 1Klicken Sie rechts auf die Schaltfläche JSON herunterladen, um die Workflow-Datei zu speichern.
  2. 2Öffnen Sie Ihre n8n-Instanz. Gehen Sie zu Workflows → Neu → Aus Datei importieren.
  3. 3Wählen Sie die heruntergeladene Datei vision-based-ai-agent-scraper-with-google-sheets-scrapingbee-and-gemini und klicken Sie auf Importieren.
  4. 4Richten Sie Anmeldedaten für jeden Dienstknoten ein (API-Schlüssel, OAuth usw.).
  5. 5Klicken Sie auf Workflow testen, um zu überprüfen, ob alles funktioniert, und aktivieren Sie es dann.

Oder direkt in n8n → Aus JSON importieren einfügen:

{ "name": "✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini", "nodes": [...], ...}

Integrationen

agentexecuteworkflowtriggergooglesheetshttprequestlmchatgooglegeminimanualtriggermarkdownoutputparserstructuredsetsplitouttoolworkflow

Diesen Workflow holen

Herunterladen und mit einem Klick importieren

JSON herunterladenAuf n8n.io ansehen
Knoten14
Komplexitätmedium
Auslösertrigger
Aufrufe35,213
KategorieAI

Erstellt von

Dataki

Dataki

@dataki

Tags

agentexecuteworkflowtriggergooglesheetshttprequestlmchatgooglegeminimanualtriggermarkdownoutputparserstructuredsetsplitout

Neu bei n8n?

n8n ist ein kostenloses Open-Source-Workflow-Automatisierungstool. Selbst hosten oder die Cloud-Version nutzen.

n8n kostenlos holen →

Related AI Workflows

AGCOFIGM+10
high

Automate Digital Product Delivery: Stripe to Gmail via n8n

Transform your post-purchase operations with this high-performance n8n workflow designed for digital creators and SaaS founders. Instead of manual fulfillment, this automation acts as a 24/7 digital concierge. It begins by scanning Stripe for successful transactions, cross-referencing buyer data against a centralized Google Sheets inventory to identify the correct digital asset. Utilizing advanced AI via GPT-4o, the system then drafts a personalized onboarding email, including secure access credentials and custom instructions, ensuring a premium customer experience without manual intervention. This flow eliminates the 'human-in-the-middle' delay, significantly reducing support tickets related to missing downloads. By integrating an AI agent, the workflow can intelligently parse complex product variations, making it far more robust than standard linear automations. Whether you are selling automation templates, software licenses, or protected PDF guides, this system provides a scalable infrastructure that grows with your sales volume while maintaining a personal touch through LLM-generated content. **Common Use Cases:** - Scaling a niche digital marketplace for selling specialized code snippets or design assets. - Automating the distribution of unique software license keys and documentation after a SaaS subscription purchase. - Delivering personalized AI-generated consulting reports or audit results based on customer input data.

Scheduled·25 nodes
AGCHCOEX+10
high

Automated AI Resume Parser & JD Matcher via n8n & GPT-4

Transform your recruitment funnel with this advanced AI-driven candidate evaluation engine. This n8n workflow eliminates manual screening fatigue by autonomously analyzing batches of resumes against specific job descriptions. Using GPT-4 and LangChain's structured output parsers, the system extracts key qualifications, scores them against your criteria, and generates objective alignment reports. The process begins with a custom n8n form for document upload, followed by intelligent text extraction from PDFs. The data is then processed through an LLM chain to ensure unbiased scoring. Results are synchronized directly to Google Sheets for centralized tracking, while high-match alerts are dispatched via Slack and SendGrid to keep hiring managers informed in real-time. This workflow is essential for high-volume recruitment agencies and scaling startups that need to maintain a rigorous, auditable, and data-backed shortlisting process without increasing headcount or sacrificing quality of hire. **Common Use Cases:** - High-volume university recruitment and internship screening - Technical talent sourcing for niche engineering roles - Internal mobility matching for large corporate restructuring

Trigger·21 nodes
AGCHGMGM+5
medium

AI Gmail Auto-Labeler: Smart Inbox Sorting with GPT-4 & n8n

Stop drowning in a cluttered inbox and regain control of your digital communication. This advanced n8n automation leverages GPT-4's natural language processing to intelligently analyze, categorize, and label incoming Gmail messages in real-time. Unlike basic filter rules that rely on rigid keywords, this workflow understands the context and sentiment of every email, ensuring high-precision organization. The process begins with a Gmail Trigger that captures new messages. It then passes the content through a LangChain LLM chain where OpenAI evaluates the intent—distinguishing between urgent client requests, internal project updates, or low-priority newsletters. Using structured output parsing, the workflow extracts key metadata and applies the appropriate Gmail labels automatically. This eliminates the manual cognitive load of triaging emails, allowing your team to focus on high-value tasks rather than administrative upkeep. Whether you are managing high-volume support tickets or complex sales inquiries, this workflow ensures that critical messages are highlighted and organized without human intervention. **Common Use Cases:** - Automated Customer Support Triage: Instantly tag emails as 'Urgent Support', 'Feature Request', or 'Billing' to speed up response times. - Sales Lead Prioritization: Automatically identify high-intent inquiries and label them for immediate follow-up by account executives. - Project Management Sync: Categorize incoming vendor updates and stakeholder feedback by project name or department for better visibility.

Trigger·11 nodes
AGGMGOLM+3
medium

AI Dental Lead Follow-up: n8n, OpenAI & Google Sheets Sync

Stop losing high-value patients to delayed responses. This advanced n8n workflow bridges the gap between lead capture and appointment booking by deploying an AI-driven engagement layer. When a prospect submits a query via your website or landing page, the automation immediately triggers, logging the data into Google Sheets for centralized tracking. Instead of sending a generic auto-reply, the integrated LangChain agent utilizes GPT-4/3.5 to analyze the specific treatment interest—be it Invisalign, dental implants, or routine cleaning—and crafts a personalized, empathetic response delivered via Gmail. The workflow includes a strategic 'Wait' node to mimic natural human timing and a 'Memory Buffer' to maintain context if the lead replies. This system is essential for clinics looking to scale their patient acquisition without increasing administrative headcount, ensuring every inquiry is nurtured instantly with professional, clinical-grade communication. By automating the initial touchpoint, your front-desk team can focus on confirmed arrivals rather than chasing cold leads. **Common Use Cases:** - Automated Patient Triage: Categorizing and responding to specific dental treatment inquiries based on urgency and procedure type. - Medical Spa Lead Nurturing: Instantly engaging prospects interested in high-ticket aesthetic treatments to increase conversion rates. - Multi-Location Clinic Sync: Centralizing lead data from various web forms into a single Google Sheet while maintaining personalized local email follow-ups.

Trigger·8 nodes
AGCOGOHT+8
high

Automate AI UGC Video Production with Google Sheets & Veo

Transform your digital marketing strategy by automating the production of high-converting User-Generated Content (UGC) at scale. This sophisticated n8n workflow eliminates the logistical bottleneck of traditional content creation by orchestrating a seamless pipeline between Google Sheets and advanced AI video models. By leveraging NanoBanana Pro for precise image synthesis and Veo 3.1 for fluid motion, the system takes three distinct visual inputs—your product, a chosen persona, and a target environment—and blends them into hyper-realistic, selfie-style video assets. The automation begins by monitoring a Google Sheet for new campaign parameters, triggers an intelligent AI agent to handle complex image processing, and manages asynchronous API calls to ensure high-fidelity video rendering. This is an enterprise-grade solution for performance marketers who need to refresh creative assets daily without manual intervention. It effectively handles the heavy lifting of prompt engineering and file management, allowing you to focus on strategy while the workflow generates 8-second, platform-ready clips optimized for the TikTok and Instagram algorithms. **Common Use Cases:** - Scaling creative testing for TikTok and Meta Ads by generating hundreds of product variants - Automating personalized influencer-style shoutouts for e-commerce loyalty programs - Rapid prototyping of social media video content for global brand localization

Scheduled·24 nodes
@BAGCOIF+7
high

Automate AI Twitter Threads via Telegram & n8n (No-Code)

Streamline your social media presence with this enterprise-grade n8n automation that bridges the gap between raw inspiration and professional X (Twitter) publishing. By integrating Telegram as a mobile command center, this workflow allows you to dictate or type thread ideas on the go. The system leverages OpenAI’s LLMs and LangChain agents to draft high-engagement thread structures, which are then sent back to your private Telegram chat for review. Unlike basic automation, this setup features a sophisticated 'human-in-the-loop' approval system. You can interactively refine the AI-generated content, request edits, or finalize the draft. Once you provide the 'Okay' confirmation, the workflow automatically formats and publishes the entire thread to X via the Blotato integration. This eliminates the friction of manual drafting and copy-pasting, ensuring your brand stays active without the constant overhead of social media management. It is the perfect solution for creators and businesses looking to maintain a high-quality digital presence using a voice-to-thread mobile pipeline. **Common Use Cases:** - Thought Leadership: Convert spontaneous voice memos into structured educational threads while commuting. - Event Live-Tweeting: Rapidly draft and approve professional event summaries from a mobile device without opening the X app. - Content Repurposing: Send links or snippets of long-form articles to Telegram to generate 'tweetable' summaries for social distribution.

Trigger·19 nodes