✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini — Workflow n8n

Moyen complexité Déclencheur14 nœuds AI👁 35,213 vuespar Dataki

Aperçu

Important Notes: Check Legal Regulations:
This workflow involves scraping, so ensure you comply with the legal regulations in your country before getting started. Better safe than sorry!

Workflow Description:
😮‍💨 Tired of struggling with XPath, CSS selectors, or DOM specificity when scraping ?

This AI-powered solution is here to simplify your workflow! With a vision-based AI Agent, you can extract data effortlessly without worrying about how the DOM is structured.

This workflow leverage

Nœuds utilisés

Google SheetsHTTP RequestAI AgentStructured Output ParserCall n8n Workflow ToolGoogle Gemini Chat Model

Aperçu du workflow

Trigger
The default trigger is When clicking ‘Test workflow’, m
You can replace this by selecting a trigger of your cho
Google Sheets - List of URLs
The Google Sheet will contain two sheets:
- List of URLs to scrape
- Results page, populated with the scraping results and
Here is an *
Set Fields
This node allows you to define the fields that will be
In this template, only one field is pre-configured: url
ScrapingBee - Get Page Screenshot
This node uses ScrapingBee, a powerful scraping tool, t
You can try ScrapingBee and en
Vision-Based Scraping AI Agent
This is the central node of the workflow, powered by an
- System Prompt: Instructs the AI on how and what data
Google Gemini Chat Model
The default model is gemini-1.5-pro. It offers excellen
HTML-Based Scraping Tool
This tool is invoked when the AI Agent requires the HTM
Structured Output Parser
This node organizes the extracted data into an easy-to-
In this template, the JSON is designed for an e-commerc
Split Out
This node splits the array created by the `Structured O
Google Sheets - Create Rows
This node creates rows in the Results sheet using the e
You can use the [example Google Sheet](https://docs.goo
HTML-Scraping Tool Trigger
This node serves as the entry point for the HTML scrapi
It is triggered by the AI Agent only when it fails to e
Set Fields - From AI Agent Query
This node sets the fields from the AI Agent’s query.
In this template, the only field configured is url.
ScrapingBee - Get Page HTML
This node utilizes the ScrapingBee API to retrieve the
HTML to Markdown
This node converts the HTML from the previous node into
The converted Markdown is then automatically sent to th
✨ Vision-Based AI Agent Scraper - with Google
Important notes :
Check legal regulations:
This workflow involves scraping, so make sure to check
modeltoolparser
W
When clicking ‘Test work…
ScrapingBee- Get page HTML
Structured Output Parser
Google Gemini Chat Model
S
Split Out
Google Sheets - Get list…
S
Set fields
ScrapingBee - Get page s…
HTML-based Scraping Tool
Google Sheets - Create R…
Vision-based Scraping Ag…
H
HTML-Scraping Tool
S
Set fields - from AI age…
H
HTML to Markdown
14 nodes12 edges

Comment ça fonctionne

  1. 1

    Déclencheur

    Le workflow démarre avec un déclencheur déclencheur.

  2. 2

    Traitement

    Les données transitent par 14 nœuds, connecting agent, executeworkflowtrigger, googlesheets.

  3. 3

    Sortie

    Le workflow termine son automatisation et livre le résultat à la destination configurée.

Détails des nœuds (14)

GO

Google Sheets

googleSheets

#1
HT

HTTP Request

httpRequest

#2
AI

AI Agent

n8n-nodes-langchain.agent

#3
ST

Structured Output Parser

n8n-nodes-langchain.outputParserStructured

#4
CA

Call n8n Workflow Tool

n8n-nodes-langchain.toolWorkflow

#5
GO

Google Gemini Chat Model

n8n-nodes-langchain.lmChatGoogleGemini

#6

Comment importer ce workflow

  1. 1Cliquez sur le bouton Télécharger JSON à droite pour enregistrer le fichier du workflow.
  2. 2Ouvrez votre instance n8n. Accédez à Workflows → Nouveau → Importer depuis un fichier.
  3. 3Sélectionnez le fichier vision-based-ai-agent-scraper-with-google-sheets-scrapingbee-and-gemini téléchargé et cliquez sur Importer.
  4. 4Configurez les identifiants pour chaque nœud de service (clés API, OAuth, etc.).
  5. 5Cliquez sur Tester le workflow pour vérifier que tout fonctionne, puis activez-le.

Ou collez directement dans n8n → Importer depuis JSON :

{ "name": "✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini", "nodes": [...], ...}

Intégrations

agentexecuteworkflowtriggergooglesheetshttprequestlmchatgooglegeminimanualtriggermarkdownoutputparserstructuredsetsplitouttoolworkflow

Obtenir ce workflow

Téléchargez et importez en un clic

Télécharger JSONVoir sur n8n.io
Nœuds14
Complexitémedium
Déclencheurtrigger
Vues35,213
CatégorieAI

Créé par

Dataki

Dataki

@dataki

Tags

agentexecuteworkflowtriggergooglesheetshttprequestlmchatgooglegeminimanualtriggermarkdownoutputparserstructuredsetsplitout

Nouveau sur n8n ?

n8n est un outil d'automatisation de workflows gratuit et open-source. Hébergez-le vous-même ou utilisez la version cloud.

Obtenir n8n gratuitement →

Related AI Workflows

AGCOFIGM+10
high

Automate Digital Product Delivery: Stripe to Gmail via n8n

Transform your post-purchase operations with this high-performance n8n workflow designed for digital creators and SaaS founders. Instead of manual fulfillment, this automation acts as a 24/7 digital concierge. It begins by scanning Stripe for successful transactions, cross-referencing buyer data against a centralized Google Sheets inventory to identify the correct digital asset. Utilizing advanced AI via GPT-4o, the system then drafts a personalized onboarding email, including secure access credentials and custom instructions, ensuring a premium customer experience without manual intervention. This flow eliminates the 'human-in-the-middle' delay, significantly reducing support tickets related to missing downloads. By integrating an AI agent, the workflow can intelligently parse complex product variations, making it far more robust than standard linear automations. Whether you are selling automation templates, software licenses, or protected PDF guides, this system provides a scalable infrastructure that grows with your sales volume while maintaining a personal touch through LLM-generated content. **Common Use Cases:** - Scaling a niche digital marketplace for selling specialized code snippets or design assets. - Automating the distribution of unique software license keys and documentation after a SaaS subscription purchase. - Delivering personalized AI-generated consulting reports or audit results based on customer input data.

Scheduled·25 nodes
AGCHCOEX+10
high

Automated AI Resume Parser & JD Matcher via n8n & GPT-4

Transform your recruitment funnel with this advanced AI-driven candidate evaluation engine. This n8n workflow eliminates manual screening fatigue by autonomously analyzing batches of resumes against specific job descriptions. Using GPT-4 and LangChain's structured output parsers, the system extracts key qualifications, scores them against your criteria, and generates objective alignment reports. The process begins with a custom n8n form for document upload, followed by intelligent text extraction from PDFs. The data is then processed through an LLM chain to ensure unbiased scoring. Results are synchronized directly to Google Sheets for centralized tracking, while high-match alerts are dispatched via Slack and SendGrid to keep hiring managers informed in real-time. This workflow is essential for high-volume recruitment agencies and scaling startups that need to maintain a rigorous, auditable, and data-backed shortlisting process without increasing headcount or sacrificing quality of hire. **Common Use Cases:** - High-volume university recruitment and internship screening - Technical talent sourcing for niche engineering roles - Internal mobility matching for large corporate restructuring

Trigger·21 nodes
AGCHGMGM+5
medium

AI Gmail Auto-Labeler: Smart Inbox Sorting with GPT-4 & n8n

Stop drowning in a cluttered inbox and regain control of your digital communication. This advanced n8n automation leverages GPT-4's natural language processing to intelligently analyze, categorize, and label incoming Gmail messages in real-time. Unlike basic filter rules that rely on rigid keywords, this workflow understands the context and sentiment of every email, ensuring high-precision organization. The process begins with a Gmail Trigger that captures new messages. It then passes the content through a LangChain LLM chain where OpenAI evaluates the intent—distinguishing between urgent client requests, internal project updates, or low-priority newsletters. Using structured output parsing, the workflow extracts key metadata and applies the appropriate Gmail labels automatically. This eliminates the manual cognitive load of triaging emails, allowing your team to focus on high-value tasks rather than administrative upkeep. Whether you are managing high-volume support tickets or complex sales inquiries, this workflow ensures that critical messages are highlighted and organized without human intervention. **Common Use Cases:** - Automated Customer Support Triage: Instantly tag emails as 'Urgent Support', 'Feature Request', or 'Billing' to speed up response times. - Sales Lead Prioritization: Automatically identify high-intent inquiries and label them for immediate follow-up by account executives. - Project Management Sync: Categorize incoming vendor updates and stakeholder feedback by project name or department for better visibility.

Trigger·11 nodes
AGGMGOLM+3
medium

AI Dental Lead Follow-up: n8n, OpenAI & Google Sheets Sync

Stop losing high-value patients to delayed responses. This advanced n8n workflow bridges the gap between lead capture and appointment booking by deploying an AI-driven engagement layer. When a prospect submits a query via your website or landing page, the automation immediately triggers, logging the data into Google Sheets for centralized tracking. Instead of sending a generic auto-reply, the integrated LangChain agent utilizes GPT-4/3.5 to analyze the specific treatment interest—be it Invisalign, dental implants, or routine cleaning—and crafts a personalized, empathetic response delivered via Gmail. The workflow includes a strategic 'Wait' node to mimic natural human timing and a 'Memory Buffer' to maintain context if the lead replies. This system is essential for clinics looking to scale their patient acquisition without increasing administrative headcount, ensuring every inquiry is nurtured instantly with professional, clinical-grade communication. By automating the initial touchpoint, your front-desk team can focus on confirmed arrivals rather than chasing cold leads. **Common Use Cases:** - Automated Patient Triage: Categorizing and responding to specific dental treatment inquiries based on urgency and procedure type. - Medical Spa Lead Nurturing: Instantly engaging prospects interested in high-ticket aesthetic treatments to increase conversion rates. - Multi-Location Clinic Sync: Centralizing lead data from various web forms into a single Google Sheet while maintaining personalized local email follow-ups.

Trigger·8 nodes
AGCOGOHT+8
high

Automate AI UGC Video Production with Google Sheets & Veo

Transform your digital marketing strategy by automating the production of high-converting User-Generated Content (UGC) at scale. This sophisticated n8n workflow eliminates the logistical bottleneck of traditional content creation by orchestrating a seamless pipeline between Google Sheets and advanced AI video models. By leveraging NanoBanana Pro for precise image synthesis and Veo 3.1 for fluid motion, the system takes three distinct visual inputs—your product, a chosen persona, and a target environment—and blends them into hyper-realistic, selfie-style video assets. The automation begins by monitoring a Google Sheet for new campaign parameters, triggers an intelligent AI agent to handle complex image processing, and manages asynchronous API calls to ensure high-fidelity video rendering. This is an enterprise-grade solution for performance marketers who need to refresh creative assets daily without manual intervention. It effectively handles the heavy lifting of prompt engineering and file management, allowing you to focus on strategy while the workflow generates 8-second, platform-ready clips optimized for the TikTok and Instagram algorithms. **Common Use Cases:** - Scaling creative testing for TikTok and Meta Ads by generating hundreds of product variants - Automating personalized influencer-style shoutouts for e-commerce loyalty programs - Rapid prototyping of social media video content for global brand localization

Scheduled·24 nodes
@BAGCOIF+7
high

Automate AI Twitter Threads via Telegram & n8n (No-Code)

Streamline your social media presence with this enterprise-grade n8n automation that bridges the gap between raw inspiration and professional X (Twitter) publishing. By integrating Telegram as a mobile command center, this workflow allows you to dictate or type thread ideas on the go. The system leverages OpenAI’s LLMs and LangChain agents to draft high-engagement thread structures, which are then sent back to your private Telegram chat for review. Unlike basic automation, this setup features a sophisticated 'human-in-the-loop' approval system. You can interactively refine the AI-generated content, request edits, or finalize the draft. Once you provide the 'Okay' confirmation, the workflow automatically formats and publishes the entire thread to X via the Blotato integration. This eliminates the friction of manual drafting and copy-pasting, ensuring your brand stays active without the constant overhead of social media management. It is the perfect solution for creators and businesses looking to maintain a high-quality digital presence using a voice-to-thread mobile pipeline. **Common Use Cases:** - Thought Leadership: Convert spontaneous voice memos into structured educational threads while commuting. - Event Live-Tweeting: Rapidly draft and approve professional event summaries from a mobile device without opening the X app. - Content Repurposing: Send links or snippets of long-form articles to Telegram to generate 'tweetable' summaries for social distribution.

Trigger·19 nodes