Extract Invoice Data from PDFs to JSON with Gemini AI and XML Transformation β€” n8n Workflow

Gemiddeld complexiteit⚑ Trigger6 knooppunten🏷️ Invoice Processingdoor Mauricio Perera

Overzicht

This n8n workflow converts invoices in PDF format into a structured, ready-to-use JSON, using AI and XML transformation β€” without writing any code.

πŸš€ How it works

Upload form β†’ The user uploads a PDF file. Text extraction β†’ The PDF content is extracted as plain text. XML schema definition β†’ A standard invoice structure is defined with fields such as:

Invoice number Customer and issuer details Items with description, quantity, and price Totals and taxes Bank account details AI (

Gebruikte knooppunten

Google Gemini

Workflow-voorvertoning

PDF to text
Clean data and XML structure definition
Generate XML string
String to XML to Json
⚑
O
On form submission
E
Extract from File
Message a model
L
Limpio data
L
Limpio XML
X
XML to JSON
6 nodes5 edges

Hoe het werkt

  1. 1

    Trigger

    De workflow start met een trigger-trigger.

  2. 2

    Verwerking

    Gegevens stromen door 6 knooppunten, connecting extractfromfile, formtrigger, googlegemini.

  3. 3

    Uitvoer

    De workflow voltooit zijn automatisering en levert het resultaat aan de geconfigureerde bestemming.

Knooppuntdetails (6)

GO

Google Gemini

n8n-nodes-langchain.googleGemini

#1

Hoe deze workflow te importeren

  1. 1Klik op de knop JSON downloaden rechts om het workflowbestand op te slaan.
  2. 2Open uw n8n-instantie. Ga naar Workflows β†’ Nieuw β†’ Importeren uit bestand.
  3. 3Selecteer het gedownloade bestand extract-invoice-data-from-pdfs-to-json-with-gemini-ai-and-xml-transformation en klik op Importeren.
  4. 4Stel inloggegevens in voor elk serviceknooppunt (API-sleutels, OAuth, enz.).
  5. 5Klik op Workflow testen om te controleren of alles werkt, activeer het vervolgens.

Of plak rechtstreeks in n8n β†’ Importeren uit JSON:

{ "name": "Extract Invoice Data from PDFs to JSON with Gemini AI and XML Transformation", "nodes": [...], ...}

Integraties

extractfromfileformtriggergooglegeminisetxml

Haal deze workflow op

Download en importeer met één klik

JSON downloadenBekijken op n8n.io
Knooppunten6
Complexiteitmedium
Triggertrigger

Gemaakt door

Mauricio Perera

Mauricio Perera

@rckflr

Tags

extractfromfileformtriggergooglegeminisetxml
⚑

Nieuw bij n8n?

n8n is een gratis open-source workflow-automatiseringstool. Host het zelf of gebruik de cloudversie.

n8n gratis ophalen β†’

Related Invoice Processing Workflows

COCOEMEX+5
medium

Automate Custom QuickBooks Invoice PDFs & Email with n8n

Standard accounting templates often fail to reflect a premium brand identity. This sophisticated n8n workflow bridges the gap between financial record-keeping and professional client presentation. By moving beyond the native limitations of QuickBooks Online, this automation enables businesses to generate high-end, multi-page PDF invoices that align perfectly with their corporate styling. The process begins the moment a new invoice is generated in QuickBooks, triggering a webhook that captures real-time billing data. The workflow then utilizes advanced HTML-to-File conversion and custom Code nodes to structure data into a polished, branded layout. It handles complex logic such as line-item merging and multi-page formatting automatically. Once the document is rendered, the system bypasses generic 'no-reply' senders by routing the finalized PDF through your preferred email provider. This ensures a seamless, white-labeled experience for your clients while eliminating the manual overhead of exporting, styling, and attaching files. Ideal for agencies and service providers, this flow guarantees that your most frequent touchpointβ€”the billβ€”is as professional as your work. **Common Use Cases:** - High-end creative agencies requiring bespoke, white-labeled billing documents for premium clients. - Automated recurring subscription billing where custom tax disclosures or localized branding are required. - Service-based businesses needing to attach dynamic project reports or terms of service directly to QuickBooks invoices.

πŸ”— WebhookΒ·12 nodes