Extract Invoice Data from PDFs to JSON with Gemini AI and XML Transformation — n8n 工作流

复杂度 触发器6 个节点🏷️ Invoice Processing作者:Mauricio Perera

概览

This n8n workflow converts invoices in PDF format into a structured, ready-to-use JSON, using AI and XML transformation — without writing any code.

🚀 How it works

Upload form → The user uploads a PDF file. Text extraction → The PDF content is extracted as plain text. XML schema definition → A standard invoice structure is defined with fields such as:

Invoice number Customer and issuer details Items with description, quantity, and price Totals and taxes Bank account details AI (

使用的节点

Google Gemini

工作流预览

PDF to text
Clean data and XML structure definition
Generate XML string
String to XML to Json
O
On form submission
E
Extract from File
Message a model
L
Limpio data
L
Limpio XML
X
XML to JSON
6 nodes5 edges

工作原理

  1. 1

    触发器

    工作流由 触发器 触发器启动。

  2. 2

    处理

    数据流经 6 个节点, connecting extractfromfile, formtrigger, googlegemini。

  3. 3

    输出

    工作流完成自动化并将结果发送到配置的目标。

节点详情 (6)

GO

Google Gemini

n8n-nodes-langchain.googleGemini

#1

如何导入此工作流

  1. 1点击右侧 下载 JSON 按钮保存工作流文件。
  2. 2打开你的 n8n 实例,依次点击 工作流 → 新建 → 从文件导入
  3. 3选择下载的 extract-invoice-data-from-pdfs-to-json-with-gemini-ai-and-xml-transformation 文件并点击导入。
  4. 4为每个服务节点配置 凭证(API 密钥、OAuth 等)。
  5. 5点击 测试工作流 验证一切正常,然后激活它。

或直接在 n8n → 从 JSON 导入 中粘贴:

{ "name": "Extract Invoice Data from PDFs to JSON with Gemini AI and XML Transformation", "nodes": [...], ...}

集成

extractfromfileformtriggergooglegeminisetxml

获取此工作流

一键下载并导入

下载 JSON在 n8n.io 上查看
节点6
复杂度medium
触发器trigger

创建者

Mauricio Perera

Mauricio Perera

@rckflr

标签

extractfromfileformtriggergooglegeminisetxml

n8n 新手?

n8n 是一款免费开源的工作流自动化工具,支持自托管或使用云版本。

免费获取 n8n →

Related Invoice Processing Workflows

COCOEMEX+5
medium

Automate Custom QuickBooks Invoice PDFs & Email with n8n

Standard accounting templates often fail to reflect a premium brand identity. This sophisticated n8n workflow bridges the gap between financial record-keeping and professional client presentation. By moving beyond the native limitations of QuickBooks Online, this automation enables businesses to generate high-end, multi-page PDF invoices that align perfectly with their corporate styling. The process begins the moment a new invoice is generated in QuickBooks, triggering a webhook that captures real-time billing data. The workflow then utilizes advanced HTML-to-File conversion and custom Code nodes to structure data into a polished, branded layout. It handles complex logic such as line-item merging and multi-page formatting automatically. Once the document is rendered, the system bypasses generic 'no-reply' senders by routing the finalized PDF through your preferred email provider. This ensures a seamless, white-labeled experience for your clients while eliminating the manual overhead of exporting, styling, and attaching files. Ideal for agencies and service providers, this flow guarantees that your most frequent touchpoint—the bill—is as professional as your work. **Common Use Cases:** - High-end creative agencies requiring bespoke, white-labeled billing documents for premium clients. - Automated recurring subscription billing where custom tax disclosures or localized branding are required. - Service-based businesses needing to attach dynamic project reports or terms of service directly to QuickBooks invoices.

🔗 Webhook·12 nodes