Extract invoice data from scanned PDFs to Google Sheets with Sarvam and Gemini — рабочий процесс n8n

Высокая сложность Триггер16 узлов🏷️ Invoice Processing👁 1 просмотровот Divyanshu Gupta

Обзор

This template is designed for operations, finance, and accounting teams that need to automatically process scanned invoices and extract structured data without manual entry.

It is ideal for businesses handling vendor invoices, reimbursement forms, or bulk document intake.

What this workflow does This workflow uses Sarvam AI Vision model to perform OCR on scanned invoices and extract raw text. The extracted content is then processed using an LLM to identify key invoice fields such as:

Vendor n

Использованные узлы

Google SheetsHTTP RequestCompressionCodeGoogle Gemini Chat ModelInformation Extractor

Предпросмотр рабочего процесса

!Sarvam
Try It Out!
This template is designed for operations, finance, and
It is
Step 1 – Upload invoice to Sarvam
Creates an OCR job and uploads the invoice PDF to Sarva
Step 2 – Run OCR and monitor status
Starts invoice OCR processing and polls the job status
Step 3 – Retrieve OCR output
Downloads the processed invoice output, decompresses th
Step 4 – Convert OCR text to structured invoi
Cleans OCR text and uses an LLM to extract structured i
Step 5 – Store invoice data
Appends extracted invoice fields to Google Sheets for a
model
Get OCR Result
W
Wait
Information Extractor
Google Gemini Chat Model
T
Trigger – Invoice upload…
Create Sarvam invoice OC…
Generate Sarvam presigne…
M
Merge job details with u…
Upload invoice PDF to Sa…
Start Sarvam invoice OCR
Check Sarvam OCR status
Download Sarvam OCR outp…
Decompress OCR result file
E
Extract invoice OCR JSON
Prepare invoice text for…
Append extracted invoice…
16 nodes16 edges

Как это работает

  1. 1

    Триггер

    Рабочий процесс запускается триггером триггер.

  2. 2

    Обработка

    Данные проходят через 16 узлов, connecting code, compression, extractfromfile.

  3. 3

    Вывод

    Рабочий процесс завершает автоматизацию и доставляет результат в настроенное место назначения.

Детали узлов (16)

GO

Google Sheets

googleSheets

#1
HT

HTTP Request

httpRequest

#2
CO

Compression

compression

#3
CO

Code

code

#4
GO

Google Gemini Chat Model

n8n-nodes-langchain.lmChatGoogleGemini

#5
IN

Information Extractor

n8n-nodes-langchain.informationExtractor

#6

Как импортировать этот рабочий процесс

  1. 1Нажмите кнопку Скачать JSON справа, чтобы сохранить файл рабочего процесса.
  2. 2Откройте ваш экземпляр n8n. Перейдите в Рабочие процессы → Новый → Импорт из файла.
  3. 3Выберите скачанный файл extract-invoice-data-from-scanned-pdfs-to-google-sheets-with-sarvam-and-gemini и нажмите Импортировать.
  4. 4Настройте учётные данные для каждого узла сервиса (ключи API, OAuth и т.д.).
  5. 5Нажмите Протестировать рабочий процесс, чтобы убедиться в правильной работе, затем активируйте его.

Или вставьте напрямую в n8n → Импорт из JSON:

{ "name": "Extract invoice data from scanned PDFs to Google Sheets with Sarvam and Gemini", "nodes": [...], ...}

Интеграции

codecompressionextractfromfileformtriggergooglesheetshttprequestinformationextractorlmchatgooglegeminimergewait

Получить этот рабочий процесс

Скачайте и импортируйте одним кликом

Скачать JSONПросмотреть на n8n.io
Узлы16
Сложностьhigh
Триггерtrigger
Просмотры1
КатегорияInvoice Processing

Создан

Divyanshu Gupta

Divyanshu Gupta

@divyanshugupta

Теги

codecompressionextractfromfileformtriggergooglesheetshttprequestinformationextractorlmchatgooglegeminimergewait

Новичок в n8n?

n8n — бесплатный инструмент автоматизации рабочих процессов с открытым исходным кодом. Разверните самостоятельно или используйте облачную версию.

Получить n8n бесплатно →

Related Invoice Processing Workflows

COCOEMEX+5
medium

Automate Custom QuickBooks Invoice PDFs & Email with n8n

Standard accounting templates often fail to reflect a premium brand identity. This sophisticated n8n workflow bridges the gap between financial record-keeping and professional client presentation. By moving beyond the native limitations of QuickBooks Online, this automation enables businesses to generate high-end, multi-page PDF invoices that align perfectly with their corporate styling. The process begins the moment a new invoice is generated in QuickBooks, triggering a webhook that captures real-time billing data. The workflow then utilizes advanced HTML-to-File conversion and custom Code nodes to structure data into a polished, branded layout. It handles complex logic such as line-item merging and multi-page formatting automatically. Once the document is rendered, the system bypasses generic 'no-reply' senders by routing the finalized PDF through your preferred email provider. This ensures a seamless, white-labeled experience for your clients while eliminating the manual overhead of exporting, styling, and attaching files. Ideal for agencies and service providers, this flow guarantees that your most frequent touchpoint—the bill—is as professional as your work. **Common Use Cases:** - High-end creative agencies requiring bespoke, white-labeled billing documents for premium clients. - Automated recurring subscription billing where custom tax disclosures or localized branding are required. - Service-based businesses needing to attach dynamic project reports or terms of service directly to QuickBooks invoices.

🔗 Webhook·12 nodes