Extract invoice data from scanned PDFs to Google Sheets with Sarvam and Gemini — n8n Workflow
Overview
This template is designed for operations, finance, and accounting teams that need to automatically process scanned invoices and extract structured data without manual entry.
It is ideal for businesses handling vendor invoices, reimbursement forms, or bulk document intake.
What this workflow does This workflow uses Sarvam AI Vision model to perform OCR on scanned invoices and extract raw text. The extracted content is then processed using an LLM to identify key invoice fields such as:
Vendor n
Nodes used
Workflow Preview
How it Works
- 1
Trigger
The workflow starts with a trigger trigger.
- 2
Process
Data flows through 16 nodes, connecting code, compression, extractfromfile.
- 3
Output
The workflow completes its automation and delivers the result to the configured destination.
Node Details (16)
Google Sheets
googleSheets
HTTP Request
httpRequest
Compression
compression
Code
code
Google Gemini Chat Model
n8n-nodes-langchain.lmChatGoogleGemini
Information Extractor
n8n-nodes-langchain.informationExtractor
How to Import This Workflow
- 1Click Download JSON button on the right to save the workflow file.
- 2Open your n8n instance. Go to Workflows → New → Import from file.
- 3Select the downloaded
extract-invoice-data-from-scanned-pdfs-to-google-sheets-with-sarvam-and-geminifile and click Import. - 4Set up credentials for each service node (API keys, OAuth, etc.).
- 5Click Test Workflow to verify everything works, then activate it.
Or paste directly in n8n → Import from JSON:
Integrations
Created by
Divyanshu Gupta
@divyanshugupta
Tags
New to n8n?
n8n is a free, open-source workflow automation tool. Self-host it or use the cloud version.
Get n8n Free →