Sponsored by APIMart - 99.9% SLA. Your AI, Always On.

AI Ad Library

Google Drive to PGVector Knowledge Base Builder

Google Drive to PGVector Knowledge Base Builder - n8n Workflow

Automate loading files (PDF, JSON, text) from Google Drive into a PGVector database using n8n. This powerful n8n workflow creates a RAG knowledge base using OpenAI embeddings.

Workflow Preview

Ready to automate?

Download this n8n workflow template and start using it instantly.

n8n Nodes Used

Recursive Character Text Splitter

Postgres PGVector Store

Sticky Note

Default Data Loader

Manual Trigger

Who is this best for?

This is the ideal n8n template for:

AI/ML Engineers: Who need reliable, scheduled data ingestion into a vector database.
Data Architects: Looking to build or maintain a Retrieval-Augmented Generation (RAG) system knowledge base.
Automation Specialists: Seeking to integrate cloud storage file operations with advanced vector processing using an n8n workflow.
Content Managers: Who frequently upload new documentation (PDFs, text files) that must be instantly available for AI queries.

Overview

Building a robust knowledge base for AI applications requires reliable ingestion pipelines. This specialized n8n workflow solves the problem of manually handling and converting diverse document types (PDF, text, JSON) into queryable vector embeddings.

This sophisticated n8n template leverages the power of LangChain n8n nodes alongside standard n8n file operation nodes. By running on a schedule using the n8n trigger, it ensures your Postgres PGVector store is always up-to-date with the latest organizational knowledge from Google Drive. After successful vectorization, the n8n workflow uses a cleanup step to move the source file, preventing duplicate processing. This end-to-end automation transforms raw documents into structured, embeddable data efficiently within the n8n platform.

How it Works

This comprehensive n8n workflow operates primarily on a scheduled basis, ensuring continuous data loading:

Scheduled Start: The automation begins via a Schedule Trigger n8n trigger (set for 3 AM daily) or manually via the When clicking ‘Test workflow’ n8n trigger.

Identify Documents: The Search Folder Google Drive n8n node scans a designated input folder for files that need processing.

Iterate and Download: The Loop Over Items n8n node processes each file individually. The Download File Google Drive n8n node fetches the binary content.

Type Routing: A Switch n8n node inspects the file's MIME type (PDF, text, or JSON) and routes the binary data to the appropriate extraction pipeline.

Content Extraction: Specialized Extract from File n8n nodes handle the parsing, turning PDF documents, plain text, or JSON structures into standardized text documents.

Vectorization Pipeline: The extracted documents are processed: they are chunked using the Recursive Character Text Splitter n8n node (using 50 characters of overlap) and then passed to the Embeddings OpenAI n8n node, which uses the text-embedding-3-small model to generate vectors.

Database Storage: The resulting vector and text chunks are inserted into the Postgres PGVector Store n8n node, updating the n8nvectorswfs table and n8n_wfs collection.

Cleanup: Finally, a Move File Google Drive n8n node relocates the successfully vectorized file to a 'vectorized' archive folder, completing the n8n workflow cycle.

Installation Guide

To deploy this powerful n8n workflow template, follow these steps:

Import: Copy the provided JSON data and paste it directly into your n8n instance using the 'New' -> 'Import from JSON' function.

Google Drive Setup: Configure the Google Drive credentials for the Search Folder, Download File, and Move File n8n nodes. Ensure the account has access to both the source (input) and destination (vectorized) folders.

OpenAI Credentials: Set up the OpenAI credential within the Embeddings OpenAI n8n node. This is crucial for generating the high-quality vector embeddings.

Postgres PGVector Setup: Establish the connection to your PostgreSQL database in the Postgres PGVector Store n8n node. Verify the database name, table name (n8nvectorswfs), and collection name (n8n_wfs) match your required vector store schema.

Folder IDs: Update the Google Drive Folder IDs in the Search Folder (input) and Move File (archive) n8n nodes to match your specific Google Drive directory structure.

Activation: Enable the n8n workflow by toggling the 'Active' switch. The n8n trigger is ready to run on its schedule.

Node Details

This n8n workflow template utilizes several core and specialized n8n nodes:

Schedule Trigger / Manual Trigger: The initial n8n trigger points that activate the flow, allowing for scheduled or immediate execution.
Google Drive Nodes (Search Folder, Download File, Move File): These are essential n8n nodes for handling cloud storage operations—locating source files, downloading content, and archiving processed files. The Search Folder uses a specific folder ID to filter content.
Switch n8n Node: This critical flow control n8n node inspects the MIME type of the downloaded file (application/pdf, text/plain, application/json) and determines the appropriate extraction path.
Extract from File n8n Nodes (PDF, Text, JSON): These nodes preprocess binary data into usable text based on the detected file type.
Embeddings OpenAI n8n Node: Connects to OpenAI to generate vectors. Key configuration uses the text-embedding-3-small model for efficiency and quality.
Recursive Character Text Splitter n8n Node: A LangChain n8n node that manages chunking of the documents, configured with a chunkOverlap of 50 to maintain context during vector generation.

Postgres PGVector Store n8n Node: The destination database handler. This n8n node is set to the insert mode and targets the table n8nvectorswfs with the collection name n8n_wfs.

Multi-Platform Video Publisher for YouTube, Instagram & TikTok AI Damage Reporting System via Telegram and GPT-4o Vision

Related n8n Workflows

Multi-Platform Video Publisher for YouTube, Instagram & TikTok

Automate video distribution across YouTube, Instagram, and TikTok using this powerful n8n workflow. Perfect for creators and marketers seeking efficient social media n8n templates.

by amanda

Social Media Automation Core Logic & Flow Control

Automated Documentation Generation for Workflows using GPT and Docsify

Use this powerful n8n workflow template to automatically generate and serve dynamic Markdown documentation for all your existing n8n workflows using GPT-4 and Docsify. Includes a live Markdown editor.

by eduard

AI Automation & Workflows DevOps & Monitoring

Daily Local Garage Sale Alerts via Telegram

Automate daily alerts for nearby garage sales and flea markets. This powerful n8n workflow scrapes web data, filters events by distance (<= 20km), and sends timely Telegram notifications. Use this n8n template for web scraping automation.

by tderouze

Web Scraping & Extraction Core Logic & Flow Control

AI Damage Reporting System via Telegram and GPT-4o Vision

Automate logistics damage reporting using an n8n workflow. Operators send photos via Telegram, GPT-4o generates structured reports and extracts barcodes, and the final document is emailed via Gmail. Use this efficient n8n template today.

by samirsaci

OpenAI Integration AI Automation & Workflows

Automated EDI Message Parsing and Order Logging to Google Sheets

Leverage this powerful n8n workflow to automatically parse complex EDI messages received via Gmail and log the extracted order details into Google Sheets, streamlining B2B data entry.

by samirsaci

Custom Code & Scripting Google Sheets Ops

Instant Telegram Notifications for WooCommerce Orders

Use this comprehensive n8n workflow template to receive instant, detailed alerts on Telegram whenever a WooCommerce order status changes to processing. Ideal for e-commerce operations.

by amir676080

CRM & Sales Ops Core Logic & Flow Control

Free

Nodes: 11 Nodes

Updated: December 26 2025

View all

Created by

Alex Kim

n8n Ambassador & Verified Partner

Featured*

APIMart

99.9% SLA. Your AI, Always On.

AdsCreator.com

AI Ad Creation Tool - Just Paste your Website URL & get Professional AI Ads

Articos

Articos is a fast, recruitment free user research platform that helps you validate product ideas, test UX flows, and understand customer needs without waiting weeks to find real participants. Instead of booking calls and chasing no shows, you run AI moderated interviews with realistic synthetic users that match your target personas. In a short time, you get clear feedback on what people understand, what confuses them, what they would pay for, and what would stop them from using your product. It is built for founders, product managers, designers, and agencies who need quick direction before they commit time and budget to building the wrong thing.

Airbrush Studio

A desktop photo software designed for anyone who wants high quality beautiful portraits, fast.

Claude Code API (code0.ai)

Stable domestic direct-connect proxy for Claude API with CNY payment and low latency.

Tokenhot

Unified LLM API gateway for 100+ models with up to 90% cost savings.

Atoms

AI platform using specialized agents to build full-stack apps and websites without code.

Typecast

AI voice generator and content creation tool with realistic AI voices and avatars.

Verdent

Your AI-native partner for the new way to build software.

Diagrimo

AI-powered tool to turn ideas/text into clear diagrams & infographics.

EverMemOS

Infinite memory. Persistent identity. Evolving intelligence. EverMemOS, powered by EverMind, is entering beta on the new cloud platform. The Memory Genesis Competition 2026 officially launches alongside it.

Free

Topview AI

#1 Marketing Video Agent - Turn Your Product Into Viral Videos

Miro

AI innovation Workspace

Mailmodo AI

Complete Email Marketing Automation With AI Agents

ace.me

Your new website, email address & cloud storage. Simple. Fast. Secure.

Gobii

Hire AI employees that automate your web workflows — built on a production-grade platform that runs 24/7 without the maintenance headaches.

Rekam AI-Your One-Stop Voice Creation Platform

All-in-one AI voice creation platform for text-to-speech, voice clone, and speech-to-text.

Teammates.ai

AI-powered customer service, sales, and lead generation solutions.

Mexty

AI-powered tool for creating personalized, interactive e-learning content.

Runable

Runable is a general AI agent that can execute any task, from building web apps, slides, reports, and documents to generating images, videos, and podcasts, all in one place. It doesn’t just create it connects. Runable integrates with thousands of your favorite apps so you can simply ask it to do the work for you.