Skip to main content

RAG Pipeline Utils

Enterprise-grade composable RAG pipelines with advanced AI capabilities

104
Exports
2050+
Tests
<200ms
Retrieval Latency
Zero
Production Vulnerabilities

Modular Plugin Architecture

5 plugin types with JSON Schema contracts and 7 provider connectors (OpenAI, Anthropic, Cohere, Ollama, and more). Swap any component without rewriting your pipeline.

Smart Chunking & Retrieval

5 chunking strategies, hybrid vector + BM25 search, and reciprocal rank fusion for highly relevant context retrieval across large document sets.

RAG Evaluation

Faithfulness, relevance, context precision/recall, and groundedness metrics to measure and improve RAG pipeline quality.

Quick Start

Build a complete RAG pipeline in minutes. Install the package, configure your plugins, and start querying your documents with AI-powered retrieval.

npm install @devilsdev/rag-pipeline-utils

const { createRagPipeline } = require('@devilsdev/rag-pipeline-utils');

const pipeline = createRagPipeline({
  loader: new PDFLoader(),
  embedder: new OpenAIEmbedder(),
  retriever: new PineconeRetriever(),
  llm: new OpenAILLM()
});

// Documents are loaded via the loader plugin configured above
const result = await pipeline.run({ query: 'What is the vacation policy?' });

Production-Ready Features

Modular Plugin Architecture

5 plugin types with JSON Schema contracts and 7 provider connectors (OpenAI, Anthropic, Cohere, Ollama, and more).

Learn more →

Smart Chunking & Retrieval

5 chunking strategies, hybrid vector + BM25 search, and reciprocal rank fusion for highly relevant context retrieval.

Learn more →

RAG Evaluation

Faithfulness, relevance, context precision/recall, and groundedness metrics to measure and improve pipeline quality.

Learn more →

Citation & Grounding

Source attribution, hallucination detection, and groundedness scoring to ensure trustworthy AI-generated answers.

Learn more →

Agentic RAG & GraphRAG

Query planning, iterative retrieval, self-critique loops, and knowledge graph traversal for complex multi-hop questions.

Learn more →

3-Layer Guardrails

Prompt injection detection, ACL filtering, and PII detection to protect your pipeline at input, retrieval, and output stages.

Learn more →

Streaming & Connectors

SSE/WebSocket streaming plus built-in OpenAI, Anthropic, Cohere, and Ollama connectors for flexible deployment.

Learn more →

Cost & Debugging

Token tracking, budget enforcement, execution tracing, and bottleneck detection for full pipeline observability.

Learn more →

Enterprise Security

JWT replay protection, plugin sandboxing, audit logging, and multi-tenancy support for production deployments.

Learn more →

Ready to Build Production RAG Systems?

Join developers building enterprise-grade RAG pipelines with modular architecture, comprehensive security, and production observability.