Skip to content
MCP profile

Github HatmanStack Ragstack

Search, chat, upload, and scrape a serverless RAGStack knowledge base on AWS.

Content & MediaPackagePythonOpen SourceExternal
Last updated
March 16, 2026
Visibility
Public
ByRegistry

About This MCP Server


Serverless document and media processing with AI chat. Scale-to-zero architecture — no vector database fees, no idle costs. Upload documents, images, video, and audio — extract text with OCR or transcription — query using Amazon Bedrock or your AI assistant via MCP.

Capabilities
☁️ Fully serverless architecture (Lambda, Step Functions, S3, DynamoDB)🧠 NEW Amazon Nova multimodal embeddings for text and image vectorization📄 Document processing & vectorization (PDF, images, Office docs, HTML, CSV, JSON, XML, EML, EPUB) → stored in managed knowledge base🎬 NEW Video/audio processing - transcribe speech with AWS Transcribe, searchable by timestamp💬 AI chat with retrieval-augmented context and source attribution📎 Collapsible source citations with optional document downloads⏱️ NEW Media sources with timestamp links - click to play at exact position🔍 Metadata filtering - auto-discover document metadata and filter search results🎯 Relevancy boost for filtered results - prioritize matches from metadata filters🔄 Knowledge Base reindex - regenerate metadata for existing documents with updated settings🗑️ Document management - reprocess, reindex, or delete documents from the dashboard🌐 Web component for any framework (React, Vue, Angular, Svelte)

Tools & Endpoints1

Example Workflow

> Base Pipeline: The core document processing tool - upload, OCR, and query documents. > Project Showcase: See RAGStack powering a real application.

Why Use Github HatmanStack Ragstack?

  • ☁️ Fully serverless architecture (Lambda, Step Functions, S3, DynamoDB)
  • 🧠 NEW Amazon Nova multimodal embeddings for text and image vectorization
  • 📄 Document processing & vectorization (PDF, images, Office docs, HTML, CSV, JSON, XML, EML, EPUB) → stored in managed knowledge base
  • 🎬 NEW Video/audio processing - transcribe speech with AWS Transcribe, searchable by timestamp
  • 💬 AI chat with retrieval-augmented context and source attribution
  • 📎 Collapsible source citations with optional document downloads
  • ⏱️ NEW Media sources with timestamp links - click to play at exact position
  • 🔍 Metadata filtering - auto-discover document metadata and filter search results
  • 🎯 Relevancy boost for filtered results - prioritize matches from metadata filters
  • 🔄 Knowledge Base reindex - regenerate metadata for existing documents with updated settings
  • 🗑️ Document management - reprocess, reindex, or delete documents from the dashboard
  • 🌐 Web component for any framework (React, Vue, Angular, Svelte)

Specifications

Status
live
Industry
Content & Media
Category
General
Server type
Package
Language
Python
License
Open Source
Verified
Yes

Requirements

  • source venv/bin/activate # On Windows: venv\Scripts\activate pip install -r requirements.txt

Hosting


Hosting Options

  • Package

API


Integrate this server into your application. Choose a connection method below.

1

Install

Install command
Python
pip install -r requirements.txt

Performance


Usage


Quick Reference


Name
Github HatmanStack Ragstack
Function
Search, chat, upload, and scrape a serverless RAGStack knowledge base on AWS.
Available Tools
Each UI tab shows server-side API examples in an expandable section.
Transport
Package
Language
Python
Install
pip install -r requirements.txt
Source
External (Registry)
License
Open Source
Get started

Ready to integrate this MCP server?

Book a demo to see how this server fits your workflow, or explore the full catalog.

Related MCP Servers


Catalog Workspace

Discover agents, MCP servers, and skills in one governed surface

Use structured catalog views to compare readiness, ownership, integrations, and deployment posture before rollout.