thunderbit-mcp-server: Thunderbit MCP Server is an open-source toolkit for the Thunderbit Open API that ships three packages: a CLI for scripted extraction, an MCP server exposing seven scraping and distillation tools, and a Claude Code plugin. It converts any web page to clean LLM-ready Markdown, extracts structured data via JSON Schema, and supports batch processing — all backed by a free API key.; agentql-mcp: AgentQL MCP Server integrates AgentQL's data extraction capabilities into the Model Context Protocol. It allows AI assistants like Claude, VS Code, Cursor, and Windsurf to extract structured data from web pages using natural language prompts. Setup requires npm installation and an API key from AgentQL.
Feeding clean web content into LLM pipelines for RAG or summarization
Extract video lists from YouTube search results