webclaw
活跃·★ 1.2k·MIT·更新于 2026-05-29
★ 浏览器自动化★ 数据处理
最快的AI代理网页爬取器。
webclaw 是一个针对AI代理优化的快速网页爬取器。它使用Chrome级别的TLS指纹从任何URL提取干净、结构化的内容,比原始HTML减少67%的令牌。
#人工智能#AI 智能体#ai-scraping#命令行工具#crawler#data-extraction
最快的AI代理网页爬取器。
webclaw 是一个针对AI代理优化的快速网页爬取器。它使用Chrome级别的TLS指纹从任何URL提取干净、结构化的内容,比原始HTML减少67%的令牌。
Structured data extraction from web content is accurate. The Rust implementation handles edge cases well.
CLI, REST API, and MCP server in one package covers the use cases without multiple tools.
Rust-based web extraction is fast — noticeably lower latency than Python scrapers for large crawls.
Fast, local-first web content extraction is the feature combination the market was missing.
Local-first extraction means no data leaving the machine. Right for sensitive content.