Ingest public web records at scale
Ingest public web records at scale Crawl, extract, and deliver structured web intelligence via API.
Pain points
- Government sites are slow and heterogeneous
- Compliance requires robots respect
- Batch exports for analysts
Architecture
- Respect robots.txt and rate limits
- Queue-backed crawl workers
- Structured extraction
- Audit logs per job
Example output
{ "record_type": "permit", "jurisdiction": "CA", "fields": { "id": "P-9912" } }FAQ
How fast can I start?
Sign up free, create an API key, and call /graph/domain-context or /scrape in minutes. See /docs for curl examples.
Is output AI-ready?
Yes — structured JSON, context_for_ai summaries, and link graphs designed for agents and RAG pipelines.