— Use case

Ingest public web records at scale

Ingest public web records at scale Crawl, extract, and deliver structured web intelligence via API.

Pain points

  • Government sites are slow and heterogeneous
  • Compliance requires robots respect
  • Batch exports for analysts

Architecture

  1. Respect robots.txt and rate limits
  2. Queue-backed crawl workers
  3. Structured extraction
  4. Audit logs per job

Example output

{ "record_type": "permit", "jurisdiction": "CA", "fields": { "id": "P-9912" } }

FAQ

How fast can I start?

Sign up free, create an API key, and call /graph/domain-context or /scrape in minutes. See /docs for curl examples.

Is output AI-ready?

Yes — structured JSON, context_for_ai summaries, and link graphs designed for agents and RAG pipelines.