# Micro Data API Factory — Full Documentation > Public-source structured data APIs for AI agents and developers. > Each record includes source URLs, observation dates, and confidence scores. > This dataset provides source-linked observational data compiled from public sources. Accuracy, completeness, and freshness are not guaranteed. Each record may include sources, timestamps, confidence levels, and known conflicts. ## Base URL https://micro-data-api-factory.kasanegi123.workers.dev ## Access Policy - Humans: Free - Search engines: Free - AI crawlers: Free for basic endpoints, x402 payment for premium endpoints - Content-Signal: ai-train=no, search=yes, ai-input=yes ## Available Datasets ### Dataset: AI Crawler User-Agent Database (ai-crawler-ua-db) Observational database of AI crawler User-Agent strings, their operating companies, purposes, and known x402 payment support status. Compiled from public sources including robots.txt documentation, Cloudflare reports, and x402 ecosystem data. **Free endpoints:** - GET /api/datasets/ai-crawler-ua-db/stats — Database statistics - GET /api/datasets/ai-crawler-ua-db/ranking — Items ranked by confidence - GET /api/datasets/ai-crawler-ua-db/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/ai-crawler-ua-db/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/ai-crawler-ua-db/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/ai-crawler-ua-db/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/ai-crawler-ua-db/export — Bulk data export ($0.01) ### Dataset: HTTP Header Fields Registry (AI-Enhanced) (http-headers-iana) Complete IANA HTTP Field Name Registry with AI-oriented annotations. Each header includes status, RFC reference, structured type, AI explanation, common usage patterns, security notes, and related headers. 317 registered fields from the official IANA registry, enhanced with practical guidance for developers and AI agents. **Free endpoints:** - GET /api/datasets/http-headers-iana/stats — Database statistics - GET /api/datasets/http-headers-iana/ranking — Items ranked by confidence - GET /api/datasets/http-headers-iana/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/http-headers-iana/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/http-headers-iana/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/http-headers-iana/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/http-headers-iana/export — Bulk data export ($0.01) ### Dataset: 日本食品標準成分表(八訂増補2023年) AI-Enhanced (jp-food-composition) 文部科学省「日本食品標準成分表(八訂)増補2023年」の公式Excelデータから 主要栄養素を抽出したAI向け構造化データ。可食部100gあたりの栄養成分値。 全18食品群2538食品の完全版。 本データは参考情報であり、医療・栄養指導の最終判断には使用しないでください。 **Free endpoints:** - GET /api/datasets/jp-food-composition/stats — Database statistics - GET /api/datasets/jp-food-composition/ranking — Items ranked by confidence - GET /api/datasets/jp-food-composition/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/jp-food-composition/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/jp-food-composition/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/jp-food-composition/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/jp-food-composition/export — Bulk data export ($0.01) ### Dataset: 労働基準法 条文構造索引 (Labor Standards Act) (laws-labor-standards-act) e-Gov法令APIから取得した労働基準法(昭和二十二年法律第四十九号)の全251条文を 条/項/号の構造化JSONに変換したsource-linked retrieval view。 法律相談・法的助言・法的判断ではない。 条文の解釈・適用は法律の専門家にご相談ください。 **Free endpoints:** - GET /api/datasets/laws-labor-standards-act/stats — Database statistics - GET /api/datasets/laws-labor-standards-act/ranking — Items ranked by confidence - GET /api/datasets/laws-labor-standards-act/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/laws-labor-standards-act/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/laws-labor-standards-act/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/laws-labor-standards-act/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/laws-labor-standards-act/export — Bulk data export ($0.01) ### Dataset: Public Data Source Candidates for AI Packaging (public-data-candidates) Evaluated list of public data sources that could be transformed into AI-readable structured data APIs. Each candidate is scored for AI demand, extraction difficulty, rights risk, and packaging potential. This is the seed list for the Micro Data API Factory's expansion pipeline. **Free endpoints:** - GET /api/datasets/public-data-candidates/stats — Database statistics - GET /api/datasets/public-data-candidates/ranking — Items ranked by confidence - GET /api/datasets/public-data-candidates/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/public-data-candidates/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/public-data-candidates/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/public-data-candidates/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/public-data-candidates/export — Bulk data export ($0.01) ### Dataset: SPDX License List — OSS License Comparison Database (spdx-license-list) Complete SPDX license list (727 licenses) with OSI approval status, FSF libre status, deprecation flags, and reference URLs. Useful for AI answering questions about license selection, compatibility, and open source compliance. Includes permissive, copyleft, and proprietary licenses. **Free endpoints:** - GET /api/datasets/spdx-license-list/stats — Database statistics - GET /api/datasets/spdx-license-list/ranking — Items ranked by confidence - GET /api/datasets/spdx-license-list/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/spdx-license-list/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/spdx-license-list/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/spdx-license-list/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/spdx-license-list/export — Bulk data export ($0.01) ### Dataset: x402 / Pay Per Crawl Ecosystem Database (x402-ecosystem-db) Directory of projects, services, and tools in the x402 payment protocol and Cloudflare Pay Per Crawl ecosystem. Tracks who builds what, which networks are supported, and the current state of AI crawler micropayment infrastructure. **Free endpoints:** - GET /api/datasets/x402-ecosystem-db/stats — Database statistics - GET /api/datasets/x402-ecosystem-db/ranking — Items ranked by confidence - GET /api/datasets/x402-ecosystem-db/items — List/search items (supports ?q= and ?category=) - GET /api/datasets/x402-ecosystem-db/items/{id} — Item detail **Premium endpoints (x402 payment for AI crawlers):** - GET /api/datasets/x402-ecosystem-db/items/{id}/sources — Source citations ($0.01) - GET /api/datasets/x402-ecosystem-db/items/{id}/conflicts — Known data conflicts ($0.01) - GET /api/datasets/x402-ecosystem-db/export — Bulk data export ($0.01) ## Weather API (D041) Single-call weather data for AI agents. Backed by Open-Meteo (CC BY 4.0). - GET /weather/current?city={name} — current temp, humidity, wind, precipitation, condition ($0.001) - GET /weather/current?lat={lat}&lon={lon} — same, by coordinates - GET /weather/forecast?city={name}&days={1-7} — daily forecast ($0.001) - GET /weather/forecast?lat={lat}&lon={lon}&days={1-7} Pass a city name, get structured JSON. No API keys, no geocoding. Attribution: Weather data by Open-Meteo.com (CC BY 4.0). ## Global Endpoints - GET /api/datasets — List all available datasets - GET /api/public/crawl-summary — AI crawler transparency dashboard (public, no auth) - GET /llms.txt — This file (concise) - GET /llms-full.txt — Extended documentation - GET /openapi.json — OpenAPI 3.1.0 specification - GET /.well-known/x402 — x402 payment configuration - GET /robots.txt — Crawler directives - GET /sitemap.xml — Sitemap ## x402 Payment Premium endpoints return HTTP 402 for AI crawlers with payment instructions. Protocol: x402 (https://x402.org) Network: Base mainnet (Ethereum L2) Currency: USDC Facilitator: Coinbase CDP ## Content Negotiation All HTML pages support Accept: text/markdown for AI-friendly markdown responses. ## Rate Limits No rate limits. Please be reasonable.