Datasets

Public JSON artifacts and release metadata

Versioned public JSON under /api/public/v1, built from the same promoted release dataset as the visible pages.

811

public university records

4391

source-backed claims

2780

official source attributions

v1

public JSON schema version

6

release download artifacts

811

analysis profiles

What can be reused

Tracker metadata, record URLs, review states, citation fields, and public JSON artifacts are reusable under the tracker metadata license.

What remains external

Official source documents, page text, PDFs, and university policy language retain their original rights and terms.

How agents retrieve data

Start with the API index, resolve entities with search, fetch the canonical university JSON, then cite claim evidence and source URLs.

Versioned public JSON

Live read-only artifacts grouped by use.

Core records

Search and analysis

Reports and embeds

Review and integrations

Dataset release downloads

Bulk files with row counts, sizes, and checksums.

Releasepublic-release-20260614-001Period2026-06PublishedJun 15, 2026

Ranking and index boundaries

Discovery inputs, not policy conclusions.

  • QS 2026 currently remains the main crawl batching source for expanding coverage.
  • THE 2026, ARWU 2025, U.S. News 2025-2026, and CWTS Leiden 2025 are supported as ranking, index, and filter sources.
  • CWTS Leiden 2025 is a derived metric order, not an overall global university rank.
  • Different ranking years are not presented as one unified 2026 ranking.

GitHub trust assets

Repository-level trust assets.

  • README.md: Project positioning, public data surfaces, local development, and validation commands.
  • DATA_DICTIONARY.md: Field-level explanation for public JSON, claims, evidence, sources, changes, and multilingual display rules.
  • CITATION.cff: Machine-readable citation metadata for GitHub and research workflows.
  • CONTRIBUTING.md: Contribution rules for source URLs, staged OpenClaw artifacts, review boundaries, and pull requests.

Current release manifest

Promotes the latest validated staged university AI-policy runs into the public dataset while keeping source-health maintenance-only output out of canonical publication.

Releasepublic-release-20260614-001PublishedJun 15, 2026Promoted runs797

The manifest controls which reviewed staged artifact directories are promoted into public pages and public JSON.

  • Only directories listed here are promoted into public pages and /api/public/v1 JSON.
  • This release promotes 21 additional validated staged university runs beyond the previous public release.
  • uapt-qs200-stage2-batch1-20260517 remains excluded because it is a source-health maintenance run; dataset release validation forbids publishing maintenance-only runs as public claims.
  • New crawler output should remain in staging until reviewed, repaired if needed, validated, and then added to this manifest.
  • Raw source documents are not published as tracker metadata; official source materials retain their original rights.

Dataset concepts

What the v1 records expose today

Universities

Available now

Canonical university records are available as visible pages and per-university public JSON records.

Claims

Available inside university JSON

Claims include claim text, claim type, confidence, review state, dates, and evidence arrays.

Sources

Available inside public records

Official sources appear as source attributions and evidence source URLs with rights caveats.

Snapshots

Hash metadata available now

Public records expose source snapshot hashes. Raw HTML, PDFs, and screenshots are not published as tracker metadata.

Recent changes

Available now

The recent changes JSON feed summarizes checked and changed university records with review states.

Analysis profiles

Available now

Deterministic policy analysis profiles derive dimensions and coverage scores from existing public claim/evidence records.

Coverage dashboards

Available now

QS coverage, source-health, and review-queue metadata expose collection status and crawler/review work without publishing staging claims.

Entity resolution and search

Available now

Canonical entity aliases and safe search indexes improve recall without creating policy facts or exposing unpublished artifacts.

License, rights, and citation

CC-BY-4.0 tracker metadata

Tracker metadata

Tracker metadata, including normalized entities, claim records, review states, and public JSON fields, is intended for CC-BY-4.0 reuse with attribution.

Official source rights

Tracker metadata is open licensed. Official source documents, page text, PDFs, and other source materials retain their original rights and terms.

Citation expectations

Cite the canonical page and public JSON together. For claim-level reuse, retain source URL, source language, snapshot hash, review state, confidence, and the original evidence snippet.

This tracker is not legal advice, not academic integrity advice, and not an official university statement unless a linked source is the university's own official page.

Citation rules are documented at /citation. Recent data freshness is visible at /changes.