Datasets

Publieke JSON-bestanden en release-metadata

Geversioneerde publieke JSON onder /api/public/v1, gebouwd uit dezelfde gepubliceerde dataset als de zichtbare pagina's.

788

publieke universiteitsrecords

4180

brononderbouwde claims

2619

officiele bronnen

v1

public JSON schema version

6

release download artifacts

788

analyseprofielen

Wat herbruikbaar is

Tracker metadata, record URLs, review states, citation fields, and public JSON artifacts are reusable under the tracker metadata license.

Wat extern blijft

Official source documents, page text, PDFs, and university policy language retain their original rights and terms.

Hoe agents data ophalen

Start with the API index, resolve entities with search, fetch the canonical university JSON, then cite claim evidence and source URLs.

Geversioneerde publieke JSON

Live read-only artifacts grouped by use.

Kernrecords

Zoeken en analyse

Rapporten en embeds

Review en integraties

Dataset release downloads

Bulk files with row counts, sizes, and checksums.

Releasepublic-release-20260526-003Period2026-05PublishedMay 26, 2026

Ranking and index boundaries

Discovery inputs, not policy conclusions.

  • QS 2026 currently remains the main crawl batching source for expanding coverage.
  • THE 2026, ARWU 2025, U.S. News 2025-2026, and CWTS Leiden 2025 are supported as ranking, index, and filter sources.
  • CWTS Leiden 2025 is a derived metric order, not an overall global university rank.
  • Different ranking years are not presented as one unified 2026 ranking.

GitHub trust assets

Repository-level trust assets.

  • README.md: Project positioning, public data surfaces, local development, and validation commands.
  • DATA_DICTIONARY.md: Field-level explanation for public JSON, claims, evidence, sources, changes, and multilingual display rules.
  • CITATION.cff: Machine-readable citation metadata for GitHub and research workflows.
  • CONTRIBUTING.md: Contribution rules for source URLs, staged OpenClaw artifacts, review boundaries, and pull requests.

Current release manifest

Promotes the latest validated staged university AI-policy runs into the public dataset, including the QS rows 776-785 batch, while keeping source-health maintenance-only output out of canonical publication.

Releasepublic-release-20260526-003PublishedMay 26, 2026Promoted runs774

The manifest controls which reviewed staged artifact directories are promoted into public pages and public JSON.

  • Only directories listed here are promoted into public pages and /api/public/v1 JSON.
  • This release promotes 10 additional validated staged university runs beyond public-release-20260526-002.
  • uapt-qs200-stage2-batch1-20260517 remains excluded because it is a source-health maintenance run; dataset release validation forbids publishing maintenance-only runs as public claims.
  • The previous public-release-20260526-002 promotion added 40 university runs plus one claim-evidence delta bundle for already-published universities.
  • New crawler output should remain in staging until reviewed, repaired if needed, validated, and then added to this manifest.
  • Raw source documents are not published as tracker metadata; official source materials retain their original rights.

Datasetconcepten

What the v1 records expose today

Universities

Available now

Canonical university records are available as visible pages and per-university public JSON records.

Claims

Available inside university JSON

Claims include claim text, claim type, confidence, review state, dates, and evidence arrays.

Sources

Available inside public records

Official sources appear as source attributions and evidence source URLs with rights caveats.

Snapshots

Hash metadata available now

Public records expose source snapshot hashes. Raw HTML, PDFs, and screenshots are not published as tracker metadata.

Recent changes

Available now

The recent changes JSON feed summarizes checked and changed university records with review states.

Analysis profiles

Available now

Deterministic policy analysis profiles derive dimensions and coverage scores from existing public claim/evidence records.

Coverage dashboards

Available now

QS coverage, source-health, and review-queue metadata expose collection status and crawler/review work without publishing staging claims.

Entity resolution and search

Available now

Canonical entity aliases and safe search indexes improve recall without creating policy facts or exposing unpublished artifacts.

Licentie, rechten en citatie

CC-BY-4.0 tracker metadata

Tracker metadata

Tracker metadata, including normalized entities, claim records, review states, and public JSON fields, is intended for CC-BY-4.0 reuse with attribution.

Official source rights

Tracker metadata is open licensed. Official source documents, page text, PDFs, and other source materials retain their original rights and terms.

Citation expectations

Cite the canonical page and public JSON together. For claim-level reuse, retain source URL, source language, snapshot hash, review state, confidence, and the original evidence snippet.

This tracker is not legal advice, not academic integrity advice, and not an official university statement unless a linked source is the university's own official page.

Citation rules are documented at /citation. Recent data freshness is visible at /changes.