Monitoring, Understanding, and Mapping Large Language Models

A scientific initiative to observe the landscape of modern AI models

Layer 1 Benchmark: Leaderboard

Latest Insights

Layer 1 Benchmark

Explore the knowledge of LLMs on observational distributions across different tasks and domains.

Methodology Results
Qualitative Model Performance

Visualize models' answers in comparison with the real world, in an interactive panel.

Compare LLM Answers vs. Real World
Papers & Reports

Access the latest papers and reports from the observatory.

Read More