Semi-structured Retrieval Benchmark (STaRK) Leaderboard

Refer to the STaRK paper for details on metrics, tasks and models.

Model types

Submit Your Results

Submit your results to be included in the leaderboard. Please ensure your submission meets all requirements. For questions, contact stark-qa@cs.stanford.edu. Detailed instructions can be referred at submission instructions.

Dataset*
Split*
Model Type*

Select the appropriate category for your model


Recent Submissions and Updates

2025-02-12 19:49:39
✅ Status update: Paprv1 has been approved

2025-02-05 06:50:41
📥 New submission: Paprv1 on test-0.1/mag

2024-11-21 02:00:17
📥 New submission: abc on human_generated_eval/mag

2024-11-21 01:16:54
📥 New submission: debug_test on human_generated_eval/mag

2024-11-20 17:09:52
❌ Status update: debug_test has been rejected

2024-11-20 17:09:16
❌ Status update: abc has been rejected