Refer to the STaRK paper for details on metrics, tasks and models.
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
AMAZON MAG PRIME
AMAZON MAG PRIME
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
AMAZON MAG PRIME
AMAZON MAG PRIME
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
1
⋮
2
⋮
3
⋮
Submit Your Results
Submit your results to be included in the leaderboard. Please ensure your submission meets all requirements.
For questions, contact stark-qa@cs.stanford.edu. Detailed instructions can be referred at submission instructions.
Dataset*
Split*
Model Type*
Select the appropriate category for your model
Drop File Here - or - Click to Upload
Submit
Recent Submissions and Updates
2025-02-12 19:49:39 ✅ Status update: Paprv1 has been approved
2025-02-05 06:50:41 📥 New submission: Paprv1 on test-0.1/mag
2024-11-21 02:00:17 📥 New submission: abc on human_generated_eval/mag
2024-11-21 01:16:54 📥 New submission: debug_test on human_generated_eval/mag
2024-11-20 17:09:52 ❌ Status update: debug_test has been rejected
2024-11-20 17:09:16 ❌ Status update: abc has been rejected