Submission Portal

Submit your predictions.

Evaluate your model using the official evaluation harness to generate predictions, then upload them here. Predictions are scored against held-out ground truth, and results will be emailed to you.

Limits: 2 submissions per day per email  ·  30 lifetime per email

01Identity

The name that will appear on the leaderboard.

Your company, university, or team name.

Link to your model (Hugging Face, arXiv, or project page). Required for leaderboard linking, optional otherwise.

Results and updates will be sent here.

02Submission type

What is the most restrictive access level required to reproduce this submission?

03Model configuration

Provide details for the primary or most representative model in your submission. If multiple models are used equally, you may write multiple and describe them in the pipeline description in Section 02.

The main or representative model in your system.

Write undisclosed if not public.

Numerical precision used during inference.

04Inference setup

Hardware used for the main reasoning component. For API-based systems, write API.

Did you use the official VANTAGE-Bench evaluation harness to generate these predictions?

Non-default inference settings. Leave blank if using harness defaults.

05Pillars submitted

At least one required. Submissions must cover all tasks within a pillar — partial pillar submissions are not accepted. Overall score is computed only over submitted pillars.

06Predictions file

Upload a .tar.gz archive containing one .jsonl file per task. Maximum size: 500MB.

📦
Click to upload or drag and drop
.tar.gz · max 500 MB
07Acknowledgements

Please confirm all statements:

Questions? Email vantage.bench.competition@gmail.com