Contribute to the UQ project (uq.stanford.edu) and become a co-author on the next version of the paper!
UQ is an effort to create a new evaluation paradigm for LLMs. Instead of crafting exam-style benchmark questions where we know the answers, we test LLMs on organic, unsolved problems via automated LLM-based validation & community verification. Our current paper is at https://arxiv.org/abs/2508.17580.
| Contribution | Point awarded | Description |
|---|---|---|
| Verify model answers in your expert domain | 1 for every accepted verification | Provide original, human-written verification to a model's answer. Post the verification on our website and contact a project lead. Many questions/answers are too challenging for our team to verify. We need your expertise! |
| Write reviews for proposed questions | 1 for every question | Review the question that passes LLM-based filtering to decide whether to include it into the new version of UQ-Dataset. |
| Help with development and maintenance | 1-10 | A great way to contribute to UQ is to help with engineering tasks, especially for UQ-Platform. Contact a project lead for detailed information. |
Ready to contribute? Get in touch with our project leads to get started.
Contact Us