Add HLE evaluation result

#11

by burtenshaw HF Staff - opened Jan 15

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-0

Add HLE evaluation result425af392

burtenshaw

Jan 15

Evaluation Results

This PR adds structured evaluation results using the new .eval_results/ format.

What This Enables

Model Page: Results appear on the model page with benchmark links
Leaderboards: Scores are aggregated into benchmark dataset leaderboards
Verification: Support for cryptographic verification of evaluation runs

Format Details

Results are stored as YAML in .eval_results/ folder. See the Eval Results Documentation for the full specification.

Generated by community-evals

Add source.user field for attribution62aa8b3f

Add task_id to hle.yamld8fc8c0e

nuxlear

LG AI Research org 11 days ago

Hello, @burtenshaw . Sorry for the delay!
Looks great to me. Would it make sense to also add the other benchmark results, in addition to the one you suggested?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment