Suggestions
:speech_balloon:
Show the actual number of samples run, including as a percentage
Not all eval runs will be on the full dataset, usually for reasons of cost and time. The number/% of actual samples run should be surfaced in the cards (including if all samples were run), or a policy stated somewhere that only runs with 100% of samples will be shown/accepted.