Skip to content

Actions: stanford-crfm/helm

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,978 workflow runs
2,978 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update unitxt tables benchmark run entries (#3290)
Test #7913: Commit 5b456b3 pushed by yifanmai
January 23, 2025 05:01 10m 4s main
January 23, 2025 05:01 10m 4s
Fix incorrect handling of labels in ClassificationMetric (#3289)
Test #7910: Commit 2ef0d9b pushed by yifanmai
January 23, 2025 04:37 10m 44s main
January 23, 2025 04:37 10m 44s
Fix metrics in run specs and schema for IBM enterprise benchmark (#3288)
Test #7909: Commit 43685b8 pushed by yifanmai
January 23, 2025 04:37 9m 45s main
January 23, 2025 04:37 9m 45s
Refactor FLEURS audio scenario to ASR task (#3287)
Test #7908: Commit 1a79234 pushed by teetone
January 23, 2025 01:09 10m 16s main
January 23, 2025 01:09 10m 16s
Include multiple annotators for WildBench (#3283)
Test #7907: Commit 80432dc pushed by yifanmai
January 23, 2025 00:18 10m 5s main
January 23, 2025 00:18 10m 5s
Improve classification metrics (#3285)
Test #7904: Commit 2c14291 pushed by yifanmai
January 22, 2025 23:39 10m 14s main
January 22, 2025 23:39 10m 14s
Add Legal Opinion Sentiment Classification scenario
Test #7903: Pull request #3286 synchronize by yifanmai
January 22, 2025 23:35 10m 4s yifanmai/legal-opinion
January 22, 2025 23:35 10m 4s
Refactor FLEURS audio scenario to ASR task
Test #7902: Pull request #3287 opened by ImKeTT
January 22, 2025 23:34 10m 1s ImKeTT:asr_fix
January 22, 2025 23:34 10m 1s
Add Legal Opinion Sentiment Classification scenario
Test #7901: Pull request #3286 synchronize by yifanmai
January 22, 2025 23:30 10m 1s yifanmai/legal-opinion
January 22, 2025 23:30 10m 1s
Add Legal Opinion Sentiment Classification scenario
Test #7900: Pull request #3286 opened by yifanmai
January 22, 2025 23:29 10m 9s yifanmai/legal-opinion
January 22, 2025 23:29 10m 9s
Improve classification metrics
Test #7899: Pull request #3285 synchronize by yifanmai
January 22, 2025 23:06 9m 46s yifanmai/fix-classification-metrics
January 22, 2025 23:06 9m 46s
Include multiple annotators for WildBench
Test #7894: Pull request #3283 opened by liamjxu
January 22, 2025 06:39 10m 18s jialiang/multiple_annotator
January 22, 2025 06:39 10m 18s
refactor fleurs asrscenario
Test #7893: Pull request #3281 opened by ImKeTT
January 22, 2025 01:25 9m 48s ImKeTT:asr_haoqin
January 22, 2025 01:25 9m 48s
Switch table_benchmark wikitq to use 1 shot instead of 5
Test #7892: Pull request #3280 opened by yifanmai
January 21, 2025 23:09 10m 32s yifanmai/unitxt-1-shot
January 21, 2025 23:09 10m 32s
Add Llama 3.1 Instruct on Vertex AI (#3278)
Test #7891: Commit c617a57 pushed by yifanmai
January 20, 2025 02:30 10m 18s main
January 20, 2025 02:30 10m 18s
Add run entries for HELM Tables with only the base variants (#3279)
Test #7890: Commit ef82a87 pushed by yifanmai
January 17, 2025 06:06 10m 48s main
January 17, 2025 06:06 10m 48s