Test

Actions

Test

Actions

Loading...
Loading

test.yml

2,978 workflow runs

Add multiple annotators to Omni-MATH and rename shared modules Test #7914: Pull request #3291 opened by liamjxu

January 23, 2025 19:06

10m 7s jialiang/multiple_annotator

jialiang/multiple_annotator

January 23, 2025 19:06

10m 7s

Update unitxt tables benchmark run entries (#3290) Test #7913: Commit 5b456b3 pushed by yifanmai

January 23, 2025 05:01

10m 4s main

main

January 23, 2025 05:01

10m 4s

Update unitxt tables benchmark run entries Test #7912: Pull request #3290 synchronize by yifanmai

January 23, 2025 04:40

10m 1s yifanmai/tables_run_entries_groups

yifanmai/tables_run_entries_groups

January 23, 2025 04:40

10m 1s

Update unitxt tables benchmark run entries Test #7911: Pull request #3290 opened by yifanmai

January 23, 2025 04:40

9m 52s yifanmai/tables_run_entries_groups

yifanmai/tables_run_entries_groups

January 23, 2025 04:40

9m 52s

Fix incorrect handling of labels in ClassificationMetric (#3289) Test #7910: Commit 2ef0d9b pushed by yifanmai

January 23, 2025 04:37

10m 44s main

main

January 23, 2025 04:37

10m 44s

Fix metrics in run specs and schema for IBM enterprise benchmark (#3288) Test #7909: Commit 43685b8 pushed by yifanmai

January 23, 2025 04:37

9m 45s main

main

January 23, 2025 04:37

9m 45s

Refactor FLEURS audio scenario to ASR task (#3287) Test #7908: Commit 1a79234 pushed by teetone

January 23, 2025 01:09

10m 16s main

main

January 23, 2025 01:09

10m 16s

Include multiple annotators for WildBench (#3283) Test #7907: Commit 80432dc pushed by yifanmai

January 23, 2025 00:18

10m 5s main

main

January 23, 2025 00:18

10m 5s

Fix incorrect handling of labels in ClassificationMetric Test #7906: Pull request #3289 opened by yifanmai

January 23, 2025 00:13

10m 49s yifanmai/fix-classification-labels-bugs

yifanmai/fix-classification-labels-bugs

January 23, 2025 00:13

10m 49s

Fix metrics in run specs and schema for IBM enterprise benchmark Test #7905: Pull request #3288 opened by yifanmai

January 23, 2025 00:09

9m 42s yifanmai/fix-enterprise-metrics

yifanmai/fix-enterprise-metrics

January 23, 2025 00:09

9m 42s

Improve classification metrics (#3285) Test #7904: Commit 2c14291 pushed by yifanmai

January 22, 2025 23:39

10m 14s main

main

January 22, 2025 23:39

10m 14s

Add Legal Opinion Sentiment Classification scenario Test #7903: Pull request #3286 synchronize by yifanmai

January 22, 2025 23:35

10m 4s yifanmai/legal-opinion

yifanmai/legal-opinion

January 22, 2025 23:35

10m 4s

Refactor FLEURS audio scenario to ASR task Test #7902: Pull request #3287 opened by ImKeTT

January 22, 2025 23:34

10m 1s ImKeTT:asr_fix

ImKeTT:asr_fix

January 22, 2025 23:34

10m 1s

Add Legal Opinion Sentiment Classification scenario Test #7901: Pull request #3286 synchronize by yifanmai

January 22, 2025 23:30

10m 1s yifanmai/legal-opinion

yifanmai/legal-opinion

January 22, 2025 23:30

10m 1s

Add Legal Opinion Sentiment Classification scenario Test #7900: Pull request #3286 opened by yifanmai

January 22, 2025 23:29

10m 9s yifanmai/legal-opinion

yifanmai/legal-opinion

January 22, 2025 23:29

10m 9s

Improve classification metrics Test #7899: Pull request #3285 synchronize by yifanmai

January 22, 2025 23:06

9m 46s yifanmai/fix-classification-metrics

yifanmai/fix-classification-metrics

January 22, 2025 23:06

9m 46s

Improve classification metrics Test #7898: Pull request #3285 opened by yifanmai

January 22, 2025 22:48

9m 50s yifanmai/fix-classification-metrics

yifanmai/fix-classification-metrics

January 22, 2025 22:48

9m 50s

Adding IMDB_PTBR Scenario Test #7897: Pull request #3284 synchronize by thallysonjsa

January 22, 2025 19:45

10m 24s llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

January 22, 2025 19:45

10m 24s

Adding IMDB_PTBR Scenario Test #7896: Pull request #3284 synchronize by thallysonjsa

January 22, 2025 18:08

9m 40s llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

January 22, 2025 18:08

9m 40s

Adding IMDB_PTBR Scenario Test #7895: Pull request #3284 opened by thallysonjsa

January 22, 2025 17:49

3m 42s llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

llm-pt-ibm:thallysonjsa/imdb_ptbr_scenario

January 22, 2025 17:49

3m 42s

Include multiple annotators for WildBench Test #7894: Pull request #3283 opened by liamjxu

January 22, 2025 06:39

10m 18s jialiang/multiple_annotator

jialiang/multiple_annotator

January 22, 2025 06:39

10m 18s

refactor fleurs asrscenario Test #7893: Pull request #3281 opened by ImKeTT

January 22, 2025 01:25

9m 48s ImKeTT:asr_haoqin

ImKeTT:asr_haoqin

January 22, 2025 01:25

9m 48s

Switch table_benchmark wikitq to use 1 shot instead of 5 Test #7892: Pull request #3280 opened by yifanmai

January 21, 2025 23:09

10m 32s yifanmai/unitxt-1-shot

yifanmai/unitxt-1-shot

January 21, 2025 23:09

10m 32s

Add Llama 3.1 Instruct on Vertex AI (#3278) Test #7891: Commit c617a57 pushed by yifanmai

January 20, 2025 02:30

10m 18s main

main

January 20, 2025 02:30

10m 18s

Add run entries for HELM Tables with only the base variants (#3279) Test #7890: Commit ef82a87 pushed by yifanmai

January 17, 2025 06:06

10m 48s main

main

January 17, 2025 06:06

10m 48s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

Test

Actions

Loading...
Loading

Create status badge

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: stanford-crfm/helm

Actions

Test Test Actions Loading... Loading Sorry, something went wrong.

Test

Test

Actions

Loading...
Loading