Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

[BesTLA] First-token inference optimization #979

[BesTLA] First-token inference optimization

[BesTLA] First-token inference optimization #979

Triggered via pull request May 31, 2024 05:27
Status Cancelled
Total duration 3m 42s
Artifacts

cpp-graph-test.yml

on: pull_request
Matrix: CPP-Graph-Workflow
Genreate-Report
0s
Genreate-Report
Fit to window
Zoom out
Zoom in

Annotations

4 errors
CPP-Graph-Workflow (llama3-8b)
Canceling since a higher priority waiting request for 'CPP Graph Test-271' exists
CPP-Graph-Workflow (llama3-8b)
The operation was canceled.
CPP-Graph-Workflow (gptj-6b)
Canceling since a higher priority waiting request for 'CPP Graph Test-271' exists
CPP-Graph-Workflow (gptj-6b)
The operation was canceled.