Fix wrong token latency when batch size is greater than 1 #4708
Job | Run time |
---|---|
14m 36s | |
16m 5s | |
33m 14s | |
17m 19s | |
14m 19s | |
18m 26s | |
19m 16s | |
15m 1s | |
25m 18s | |
10m 18s | |
15m 51s | |
24m 30s | |
7m 42s | |
13m 22s | |
14m 57s | |
12m 13s | |
14m 8s | |
7m 56s | |
31m 22s | |
18m 30s | |
1s | |
5h 44m 24s |