Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
pass batch_dim_idx to deepspeed sequence parallel distributed attenti…
…on for supporting batch size larger than 1 (#433) Co-authored-by: Jinghan Yao <yjhmitweb@ascend-rw02.ten.osc.edu>
- Loading branch information