Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add CP support to Neva in NeMo2 (#11850)
* api updates and fixes Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix arg Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update seq packing in mock ds Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * save Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update preprocess_data Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update seq packing Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix sp Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * save Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix seq packing Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * add truncation and padding Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Fix issues Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * change LLaVATemplateConfig variables to class variables * change to use field with default attributes * Apply isort and black reformatting Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Initial support for CP * Add seq packing option in energon Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix energon conversation Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * add energon option in neva training script Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: parthmannan <parthmannan@users.noreply.github.com> * Improvements * add ci test for packed seq Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix for PP+CP * Max seq len fix * fix mock dataset seq packing Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix mock dataset seq packing Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix lint and update seq pack func Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix energon module Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix comments Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * address lightning issues Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Update sequence_packing.py Signed-off-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com> * update energon requirements Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix for energon update Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix for test Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * revert overlap config change Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> --------- Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> Signed-off-by: parthmannan <parthmannan@users.noreply.github.com> Signed-off-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> Co-authored-by: ykarnati <ykarnati@nvidia.com> Co-authored-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> Co-authored-by: Parth Mannan <pmannan@nvidia.com> Co-authored-by: parthmannan <parthmannan@users.noreply.github.com> Co-authored-by: Parth Mannan <parth.mannan95@gmail.com>
- Loading branch information