We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DocOwl2 cannot be loaded without flash_attn because the implementation of the compressor mandatorily uses flash attention.
This issue causes DocOwl2 to not run on many non-NVIDIA devices. Please consider adding an option to disable or enable flash attention.
https://huggingface.co/mPLUG/DocOwl2/blob/205b9e18b0cb503c9ef0dde1e7b120e6925778d9/visual_compressor.py#L106
The text was updated successfully, but these errors were encountered:
No branches or pull requests
DocOwl2 cannot be loaded without flash_attn because the implementation of the compressor mandatorily uses flash attention.
This issue causes DocOwl2 to not run on many non-NVIDIA devices. Please consider adding an option to disable or enable flash attention.
https://huggingface.co/mPLUG/DocOwl2/blob/205b9e18b0cb503c9ef0dde1e7b120e6925778d9/visual_compressor.py#L106
The text was updated successfully, but these errors were encountered: