Disable strictness for export of llama #168

rsuderman · 2024-09-05T18:53:08Z

Strictness validates correctness but this results in loading the tensors to memory. Disabling helps with export speed.

Strictness validates correctness but this results in loading the tensors to memory. Disabling helps with loading speed.

stellaraccident

I feel like there's an issue to file here on the PyTorch side. It isn't clear to me that the overhead ballooning is intended on their side, and if you can describe the issue/have a simplified reproducer, it might help them.

Disable strictness for export of llama

9213468

Strictness validates correctness but this results in loading the tensors to memory. Disabling helps with loading speed.

rsuderman force-pushed the enable_strictness branch from 6baad65 to 9213468 Compare September 5, 2024 18:53

stellaraccident approved these changes Sep 5, 2024

View reviewed changes

rsuderman added 2 commits September 5, 2024 13:19

Merge branch 'main' into enable_strictness

f026a65

Merge branch 'main' into enable_strictness

4ef7b35

rsuderman merged commit a038133 into nod-ai:main Sep 6, 2024
7 checks passed

rsuderman deleted the enable_strictness branch September 6, 2024 04:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable strictness for export of llama #168

Disable strictness for export of llama #168

rsuderman commented Sep 5, 2024

stellaraccident left a comment

Disable strictness for export of llama #168

Disable strictness for export of llama #168

Conversation

rsuderman commented Sep 5, 2024

stellaraccident left a comment

Choose a reason for hiding this comment