Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
attention attention-mechanism attention-model linear-attention linear-attention-model heinsen-attention
-
Updated
Jun 6, 2024 - Python