Sunday, March 09, 2025

Flash Attention


https://github.com/Dao-AILab/flash-attention 

https://github.com/ollama/ollama/blob/main/docs/faq.md#how-can-i-enable-flash-attention


OLLAMA_FLASH_ATTENTION=1

No comments:

Related Posts Plugin for WordPress, Blogger...