[readme] update flash attention installation instruction on win platform (#6788)

* Update README_zh.md

* Update README.md

Former-commit-id: e48d1327fb39cc95f8fbfc746494f67a79471893
This commit is contained in:
neavo
2025-02-01 12:43:29 +08:00
committed by GitHub
parent 5bb447b118
commit 34746d6151
2 changed files with 2 additions and 2 deletions

View File

@@ -441,7 +441,7 @@ pip install https://github.com/jllllll/bitsandbytes-windows-webui/releases/downl
#### Install Flash Attention-2
To enable FlashAttention-2 on the Windows platform, you need to install the precompiled `flash-attn` library, which supports CUDA 12.1 to 12.2. Please download the corresponding version from [flash-attention](https://github.com/bdashore3/flash-attention/releases) based on your requirements.
To enable FlashAttention-2 on the Windows platform, please use the script from [flash-attention-windows-wheel](https://huggingface.co/lldacing/flash-attention-windows-wheel) to compile and install it by yourself.
</details>