Add llama-cpp-python wheels with tensor cores support (#5003)

2023-12-19 17:30:53 -03:00 · 2023-12-19 17:30:53 -03:00 · de138b8ba6
commit de138b8ba6
parent 0a299d5959
9 changed files with 69 additions and 21 deletions
--- a/README.md
+++ b/README.md
@ -252,6 +252,7 @@ List of command-line flags

 | Flag        | Description |
 |-------------|-------------|
+| `--tensorcores`  | Use llama-cpp-python compiled with tensor cores support. This increases performance on RTX cards. NVIDIA only. |
 | `--n_ctx N_CTX` | Size of the prompt context. |
 | `--threads` | Number of threads to use. |
 | `--threads-batch THREADS_BATCH` | Number of threads to use for batches/prompt processing. |