add n_batch support for llama.cpp (#1115)

2023-04-24 02:46:18 -04:00 · 2023-04-24 02:46:18 -04:00 · 78d1977ebf
commit 78d1977ebf
parent 2f6e2ddeac
3 changed files with 4 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -220,6 +220,7 @@ Optionally, you can use the following command-line flags:
 | Flag        | Description |
 |-------------|-------------|
 | `--threads` | Number of threads to use in llama.cpp. |
+| `--n_batch` | Processing batch size for llama.cpp. |

 #### GPTQ