Support LLaVA v1.5 7B (#4348)

2023-10-22 11:49:04 -04:00 · 2023-10-22 11:49:04 -04:00 · c544f5cc51
commit c544f5cc51
parent 05741821a5
3 changed files with 22 additions and 1 deletions
--- a/extensions/multimodal/README.md
+++ b/extensions/multimodal/README.md
@ -13,7 +13,10 @@ https://user-images.githubusercontent.com/3718215/233817203-69b57e77-0c55-4fd6-b
 To run this extension, download a LLM that supports multimodality, and then start server.py with the appropriate `--multimodal-pipeline` argument. Examples:

 ```
+# LLaVA 1.5 13B has the best performance
 python server.py --model liuhaotian_llava-v1.5-13b --multimodal-pipeline llava-v1.5-13b --load-in-4bit
+# LLaVA 1.5 7B is relatively weaker, but requires less memory
+python server.py --model liuhaotian_llava-v1.5-7b --multimodal-pipeline llava-v1.5-7b --load-in-4bit
 python server.py --model TheBloke_llava-v1.5-13B-GPTQ_gptq-4bit-32g-actorder_True --multimodal-pipeline llava-v1.5-13b --disable_exllama --loader autogptq
 python server.py --model wojtab_llava-7b-v0-4bit-128g --multimodal-pipeline llava-7b
 python server.py --model wojtab_llava-13b-v0-4bit-128g --multimodal-pipeline llava-13b