data_process中，使用load_4bit加载liuhaotian/llava-v1.6-34b模型报错 #408

sunfan1997 · 2024-05-17T03:09:04Z

显存只有16G，想处理一批自己的数据，如何量化加载liuhaotian/llava-v1.6-34b模型，报错ValueError: Calling cuda() is not supported for 4-bit or 8-bit quantized models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct dtype.有什么办法减少显存占用吗，或者说最低需要多少显存才能跑data process部分。

The text was updated successfully, but these errors were encountered:

github-actions · 2024-05-25T01:46:15Z

This issue is stale because it has been open for 7 days with no activity.

github-actions · 2024-06-01T01:49:47Z

This issue was closed because it has been inactive for 7 days since being marked as stale.

github-actions bot added the stale label May 25, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jun 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data_process中，使用load_4bit加载liuhaotian/llava-v1.6-34b模型报错 #408

data_process中，使用load_4bit加载liuhaotian/llava-v1.6-34b模型报错 #408

sunfan1997 commented May 17, 2024

github-actions bot commented May 25, 2024

github-actions bot commented Jun 1, 2024

data_process中，使用load_4bit加载liuhaotian/llava-v1.6-34b模型报错 #408

data_process中，使用load_4bit加载liuhaotian/llava-v1.6-34b模型报错 #408

Comments

sunfan1997 commented May 17, 2024

github-actions bot commented May 25, 2024

github-actions bot commented Jun 1, 2024