Are embeddings not supported with the mistral-7b-instruct-v0.2 model? #414

norteo · 2024-05-12T09:18:23Z

I run llamafile with the mistral model as:

./mistral-7b-instruct-v0.2.Q5_K_M.llamafile -ngl 9999 --port 8080 --host 0.0.0.0 --embedding --threads 16

I don't have a GPU.

If I run

curl http://localhost:8080/embedding \
        -H "Authorization: Bearer no-key" \
        -H "Content-Type: application/json" \
        -d '{ "content": "The food was delicious and the waiter..." }'

llamafile "crashes" with the message:

{"function":"launch_slot_with_data","level":"INFO","line":875,"msg":"slot is processing task","slot_id":0,"task_id":0,"tid":"9434528","timestamp":1715505285}
{"function":"update_slots","level":"INFO","line":1890,"msg":"kv cache rm [p0, end)","p0":0,"slot_id":0,"task_id":0,"tid":"9434528","timestamp":1715505285}
llama_get_embeddings_ith: invalid embeddings id 0, reason: batch.logits[0] != true
GGML_ASSERT: llama.cpp/llama.cpp:16631: false

The text was updated successfully, but these errors were encountered:

lovenemesis · 2024-05-28T01:38:24Z

If you don't have an GPU, you probably don't need to parse -ngl 9999 .

k8si · 2024-05-28T14:05:08Z

This should only be an issue in older versions of llamafile. What version of llamafile is this llamafile associated with? To find out, you can run

./mistral-7b-instruct-v0.2.Q5_K_M.llamafile --version

norteo · 2024-05-28T15:15:00Z

Thank you for the reply.

user@fe9e8ccdc306:~$ ./mistral-7b-instruct-v0.2.Q5_K_M.llamafile --version
llamafile v0.8.0

It seems I was not using the latest version.
I redownloaded the file and I rerun the curl command and it seems to work fine.
The version I am using now is 0.8.5 .
I just downloaded it from https://huggingface.co/Mozilla/Mistral-7B-Instruct-v0.2-llamafile/resolve/main/mistral-7b-instruct-v0.2.Q5_K_M.llamafile?download=true
The hugging face repository does not seem to have the latest version and it seems it is not possible to know what llamafile version you are downloading.
Maybe something should be done about that?

k8si · 2024-05-28T16:09:04Z

The hugging face repository does not seem to have the latest version and it seems it is not possible to know what llamafile version you are downloading.
Maybe something should be done about that?

Would you be willing to post this as a separate issue?

mofosyne added the bug label May 21, 2024

k8si closed this as completed May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are embeddings not supported with the mistral-7b-instruct-v0.2 model? #414

Are embeddings not supported with the mistral-7b-instruct-v0.2 model? #414

norteo commented May 12, 2024

lovenemesis commented May 28, 2024

k8si commented May 28, 2024

norteo commented May 28, 2024

k8si commented May 28, 2024

Are embeddings not supported with the mistral-7b-instruct-v0.2 model? #414

Are embeddings not supported with the mistral-7b-instruct-v0.2 model? #414

Comments

norteo commented May 12, 2024

lovenemesis commented May 28, 2024

k8si commented May 28, 2024

norteo commented May 28, 2024

k8si commented May 28, 2024