Do you have guide to convert this to GGUF/GGML format?

by qhkm - opened Oct 19, 2023

Oct 19, 2023

Hi there! This is super cool! Saw it on twitter and was amazed by the performance as compared to sentence embedding. Would love to be able to use this in llama cpp so would need to convert to gguf to be able to use it. Do you have any idea how to do that? Thanks!

andersonbcdefg

Taylor org Oct 27, 2023

I would not recommend using this with Llama.cpp. It's a BERT model, so I looked into BERT.cpp but I don't really see the benefits of that over ONNX. I provided ONNX checkpoints so you should just use those. Many of the benefits of using Llama.cpp are more relevant to text generation, not so much for embeddings.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment