You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a way of receiving the embeddings back in BQ format? Right now, I receive the full precision embedding and quantize it in the client, but wondering if I'm missing a way to get this directly from the server in binary format? If static quantization is done at zero, then there's no need to profile the data so the server can still remain stateless.
Motivation
More compact response from server.
Your contribution
N/A
The text was updated successfully, but these errors were encountered:
There is a '--embedding_dtype-' parameter for this cli. You can discover new parameters via --help command in the cli, they also come with a description.
For embedding_dtype, there are few other open source implementations. Let me know if the results from it are good. It uses an English dataset to quantize.
michaelfeil
changed the title
Binary quantization
Binary quantization - evaluate quality
Dec 2, 2024
Feature request
Is there a way of receiving the embeddings back in BQ format? Right now, I receive the full precision embedding and quantize it in the client, but wondering if I'm missing a way to get this directly from the server in binary format? If static quantization is done at zero, then there's no need to profile the data so the server can still remain stateless.
Motivation
More compact response from server.
Your contribution
N/A
The text was updated successfully, but these errors were encountered: