Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scalar quantization reliability test #228

Open
wants to merge 18 commits into
base: main
Choose a base branch
from

Conversation

donomii
Copy link
Contributor

@donomii donomii commented Jul 12, 2024

Adds a test that imports long sparse vectors to a cluster in Scalar Quantization mode, and randomly crashes a node to simulate cluster failure.

@donomii donomii self-assigned this Jul 12, 2024
"vectorIndexConfig": {
"sq": {"enabled": True, "trainingLimit": 10000, "rescoreLimit": 20},
Copy link
Member

@trengrj trengrj Jul 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @donomii, this should fail due to additional validation weaviate/weaviate#5341. Both PQ and SQ require that either async indexing is enabled or that they are enabled post loading some data. You can use this import script on the other PR to show how importing is paused while PQ/SQ is enabled

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added the ASYNC_INDEXING environment variable, since this seemed to make the most sense for the test. So now the test confirms that a node can crash while objects are being queued for later indexing, and import still works.

donomii added 2 commits July 18, 2024 21:26
Update schema to use bigram vectors and scalar quantization

Use bigram vectorizer in trigram mode to give (sparse) 16k vectors

Print container name

?

Add the actual shell script

linter

Actually run the test
@donomii donomii force-pushed the scalar-quantization-reliability-2 branch from 3166308 to 260d734 Compare July 18, 2024 12:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants