Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add collection IDs to relevant text reuse passages #28

Open
theorm opened this issue Sep 10, 2020 · 1 comment
Open

Add collection IDs to relevant text reuse passages #28

theorm opened this issue Sep 10, 2020 · 1 comment
Assignees

Comments

@theorm
Copy link
Member

theorm commented Sep 10, 2020

To enable filtering of text reuse clusters by user collections we have added a collection filter that works with the ucoll_ss field in tr_passages index. At the moment Maud added a couple of test collections to the development index in preprod Solr. We need a celery job that would update text reuse passages when collections are created/changed. We will also need to tag passages with collections that have already been created (possibly a small script?).

PS: memo for @e-maud : check rights (r/w) on this index for c2dh user.

@e-maud
Copy link
Member

e-maud commented Oct 16, 2020

Actions:

  • Maud ingest TR data in a SOLR prod (current app in prod points to dev SOLR)
  • Maud allows c2dh writer on passages indexes (dev and prod)
  • Daniele prepares bulk script
  • Daniele ingest coll IDs to the newly ingested TR data in prod (and in dev for test if needed)
  • Daniele sets up the celeri task
  • Matteo tests everything :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants