[WIP] Update in background job #49

ebrelsford · 2024-12-19T16:52:09Z

This is a work in progress PR, please do not merge.

The intention here is to use background jobs to update dataset metadata needed for zip file expansion and map previews. Currently we run a separate script to do this which can lead to confusion or errors.

To do:

update zip file sources
update map preview metadata
remove sync script

ebrelsford · 2024-12-19T16:52:52Z

@phargogh before I wrap this up I thought I'd show you what I have so far here--feels like it will be a bit more convenient than what we have right now.

ebrelsford · 2024-12-19T16:53:17Z

ckan/docker-entrypoint.d/02_setup_background_jobs.sh

+echo "Starting background jobs worker"
+ckan jobs worker &


Start the jobs worker in the background.

Would this need to be nohuped in order to keep it running past the end of the shell process, and still have it running in the background (thanks to &)?

Ah yeah that could be necessary, locally this works but I can see how it would not when deployed. I can try it on staging first.

ebrelsford · 2024-12-19T16:53:35Z

src/ckanext-natcap/ckanext/natcap/plugin.py

+    def after_dataset_update(self, context, package):
+        resources = [res.as_dict(core_columns_only=False) for res in context['package'].resources]
+        toolkit.enqueue_job(update_dataset, [context['user'], package, resources])


Hook that is fired on dataset update via the interface or API.

ebrelsford · 2024-12-19T16:54:19Z

src/ckanext-natcap/ckanext/natcap/update_dataset.py

+    ctx = { 'user': user }
+    updates = {'id': dataset['id'], 'extras': extras}
+    toolkit.get_action('package_patch')(ctx, updates)


We patch the dataset with our updated extras to avoid overwriting other fields.

ebrelsford · 2024-12-19T16:54:37Z

src/ckanext-natcap/ckanext/natcap/update_dataset.py

+def update_mappreview(dataset, metadata, extras):
+    # TODO
+    return extras


Will port over from sync-datasets.py.

ebrelsford · 2024-12-20T20:42:39Z

@phargogh I believe this is ready now! I tried this on staging without a nohup and it appears to work fine.

Steps to test:

Deploy with these changes.
Edit a dataset and save it.
In the background, the worker should at least update an extras field called natcap_last_updated, which you should see when you edit the dataset again.

So now adding / editing via the interface or API should have the same result as running sync-datasets.

phargogh

Awesome, thanks @ebrelsford !

ebrelsford added 2 commits December 19, 2024 16:42

Update datasets in background rather than in script

d2ecf7f

Avoid dealing with missing metadata

b8f703a

ebrelsford commented Dec 19, 2024

View reviewed changes

ebrelsford added 3 commits December 19, 2024 18:49

Merge branch 'master' into update-in-background-job

caeb1cc

Add mappreview updates, catch error on zipexpand

8557f57

Remove script

6dfdaed

phargogh approved these changes Dec 20, 2024

View reviewed changes

phargogh merged commit de6e983 into natcap:master Dec 20, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Update in background job #49

[WIP] Update in background job #49

ebrelsford commented Dec 19, 2024 •

edited

Loading

ebrelsford commented Dec 19, 2024

ebrelsford Dec 19, 2024

phargogh Dec 19, 2024

ebrelsford Dec 19, 2024

ebrelsford Dec 19, 2024

ebrelsford Dec 19, 2024

ebrelsford Dec 19, 2024

ebrelsford commented Dec 20, 2024

phargogh left a comment

[WIP] Update in background job #49

[WIP] Update in background job #49

Conversation

ebrelsford commented Dec 19, 2024 • edited Loading

ebrelsford commented Dec 19, 2024

ebrelsford Dec 19, 2024

Choose a reason for hiding this comment

phargogh Dec 19, 2024

Choose a reason for hiding this comment

ebrelsford Dec 19, 2024

Choose a reason for hiding this comment

ebrelsford Dec 19, 2024

Choose a reason for hiding this comment

ebrelsford Dec 19, 2024

Choose a reason for hiding this comment

ebrelsford Dec 19, 2024

Choose a reason for hiding this comment

ebrelsford commented Dec 20, 2024

phargogh left a comment

Choose a reason for hiding this comment

ebrelsford commented Dec 19, 2024 •

edited

Loading