Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Occurrence Downloads in Airflow #320

Open
5 tasks
fmendezh opened this issue Oct 16, 2023 · 0 comments
Open
5 tasks

Occurrence Downloads in Airflow #320

fmendezh opened this issue Oct 16, 2023 · 0 comments
Assignees
Labels
proof of concept this issues is technical proof of concept of a new approach or techniology

Comments

@fmendezh
Copy link
Contributor

fmendezh commented Oct 16, 2023

Create a simple Airflow DAG that trigger an occurrence downloads and tracks it status.
The DAG must trigger a Spark job that and its settings must be applied dynamically to assign cores, executors and memory accordingly to the amount of data to process.
Once the DAGs finishes its status must be updated in the registry database.

  • Implement a simple DAG that triggers an download Spark job.
  • Implement a basic mechanism to assign resources to spark jobs based on the amount of records to process by a download.
  • Implement a Airflow notifier to update the status of downloads.
  • Implement a mechanism to cancel a running Download DAG.
  • Implement a service call to get the status of Download DAG.
@fmendezh fmendezh self-assigned this Oct 16, 2023
@fmendezh fmendezh added the proof of concept this issues is technical proof of concept of a new approach or techniology label Oct 16, 2023
fmendezh added a commit to gbif/stackable that referenced this issue Oct 18, 2023
fmendezh added a commit to gbif/stackable that referenced this issue Oct 18, 2023
fmendezh added a commit to gbif/stackable that referenced this issue Oct 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
proof of concept this issues is technical proof of concept of a new approach or techniology
Projects
None yet
Development

No branches or pull requests

1 participant