Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

📈 Update Create-A-Derived-Table to newest DBT-Core/DBT-Athena Versions #3290

Closed
3 of 4 tasks
jhpyke opened this issue Feb 12, 2024 · 12 comments
Closed
3 of 4 tasks
Assignees
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools enhancement enhancing an existing feature

Comments

@jhpyke
Copy link
Contributor

jhpyke commented Feb 12, 2024

User Story

As a… user of Create-A-Derived-Table
I expect to… be able to use the latest features enabled by DBT.
So that… I can be empowered to do cutting edge analysis within the organisation.

Value / Purpose

We are currently using

dbt-core==1.6.5
dbt-athena-community==1.6.2

as our locked package versions. As of writing this ticket, DBT-Athena is available at v1.8.2, and dbt-core is at v1.8.3. We should bring our packages up to date, to ensure users can use the latest features of DBT and to ensure we do not develop tech debt.

Please note that although dbt-athena-community enabled Python Models (I.E. Athena with PySpark) as functionality for DBT, we will NOT be including this functionality in the scope of this work. This will require further investigation, including understanding of cost implications of supporting this functionality.

Useful Contacts

@jhpyke

User Types

No response

Hypothesis

No response

Proposal

  1. Bump requirements.txt to latest versions
  2. Test deployments against the sandpit environment. Work with the #data-modelling team to identify some models to test deploy in sandpit.
  3. Create a branch for users to do acceptance testing with the updated requirements. This branch should NOT contain any file changes from main other than the changes to requirements.
  4. Put comms in #ask-data-modelling and #data-and-analytics-engineering channels to encourage users to test their in progress and existing prod models using the branch. See previous comms for examples of structure.
  5. If users identify issues with updated packages, work with them to proactively understand (and where appropriate intervene to fix) to ensure that the upgrade can occur smoothly.
  6. After an acceptance period (no more than 7 days unless issues are identified), merge the acceptance testing branch into main.

Additional Information

No response

Definition of Done

  • Internal test deployments have been made successfully
  • If any functionality has changed, docs are updated accordingly
  • User acceptance testing run
  • Changes merged to prod
@jhpyke jhpyke added enhancement enhancing an existing feature data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools labels Feb 12, 2024
@murad-ali-MoJ murad-ali-MoJ self-assigned this Feb 14, 2024
@julialawrence julialawrence moved this to 👀 TODO in Analytical Platform Feb 15, 2024
@murad-ali-MoJ murad-ali-MoJ moved this from 👀 TODO to 🚀 In Progress in Analytical Platform Feb 15, 2024
@murad-ali-MoJ
Copy link
Contributor

murad-ali-MoJ commented Feb 15, 2024

Progression:

  • I have tested the newer version of DBT and the Athena connector locally with smaller models on the sandpit, yielding promising results.
  • Our next objective is to reach out to the modeler and conduct further testing with additional models.

@murad-ali-MoJ
Copy link
Contributor

murad-ali-MoJ commented Feb 19, 2024

Update:
Posted about the new version release to the data-modeling channel and asked the modeler to provide some example databases for testing.

  • Test deployments against sandpit
  • making a branch available for users to test within the dev

@murad-ali-MoJ
Copy link
Contributor

murad-ali-MoJ commented Feb 21, 2024

@murad-ali-MoJ
Copy link
Contributor

update the user guidance for testing. https://github.com/moj-analytical-services/user-guidance/actions/runs/7985351178

@murad-ali-MoJ
Copy link
Contributor

  • The new workflow branch merged Testing-deploy-with-updated-dbt-athena moj-analytical-services/create-a-derived-table#1163
  • This branch posted to data-modelling channel for user test
  • Update the user-guidance for the new testing.

@Emterry Emterry moved this from 🚀 In Progress to 🚫 Blocked in Analytical Platform Mar 1, 2024
@julialawrence julialawrence moved this from 🚫 Blocked to 🚀 In Progress in Analytical Platform Mar 4, 2024
@murad-ali-MoJ
Copy link
Contributor

murad-ali-MoJ commented Mar 6, 2024

Some of the tests fail here. I don't know why they are failing. I posted the problem analytical-platform channel. If anyone can help with it.

@murad-ali-MoJ murad-ali-MoJ moved this from 🚀 In Progress to 🚫 Blocked in Analytical Platform Mar 6, 2024
@tamsinforbes tamsinforbes mentioned this issue Mar 25, 2024
4 tasks
@AntFMoJ
Copy link
Contributor

AntFMoJ commented Mar 26, 2024

Results from Tamsin following collaboration:

Outcome
successfully deployed general domain and all seeds to sandpit
deployed and tested dervied_delius_stg and derived_delius_dim

Next steps
build probation snapshots
derived_delius itself

Error with snapshots
more information/input from the derived_delius team; Ben W and Chris S

Suggested to team
liaise closely with derived_delius team
get them tested their stuff on the dbt_athena_update branch

Longer term
liaise with other key stakeholder / model owners; opg etc
open up testing to users for them to test their own models on the dbt_athena_update branch, using the new version of dbt_core and dbt_athena_community

Things to consider
life cycle rules on sandpit; make these longer during testing phase

@AntFMoJ AntFMoJ moved this from 🚫 Blocked to 🚀 In Progress in Analytical Platform Mar 26, 2024
@AntFMoJ
Copy link
Contributor

AntFMoJ commented Mar 28, 2024

Arranged a call with probation modelling contact to discuss issues with creating snapshots for derived-delius.

Working through building daily deployment models in sandpit, currently on OPG.

@AntFMoJ
Copy link
Contributor

AntFMoJ commented Apr 3, 2024

All daily deployment models have been built in sandpit.
Put another message out to data-modelling channel to ask for user feedback on the new dbt versions.

dbt-core v.1.7.10 needs to be used, as v1.7.7 has an issue with incompatibility with the latest version of protobuf.

@BrianEllwood BrianEllwood moved this from 🚀 In Progress to 🛂 In Review in Analytical Platform Apr 4, 2024
@AntFMoJ
Copy link
Contributor

AntFMoJ commented Apr 11, 2024

Follow up call for feedback on new versions has been put out to the data-modelling channel. The new versions have not been put into production.

Any issues raised through feedback will be raised as support tickets. This ticket will be taken out of sprint for now, although new versions have not yet been put in production.

@AntFMoJ AntFMoJ moved this from 🛂 In Review to 👀 TODO in Analytical Platform Apr 11, 2024
@jhpyke jhpyke self-assigned this Jul 22, 2024
@jhpyke
Copy link
Contributor Author

jhpyke commented Jul 22, 2024

Currently tested via recent Sandpit deploy - Awaiting final user acceptance by #data-modelling team to sign-off prod deployment.

@jhpyke jhpyke moved this from 👀 TODO to 🚀 In Progress in Analytical Platform Jul 22, 2024
@EO510 EO510 moved this from 🚀 In Progress to 🛂 In Review in Analytical Platform Jul 29, 2024
@jhpyke
Copy link
Contributor Author

jhpyke commented Aug 5, 2024

Closing as has been deployed in main for a week with all issues being transient/resolved. Many thanks for @jacobwoffenden for his quick work on the runners!

@jhpyke jhpyke moved this from 🛂 In Review to 🎉 Done in Analytical Platform Aug 5, 2024
@jhpyke jhpyke closed this as completed by moving to 🎉 Done in Analytical Platform Aug 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools enhancement enhancing an existing feature
Projects
Archived in project
Development

No branches or pull requests

3 participants