-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow customizing names of metadata fields in snapshots #10185
Comments
up vote! this should have been considered already at implementation... |
We need to think a bit about migration:
Idea from @aranke : add comment to the |
Migration Example:
|
Opened a new issue in dbt-labs/docs.getdbt.com: dbt-labs/docs.getdbt.com#6112 |
{{
}} Running dbt-core = 1.7.18 with snowflake = 1.7.5 snapshot completed with no error but column didn't changed. |
@GRVsati Hi! To avoid any unintentional data modification, dbt will not automatically apply any column renames. So if a user applies |
It does not work with spark databricks too. Even if i added in project.yml a snippet with default snapshot column names convention or to specific snapshot sql|yml file it does not work properly. Snapshot target is missing configured columns (missing "dbt_scd_id", "dbt_valid_from", "dbt_valid_to"). See https://docs.getdbt.com/docs/build/snapshots#snapshot-meta-fields for more information. I guess that macro for databricks spark is not preparing a rendered sql correctly and is trying to use a default snapshot names. This is snapshot created (running correctly only when i am invoking |
@czasoprzestrzenny Hi - as I mentioned above, we recommend either only using these settings for net-new snapshots, or arranging an update of pre-existing tables prior to committing a column name change. You are getting this error, because dbt does not automatically manage the schema change from one column set to another. |
Hi @graciegoheen not sure whether i understand it correctly. I have created a snapshot from scratch with adjusted columns using a dictionary:
Snapshot has been created with that adjusted dbt column names. Issue is if i am trying to run snapshot second time. Query is not even sent to databrick cluster- it fails during compilation (Compilation Error in snapshot snapshot__animals_scd2_1 (snapshots/snapshot__animals_scd2_1.sql) Can you advice me how to avoid that? |
That is strange! I'm going to open a new bug report so we can triage the issue |
Thank you very much @graciegoheen , if you need a more information let me know. |
@graciegoheen @czasoprzestrzenny I've investigate this issue and found that the issue is that the dbt-databricks adapter has not yet added support for custom meta column names. There is already an issue open against dbt-databricks: databricks/dbt-databricks#887 I will follow up with further details in that issue. |
From #9848
Description
There are a handful of Snapshot meta fields:
dbt_valid_from
dbt_valid_to
dbt_scd_id
dbt_updated_at
These are fields dbt creates when creating the snapshot.
Folks want the ability to rename these meta fields - “I want to rename
dbt_valid_from
tograce_valid_from
”- because of some corporate standardCustomers currently have to create a view on top of the snapshot
We should instead provide the ability to rename these meta fields within the actual snapshot configuration.
draft of YML config:
Open questions/thoughts:
Acceptance Critera
dbt_field_name_overrides
adapters.get_columns_in_relation()
on the existing snapshot and comparing the meta fields that exist in the current config to what exists in the warehouseThe text was updated successfully, but these errors were encountered: