Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Drop head tracker for summaries DAG #6744

Open
wants to merge 2 commits into
base: unstable
Choose a base branch
from
Open

Conversation

dapplion
Copy link
Collaborator

@dapplion dapplion commented Dec 28, 2024

Issue Addressed

The head tracker is a persisted piece of state that must be kept in sync with the fork-choice. It has been a source of pruning issues in the past, so we want to remove it

When implementing tree-states in the hot DB we have to change the pruning routine (more details below) so we want to do those changes first in isolation.

Closes #1785

Proposed Changes

Current DB migration routine

  • Locate abandoned heads with head tracker
  • Use a roots iterator to collect the ancestors of those heads can be pruned
  • Delete those abandoned blocks / states
  • Migrate the newly finalized chain to the freezer

In summary, it computes what it has to delete and keeps the rest. Then it migrates data to the freezer. If the abandoned forks routine has a bug it can break the freezer migration.

Proposed migration routine (this PR)

  • Migrate the newly finalized chain to the freezer
  • Load all state summaries from disk
  • From those, just knowing the head and finalized block compute two sets: (1) descendants of finalized (2) newly finalized chain
  • Iterate all summaries, if a summary does not belong to set (1) or (2), delete

This strategy is more sound as it just checks what's there in the hot DB, computes what it has to keep and deletes the rest. Because it does not rely and 3rd pieces of data we can drop the head tracker and pruning checkpoint. Since the DB migration happens first now, as long as the computation of the sets to keep is correct we won't have pruning issues.

@dapplion
Copy link
Collaborator Author

@michaelsproul If we stop updating the head_tracker in the persisted head, how can we downgrade safely? If you run an old version of Lighthouse it will read the persisted head which contains an empty head tracker.

@dapplion dapplion added work-in-progress PR is a work-in-progress tree-states Upcoming state and database overhaul labels Jan 2, 2025
@dapplion
Copy link
Collaborator Author

dapplion commented Jan 2, 2025

Test store_tests::finalizes_non_epoch_start_slot panics at

---- store_tests::finalizes_non_epoch_start_slot stdout ----
thread 'store_tests::finalizes_non_epoch_start_slot' panicked at /Users/lion/code/sigp/lighthouse/beacon_node/beacon_chain/src/test_utils.rs:836:14:
called `Result::unwrap()` on an `Err` value: HotColdDBError(MissingEpochBoundaryState(0x8da1c38d20071452e39a518001b61aa94ad871c6b95896203ddb0b68cfb89e5f))
stack backtrace:
   0: _rust_begin_unwind
   1: core::panicking::panic_fmt
   2: core::result::unwrap_failed
   3: beacon_chain::test_utils::BeaconChainHarness<beacon_chain::builder::Witness<slot_clock::manual_slot_clock::ManualSlotClock,beacon_chain::eth1_chain::CachingEth1Backend<E>,E,Hot,Cold>>::hot_state_exists
   4: beacon_chain_tests::store_tests::finalizes_non_epoch_start_slot::{{closure}}
   5: tokio::runtime::runtime::Runtime::block_on
   6: core::ops::function::FnOnce::call_once
note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace.

This is part of an assertion in the test calling hot_state_exists, which now instead of returning an Ok(bool) returns an Err.

This state is not pruned, but is the new_finalized_state_hash

Jan 02 10:31:34.615 DEBG Starting database pruning, new_finalized_state_hash: 0x8da1c38d20071452e39a518001b61aa94ad871c6b95896203ddb0b68cfb89e5f, new_finalized_checkpoint: Checkpoint { epoch: Epoch(1), root: 0x401d573d920244cb9b9988da85f8093f6bedb1e68156774b87c173f6ded6623b }, module: beacon_chain::migrate:485
Jan 02 10:31:34.619 DEBG Extra pruning information, states_to_prune: {}, blocks_to_prune: {}, states_to_prune_count: 0, blocks_to_prune_count: 0, finalized_and_descendant_block_roots: 40, state_summaries_count: 40, newly_finalized_states_min_slot: 8, newly_finalized_state_roots: 1, newly_finalized_blocks_min_slot: 7, newly_finalized_blocks: 1, new_finalized_checkpoint: Checkpoint { epoch: Epoch(1), root: 0x401d573d920244cb9b9988da85f8093f6bedb1e68156774b87c173f6ded6623b }, module: beacon_chain::migrate:645

Michael: figure out which of the hot_state_exists cases triggers, plus what state root and slot

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
tree-states Upcoming state and database overhaul work-in-progress PR is a work-in-progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant