Releases · pinax-network/firehose-antelope

15 Apr 09:48

fschoell

v1.3.4

44de312

v1.3.4

Substream

fix a possible panic() when an request is interrupted during the file loading phase of a squashing operation.
fix a rare possibility of stalling if only some fullkv stores caches were deleted, but further segments were still present.
fix stats counters for store operations time

Assets 2

11 Apr 10:55

fschoell

v1.3.3

69f88d4

v1.3.3

Substreams

bumped to v1.5.3
add --block-type flag that can be specified when creating substreams tier1. If not specified, tier1 will auto-detect block type from source.
fix memory leak on substreams execution (by bumping wazero dependency)
prevent substreams-tier1 stopping if blocktype auto-detection times out
fix missing error handling when writing output data to files. This could result in tier1 request just "hanging" waiting for the file never produced by tier2.
fix handling of dstore error in tier1 'execout walker' causing stalling issues on S3 or on unexpected storage errors
increase number of retries on storage when writing states or execouts (5 -> 10)
prevent slow squashing when loading each segment from full KV store (can happen when a stage contains multiple stores)

Assets 2

04 Apr 20:28

fschoell

v1.3.2

db6c19e

v1.3.2

Substreams

Fix a context leak causing tier1 responses to slow down progressively

Assets 2

03 Apr 21:36

fschoell

v1.3.1

3842f0b

v1.3.1

Substreams

fix another panic on substreams-tier2 service
fix thread leak in metering affecting substreams
revert a substreams scheduler optimisation that causes slow restarts when close to head
add substreams_tier2_active_requests and substreams_tier2_request_counter prometheus metrics

Assets 2

03 Apr 08:40

fschoell

v1.3.0

41b0093

v1.3.0

Substreams

Substreams bumped to @v1.5.0: See https://github.com/streamingfast/substreams/releases/tag/v1.5.0 for details.

Chain-agnostic tier2

A single substreams-tier2 instance can now serve requests for multiple chains or networks. All network-specific parameters are now passed from Tier1 to Tier2 in the internal ProcessRange request.
This allows you to better use your computing resources by pooling all the networks together.

Important

Since the tier2 services will now get the network information from the tier1 request, you must make sure that the file paths and network addresses will be the same for both tiers.
ex: if --common-merged-blocks-store-url=/data/merged is set on tier1, make sure the merged blocks are also available from tier2 under the path /data/merged.
The flags --substreams-state-store-url, --substreams-state-store-default-tag and --common-merged-blocks-store-url are now ignored on tier2. The flag --common-first-streamable-block should be set to 0 to accommodate every chain.

Tip

The cached 'partial' files no longer contain the "trace ID" in their filename, preventing accumulation of "unsquashed" partial store files. The system will delete files under '{modulehash}/state' named in this format{blocknumber}-{blocknumber}.{hexadecimal}.partial.zst when it runs into them.

Performance improvements

All module outputs are now cached. (previously, only the last module was cached, along with the "store snapshots", to allow parallel processing).
Tier2 will now read back mapper outputs (if they exist) to prevent running them again. Additionally, it will not read back the full blocks if its inputs can be satisfied from existing cached mapper outputs.
Tier2 will skip processing completely if it's processing the last stage and the output_module is a mapper that has already been processed (ex: when multiple requests are indexing the same data at the same time)
Tier2 will skip processing completely if it's processing a stage where all the stores and outputs have been processed and cached.
Scheduler modification: a stage now waits for the previous stage to have completed the same segment before running, to take advantage of the cached intermediate layers.
Improved file listing performance for Google Storage backends by 25%

[!TIP]

Concurrent requests on the same module hashes may benefit from the other requests' work to a certain extent (up to 75%) -- The very first request does most of the work for the other ones.

Tip

More caches will increase disk usage and there is no automatic removal of old module caches. The operator is responsible for deleting old module caches.

Tip

The cached 'partial' files no longer contain the "trace ID" in their filename, preventing accumulation of "unsquashed" partial store files.
The system will delete files under '{modulehash}/state' named in this format{blocknumber}-{blocknumber}.{hexadecimal}.partial.zst when it runs into them.

Metrics

Readiness metric for Substreams tier1 app is now named substreams_tier1 (was mistakenly called firehose before).
Added back readiness metric for Substreams tiere app (named substreams_tier2).
Added metric substreams_tier1_active_worker_requests which gives the number of active Substreams worker requests a tier1 app is currently doing against tier2 nodes.
Added metric substreams_tier1_worker_request_counter which gives the total Substreams worker requests a tier1 app made against tier2 nodes.

Flags

Added --merger-delete-threads to customize the number of threads the merger will use to delete files. It's recommended to increase this when using Ceph as S3 storage provider to 25 or higher (due to performance issues with deletes the merger might otherwise not be able to delete one-block files fast enough).
Added --substreams-tier2-max-concurrent-requests to limit the number of concurrent requests to the tier2 substreams service.

Assets 2

20 Mar 15:52

fschoell

v1.2.4

59a41a8

v1.2.4

Fixed tools check merged-blocks default range when -r <range> is not provided to now be [0, +∞] (was previously [HEAD, +∞]).
Fixed tools check merged-blocks to be able to run without a block range provided.
Added API Key authentication to client.NewFirehoseFetchClient and client.NewFirehoseClient.

[!NOTE]
If you were using github.com/streamingfast/firehose-core/firehose/client.NewFirehoseFetchClient or github.com/streamingfast/firehose-core/firehose/client.NewFirehoseStreamClient, this will be a minor breaking change, refer to upgrade notes for details if it affects you.
Fixed tools check merged-blocks examples using block range (range should be specified as [<start>]?:[<end>]).
Added --substreams-tier2-max-concurrent-requests to limit the number of concurrent requests to the tier2 Substreams service.

Assets 2

01 Mar 22:16

fschoell

v1.2.3

e90c664

v1.2.3

Substreams improvements

Performance: prevent reprocessing jobs when there is only a mapper in production mode and everything is already cached
Performance: prevent "UpdateStats" from running too often and stalling other operations when running with a high parallel jobs count
Performance: fixed bug in scheduler ramp-up function sometimes waiting before raising the number of workers
Added the output module's hash to the "incoming request" log

Reader node

The reader-node-bootstrap-url gained the ability to be bootstrapped from a bash script.

If the bootstrap URL is of the form bash:///<path/to/script>?<parameters>, the bash script at
<path/to/script> will be executed. The script is going to receive in environment variables the resolved
reader node variables in the form of READER_NODE_<VARIABLE_NAME>. The fully resolved node arguments
(from reader-node-arguments) are passed as args to the bash script. The query parameters accepted are:
- arg=<value> | Pass as extra argument to the script, prepended to the list of resolved node arguments
- env=<key>%3d<value> | Pass as extra environment variable as <key>=<value> with key being upper-cased (multiple(s) allowed)
- env_<key>=<value> | Pass as extra environment variable as <key>=<value> with key being upper-cased (multiple(s) allowed)
- cwd=<path> | Change the working directory to <path> before running the script
- interpreter=<path> | Use <path> as the interpreter to run the script
- interpreter_arg=<arg> | Pass <interpreter_arg> as arguments to the interpreter before the script path (multiple(s) allowed)
[!NOTE]
The bash:/// script support is currently experimental and might change in upcoming releases, the behavior changes will be
clearly documented here.
The reader-node-bootstrap-url gained the ability to be bootstrapped from a pre-made archive file ending with tar.zst or tar.zstd.
The reader-node-bootstrap-data-url is now added automatically if firecore.Chain#ReaderNodeBootstrapperFactory is non-nil.

If the bootstrap URL ends with tar.zst or tar.zstd, the archive is read and extracted into the
reader-node-data-dir location. The archive is expected to contain the full content of the 'reader-node-data-dir'
and is expanded as is.

Assets 2

26 Feb 15:04

fschoell

v1.2.2

9b06d46

v1.2.2

What's Changed

Fix marshalling of blocks to JSON in tools like firehose-client and print merged-blocks

Assets 2

26 Feb 10:58

fschoell

v1.2.1

0a14366

v1.2.1

Auth and metering

Add missing metering events for sf.firehose.v2.Fetch/Block responses.
Changed default polling interval in 'continuous authentication' from 10s to 60s, added 'interval' query param to URL.

Substreams

Fixed bug in scheduler ramp-up function sometimes waiting before raising the number of workers
Fixed load-balancing from tier1 to tier2 when using dns:/// (round-robin policy was not set correctly)
Added trace_id in grpc authentication calls
Bumped connect-go library to new "connectrpc.com/connect" location

Assets 2

19 Feb 09:02

fschoell

v1.2.0

6dbc9f8

v1.2.0

What's Changed

Breaking

Removed the Silkworm block poller in favour of the generic EVM poller from firehose-ethereum.

Added

Added 'x-trace-id' header to auth requests when using --common-auth-plugin=grpc
Added --api-key-env-var flag to firehose-clients, which allows you to pass your API Key from an environment variable (HTTP header x-api-key) instead of a JWT (Authorization: bearer), where supported.

Fixed

Fixed Substreams scheduler sometimes taking a long time to spawn more than a single worker.
Fixed the timestamp handling when sanitizing blocks by @fschoell in #120
Fixed missing sanitation for failed deferred transaction traces by @fschoell in #121

Contributors

fschoell

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Substream

Substreams

Substreams

Substreams

Substreams

Chain-agnostic tier2

Performance improvements

Metrics

Flags

Substreams improvements

Reader node

What's Changed

Auth and metering

Substreams

What's Changed

Breaking

Added

Fixed

Contributors

Releases: pinax-network/firehose-antelope

v1.3.4

Substream

v1.3.3

Substreams

v1.3.2

Substreams

v1.3.1

Substreams

v1.3.0

Substreams

Chain-agnostic tier2

Performance improvements

Metrics

Flags

v1.2.4

v1.2.3

Substreams improvements

Reader node

v1.2.2

What's Changed

v1.2.1

Auth and metering

Substreams

v1.2.0

What's Changed

Breaking

Added

Fixed

Contributors