Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce redis storage which syncs between multiple instances #4074

Merged
merged 26 commits into from
Jan 13, 2025

Conversation

rodja
Copy link
Member

@rodja rodja commented Dec 8, 2024

This PR tries to solve #1606 by implementing a redis variant of the persistent dictionaries currently used for app.storage. This PR will not solve every remote-syncing-problem out there. It is meant as a transparent (eg. non-api-breaking) drop in replacement for the existing local storage mechanism.

NOTE: this is in very early development.

ToDos

  • make redis server url configureable via NICEGUI_REDIS_URL environment variable, if none is given we use the local storage solution
  • think about an elegant way to not sync the whole data set (and hence decide if we can do this in the scope of this PR) -> we will implement more sophisticated solutions with mongodb or etcd in other PRs
  • make it possible to use app.storage.user with redis
  • introduce optional redis dependency in pyproject.toml
  • how do we cope with the neccessary async init? -> by doing the necessary async init in the page decorator
  • fix tests
  • make it possible to use app.storage.tab with redis
  • make db prefix configurable?
  • documentation
  • fix demo
  • mention limitations
  • clean up the example (remove Dockerfile, use newest NiceGUI image with redis support)

@rodja rodja added the enhancement New feature or request label Dec 8, 2024
@rodja
Copy link
Member Author

rodja commented Dec 8, 2024

I started experimenting and found a simple testing setup:

  1. start a redis server locally with docker run -d --name redis -p 6379:6379 redis
  2. use the code in this PR and start two NiceGUI servers (with different ports):
    from nicegui import ui, app
    
    @ui.page('/')
    def index():
        ui.input().bind_value(app.storage.general, 'text')
    
    ui.run(port=int(sys.argv[1]))
  3. if you type something in the input it appears also in the input of the other instances

@Alyxion
Copy link
Contributor

Alyxion commented Dec 8, 2024

Very interesting feature. We are currently synching manually with redis, using the browser tab uid as part of the key, if this could be automated some point this were great of course.

As changes in such "shared" variables are in real application likely not as consequenceless like synching two edit fields but might need to trigger a programmatical reaction if some data is changed, is this integrated already?

@rodja
Copy link
Member Author

rodja commented Dec 8, 2024

@Alyxion app.storage uses ObservableDict and hence can have code which executes when something changes. To make this available between instances we would need to publish only the data which changed instead of the full dict. This is not easy because redis only allows a flat key/value hierarchy. Another way would be to always publish the whole dict but on the subscriber side walk the whole dict and trigger events only for keys which have changed.

@Alyxion
Copy link
Contributor

Alyxion commented Dec 9, 2024

Thanks, just found the time to do a full code review, really good job so far.

The redis server url can not be configured yet nor can the prefix but I assume thats due to the fact its still work in progress.

Conceptual though my 2 cents:

  • The RedisDict is always synching the whole dict in one go, I think this will massively limit the real-world scenarios for this feature beyond some visitor or traffic counters or hello world samples for the nicegui documentation.

    Imagine for example you have one big shared setting, 500 kb in size which shall be synched via the redis storage. And then you have e.g. a call meter, token counter for an LLM or what ever dynamic kind of live data. When ever you now just want to change the call meter, may be thousand times per hour, the 500kb will get distributed over and over again as well, the big JSON needs to be parsed burning performance. Or with other words: I definitlely think at least a minimal approach, e..g for different "groups" of data (changing thrice a day vs may be 10 times a second) should be considered.

  • If I interpreted the code right the lazy background task for publishing to REDIS will trigger "without remorse" for every minor change. This could / will easily pile up to a bombardement of broadcast publishs and resulting REDIS gets. Just imagine 1000 concurrent users, each just triggering a change just once all 5 seconds, then each server would sync 200 times per second.
    Basically the same as for the previous point: I think there might be variables which need an asap synch and others which might also be totally fine just be synched once all 10 seconds.

  • Potential changes in format of the data, e.g. version changes, are not considered yet. This could for example be part of a configurable prefix.

  • One remark independent of this feature but general about NiceGUIs storage mechanisms: Most modern Python solutions from FastAPI itself via OpenAI, HuggingFace etc. nowadays use Pydantic to define data models, document them, ensure type safety when parsing them, transporting them between different languages and/or systems etc.

    This is in my opinion also the way to go and thus mandatory for all projects at our company - and accessing dictionaries by string constants definitely a relict of the past. Unfortunately also this new feature does not embrace "modern Python" even though it would benefit a lot from it. For example if the user would provide a Pydantic model for the redis storage, then you could easily create an MD5 of the schema to ensure compatibility between "whats stored in REDIS" and what the new software actually awaits, e.g. by putting it into the prefix or by storing it in the data itself to ensure the versions are still compatible. Pydantics Field metadata could also e.g be used to solve the points above, e.g. "how often shall this sub object be synchronized" such as

    class MyRedisData(BaseModel):
       my_dynamic_data: MyNiceGuiAppsDynamicData = Field(..., description="Very often changing data...", sync_interval_s=5.0)
       my_large_data: MyNiceGuisLazyUpdatingData = Field(..., sync_interval=60)

    And yes, of course you can somehow "hack" Pydantic models also in the current ObservableDics by always dumping them completely and always parsing them completely and storing there somewhere else but its not very elegant.

@rodja
Copy link
Member Author

rodja commented Dec 9, 2024

Thanks for this excellent observations. You are right. There are still a lot of things to be done. I've started a checklist in the PR description to clarify what is missing to go from draft to review.

The RedisDict is always synching the whole dict in one go

Yes. After further investigation, I believe it’s beyond the scope of this pull request to handle it differently. The underlying ObservableDict simply fires an on_change event without indicating what has changed. We either need to add these infos or override all dict modification operators (pop, clear, update, ...) in the redis persistent dict.

If I interpreted the code right the lazy background task for publishing to REDIS will trigger "without remorse" for every minor change.

Yes, this is by design to keep things simple. See below.

Potential changes in format of the data, e.g. version changes, are not considered yet.

This is also true for the existing local persistence.app.storage is just a storage mechanism and does not care about migrations. These should be done by the user code.

Most modern Python solutions from FastAPI itself via OpenAI, HuggingFace etc. nowadays use Pydantic to define data models

You are right. But Pydantic is quite slow compared to json etc. We used to have Pydantic everywhere a few years back and need to throw it out the window because of performance cosiderations in our robotics applications. Pydantic is great when you get external data where you can not be sure about the right structure, types etc. But I don't think this is the case when using app.storage.

As a closing comment, I want to clarify a point which I'll also put in the description of the PR because it is so fundamental:

This PR will not solve every remote-syncing-problem out there. It is meant as a transparent (eg. non-api-breaking) drop in replacement for the existing local storage mechanism.

Your points are all valid but some are just out of scope of what we want to provide here. In the future we may provide a more flexible storage api. See #1606 (reply in thread)

@Alyxion
Copy link
Contributor

Alyxion commented Dec 9, 2024

Thank you for the through explanation.

I personally wasn't aware of the performance implications of Pydantic but I think thats because we are more thinking here in dozens of messages exchanged between our servers per second rather than thousands per second on an edge device which might be sent to your Feldfreund, good to know though, will keep an eye on it.

Regarding your closing comment: Totally aware of that. Though I still think not being to somehow sync the storage "smarter" than always dumping and parsing that whole thing and very quickly having 95% garbage and just 5% relevant update data is sub optimal :).

@rodja
Copy link
Member Author

rodja commented Dec 10, 2024

I think there are quite some use cases which would work fine with a full dictionary update. Our existing local storage is also doing it that way because app.storage was not meant to be a database but rather a quick way to save some settings. We can add smarter and more powerful storage options later on. These could be done with RedisJSON, etcd or even MongoDB. But for this PR I would like to keep things simple to allow easy setup and quickly have something to work with until more sophisticated solutions are implemented.

@rodja
Copy link
Member Author

rodja commented Dec 15, 2024

Ok. I made quite some progress:

  1. app.storage.user works with redis
  2. the redis server can be configured by defining NICEGUI_REDIS_URL in the environment
  3. initial loading of the storage is done async, while still allowing synchronous access to app.storage.user -- this is possible due to doing the async stuff in the middleware

@rodja rodja requested a review from falkoschindler December 15, 2024 18:32
@rodja
Copy link
Member Author

rodja commented Dec 15, 2024

Ok, I think @falkoschindler and others can review the code now. @alydersen what do you think? Does that work for you?

There is still some cleanup to be done in the new redis_storage example, but that should be done after everything else is ready for merge.


@ui.page('/')
def index():
# ui.label(f'Tab storage age: {app.storage.tab.age} seconds')
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tab.age seems to be wrong...

@alydersen
Copy link

Ok, I think @falkoschindler and others can review the code now. @alydersen what do you think? Does that work for you?

There is still some cleanup to be done in the new redis_storage example, but that should be done after everything else is ready for merge.

Looking at the examples it seems to meet our requirements nicely @rodja - great work!

@rodja rodja marked this pull request as ready for review January 11, 2025 06:18
@rodja rodja requested a review from falkoschindler January 11, 2025 06:19
@rodja
Copy link
Member Author

rodja commented Jan 11, 2025

Ok, I've completed the rest of the todos except cleaning up the example which should only be done right before we do the merge.

@falkoschindler falkoschindler merged commit 36a9cd6 into main Jan 13, 2025
9 checks passed
@falkoschindler falkoschindler deleted the redis-storage branch January 13, 2025 10:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants