RFC: Infinite Tasks #29025
Replies: 12 comments 8 replies
-
Yes please. Setting up things like SSH tunnels / etc would be so nice to do in a NX ecosystem |
Beta Was this translation helpful? Give feedback.
-
Regarding the naming, I'd opt for something that is already widely used, such as
In the end we'll have
|
Beta Was this translation helpful? Give feedback.
-
Thanks a lot for this RFC, this is one of the most exciting changes I'm eager to use from Nx. Long running task has been an issue for a while, especially with package.json#workspace resolution! In terms of use cases, I'm very eager to use it for:
It opens soo many doors! :D One thought regarding
|
Beta Was this translation helpful? Give feedback.
-
Until it's rewritten in Rust, it can be relatively easy done with https://github.com/vadimdemedes/ink |
Beta Was this translation helpful? Give feedback.
-
I'm wondering if something special can be done to handle this type of situation: Multiple Unrelated Infinite TasksWhen working within a large monorepo and you have multiple systems running. Interlacing all of the terminal outputs at once can get very confusing and cumbersome. I'm thinking of the current CLI that nx uses right. nx run-many -t serve --cli --projects frontend,backend The output provides a command interface which allows the user to select what stream they want to see. Just my 2cents but I think that would clean up all of the noise and allow the user to decided what they want to see when it comes to the terminal output |
Beta Was this translation helpful? Give feedback.
-
Lifecycle hooksI am curious if we can leverage the lifecycle hooks further and create a user API for it or some sort of notification stream. I am currently on v19, using custom task runners for hand-rolled telemetry, collecting all kinds of data, such as the platform, task duration and much more. Due too the deprecation of custom task runners in v20 I am blocked from upgrading and there is no migration path for this use case yet. If you could produce a log with the lifecycles, or a pub/sub service, whatever, just something that I can use to trigger my custom telemetry service. Do you think something like this could be possible? |
Beta Was this translation helpful? Give feedback.
-
TerminologyI like persistent tasks the most, as it is a common term in turborepo already and when someone new to monorepos would start to compare features, it would be easier to have common terminology among monorepo tools. But I find mortal tasks for finite/short-lived tasks also really funny :) But yeah, I think persistent and non-persistent is pretty straightforward. |
Beta Was this translation helpful? Give feedback.
-
Naming/ Terminology: 1️⃣ Note: I am afraid that "immortal" might sound like a "daemon" process that Nx will keep up and running even after tasks finish. |
Beta Was this translation helpful? Give feedback.
-
I don't know if I am more excited about the new terminal UI or the infinite tasks 😊 |
Beta Was this translation helpful? Give feedback.
-
Will infinite tasks be reused between tasks executed in different graphs? Here is the use case:
If the Nx daemon could reuse the infinite tasks triggered in another graph:
|
Beta Was this translation helpful? Give feedback.
-
Hey all, I thought of a new name which I think is simpler but still specific. Continuous Tasks These tasks run continuously while other tasks run and until they are stopped. What do you all think? |
Beta Was this translation helpful? Give feedback.
-
Use Case: Federated
|
Beta Was this translation helpful? Give feedback.
-
RFC: Continuous Tasks
Some tasks run through Nx are tasks which run continuously until they are terminated. In this RFC, these tasks will be referred to as continuous tasks. Examples of continuous tasks are tasks which start servers, build and test tasks which have watch modes, and many more. Currently, Nx expects all tasks to end and does not really have much special handling for continuous tasks. In many cases, tasks have a relationship with these continuous tasks where they depend on them being running. If this is specified as a
dependsOn
today, Nx will never continue as it will wait for the depended upon continuous tasks to finish; which it will not.Concept
Nx should have handling for continuous tasks. Continuous tasks are a subset of tasks which never end on their own accord; they must be terminated by something else.
Continuous tasks are much like other tasks. Continuous tasks can be run like any other tasks, either on their own, as part of a set of tasks, or as a dependency of another task. Continuous tasks can depend on other tasks; continuous and discrete. And other tasks, continuous or not, can depend on continuous tasks.
Continuous tasks are different from discrete tasks in some ways though. Continuous tasks do not end on their own and need to be terminated by something else. Continuous tasks also do not yield an output; this means that they cannot be cached. However, continuous tasks do yield a side effect which is likely the purpose why the task is kept alive.
Example Use Cases
Starting a web-server to e2e test
Currently in Nx, the e2e task is responsible for starting the web-server which it is testing. With dependencies on continuous tasks, Nx can takeover the responsibility of starting the web server while the e2e task is responsible for waiting for the server to be ready. Cypress, Playwright and other e2e tasks runners already do this.
Starting backends alongside a web server
This would start the required backends while running a frontend application.
Starting a db alongside a backend server
Publishing to a local-registry before running e2e tests
Feedback Requested
While you read this RFC and dive into the details, the Nx core team would like some feedback on this design before we implement it. Please let us know your thoughts after you have read it.
Is this a feature you would like to see implemented?
Firstly, is this a feature that excites you and does it solve the problems you face? And would this feature improve or enable the kinds of workflows that you need?
Naming/ Terminology
After this discussion, it seems like continuous tasks is a good term to move forward with. If you have any objections to this, please continue to discuss.
Secondly, naming things is hard. "continuous tasks" is terminology which is subject to change. We would like to hear feedback on the terminology we should use. We'd like the term to be easily understood and intuitively capture this type of task and the differences with the existing concept of tasks.
Infinite Tasks was a considered name. Infinite refers to the infinite duration of the tasks but other terms could be more specific. Is it obvious what infinite vs finite pertains to?
Perpetual could make sense because the tasks remain running as does a perpetual motion machine. Unfortunately, there is no real antonym as there is for infinite vs finite. Talking about not perpetual tasks is not as nice.
We have also considered some other terms.
Tasks can be classified as neverending or endless as they will not end on their own accord. They would end though, when they are terminated.
These tasks could be considered long running tasks. But some tasks such as e2e tests can also run for a long time while not being continuously running. And if these tasks don't last for that long... is it fair to call them long running?
Immortal vs Mortal is another way of describing these tasks. Immortallity is specific to life though but tasks aren't living. It also has religious connotations which may cause uncomfort.
Tasks should eventually complete. If something does not complete, is it even still a task? Should we introduce a new term altogether? Rather than referring to them as tasks, should we refer to them as processes? This would be easily confused for OS processes. Comment below if you think it's confusing for both of these concepts to be referred to as tasks.
Timeline
We hope to begin implementation starting January 2025 so please provide as much feedback as you can before then.
Lifecycle
The lifecycle for a continuous task would have the following lifecycle:
Now we'll contrast this to the lifecycle for a discrete task:
There are 2 main differences from what is currently in Nx.
Tasks (both discrete and continuous) depending on continuous tasks, Nx will not wait for the task to complete. It will instead, wait for the task to start. The task may not be fully ready to produce side effects yet. This will be discussed later on.
Continuous tasks need to be terminated. Nx would handle this but when this occurs depends on why Nx is running that continuous task.
Use Cases
Below, we'll dive into some different cases Nx will have to handle. Each of them will detail some current workarounds and their flaws. @vsavkin also produced a video with diagrams to go over this topic here: https://www.youtube.com/watch?v=-gezeX9zxuM
Isolated Continuous Task
Running a continuous task on its own with no dependencies and without other tasks depending on it is already fairly well supported by Nx. If you're using Nx to run
app:serve
,app:build --watch
, orapp:test --watch
then these are already continuous tasks. From the user's point of view, this should be the exact same as if the user were to run that task outside of Nx.When a continuous task starts running, it is expected to continue running until Nx ends it. Thus, Nx would throw an error if a continuous task were to exit before Nx terminates it. This will be true for the following cases as well.
Nx starts the continuous task solely because the user instructed it to. There is not much other known information as there will be in other use cases. Nx will thus wait for the user to interrupt Nx before it terminates the continuous task.
Continuous Tasks Depending on Discrete Tasks
Continuous tasks can depend on discrete tasks.
For example,
app:serve
could depend on itself and it's dependencies to be built before starting up.Running a continuous task on its own with dependencies on other tasks but no dependencies on itself will be much like running a continuous task in isolation. The only difference is that Nx will wait for the discrete tasks to complete before the continuous task is started. It is expected to continue running until Nx is interrupted (Ctrl + C).
Multiple Unrelated Continuous Tasks
Running multiple continuous tasks may be something users already do.
An example of this is serving two applications via
nx run-many -t serve --projects frontend,backend
such as a frontend and a backend.As it already does currently, Nx will wait dependent tasks of each continuous task before starting them respectively. As with the prior cases, Nx would also terminate these tasks when the user interrupts Nx.
The main change here will be how Nx handle the terminal outputs. Currently, the user has a few different options for how to show the output. Only 2 of the 3 really make sense.
Nx would be updated to handle multiple streams of outputs differently. What would be ideal in this case is if the user could switch focus between different continuous task outputs. None of the continuous tasks are auxilliary and all of them are main tasks. Terminal output will be covered further in more detail below.
Single Discrete Task Depending on Continuous Task(s)
A discrete task can depend on continuous task(s).
For example, e2e tests can depend on the application server. Nx would start the application server first, then start the e2e tests. Until the e2e tests complete, the application server will stay running. When the e2e tests finish, Nx will terminate the application server.
Currently, Nx does not handle this case. Because tasks wait for all dependent tasks to end and continuous tasks do not end, Nx will just stop at running the continuous task.
Nx plugins can do their best to handle this case by configuring tools to wait for another Nx task to start and produce side effects. Doing this does handle the same issue but is inefficient as it will yield multiple task graphs as opposed to being combined in a single task graph. This would mean that tasks which exist in multiple graphs, are run once for each graph being executed. Caching can be utilized to reuse as much existing work as possible. Not all work can be cached though so this could result in a lot of wasted work.
When a discrete task depends on any continuous task(s), Nx will start the continuous task(s) before running the discrete task. Unlike the previous cases, the user starting the continuous task directly. The continuous task is started to accompany the discrete task. The discrete task will eventually complete. When this happens, the continuous task(s) become orphaned and Nx will terminate them since they are no longer necessary.
As for the actual nature of the dependency, the discrete task depending on the continuous task(s) likely needs some sort of side effect to be produced before the discrete task can truly begin. If Nx were to start the discrete task immediately as soon as the continuous task is started, the discrete task may encounter issues if it does not handle the side effect not being present. For the initial implementation, Nx will classify that as an issue with the discrete task. When the tasks are run outside of Nx, they still likely need some handling for when the side effect does not exist. Application servers need to handle when their accompanying services are unavailable. And test runners are already currently configured to wait for an application server to come up. However, Nx can still prioritize starting the continuous task(s) being depended upon as soon as it can. This would give more time for the side-effect to become available while Nx runs other dependent tasks. But ultimately, Nx will not further delay the execution of tasks depending on continuous task(s) after the continuous task has started.
The terminal output of the discrete task is more important than the output of the continuous task(s) output in this case. The user initiated directly. Nx would have the discrete task's terminal output shown most prominently. The continuous task(s) output could either be hidden entirely or optionally shown. Ideally, Nx would allow the user to see two terminal outputs at once. Again, terminal output will be covered in more detail further below.
Multiple Discrete Tasks depending on a particular Continuous Task(s)
An extension of the previous case is when multiple discrete tasks depend on a single continuous task or set of continuous tasks.
An example of this is multiple e2e test suites (
app1-e2e:e2e
andapp2-e2e:e2e
both dependingapp:serve
) or atomized test suites (app1-e2e:e2e-ci--feature1.spec.ts
andapp1-e2e:e2e-ci--feature2.spec.ts
both depending onapp:serve
).As with the previous case, Nx Plugins do their best to handle this case by configuring the testing tool to start the continuous task. This has the same issues as the previous case but it is amplified because each and every task depending on the continuous task would waste some work. A lot of compute is wasted here.
When multiple discrete tasks depend on a particular continuous task or a set of continuous tasks, Nx can reuse the continuous task(s). As with the previous case, Nx will start the continuous task(s) being depended upon as soon as possible. However, Nx will not terminate the continuous task(s) until all tasks which depend upon the continuous task completes.
This case is more complicated to distribute across different machines. Multiple discrete tasks would be in the execution queue. An agent does not know which if any of those discrete tasks it will receive. Agents could prematurely run any continuous tasks any task in the execution queue depends on but this may waste some work if some agents end up not being assigned any tasks depending on those continuous tasks. Agents could only run the continuous task necessary for tasks it is assigned when it is assigned but this would likely leave the tasks depending on the continuous tasks waiting for a bit until the side effects are available. Ideally, the orchestrator would accurately make the decision upfront about which subset of agents will receive the dependent tasks and run them efficiently utilizing only those agents. There's a lot of optimization which can be done but to start, it should be sufficient for the orchestrator to send continuous task(s) to an agent before it wants to send it a task which depends on it. We'll start there and explore more optimizations later on.
As for the terminal output, now multiple discrete tasks would need to have their outputs shown while also showing the output of continuous tasks at the same time. Nx already handles showing multiple discrete tasks' output, but not the output of dependent continuous task's. The mortal task output should still be more prominent and the dependent tasks could be optionally shown. Again, terminal output will be covered in more detail further below.
Continuous Task(s) Depending on Continuous Task(s)
An extension of all the previous cases, is when continuous task(s) depend on a particular continuous task or a set of continuous tasks.
Examples of this would be if:
frontend:serve
depends onbackend:serve
frontend:serve
depends onbackend1:serve
andbackend2:serve
app1:serve
andapp2:serve
both depend onbackend:serve
app2:serve
andapp2:serve
both depend on bothbackend1:serve
andbackend2:serve
backend:serve
depends ondatabase:serve
app1:serve
andapp2:serve
both depend onbackend1:serve
andbackend2:serve
which depend ondatabase1:serve
anddatabase2:serve
respectivelyCurrently, Nx does not handle this because the continuous tasks do not end and Nx will not continue.
Nx plugins again can do their best to handle this case using an executor to optionally start the dependent servers only of they are not available. This has a lot of intricacies. When multiple tasks are started simultaneously, there is still a chance multiple tasks will start the dependent servers as they are not available. There's a lot of wasted work as it's not included in the task graph. In theory, this can be handled perfectly.. but honestly, this case is not handled well for the most part at the moment.
Nx could handle this case with the same basic principles discussed in the previous sections. Dependent Continuous tasks are started first before any continuous tasks depending on them are started. The multiple continuous tasks started directly by the user remain running until the user interrupts Nx. And when this happens, the dependent continuous tasks are also terminated.
Solutions
Configuration
A target will declare that the task that it creates is a continuous task.
Other targets can specify that its task depend on the continuous task
With this configuration, Nx will know that
app:serve
is infinite.Readiness
Earlier we touched on the fact that tasks can take some time to produce side effects which parent tasks are expecting. For instance, starting servers might not be instantaneous. Continuous tasks may need a way to indicate that the side effects have been produced. Parent tasks would then wait for their dependent tasks to be ready rather than started. This concept of "readiness" does not currently exist.
Option 1: Do not handle it
Tasks are likely already built in a way where they can be started without their dependent continuous tasks being truly ready and wait until the side effects are available. It is definitely an option to not introduce the notion of "ready" to Nx and let the task itself handle this. Given this, Nx would start the parent task possibly immediately after the dependent continuous tasks have started.
Applications would need this sort of logic when running in production (not via Nx), so most applications should already be built to handle this kind of startup. For example, frontends would need to handle if their backends either haven't started or went down. Not delaying parent tasks allows multiple continuous tasks begin any pre work at the same time.
For tests, the plugins that Nx provides for Cypress and Playwright already handle waiting for the web server to startup. This waiting would continue to exist without Cypress and Playwright explicitly responsible for starting and terminating the dependent continuous tasks.
If the parent task doesn't handle waiting for the side effect, the parent task would fail and Nx would leave it up to developers to change the behavior of the parent task.
Option 2: Add configuration
Tasks can report to Nx that the sidecar task produces a side effect. Nx can handle waiting for these side effects. Doing so would delay parent task's from executing. Readiness might not be a boolean state though. Readiness may be more gradual as more side effects are produced. It is hard to configure more gradual startups where different features become ready at different times. The most generic way of specifying readiness is to allow a task to determine a stream or iterable of readiness states. And this stream would need to originate within the actual application itself and communicate with Nx. Even if readiness were simplified to be a boolean when all side effects are available, it would still have to be built into the application itself and cause delays in task execution.
Decision
For the initial implementation, option 1 of not introducing readiness state should suffice. Nx would still be able to add readiness handling in the future while remaining backwards compatible. Nx would default tasks to immediately become ready unless it has some other indicator for Nx to use. Readiness definitely needs to be handled but this will continue to stay within the tasks. As discussed before, both application servers and test tools already likely have some handling for this.
Terminal Output
As discussed before in the different use cases, terminal output is a large part of how Nx would need to handle continuous tasks.
In general, terminal outputs of tasks can either be main or auxilliary. Main tasks are those directly requested by users and should be given prominent treatment. Tasks which main tasks depend on are auxilliary and can be optionally shown possibly at the same time. Nx does not currently have the infrastructure in place to support this.
However, utilizing Rust has already allowed Nx to be more transparent terminal output from individual tasks. In the coming year, we will be rewriting the orchestration of multiple task outputs via Rust. There are powerful terminal libraries in Rust that Nx could utilize to create a new terminal UI which can handle the above use cases.
This new UI would have 2 new capabilities that the current UI does not have:
Separate areas of terminal output
Nx should be able to output terminal output to separate areas of the terminal. This is necessary to view both main and auxilliary output at the same time. The auxilliary output should be handled differently depending on the width of the terminal. Terminals which are not wide enough will be limited to switching between main and auxilliary output.
Switch between outputs of different tasks
Multiple tasks can be main or auxilliary so Nx should allow for developers to switch between different tasks. This prevents the outputs from being interlaced which could make the output hard to understand.
Current Terminal Output
The above terminal output will take some time to create so in the mean time, there still remains the existing display solutions for continuous tasks for the time being. When the new Terminal UI is ready, the developer experience using continuous tasks will improve. The following options would be switched by the developer with the current
--outputStyle
flag.Option 1: Do not show terminal output for dependent tasks
In some situations, dependent tasks can very well be ignored. If all dependent tasks are working as they are expected to, then terminal outputs of dependent tasks could be hidden altogether. Nx already handles this today.
Option 2: Show terminal output interlaced with parent tasks
Interlacing terminal outputs is messy. Nx would prefix the outputs but either task's outputs could still be broken up in a way where it becomes hard to understand. In situations where a dependent task is not behaving as expected, this option can be used to debug the issue. This is not a great experience but Nx already currently has support for this.
EDITS
I renamed the terminology to be "Continuous Tasks"
Prior Art
https://kubernetes.io/docs/concepts/workloads/pods/sidecar-containers/
Kubernetes allows containers to specify other containers to run alongside it much like Nx's tasks expect continuous tasks to run alongside them.
Previous Sidecars RFC
This previous RFC designs out sidecars which are a subset of continuous tasks. In this design, continuous tasks depended upon by other tasks are synonymous to the sidecars mentioned in the previous design. This design encompasses more than just sidecar tasks.
Beta Was this translation helpful? Give feedback.
All reactions