-
Notifications
You must be signed in to change notification settings - Fork 336
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added unit section to status endpoint #928 #1089
base: master
Are you sure you want to change the base?
Added unit section to status endpoint #928 #1089
Conversation
Thanks for the PR! OK, first things first, this is going to need a commit message. What is this commit introducing and why do we want/need it. In this case perhaps also some example output from the endpoint. Could you move the GH issue number from the commit subject and put it at the bottom of the commit message (separated by a blank line) in the form
Cheers, |
@ac000 Do you need a commit message like this?
|
.gitignore
Outdated
@@ -2,3 +2,4 @@ | |||
Makefile | |||
*.pyc | |||
__pycache__/ | |||
/.idea/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not entirely sure what thiis is. However I suspect this is better handled locally your end by either adding it into your global .gitignore or in the repository under .git/info/exclude
docs/changes.xml
Outdated
<change type="feature"> | ||
<para> | ||
added unit section to /status endpoint. | ||
unit section is about web-server version, config last load time and config update generation | ||
</para> | ||
</change> | ||
|
||
|
||
<change type="feature"> | ||
<para> | ||
$request_id variable contains a string that is formed using random data and |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need two of these!?
docs/changes.xml
Outdated
date="2023-08-31" time="18:00:00 +0300" | ||
packager="Nginx Packaging <nginx-packaging@f5.com>"> | ||
|
||
<change type="feature"> | ||
<para> | ||
added unit section to /status endpoint. | ||
unit section is about web-server version, control_socket and etc. | ||
</para> | ||
</change> | ||
|
||
<change type="change"> | ||
<para> | ||
if building with njs, version 0.8.0 or later is now required. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And here's the other one.
I would also prefer the changelog entry to be in its own commit. This can give things like git-revert(1) a better chance of working if we ever need to for some reason...
src/nxt_controller.c
Outdated
nxt_buf_t *b; | ||
nxt_port_t *main_port; | ||
nxt_runtime_t *rt; | ||
time_t rawtime; | ||
struct tm *timeinfo; | ||
u_char buffer[25]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Variables should be sorted in christmias tree order and variable types of the same length should be sorted alphabetically, yeah, I know, I didn't make the rule...
src/nxt_runtime.c
Outdated
nxt_sockaddr_t *sa; | ||
nxt_file_name_str_t file_name; | ||
const nxt_event_interface_t *interface; | ||
time_t rawtime; | ||
struct tm *timeinfo; | ||
u_char buffer[25]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ditto variables...
src/nxt_controller.c
Outdated
strftime((char*)buffer, 25, "%Y-%m-%dT%H:%M:%S.000Z", timeinfo); | ||
|
||
rt->conf_ltime.length = strlen((char*)buffer); | ||
rt->conf_ltime.start = (u_char*) malloc(rt->conf_ltime.length + 1); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't tend to call malloc(3) directly. There is a nxt_malloc() you can use.
Also please don't cast the return, in C it simply isn't required and (in that past at least) can hide bugs.
src/nxt_runtime.c
Outdated
strftime((char*)buffer, 25, "%Y-%m-%dT%H:%M:%S.000Z", timeinfo); | ||
|
||
rt->conf_ltime.length = strlen((char*)buffer); | ||
rt->conf_ltime.start = (u_char*) malloc(rt->conf_ltime.length + 1); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Diit malloc(3).
Hmm, duplicated code...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ac000 How I can fix code duplicate? I'm using this code to get time and convert it to nxt_str_t. I can create nxt_time.c with function to reuse it
time_t rawtime;
u_char buffer[25];
struct tm *timeinfo;
time(&rawtime);
timeinfo = gmtime(&rawtime); //convert to UTC
strftime((char*)buffer, 25, "%Y-%m-%dT%H:%M:%S.000Z", timeinfo);
rt->conf_ltime.length = strlen((char*)buffer);
rt->conf_ltime.start = nxt_malloc(rt->conf_ltime.length + 1);
src/nxt_status.c
Outdated
rt = task->thread->runtime; | ||
|
||
static nxt_str_t unit_str = nxt_string("unit"); | ||
static nxt_str_t ver_str = nxt_string("version"); | ||
static nxt_str_t gen_str = nxt_string("generation"); | ||
static nxt_str_t ltime_str = nxt_string("load_time"); | ||
static nxt_str_t conns_str = nxt_string("connections"); | ||
static nxt_str_t acc_str = nxt_string("accepted"); | ||
static nxt_str_t active_str = nxt_string("active"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Variable declarations should come before code... (I know there are some places that break this).
src/nxt_status.c
Outdated
size_t i; | ||
nxt_str_t name; | ||
nxt_str_t version; | ||
nxt_int_t ret; | ||
nxt_status_app_t *app; | ||
nxt_conf_value_t *status, *obj, *apps, *app_obj; | ||
nxt_runtime_t *rt; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ditto variables...
Hi.
Just about!. Something like this at the minimum
Wrapping lines around the 72/73 charcter mark. Thanks! |
Thanks for that, quick couple of questions. "unit": {
"version": "1.32.0",
"load_time": "2024-01-25T13:24:08.000Z",
"generation": 0
}, Is What does |
load_time shows when last configured |
Some changes are ready after review. I will push it asap |
2e85ebd
to
9611a3b
Compare
@ac000 Can you help me with changes.xml or I can push my version with new commit? |
If you have it split out locally, you can force push it here with |
From review context Pardon. Should I merge with commit and push or push with new commit? |
After you split the changelog changes into their own commit, you should then have two commits for this PR, at which point you can force push them to your feature/unit_about_section branch. |
Hmm, we probably need a better name for this. 'last_configured' or somesuch... Although a 'start_time' would perhaps be useful. |
I can add it. Ask me if you know how to name it better. Do I need add |
strftime((char*)buffer, 25, "%Y-%m-%dT%H:%M:%S.000Z", timeinfo); | ||
|
||
rt->conf_ltime.length = strlen((char*)buffer); | ||
rt->conf_ltime.start = nxt_malloc(rt->conf_ltime.length + 1); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for changing the malloc(3), but now my next question is, where does this memory get free(3)'d?.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know when memory should be released. Can you explain me?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We usually use nxt_mp_xxx api instead that you don't need to explicitly do memory free.
It will be released together with the mp destroy.
src/nxt_runtime.c
Outdated
rt->conf_ltime.length = strlen((char*)buffer); | ||
rt->conf_ltime.start = nxt_malloc(rt->conf_ltime.length + 1); | ||
|
||
strcpy((char*)rt->conf_ltime.start, (char*)buffer); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should probably be nxt_copystr()
My current suggestion is 'load_time' -> 'last_configured' but maybe others will have better suggestions. Don't worry about that one for now... |
Added unit section to /status endpoint. Unit section is about web-server version, config last load time and config update generation Response example below: {"unit":{"version":"1.32.0","load_time":"2024-01-25T13:24:08.000Z","generation":0},"connections":{"accepted":0,"active":0,"idle":0,"closed":0},"requests":{"total":0},"applications":{"laravel":{"processes":{"running":1,"starting":0,"idle":1},"requests":{"active":0}}}} Closes: nginx#928
9611a3b
to
2cb99ca
Compare
It looks good to contain |
@Pavlusha311245 |
According to the Node.js documenation this variable should only include numbering scheme. Thanks to @dbit-xia. Closes: nginx#1085
This is in preparation for adding conditional access logging. No functional changes.
This feature allows users to specify conditions to control if access log should be recorded. The "if" option supports a string and JavaScript code. If its value is empty, 0, false, null, or undefined, the logs will not be recorded. And the '!' as a prefix inverses the condition. Example 1: Only log requests that sent a session cookie. { "access_log": { "if": "$cookie_session", "path": "..." } } Example 2: Do not log health check requests. { "access_log": { "if": "`${uri == '/health' ? false : true}`", "path": "..." } } Example 3: Only log requests when the time is before 22:00. { "access_log": { "if": "`${new Date().getHours() < 22}`", "path": "..." } } or { "access_log": { "if": "!`${new Date().getHours() >= 22}`", "path": "..." } } Closes: nginx#594
Conditional access logging was introduced here: nginx@4c91beb
This is a generic type to represent a uid_t/gid_t on Linux when user namespaces are in use. Technically this only needs to be an unsigned int, but we make it an int64_t so we can make use of the existing NXT_CONF_MAP_INT64 type. This will be used in subsequent commits. Reviewed-by: Zhidao Hong <z.hong@f5.com> Signed-off-by: Andrew Clayton <a.clayton@nginx.com>
Andrei reported an issue on arm64 where he was seeing the following error message when running the tests 2024/01/17 18:32:31.109 [error] 54904#54904 "gidmap" field has an entry with "size": 1, but for unprivileged unit it must be 1. This error message is guarded by the following if statement if (nxt_slow_path(m.size > 1) Turns out size was indeed > 1, in this case it was 289356276058554369, m.size is defined as a nxt_int_t, which on arm64 is actually 8 bytes, but was being printed as a signed int (4 bytes) and by chance/undefined behaviour comes out as 1. But why is size so big? In this case it should have just been 1 with a config of 'gidmap': [{'container': 0, 'host': os.getegid(), 'size': 1}], This is due to nxt_int_t being 64bits on arm64 but using a conf type of NXT_CONF_MAP_INT which means in nxt_conf_map_object() we would do (using our m.size variable as an example) ptr = nxt_pointer_to(data, map[i].offset); ... ptr->i = num; Where ptr is a union pointer and is now pointing at our m.size Next we set m.size to the value of num (which is 1 in this case), via ptr->i where i is a member of that union of type int. So here we are setting a 64bit memory location (nxt_int_t on arm64) through a 32bit (int) union alias, this means we are only setting the lower half (4) of the bytes. Whatever happens to be in the upper 4 bytes will remain, giving us our exceptionally large value. This is demonstrated by this program #include <stdio.h> #include <stdint.h> int main(void) { int64_t num = -1; /* All 1's in two's complement */ union { int32_t i32; int64_t i64; } *ptr; ptr = (void *)# ptr->i32 = 1; printf("num : %lu / %ld\n", num, num); ptr->i64 = 1; printf("num : %ld\n", num); return 0; } $ make union-32-64-issue cc union-32-64-issue.c -o union-32-64-issue $ ./union-32-64-issue num : 18446744069414584321 / -4294967295 num : 1 However that is not the only issue, because the members of nxt_clone_map_entry_t were specified as nxt_int_t's on the likes of x86_64 this would be a 32bit signed integer. However uid/gids on Linux at least are defined as unsigned integers, so a nxt_int_t would not be big enough to hold all potential values. We could make the nxt_uint_t's but then we're back to the above union aliasing problem. We could just set the memory for these variables to 0 and that would work, however that's really just papering over the problem. The right thing is to use a large enough sized type to store these things, hence the previously introduced nxt_cred_t. This is an int64_t which is plenty large enough. So we switch the nxt_clone_map_entry_t structure members over to nxt_cred_t's and use NXT_CONF_MAP_INT64 as the conf type, which then uses the right sized union member in nxt_conf_map_object() to set these variables. Reported-by: Andrei Zeliankou <zelenkov@nginx.com> Reviewed-by: Zhidao Hong <z.hong@f5.com> Signed-off-by: Andrew Clayton <a.clayton@nginx.com>
Use the NXT_CONF_VLDT_REQUIRED flag on the app_procmap members. These three settings are required. These are for the uidmap & gidmap settings in the config. Suggested-by: Zhidao HONG <z.hong@f5.com> Reviewed-by: Zhidao Hong <z.hong@f5.com> Signed-off-by: Andrew Clayton <a.clayton@nginx.com>
The commit that added support for Unix sockets accepts abstract sockets using '@' in the config, but we stored it internally using '\0'. We want to support abstract sockets transparently to the user, so that if the user configures unitd with '@', if we receive a query about the current configuration, the user should see the same exact thing that was configured. So, this commit avoids the transformation in the internal state file, storing user input pristine, and we only transform the '@' in temporary strings. This commit fixes another bug, where we try to connect to abstract sockets with a trailing '\0' in their name due to calling twice nxt_sockaddr_parse() on the same string. By calling that function only once with each copy of the string, we have fixed that bug. The following code was responsible for this bug, which the second time it was called, considered these sockets as file-backed (not abstract) Unix socket, and so appended a '\0' to the socket name. $ grepc -tfd nxt_sockaddr_unix_parse . | grep -A10 @ if (path[0] == '@') { path[0] = '\0'; socklen--; #if !(NXT_LINUX) nxt_thread_log_error(NXT_LOG_ERR, "abstract unix domain sockets are not supported"); return NULL; #endif } sa = nxt_sockaddr_alloc(mp, socklen, addr->length); This bug was found thanks to some experiment about using 'const' for some strings. And here's some history: - 9041d27 ("nxt_sockaddr_parse() introducted.") This commit introduced support for abstract Unix sockets, but they only worked as "servers", and not as "listeners". We corrupted the JSON config file, and stored a \u0000. This also caused calling connect(2) with a bogus trailing null byte, which tried to connect to a different abstract socket. - d8e0768 ("Fixed support for abstract Unix sockets.") This commit (partially) fixed support for abstract Unix sockets, so they they worked also as listeners. We still corrupted the JSON config file, and stored a \u0000. This caused calling connect(2) (and now bind(2) too) with a bogus trailing null byte. - e2aec66 ("Storing abstract sockets with @ internally.") This commit fixed the problem by which we were corrupting the config file, but only for "listeners", not for "servers". (It also fixes the issue about the terminating '\0'.) We completely forgot about "servers", and other callers of the same function. To reproduce the problem, I used the following config: ```json { "listeners": { "*:80": { "pass": "routes/u" }, "unix:@abstract": { "pass": "routes/a" } }, "routes": { "u": [{ "action": { "pass": "upstreams/u" } }], "a": [{ "action": { "return": 302, "location": "/i/am/not/at/home/" } }] }, "upstreams": { "u": { "servers": { "unix:@abstract": {} } } } } ``` And then check the state file: $ sudo cat /opt/local/nginx/unit/master/var/lib/unit/conf.json \ | jq . \ | grep unix; "unix:@abstract": { "unix:\u0000abstract": {} After this patch, the state file has a '@' as expected: $ sudo cat /opt/local/nginx/unit/unix/var/lib/unit/conf.json \ | jq . \ | grep unix; "unix:@abstract": { "unix:@abstract": {} Regarding the trailing null byte, here are some tests: $ sudo strace -f -e 'bind,connect' /opt/local/nginx/unit/d8e0/sbin/unitd \ |& grep abstract; [pid 22406] bind(10, {sa_family=AF_UNIX, sun_path=@"abstract\0"}, 12) = 0 [pid 22410] connect(134, {sa_family=AF_UNIX, sun_path=@"abstract\0"}, 12) = 0 ^C $ sudo killall unitd $ sudo strace -f -e 'bind,connect' /opt/local/nginx/unit/master/sbin/unitd \ |& grep abstract; [pid 22449] bind(10, {sa_family=AF_UNIX, sun_path=@"abstract"}, 11) = 0 [pid 22453] connect(134, {sa_family=AF_UNIX, sun_path=@"abstract\0"}, 12) = -1 ECONNREFUSED (Connection refused) ^C $ sudo killall unitd $ sudo strace -f -e 'bind,connect' /opt/local/nginx/unit/unix/sbin/unitd \ |& grep abstract; [pid 22488] bind(10, {sa_family=AF_UNIX, sun_path=@"abstract"}, 11) = 0 [pid 22492] connect(134, {sa_family=AF_UNIX, sun_path=@"abstract"}, 11) = 0 ^C Fixes: 9041d27 ("nxt_sockaddr_parse() introducted.") Fixes: d8e0768 ("Fixed support for abstract Unix sockets.") Fixes: e2aec66 ("Storing abstract sockets with @ internally.") Link: <nginx#1108> Reviewed-by: Andrew Clayton <a.clayton@nginx.com> Cc: Liam Crilly <liam.crilly@nginx.com> Cc: Zhidao Hong <z.hong@f5.com> Signed-off-by: Alejandro Colomar <alx@kernel.org>
It's an integer, not a floating number. Fixes: 68c6b67 ("Configuration: support for rational numbers.") Closes: nginx#1115 Link: <nginx#1116> Reviewed-by: Zhidao Hong <z.hong@f5.com> Reviewed-by: Andrew Clayton <a.clayton@nginx.com> Cc: Dan Callahan <d.callahan@f5.com> Cc: Valentin Bartenev <vbartenev@gmail.com> Signed-off-by: Alejandro Colomar <alx@kernel.org>
Added unit section to /status endpoint. Unit section is about web-server version, config last load time and config update generation Response example below: {"unit":{"version":"1.32.0","load_time":"2024-01-25T13:24:08.000Z","generation":0},"connections":{"accepted":0,"active":0,"idle":0,"closed":0},"requests":{"total":0},"applications":{"laravel":{"processes":{"running":1,"starting":0,"idle":1},"requests":{"active":0}}}} Closes: nginx#928
Hi @hongzhidao I can start working on this pr again. Can you describe in more detail what you meant by giving links to methods? Is there a problem with the formatting or the coding itself? |
I mean there is |
Is this all the comments on the code for now? I would like to know if there is anything else, then I can fully understand what to do so as not to return to the dialogue after each completed stage 🙂 |
But... How to use nxt_time_strint_t? I use function |
I'm afraid we can only point out that these are obvious first. Not sure if there is anyone else. And in our experience, it's common to rework patches after review.
I'd suggest you look into the source code, it's too detailed, but I usually look at its usage by doing something like |
Hi @hongzhidao Can a It's not immediately obvious to me as the handler function seems to |
Yes, I think so. Here's an example from
Correct. And The other example is related to http variable.
|
Hi @Pavlusha311245, |
It's a convoluted interface, but I see how it would work now. However I think it could end up doing quite a lot of work at start up At least I think it might be best to simply store either a That's the bit I'm not sure about, if you can use an arbitrary Failing that just use the standard libc API at the time of showing That way you do less work at start up (only storing a You also use a little less memory for each timestamp, 8 or 16 bytes vs |
Are you suggesting to change the format of a timestamp in the If so, I'm not sure that's a good idea, people may be relying on the |
Nope, it's for learning the usage of |
I think so. Here's the related source code.
Sorry that I don't understand it. |
Yes, but how do you get that into the The handler function is called from the likes of So I can't see how you could store a Looks like you'd need to create a new function for that case...
Rather than storing a string representing the timestamp, you simply store But I suppose it's a toss up between doing perhaps unnecessary work at |
Yes, you need a specific function, for example.
Note that the function
There are serval ways to do it like what happens in
I guess it's not often, but anyway, I feel |
[...]
My point is these all operate on the current time. So you'd need a
I wasn't asking how to do it, I was suggesting it may be a preferable
What is cached exactly? I think there's a lot over-optimisation in Unit... However reading the clock should generally be pretty efficient as it's
Where it makes sense... |
Take the error log as an example, if there are lots of logs generated in a very short term, for example, 1s, these logs may count the time only once, but not calling something like
Good to know, thanks. Either way, we have to do two things: get the current time and format time-string. |
nxt_int_t conf_gen; | ||
nxt_str_t conf_ltime; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a nxt_status_report_t
structure in src/nxt_status.h
. I wonder if these would be better placed there?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a
nxt_status_report_t
structure insrc/nxt_status.h
. I wonder if these would be better placed there?
Hmm... I can try 🙂 But I thought of leaving them in runtime so that I could use them anywhere else. Also I need it here to known time when config initialized. Have you suggestion how I can do it? @ac000
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems they don't belong to nxt_runtime_t
and nxt_status_report_t
.
The nxt_status_report_t
is a temporary structure to collect the status information from every engine between worker threads in the router process.
The configuration generation needs to increase when a new configuration is generated. But I can't see where it happens in the PR. The configuration time needs to be recorded on its corresponding configuration.
I feel you need to handle them first.
No clear idea on it yet, but you might start with nxt_router_conf_data_handler()
which generates configuration structure through control API.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The configuration generation needs to increase when a new configuration is generated. But I can't see where it happens in the PR. The configuration time needs to be recorded on its corresponding configuration. I feel you need to handle them first.
The update of the counter and time is now located in nxt_controller
in thenxt_controller_conf_store()
method
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The configuration is updated by the control API which is created in the controller process.
I feel it has nothing to do with the structure nxt_runtime_t
.
When a new configuration is applied, the last JSON value is stored as nxt_controller_conf
.
We could consider putting the generation and updated time in this structure nxt_controller_conf_t
.
Btw, debug information in error.log
is useful.
On Thu, 21 Mar 2024 06:40:46 -0700 Pavel Zavadski ***@***.***> wrote:
@Pavlusha311245 commented on this pull request.
> + nxt_int_t conf_gen;
+ nxt_str_t conf_ltime;
But I thought of leaving them in runtime so that I could use them anywhere else
Do you have a specific use case in mind?
I think it would be nice to have _all_ (or at least most of them, I
can see the odd one possibly coming from some other pre-existing
place) the status variables stored in the same place.
If you did then find that you needed them for some other purpose, then
I would look at making the status structure more accessible.
I know this may sound like a lot of extra work, but I think it's
important to get the fundamentals right now, before this API is
expanded any further...
|
@ac000 Hi. Unfortunately, I can’t think of any case, although I think it might suddenly be needed. As I understand from the message. If I need to save a given use case, will I need to make a structure for this? |
@Pavlusha311245 Hello there! If I understand correctly it looks like left to do is to use the nxt_mp_alloc instead of nxt_alloc and to refactor to extend the nxt_controller module instead of keeping the new data in the nxt_runtime_t object? Please let us know if there is anything blocking you! |
There is not enough time to complete this task. I want to close it as soon as possible so that the changes go to the main branch. If there is such a possibility, I would ask for help |
…eature/unit_about_section
Added unit section to /status endpoint. Unit section is about web-server version, config last load time and config update generation
Response example below:
Closes: #928