Fix: mdsip services zombie processes #2645

santorofer · 2023-10-25T19:35:44Z

When executing a shot cycle using D-Tacq's digitizers, sometimes mdsip processes become zombie processes. This, in turn, will break the shot cycle for the user.

Those mdsip process will contain sockets that were kept open, even when no data was being received.

Solution: the while loop that controls the data being received from the socket (in the device driver) has now a new logic, that includes using the value of the running node (false or true) to exit the loop correctly when executing the stop method.

For this to work, the user needs to be sure it runs the stop method for the device.

…ng zombies

zack-vii

I assume you intended to fix both devices the same way? Since it is a common problem with streaming sockets I had to deal with the very problem. I would solve is like this:

# reduce delay after stop
s.settimeout(1) 
while toread and self.running:
    try:
        nbytes = s.recv_into(
            view, min(self.io_buffer_size, toread))
    except socket.timeout:
         if self.running:
             continue
         self.running = False
         break
    if nbytes == 0:
        break                            
    if first:
        self.trig_time = time.time()
        first = False
    view = view[nbytes:]  # slicing views is cheap
    toread -= nbytes

If a blocking socket should returns 0 bytes, it has been closed and you would stop the stream
If running is false you would stop the stream
If the socket times out you would not discard partial buffers but check if it is still running

santorofer · 2024-06-10T19:03:07Z

Yes, that is correct, we basically fixed two drivers for modules 435 and 423. Now, I think our change to the same part of the code are the same as your suggestion, am I right? Because we also check for a socket.timeout in the try when reading the memoryview(), which is where the check for the nbytes is. I must be missing something.

zack-vii · 2024-06-10T20:17:37Z

pydevices/HtsDevices/acq2106_435st.py

@@ -357,9 +357,14 @@ def run(self):
                        while toread:


this line seems different from line 299 of the other device while toread and self.running:.
given that the complexity of the run method is quite high it would probably be best to share one implementation, where possible.

santorofer requested a review from WhoBrokeTheBuild October 31, 2023 16:51

WhoBrokeTheBuild assigned santorofer Apr 29, 2024

WhoBrokeTheBuild added bug An unexpected problem or unintended behavior devices Relates to devices (c devices, tdi devices, python devices, java devices, device_support, etc) labels Apr 29, 2024

santorofer added 2 commits April 29, 2024 14:32

Fix: added fix for the mdsip spinning issues

5b46bb9

Changed the way we checked nbytes to avoid mdsip services from becomi…

f122145

…ng zombies

WhoBrokeTheBuild force-pushed the mdsip-zombies branch from e98a66c to f122145 Compare April 29, 2024 18:32

zack-vii requested changes Jun 1, 2024

View reviewed changes

zack-vii reviewed Jun 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: mdsip services zombie processes #2645

Fix: mdsip services zombie processes #2645

santorofer commented Oct 25, 2023

zack-vii left a comment

santorofer commented Jun 10, 2024 •

edited

Loading

zack-vii Jun 10, 2024

Fix: mdsip services zombie processes #2645

Are you sure you want to change the base?

Fix: mdsip services zombie processes #2645

Conversation

santorofer commented Oct 25, 2023

zack-vii left a comment

Choose a reason for hiding this comment

santorofer commented Jun 10, 2024 • edited Loading

zack-vii Jun 10, 2024

Choose a reason for hiding this comment

santorofer commented Jun 10, 2024 •

edited

Loading