Remove multiplier from resource docs, add it instead as an extra first dimension in descriptor shape #576

jwlodek · 2024-09-13T14:01:00Z

Per discussion today, it was decided that in order to keep the dimensionality of shape in the event descriptor consistent, that the multiplier should be added there as the first dimension. So for example in the case of two detectors w/ x-y dimensions of 10x10, with one collecting a single frame per acq, and one collecting 5, we would have

shape: (1, 10, 10)

and

shape: (5, 10, 10)

respectively in the descriptor. Since this tells you the multiplier, and you can fetch this information given a resource doc by looking back at the associated descriptor, the multiplier field can be removed from the resource doc creation.

The text was updated successfully, but these errors were encountered:

jwlodek · 2024-09-13T14:01:23Z

@coretl @genematx @danielballan

coretl · 2024-11-22T11:36:11Z

@jwlodek was this something you were going to do, or should someone else pick it up?

thomashopkins32 · 2024-12-26T21:51:34Z

Going to open a PR for this change soon.

From my understanding, the way this operates is that it holds off on producing a StreamResource until the index has been incremented by one. The index does not get incremented until multiplier number of "captures" or "exposures" have been processed.

So the only work to be completed for this issue is to do the following:

Add the multiplier as the first dimension of DataKey.shape
Ensure that the index provided by DetectorWriter.get_indices_written() and DetectorWriter.observe_indices_written() is divided by this multiplier so that it actually captures the correct amount of exposures
Add unit tests showing that describe() works as intended
Add unit tests showing that stream resources are actually batches of exposures

Also, I think batch_size is a better name for this...multiplier could mean anything and it took me a few passes to try and figure out what it actually meant.

thomashopkins32 · 2024-12-27T15:46:56Z

Blocked on the merge of #606 since that reworks a lot of the overlapping code

coretl · 2025-01-06T10:00:29Z

* Ensure that the index provided by `DetectorWriter.get_indices_written()` and `DetectorWriter.observe_indices_written()` is divided by this `multiplier` so that it actually captures the correct amount of exposures

This should already be done in the code...

* Add unit tests showing that `describe()` works as intended

* Add unit tests showing that stream resources are actually batches of exposures

...although when you add these unit tests you may find it doesn't work as indended!

Also, I think batch_size is a better name for this...multiplier could mean anything and it took me a few passes to try and figure out what it actually meant.

Good idea to change the name, but we're trying to convey number of detector triggers/exposures/frames that make up a single index in a StreamDatum, so maybe frames_per_event would be better?

thomashopkins32 · 2025-01-06T14:45:47Z

@coretl frames_per_event sounds good as well. I decided on batch_size since that lines up with what typical deep learning frameworks in Python use for the first dimension. Instinctively when I see a 4-d array that is supposed to be image data I assume that the first dimension is a batch size. I'm not sure how true this is for the would-be users of ophyd-async though. Let me know if you think frames_per_event would be better and I can change it to that.

This should already be done in the code...

I believe I found a few places where this was not done. Either way, I did not notice any unit tests that actually examine the behavior when the multiplier > 1.

coretl · 2025-01-06T16:33:57Z

Decided on frames_per_event. Also decided to rename StandardDetector -> FramingDetector.

coretl · 2025-01-07T11:12:08Z

Decided on frames_per_event. Also decided to rename StandardDetector -> FramingDetector.

Since then we have had second thoughts on the FramingDetector name, especially given that the unique thing about StandardDetector is that it is prepared with the number of triggers and type it takes, and the writer makes stream_resource and stream_datum. On balance it is better to leave it as StandardDetector and explain what that entails. If we have something later on that doesn't fit we will revisit that name then.

The frames_per_event name change should still take place.

thomashopkins32 self-assigned this Dec 26, 2024

thomashopkins32 mentioned this issue Dec 27, 2024

Split out common AD file plugin logic into core writer class, create ADTiffWriter #606

Merged

3 tasks

thomashopkins32 linked a pull request Jan 8, 2025 that will close this issue

Rename multiplier to frames_per_event and move to first dim of shape #726

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove multiplier from resource docs, add it instead as an extra first dimension in descriptor shape #576

Remove multiplier from resource docs, add it instead as an extra first dimension in descriptor shape #576

jwlodek commented Sep 13, 2024

jwlodek commented Sep 13, 2024

coretl commented Nov 22, 2024

thomashopkins32 commented Dec 26, 2024

thomashopkins32 commented Dec 27, 2024

coretl commented Jan 6, 2025

thomashopkins32 commented Jan 6, 2025

coretl commented Jan 6, 2025

coretl commented Jan 7, 2025

Remove multiplier from resource docs, add it instead as an extra first dimension in descriptor shape #576

Remove multiplier from resource docs, add it instead as an extra first dimension in descriptor shape #576

Comments

jwlodek commented Sep 13, 2024

jwlodek commented Sep 13, 2024

coretl commented Nov 22, 2024

thomashopkins32 commented Dec 26, 2024

thomashopkins32 commented Dec 27, 2024

coretl commented Jan 6, 2025

thomashopkins32 commented Jan 6, 2025

coretl commented Jan 6, 2025

coretl commented Jan 7, 2025