Add a baseline CFIS dataset #40

EiffL · 2021-11-07T02:28:04Z

This PR adds a first pass of a tf dataset of CFIS-like images, this aims at solving #39.

This is a first pass because we will probably want to add additional variants on that dataset in the future, but it is a starting point

EiffL · 2021-11-07T02:36:47Z

Super easy to use:

import autometacal
import tensorflow_dataset as tfds
import tensorflow as tf

# We create a function to add noise on the fly for data augmentation
@tf.function
def add_noise(example):
    im_noise = example['obs'] + example['noise_std'] * tf.random.normal([51,51])
    return im_noise, example

dset = tfds.load('CFIS/parametric_shear_1k', split='train')
dset = dset.map(add_noise)

for im, example in dset:
  # im contains the noisy observation
  # example contains psf image, observation without noise, and galaxy magnitude
  ....

aguinot · 2021-11-08T13:40:41Z

I checked the notebook, I am a bit confused by the results:

Do you know why autometacal is "slower"? (from the tqdm print)
The response are different from ngmix and autocal (and they are not negligible).
Autocal is 1 magnitude better than ngmix, do you think this is because of autodiff? If so, this is very cool!
I think I would make a test at very high SNR (like 50000-100000) just to make sure everything is working well. Maybe you have already tried?

EiffL · 2021-11-09T08:40:14Z

Dont look too closely at the notebook. The response computed by autometacal is wrong, cf #26.
The tqdm looks slower but that's because I do like 10000 instead of 1000 galaxies.

The main point of this PR is to add the dataset, the notebook is to illustrate how to load and use the dataset, but it's optional, if you think it's not useful I can drop it for now.

EiffL

I have a few comments

EiffL · 2021-11-16T13:49:40Z

autometacal/python/datasets/CFIS.py

+          'psf': tfds.features.Tensor(shape=[self.builder_config.stamp_size,
+                                                   self.builder_config.stamp_size],
+                                        dtype=tf.float32),    
+          # 'gal_kimage': tfds.features.Tensor(shape=[2, self.builder_config.kstamp_size,


what do you have these commented lines? If they are not needed we should remove to avoid cluttering the code

that;s a good point, I'll rmove this stuff for now

andrevitorelli

gtg

EiffL added 2 commits November 7, 2021 03:01

adds prototype cfis dataset

1191bb1

Update demo notebook

074f0a6

EiffL requested review from andrevitorelli and aguinot November 7, 2021 02:28

EiffL mentioned this pull request Nov 7, 2021

Build a realistic galaxy sample, with some typical SNR, shape, size #39

Closed

andrevitorelli linked an issue Nov 10, 2021 that may be closed by this pull request

Build a realistic galaxy sample, with some typical SNR, shape, size #39

Closed

EiffL commented Nov 16, 2021

View reviewed changes

andrevitorelli approved these changes Nov 18, 2021

View reviewed changes

andrevitorelli merged commit 974b271 into main Nov 18, 2021

andrevitorelli mentioned this pull request Nov 18, 2021

New data generators #34

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a baseline CFIS dataset #40

Add a baseline CFIS dataset #40

EiffL commented Nov 7, 2021

EiffL commented Nov 7, 2021

aguinot commented Nov 8, 2021

EiffL commented Nov 9, 2021

EiffL left a comment

EiffL Nov 16, 2021

EiffL Nov 16, 2021

andrevitorelli left a comment

Add a baseline CFIS dataset #40

Add a baseline CFIS dataset #40

Conversation

EiffL commented Nov 7, 2021

EiffL commented Nov 7, 2021

aguinot commented Nov 8, 2021

EiffL commented Nov 9, 2021

EiffL left a comment

Choose a reason for hiding this comment

EiffL Nov 16, 2021

Choose a reason for hiding this comment

EiffL Nov 16, 2021

Choose a reason for hiding this comment

andrevitorelli left a comment

Choose a reason for hiding this comment