A first draft of "loading data: images" #128

MarkDaoust · 2018-10-17T20:08:36Z

It still needs an introduction and conclusion paragraph.

But this won't be visible on the site unless we add it to the "_toc.yaml"

Staging:

https://github.com/MarkDaoust/docs/blob/load_data/site/en/tutorials/load_data/images.ipynb

https://colab.sandbox.google.com/github/MarkDaoust/docs/blob/load_data/site/en/tutorials/load_data/images.ipynb

random-forests · 2018-10-17T20:41:04Z

Thank you for writing this! Good timing, too. Together with Kiran's notebook on tf records, this would make a great start to a "Loading data" collection.

Suggestions:

Under "Load and format the images"

Instead of normalizing the image by "image = (image/128.0) - 1", instead, how about "image = image / 255"?
This would also simplify the code to display the images (instead of "plt.imshow((image+1)/2)", we can write "plt.imshow(image)")
How about demo'ing the "load_and_format_image" function, just to demonstrate it works like regular Python code when eager is enabled? To do so, you could add a short block under it, that calls the method using something like "load_and_format_image('/root/.keras/datasets/flower_photos/tulips/7166644048_b00a14f01b.jpg', 0)" (I know there's a bunch of output, maybe catch it in a object, show how to get the .numpy() value, and print out the shape

Under "Pipe to a model for training"

Explain the prefetch parameter.

Under "Quick Transfer learning with keras.Applications"

Leave the depth_multiplier at the default value.
Finally, split the dataset into train / test, so we can show a quick evaluation after the transfer learning section, which is really cool.

Overall this is great!

MarkDaoust · 2018-10-19T23:28:30Z

Instead of normalizing the image by "image = (image/128.0) - 1", instead, how about "image = image / 255"?

Done.
The tf.keras.applications.mobilenet expects data to be normalized to [-1,1].
I've switched it to use [0,1] for most of it, with a transform to [-1,1] in the transfer learning section.

demo'ing the "load_and_format_image"

Done.

Explain the prefetch parameter.

Done

Leave the depth_multiplier at the default value.

Done

Finally, split the dataset into train / test, so we can show a quick evaluation after the transfer learning section, which is really cool.

I'd rather avoid going into too much detail here. I just want to get to "see how easy that was!" and leave the rest for a real tutorial.

+Split H2/H3. +minor cleanup and clarifications.

MarkDaoust · 2018-11-28T18:55:21Z

+tf.data

Hey Derek, Jiri,

Does this seem like a reasonable approach to loading image directories with tf.data?

site/en/tutorials/load_data/images.ipynb

jsimsa · 2018-11-29T17:10:10Z

Thanks @MarkDaoust did a pass and left some comments.

mrry

Thanks for getting this started, Mark!

site/en/tutorials/load_data/images.ipynb

mrry · 2018-12-02T20:14:07Z

site/en/tutorials/load_data/images.ipynb

+ },
+ "cell_type": "markdown",
+ "source": [
+ "One disadvantage to using an im memory cache is that the cache must be rebuilt on each run, giving the same startup delay each time the dataset is started:"


I think @rohan100jain has some changes in flight that will help this no longer be the case.

site/en/tutorials/load_data/images.ipynb

MarkDaoust · 2018-12-04T17:38:35Z

(not really merging, just testing copybara)

…ile.

PiperOrigin-RevId: 229997672

MarkDaoust added 2 commits October 17, 2018 12:49

First Draft

833f0e5

Add buttons, more description in the "Quick transfer learning" section.

7507efd

MarkDaoust requested review from lamberta, random-forests and wolffg October 17, 2018 20:08

MarkDaoust added 2 commits October 18, 2018 14:29

Add some performance tricks.

4e02f7d

Created using Colaboratory

47df15d

MarkDaoust added the cla: yes CLA has been signed label Oct 23, 2018

minor

d48640f

MarkDaoust mentioned this pull request Nov 13, 2018

Some fixes for tf.Example notebook. #188

Merged

MarkDaoust added 2 commits November 13, 2018 08:55

+Add intro.

7fede02

+Split H2/H3. +minor cleanup and clarifications.

minor.

9ef13a4

MarkDaoust requested review from jsimsa and mrry and removed request for wolffg November 28, 2018 18:53

jsimsa reviewed Nov 29, 2018

View reviewed changes