[maskedtensor] Add missing nan ops tutorial #2046

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

george-qi wants to merge 1 commit into gh/george-qi/5/base from gh/george-qi/5/head

Contributor

george-qi commented Sep 20, 2022 •

edited

Loading

Stack from ghstack (oldest at bottom):


          [maskedtensor] Add missing nan ops tutorial

46f2fbe

[ghstack-poisoned]

This was referenced Sep 20, 2022

[maskedtensor] Add overview tutorial #2042

Closed

[maskedtensor] Add sparsity tutorial #2043

Closed

[maskedtensor] Distinguish between 0 and NaN gradient #2044

Closed

[maskedtensor] Add safe softmax tutorial #2045

Closed

[maskedtensor] Add adagrad sparse semantics tutorial #2047

Closed

facebook-github-bot added the cla signed label

jisaacso suggested changes

View reviewed changes

jisaacso left a comment

Please address comments on masked_tensor vs MaskedTensor. I'm also not sure that this tutorial is all that helpful as a standalone? Would it make more sense to just merge this into the overall tutorial? Or have a second tutorial that aggregate advanced features? This feels like a pretty minor demonstration of nanmean with Tensors vs mean with MT.

beginner_source/maskedtensor_missing_nan_ops.rst Show resolved Hide resolved

beginner_source/maskedtensor_missing_nan_ops.rst

+                  >>> y
+                  tensor([nan,  1.,  4.,  9., nan,  5., 12., 21., nan,  9., 20., 33., nan, 13.,
+., 45.])
+                  >>> y.nanmean()

jisaacso Sep 21, 2022

it might be useful to inline some comments on what you're trying to show here

beginner_source/maskedtensor_missing_nan_ops.rst

+., 45.])
+                  >>> y.nanmean()
+                  tensor(16.6667)
+                  >>> torch.mean(masked_tensor(y, ~torch.isnan(y)))

jisaacso Sep 21, 2022

is the goal to have sequential tutorials or keep each self contained? If the latter, can you add the relevant imports up top.

Contributor Author

george-qi Sep 24, 2022

This tutorial will be merged with overview!

beginner_source/maskedtensor_missing_nan_ops.rst

+., 45.])
+                  >>> y.nanmean()
+                  tensor(16.6667)
+                  >>> torch.mean(masked_tensor(y, ~torch.isnan(y)))

jisaacso Sep 21, 2022

did you replace masked_tensor with MaskedTensor in pytorch/pytorch@5e9c26c ? If so, can you update the tutorial here?

Contributor Author

george-qi Sep 24, 2022

masked_tensor is the preferred function to use!

beginner_source/maskedtensor_missing_nan_ops.rst

+                  tensor([nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan])
+                  >>> torch.nanmean(x)
+                  tensor(nan)
+                  >>> torch.mean(masked_tensor(x, ~torch.isnan(x)))

jisaacso Sep 21, 2022

same comment above on MaskedTensor

beginner_source/maskedtensor_missing_nan_ops.rst

+                  >>> y
+                  tensor([nan,  1.,  4.,  9., nan,  5., 12., 21., nan,  9., 20., 33., nan, 13.,
+., 45.])
+                  >>> y.nanmean()

jisaacso Sep 21, 2022

probably outside the scope of this review, but why do we have nanmean() as an API instead of the pandas-style mean(..., skipna=True)

Contributor Author

george-qi Sep 24, 2022

Not sure..

beginner_source/maskedtensor_missing_nan_ops.rst

+., 45.])
+                  >>> y.nanmean()
+                  tensor(16.6667)
+                  >>> torch.mean(masked_tensor(y, ~torch.isnan(y)))

jisaacso Sep 21, 2022

have we considered API sugar:

(1) instantiating a MT from a Tensor assuming na is the mask

>>> MaskedTensor(y)
MaskedTensor(
  [      --,   1.0000,   4.0000,   9.0000,       --,   5.0000,  12.0000,  21.0000,       --,   9.0000,  20.0000,  33.0000,       --,  13.0000,  28.0000,  45.0000]
)

(2) instantiating a MT where user just states the mask value instead of passing the mask

y = MaskedTensor(y, mask_value=float(1))

Contributor Author

george-qi Sep 24, 2022

Not yet! I think an unspecified mask could also be an indication that they would like all True values for the mask, so that could be a third option as well.

Another one would be to allow for just MaskedTensor(y) if y is a sparse tensor because then the mask is "implied".

All been discussed and will take note to add in :)

jisaacso Sep 26, 2022

where are you tracking feature requests?

This was referenced Sep 24, 2022

[maskedtensor] Overview tutorial [1/4] #2050

Merged

[maskedtensor] Sparsity tutorial [2/4] #2051

Merged

[maskedtensor] Adagrad sparse semantics [3/4] #2052

Merged

[maskedtensor] Advanced semantics [4/4] #2053

Merged

george-qi closed this

facebook-github-bot deleted the gh/george-qi/5/head branch

October 27, 2022 14:20

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels