cpp extension tutorial: not device agnostic?

Would the kerne calll in `lltm_cuda_forward` in  the tutorial  `tutorials/advanced_source/cpp_extension.rst` fail on multi gpu systems if the inputs are not on the default device, i.e., `device:0`?

To my understanding, some "magic" takes care of setting the right context if we add functionality do pytorch via custom kernels,  [see here](https://github.com/pytorch/pytorch/tree/7d7855ea3124c16862ea7ed4758f4c7a804ca1ac/aten/src/ATen/native#device_guard).
However, it seems like in the tutorial this machinery is not used. 
Explicit usage of `at::OptionalDeviceGuard` should resolve the issue (?) in the tutorial. 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpp extension tutorial: not device agnostic? #431

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

cpp extension tutorial: not device agnostic? #431

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions