You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[mlir][gpu] Add the OffloadEmbeddingAttr offloading translation attr
This patch adds the offloading translation attribute. This attribute uses LLVM
offloading infrastructure to embed GPU binaries in the IR. At the program start,
the LLVM offloading mechanism registers kernels and variables with the runtime
library: CUDA RT, HIP RT, or LibOMPTarget.
The offloading mechanism relies on the runtime library to dispatch the correct
kernel based on the registered symbols.
This patch is 3/4 on introducing the OffloadEmbeddingAttr GPU translation
attribute.
Note: Ignore the base commits; those are being reviewed in PRs #78057, #78098,
and #78073.
0 commit comments