[libcxx] Add necessary compile flags for targeting the GPU #99333

jhuber6 · 2024-07-17T14:50:50Z

Summary:
The GPU target will always be a sufficiently new clang, so we can
assume these flags are present. We need to first set
CMAKE_REQUIRED_FLAGS to these values so that the
check_cxx_compile_flag utilities work. Then, we need to add several
things to the compiler flags that are necessary for correctness and
optimal code output.

llvmbot · 2024-07-17T14:51:23Z

@llvm/pr-subscribers-libcxx

Author: Joseph Huber (jhuber6)

Changes

Summary:
The GPU target will always be a sufficiently new clang, so we can
assume these flags are present. We need to first set
CMAKE_REQUIRED_FLAGS to these values so that the
check_cxx_compile_flag utilities work. Then, we need to add several
things to the compiler flags that are necessary for correctness and
optimal code output.

Full diff: https://github.com/llvm/llvm-project/pull/99333.diff

1 Files Affected:

(modified) libcxx/CMakeLists.txt (+9)

diff --git a/libcxx/CMakeLists.txt b/libcxx/CMakeLists.txt
index 190a97db9462f..a72e5aca0903f 100644
--- a/libcxx/CMakeLists.txt
+++ b/libcxx/CMakeLists.txt
@@ -491,6 +491,15 @@ include(HandleLibcxxFlags)
 # 'config-ix' use them during feature checks. It also adds them to both
 # 'LIBCXX_COMPILE_FLAGS' and 'LIBCXX_LINK_FLAGS'
 
+# Targeting the GPU requires a clang compiler and several extra flags.
+if (${LLVM_RUNTIMES_TARGET} MATCHES "^amdgcn")
+  set(CMAKE_REQUIRED_FLAGS "${CMAKE_REQUIRED_FLAGS} -nogpulib")
+  add_flags("-nogpulib" "-flto" "-fconvergent-functions" "-Xclang" "-mcode-object-version=none")
+elseif (${LLVM_RUNTIMES_TARGET} MATCHES "^nvptx")
+  set(CMAKE_REQUIRED_FLAGS "${CMAKE_REQUIRED_FLAGS} -flto -c -Wno-unused-command-line-argument")
+  add_flags("-nogpulib" "-flto" "-fconvergent-functions" "--cuda-feature=+ptx63")
+endif()
+
 if (${CMAKE_SYSTEM_NAME} MATCHES "AIX")
   add_flags_if_supported("-mdefault-visibility-export-mapping=explicit")
   set(CMAKE_AIX_EXPORT_ALL_SYMBOLS OFF)

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the fllowing patches: llvm#99259, llvm#99243, llvm#99287, and llvm#99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -C${LLVM_ROOT}/libcxx/cmake/caches/GPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`.

jhuber6 · 2024-07-31T14:02:32Z

Ping

ldionne · 2024-07-31T14:17:19Z

libcxx/CMakeLists.txt

@@ -491,6 +491,15 @@ include(HandleLibcxxFlags)
 # 'config-ix' use them during feature checks. It also adds them to both
 # 'LIBCXX_COMPILE_FLAGS' and 'LIBCXX_LINK_FLAGS'

+# Targeting the GPU requires a clang compiler and several extra flags.


Why does Clang require -nogpulib? Don't we already pass -nodefaultlibs?

Also, why -flto?

We should set this in the CMake caches -- we don't want to add any more target-specific flags in our CMake files, this is too messy.

-nogpulib is separate, it controls whether or not the GPU toolchain pulls in external vendor libraries.

-flto is 100% required because GPUs have almost no backwards compatibility, so unless you want to build libc++ over 30 times you need to use LLVM-IR + LTO to defer creating object code until it's linked into the user's application. Also ELF linking isn't supported at all for AMDGPU.

I can move this bit to the CMake cache, but CMAKE_REQUIRED_FLAGS might need to stay, otherwise check_cxx_flags will not work.

Summary: The GPU target will always be a sufficiently new `clang`, so we can assume these flags are present. We need to first set `CMAKE_REQUIRED_FLAGS` to these values so that the `check_cxx_compile_flag` utilities work. Then, we need to add several things to the compiler flags that are necessary for correctness and optimal code output.

jhuber6 · 2024-07-31T16:04:57Z

Seems I can do this in the cache like you said, will update that shortly.

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the fllowing patches: llvm#99259, llvm#99243, llvm#99287, and llvm#99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -C${LLVM_ROOT}/libcxx/cmake/caches/GPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`.

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the fllowing patches: llvm#99259, llvm#99243, llvm#99287, and llvm#99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -C${LLVM_ROOT}/libcxx/cmake/caches/GPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`. Move to separate files

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the fllowing patches: llvm#99259, llvm#99243, llvm#99287, and llvm#99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -C${LLVM_ROOT}/libcxx/cmake/caches/GPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`.

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the following patches: #99259, #99243, #99287, and #99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -DRUNTIMES_nvptx64-nvidia-cuda_CACHE_FILES=${LLVM_SRC}/../libcxx/cmake/caches/NVPTX.cmake \ -DRUNTIMES_amdgcn-amd-amdhsa_CACHE_FILES=${LLVM_SRC}/../libcxx/cmake/caches/AMDGPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`.

Summary: This patch adds a CMake cache config file for the GPU build. This cache will set the default required options when used from the LLVM runtime interface or directly. These options pretty much disable everything the GPU can't handle. With this and the following patches: llvm#99259, llvm#99243, llvm#99287, and llvm#99333, we will be able to build `libc++` targeting the GPU with an invocation like this. ``` $ cmake ../llvm -DRUNTIMES_nvptx64-nvidia-cuda_CACHE_FILES=${LLVM_SRC}/../libcxx/cmake/caches/NVPTX.cmake \ -DRUNTIMES_amdgcn-amd-amdhsa_CACHE_FILES=${LLVM_SRC}/../libcxx/cmake/caches/AMDGPU.cmake \ -DRUNTIMES_nvptx64-nvidia-cuda_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=compiler-rt;libc;libcxx \ -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa;nvptx64-nvidia-cuda \ ``` This will then install the libraries and headers into the appropriate locations for use with `clang`.

jhuber6 requested a review from a team as a code owner July 17, 2024 14:50

jhuber6 requested review from ldionne, mordante and philnik777 July 17, 2024 14:50

llvmbot added the libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. label Jul 17, 2024

This was referenced Jul 17, 2024

[libcxx] Set _LIBCPP_HAS_CLOCK_GETTIME for GPU targets #99243

Merged

[libcxx] Add cache file for the GPU build #99348

Merged

ldionne requested changes Jul 31, 2024

View reviewed changes

jhuber6 force-pushed the libcxx-flags branch from 77be018 to c9a9d02 Compare July 31, 2024 14:26

jhuber6 closed this Jul 31, 2024

jhuber6 deleted the libcxx-flags branch October 14, 2024 19:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libcxx] Add necessary compile flags for targeting the GPU #99333

[libcxx] Add necessary compile flags for targeting the GPU #99333

Uh oh!

jhuber6 commented Jul 17, 2024

Uh oh!

llvmbot commented Jul 17, 2024

Uh oh!

jhuber6 commented Jul 31, 2024

Uh oh!

ldionne Jul 31, 2024

Uh oh!

jhuber6 Jul 31, 2024

Uh oh!

jhuber6 commented Jul 31, 2024

Uh oh!

Uh oh!

[libcxx] Add necessary compile flags for targeting the GPU #99333

[libcxx] Add necessary compile flags for targeting the GPU #99333

Uh oh!

Conversation

jhuber6 commented Jul 17, 2024

Uh oh!

llvmbot commented Jul 17, 2024

Uh oh!

jhuber6 commented Jul 31, 2024

Uh oh!

ldionne Jul 31, 2024

Choose a reason for hiding this comment

Uh oh!

jhuber6 Jul 31, 2024

Choose a reason for hiding this comment

Uh oh!

jhuber6 commented Jul 31, 2024

Uh oh!

Uh oh!