Conversation
Greptile SummaryThis PR fixes a build failure when compiling TransformerEngine with
Confidence Score: 5/5Safe to merge — both changes are minimal, narrowly scoped, and together fully resolve the reported build failure with no side effects. The fix is a two-line correction: adding the missing No files require special attention. Important Files Changed
Sequence DiagramsequenceDiagram
participant cu as nvshmem_waitkernel.cu
participant cdr as cuda_driver.h
participant cmn as common.h
participant te as transformer_engine.h
cu->>cdr: include ../util/cuda_driver.h (NEW)
cdr->>cmn: include ../common.h
cmn->>te: include transformer_engine/transformer_engine.h
Note over cmn,te: Resolved via ../include added to CMakeLists.txt
te-->>cmn: DType, NVTETensor definitions
cmn-->>cdr: transformer_engine namespace, NVTE_ERROR
cdr-->>cu: NVTE_CHECK_CUDA_DRIVER macro defined
Reviews (2): Last reviewed commit: "Fix nvshmem build" | Re-trigger Greptile |
Signed-off-by: Gaetan Lepage <gaetan@glepage.com>
e51ea4d to
60cfe87
Compare
Description
When building TE with
NVTE_ENABLE_NVSHMEM=1, the build fails with:Type of change
Changes
Please list the changes introduced in this PR:
transformer_engine/common/nvshmem_api/nvshmem_waitkernel.cu${CMAKE_CURRENT_SOURCE_DIR}/../includetotarget_include_directoriesintransformer_engine/common/nvshmem_api/CMakeLists.txtChecklist: