Add numba-cuda-mlir docs, cuda-core/cuda-cccl deps, cuda-toolkit extras by leofang · Pull Request #2101 · NVIDIA/cuda-python

leofang · 2026-05-18T14:47:52Z

Summary

Mention numba-cuda-mlir in README.md and DESCRIPTION.rst (above the existing numba.cuda entry)
Add cuda-core~=1.0.0 and cuda-cccl~=1.0.0 as required dependencies of the cuda-python metapackage (closes Add cuda-core as a required dependency to cuda-python #148, closes Add cuda-cccl as a required dependency to cuda-python #691)
Add bare cuda-toolkit==13.* to cuda-bindings[all] and remove cudla from the component-specific extras (closes RFC: Make our cuda-* packages depend on the new cuda-toolkit metapackage without version constraints? #903)
Add cuda-toolkit==12.* / cuda-toolkit==13.* to cuda-core's cu12 / cu13 extras

… cuda-toolkit extras - Mention numba-cuda-mlir in README.md and DESCRIPTION.rst above the existing numba.cuda entry - Add cuda-core~=1.0.0 and cuda-cccl~=1.0.0 as required dependencies of the cuda-python metapackage (closes NVIDIA#148, closes NVIDIA#691) - Add bare cuda-toolkit==13.* to cuda-bindings[all] and remove cudla from the component-specific extras (closes NVIDIA#903) - Add cuda-toolkit==12.*/13.* to cuda-core's cu12/cu13 extras

copy-pr-bot · 2026-05-18T14:47:56Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

leofang · 2026-05-18T14:57:05Z

 [project.optional-dependencies]
 all = [
-    "cuda-toolkit[nvrtc,nvjitlink,nvvm,nvfatbin,cudla]==13.*",
+    "cuda-toolkit[nvrtc,nvjitlink,nvvm,nvfatbin]==13.*",


I noticed this change sneaked in from PR #2034. This is a bit problematic because cuDLA is not a universally available component in the CTK, and the majority of users cannot use it but they'd see this at install time:

WARNING: cuda-toolkit 13.1.2.0 does not provide the extra 'cudla'

We should just document that DLA users need to install cuDLA if they haven't already (which I suspect is not the case, because there should be a system CTK already installed on embedded devices).

I approved this knowingly, after discovering that the meta package has the filters we need.

I missed that warning message though. If we want to get rid of it, we have do add the conditions similar to what we're doing under cuda_pathfinder/pyproject.toml:

"nvidia-cudla; platform_system == 'Linux' and platform_machine == 'aarch64'",

That would still fire on aarch64 server machines where I don't think we have a way to differentiate Jetson / Tegra with current pip capabilities today

My 2c: I think we should include cuDLA in all still as it would only error at runtime if someone tried to use a cuDLA binding from a non Tegra system. The cuDLA binary is tiny (< 1MB), so binary size impact isn't a big concern.

@kkraus14 My concerns:

the warning (see above)

would cuda-toolkit==13.0 still work for constraining the installed packages? cuDLA wheel was not shipped until 13.2

If we guard it based on @rwgk's comment in using platform_system and platform_machine does it still warn?

I assume if we added it to all and someone did pip install cuda-bindings[all] cuda-toolkit==13.0 that it would error because there isn't a cuda-toolkit[cudla] package for 13.0?

leofang · 2026-05-22T18:40:41Z

/ok to test 0587448

github-actions · 2026-05-22T19:01:31Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-2101/
https://nvidia.github.io/cuda-python/pr-preview/pr-2101/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-2101/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-2101/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

leofang · 2026-05-22T20:42:45Z

    install_requires=[
        f"cuda-bindings{matcher}{version}",
+        "cuda-core~=1.0.0",
+        "cuda-cccl~=1.0.0",


Life has been not very kind to us. Because cuda-cccl does not yet have a 3.14t build, and because there is no environment marker that would allow us to append a condition here (PEP 780 is still in draft status and the DPO discussion is brutal), all Python 3.14t pipelines failed. We have no other way but drop cuda-cccl here:

Suggested change

"cuda-cccl~=1.0.0",

Let's add cuda-cccl in a follow up because of this.

kkraus14 · 2026-05-23T01:42:18Z

 * Python 3.10 - 3.14
 * Driver: Linux (580.65.06 or later) Windows (580.88 or later)
-* Optionally, NVRTC, nvJitLink, NVVM, and cuFile from CUDA Toolkit 13.x
+* Optionally, NVRTC, nvJitLink, nvFatBin, NVVM, and cuFile from CUDA Toolkit 13.x


Should we include cuDLA here now as well?

kkraus14 · 2026-05-23T01:42:49Z


 * ``nvidia-cuda-nvrtc`` (NVRTC runtime compilation library)
 * ``nvidia-nvjitlink`` (nvJitLink library)
+* ``nvidia-nvfatbin`` (nvFatBin library)


+1 cuDLA here?

kkraus14 · 2026-05-23T01:46:21Z

 [project.optional-dependencies]
 all = [
-    "cuda-toolkit[nvrtc,nvjitlink,nvvm,nvfatbin,cudla]==13.*",
+    "cuda-toolkit[nvrtc,nvjitlink,nvvm,nvfatbin]==13.*",


My 2c: I think we should include cuDLA in all still as it would only error at runtime if someone tried to use a cuDLA binding from a non Tegra system. The cuDLA binary is tiny (< 1MB), so binary size impact isn't a big concern.

kkraus14 · 2026-05-23T01:47:24Z

    install_requires=[
        f"cuda-bindings{matcher}{version}",
+        "cuda-core~=1.0.0",
+        "cuda-cccl~=1.0.0",


Let's add cuda-cccl in a follow up because of this.

github-actions Bot added cuda.bindings Everything related to the cuda.bindings module cuda.core Everything related to the cuda.core module labels May 18, 2026

Add numba-cuda-mlir to cuda-python docs index.rst

9dcb465

leofang commented May 18, 2026

View reviewed changes

leofang added documentation Improvements or additions to documentation P0 High priority - Must do! packaging Anything related to wheels or Conda packages labels May 18, 2026

leofang added this to the cuda.bindings 13.3.0 & 12.9.7 milestone May 18, 2026

leofang self-assigned this May 19, 2026

leofang commented May 19, 2026

View reviewed changes

Comment thread cuda_python/setup.py

leofang requested review from aterrel, danielfrg and kkraus14 May 19, 2026 21:57

leofang added 2 commits May 22, 2026 01:05

Update install.rst

06afca8

Merge branch 'main' into meta-deps

0587448

leofang marked this pull request as ready for review May 22, 2026 05:07

gmarkall approved these changes May 22, 2026

View reviewed changes

leofang enabled auto-merge (squash) May 22, 2026 18:41

rparolin approved these changes May 22, 2026

View reviewed changes

leofang commented May 22, 2026

View reviewed changes

kkraus14 reviewed May 23, 2026

View reviewed changes

Conversation

leofang commented May 18, 2026

Summary

Uh oh!

copy-pr-bot Bot commented May 18, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leofang commented May 22, 2026

Uh oh!

github-actions Bot commented May 22, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants