feature request: edge device CUDA backend #8214

mindbeast · 2023-12-19T02:08:59Z

mindbeast
Dec 19, 2023

Does it make sense for executorch to have a mobile cuda backend? There are many edge devices in the Jetson lineup from nvidia that have a cuda gpu, but can benefit from not wanting to link an enormous libtorch dependence.

bionictoucan · 2024-02-12T11:41:17Z

bionictoucan
Feb 12, 2024

+1 to this, would be nice to get comparable performance to TensorRT without having to export models to ONNX etc. first!

0 replies

hietalajulius · 2024-02-29T17:54:38Z

hietalajulius
Feb 29, 2024

+1

0 replies

mergennachin · 2024-02-29T18:15:50Z

mergennachin
Feb 29, 2024
Collaborator

@mindbeast @bionictoucan @hietalajulius

Hi, thanks for the comment.

Yes, that makes sense in general.

Right now, for ExecuTorch, we are integrating Vulkan into ExecuTorch. The reason is that it is a suitable solution for mobile GPUs. Enabling mobile use-cases is our primary goal at the moment.

We will revisit Cuda, but perhaps, in the second half in the year. Curious, what are your current product needs?

0 replies

DzAvril · 2024-09-11T11:01:41Z

DzAvril
Sep 11, 2024

Apologies for opening a similar feature request in #5263.

Curious, what are your current product needs?

@mergennachin We want to deploy LLMs in cars, but Python-based inference frameworks like vLLM and SGLang are not suitable for edge devices.

We will revisit Cuda, but perhaps, in the second half in the year.

Nearly five months have passed, is there any progress on this?

0 replies

digantdesai · 2024-09-11T14:47:09Z

digantdesai
Sep 11, 2024
Collaborator

Thank you for following up @DzAvril.

We want to deploy LLMs in cars, but Python-based inference frameworks like vLLM and SGLang are not suitable for edge devices.

I guess this is using a platform similar to Jetson?

Nearly five months have passed, is there any progress on this?

No update yet on CUDA backend for ET at the moment. We can get back to you here once we plan something.

0 replies

DzAvril · 2024-09-12T01:22:19Z

DzAvril
Sep 12, 2024

I guess this is using a platform similar to Jetson?

@digantdesai Yes, Jetson Orin for now, and possibly Thor in the future.

Looking forward to your update.

0 replies

DuinoDu · 2025-01-11T05:05:21Z

DuinoDu
Jan 11, 2025

For mobile cuda backend, does torch_tensorrt satisfy the requirement?

0 replies

mindbeast · 2025-01-23T21:06:41Z

mindbeast
Jan 23, 2025
Author

@DuinoDu My expectation is that compatibility is poor with torch_tensorrt. I expect a more compliant backend in executorch would help a lot of developers.

0 replies

yrik · 2025-05-26T07:24:23Z

yrik
May 26, 2025

Any update on this? What is the best alternative so far to run CUDA on jetson? Directly Torch? Onnx?

0 replies

shoumikhin · 2025-11-07T18:28:20Z

shoumikhin
Nov 7, 2025
Collaborator

@Gasoonjia may like to update this discussion?

0 replies

larryliu0820 · 2025-11-07T23:22:55Z

larryliu0820
Nov 7, 2025
Collaborator

We have a WIP cuda backend, backend by AOTInductor: https://docs.pytorch.org/docs/stable/torch.compiler_aot_inductor.html. We have enabled some popular models (whisper, voxtral, gemma3 etc), please checkout this README.md to give whisper a try! You can also find voxtral instructions here: https://github.com/pytorch/executorch/tree/main/examples/models/voxtral#readme, gemma3 instructions here: https://github.com/pytorch/executorch/blob/main/examples/models/gemma3/README.md

1 reply

mindbeast Nov 8, 2025
Author

You should actually link to the backend:

executorch/backends/cuda/cuda_backend.py

Line 31 in 6014129

cuda_decomposition_table = {

Why is executorch involved at all if you are just using the AOTInductor backend from torch? Any overview of design here?

I don't know what whisper models are and don't see how that is related to the CUDA backend.

feature request: edge device CUDA backend #8214

Uh oh!

mindbeast Dec 19, 2023

Replies: 11 comments · 1 reply

Uh oh!

bionictoucan Feb 12, 2024

Uh oh!

hietalajulius Feb 29, 2024

Uh oh!

Uh oh!

mergennachin Feb 29, 2024 Collaborator

Uh oh!

DzAvril Sep 11, 2024

Uh oh!

digantdesai Sep 11, 2024 Collaborator

Uh oh!

DzAvril Sep 12, 2024

Uh oh!

DuinoDu Jan 11, 2025

Uh oh!

mindbeast Jan 23, 2025 Author

Uh oh!

yrik May 26, 2025

Uh oh!

shoumikhin Nov 7, 2025 Collaborator

Uh oh!

larryliu0820 Nov 7, 2025 Collaborator

Uh oh!

mindbeast Nov 8, 2025 Author

mindbeast
Dec 19, 2023

Replies: 11 comments 1 reply

bionictoucan
Feb 12, 2024

hietalajulius
Feb 29, 2024

mergennachin
Feb 29, 2024
Collaborator

DzAvril
Sep 11, 2024

digantdesai
Sep 11, 2024
Collaborator

DzAvril
Sep 12, 2024

DuinoDu
Jan 11, 2025

mindbeast
Jan 23, 2025
Author

yrik
May 26, 2025

shoumikhin
Nov 7, 2025
Collaborator

larryliu0820
Nov 7, 2025
Collaborator

mindbeast Nov 8, 2025
Author