[#3334][feat] Support of CPU Inference for Scaffolding via PyTorch #4639

amemov · 2025-05-25T14:36:56Z

Description

Part of #3706. Addresses #3334 and #3333

Provides a Worker pytorch_worker.py to run inference directly on CPU

Examples

Described in examples/scaffolding/contrib/PytorchCPU/pytorch_worker_run.py - similar to TRTLLMWorker in how the worker is initialized

Signed-off-by: amemov <[email protected]>

amemov force-pushed the cpu-inference-w-scaffolding branch from 09a46af to ad2bcbf Compare May 25, 2025 15:03

amemov mentioned this pull request May 26, 2025

Support cpu inference with scaffolding #3334

Open

juney-nvidia added Community want to contribute PRs initiated from Community Community Engagement help/insights needed from community labels May 26, 2025

Initial implementation of CPU Inference for Scaffolding via PyTorch

5c1f5e7

Signed-off-by: amemov <[email protected]>

amemov force-pushed the cpu-inference-w-scaffolding branch from ad2bcbf to 5c1f5e7 Compare May 27, 2025 00:06

poweiw added the Generic Runtime General operational aspects of TRTLLM execution not in other categories. label Jun 5, 2025

poweiw requested a review from dcampora June 5, 2025 20:27

poweiw added the triaged Issue has been triaged by maintainers label Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[#3334][feat] Support of CPU Inference for Scaffolding via PyTorch #4639

[#3334][feat] Support of CPU Inference for Scaffolding via PyTorch #4639

Uh oh!

amemov commented May 25, 2025

Uh oh!

Uh oh!

[#3334][feat] Support of CPU Inference for Scaffolding via PyTorch #4639

Are you sure you want to change the base?

[#3334][feat] Support of CPU Inference for Scaffolding via PyTorch #4639

Uh oh!

Conversation

amemov commented May 25, 2025

Description

Examples

Uh oh!

Uh oh!