Skip to content

Role of save_cuda_graph and the uncommented native capture #5

@fengyuanyu1

Description

@fengyuanyu1

Hello, @ShaoxunZeng I read the code and have two questions:

  1. I find the save_cuda_graph at https://github.com/ShaoxunZeng/PyTorch-Medusa/blob/de68b8092d45893e45489b39a219d66b7897c73d/aten/src/ATen/cuda/CUDAGraph.cpp#L216. This function reads the information of CUDAGraph from files generated in offline execution. Right?
  2. During online execution, the native capture still exists at https://github.com/ShaoxunZeng/PyTorch-Medusa/blob/de68b8092d45893e45489b39a219d66b7897c73d/aten/src/ATen/cuda/CUDAGraph.cpp#L128. Why Medusa can boot faster with additional reading operations?

Maybe I understand it wrong, can you explain it in more detail?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions