Skip to content

Non-blocking gather_csr_cuda() #305

Open
@rBenke

Description

@rBenke

Is it possible to make gather_csr_cuda() without cpu-gpu sync?

I can only guess that the problem is in line 248 in csrc/cuda/segment_csr_cuda.cu:

sizes[dim] = indptr.flatten()[-1].cpu().data_ptr<int64_t>()[0];

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions