API for # of cores/multiprocessors

This may not be the best way to approach this, but to improve the heuristic deciding whether to reduce with blocks or with threads I'm thinking there should be a way to expose the number of cores.

See https://github.com/JuliaGPU/CUDA.jl/blob/e561e7a106684f8e4be59cad98a51cc304c671d2/src/mapreduce.jl#L163-L167 and https://github.com/JuliaGPU/Metal.jl/pull/626

I guess we would also need a way to access the max threads per block/group. Maybe we expose an API specifically for reductions that is essentially an interface for what CUDA has defined in `big_mapreduce_threshold`?

@vchuravy @maleadt @anicusan 


Should probably update https://discourse.julialang.org/t/how-to-get-the-device-name-and-the-number-of-compute-units-when-using-oneapi-jl-or-amdgpu-jl/128361 once resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API for # of cores/multiprocessors #631

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

API for # of cores/multiprocessors #631

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions