Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SIMD slower than NoParallelization #4

Open
Octogonapus opened this issue Jun 18, 2021 · 0 comments
Open

SIMD slower than NoParallelization #4

Octogonapus opened this issue Jun 18, 2021 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@Octogonapus
Copy link
Owner

Octogonapus commented Jun 18, 2021

Describe the bug

The SIMD implementation is slower than NoParallelization across the board. I don't want to jump to conclusions and blame LoopVectorization here; I probably have written my loop wrong.

Steps To Reproduce

This happens on all benchmarks (in the README).

Expected behavior

SIMD should be faster than NoParallelization. My "gold standard" (i.e. the fastest histogram implementation I can find so far) is AHTL. I would like to match AHTL's performance as much as possible.
As of writing, FastHistograms (without SIMD) needs 2.57596e8 ns to do what AHTL can do in 1.08249e8 ns.

Additional Information

Relevant code in AHTL: https://github.com/pcjung/AHTL/blob/master/src/fixed.cpp

Also, debatably I should remove SIMD from this package (for now) because it's slower.

@Octogonapus Octogonapus added the bug Something isn't working label Jun 18, 2021
@Octogonapus Octogonapus self-assigned this Jun 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant