-
Notifications
You must be signed in to change notification settings - Fork 95
Calling parallelproj forward Segmentation fault (core dumped) for LAFOV #1567
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Sadly, there is no line number information in any of your traces, which seems to say you should recompile STIR in Neverthless, the interesting bit in the valgrind log seems to be
Confusingly, the gdb and valgrind outputs are in different places. In
Of course, these are huge in this case: it implies that we need 2 std::vector<float> of 3 * total_of_elements_in_the_Quadra_sinogram. It's not so surprising to me that it fails, and it "properly" throws an exception (except that valgrind can't handle that).
For your gdb case, it looks like it did succesfully allocate the ParallelprojHelper data structures, but then segfaults in ForwardProjectorByBinParallelproj::set_input. That's of course a large function, so it's hard to know where it segfaults ATM. Even the non-CUDA version does need another "sinogram-sized allocation" due to a "transpose" between the parallelproj TOF dimension (fastest) and the STIR one (slowest). This extra allocation might be avoidable. But of course, Summary: please recompile such that we get line information. Sorry. |
Hi Kris, Sorry I didn't get the notification of your reply. WARNING: Expected the number of views (50) to be related to the number of detectors per ring (798), but this is not the case. Continuing anyway (but without adjusting the azimuthal angle offset). INFO: Tbin -16: -354.252 - -332.782 mm (-2363.31 - -2220.08 ps) = 21.4698 INFO: Tbin -15: -332.782 - -311.312 mm (-2220.08 - -2076.85 ps) = 21.4698 INFO: Tbin -14: -311.312 - -289.843 mm (-2076.85 - -1933.62 ps) = 21.4698 INFO: Tbin -13: -289.843 - -268.373 mm (-1933.62 - -1790.39 ps) = 21.4698 INFO: Tbin -12: -268.373 - -246.903 mm (-1790.39 - -1647.16 ps) = 21.4698 INFO: Tbin -11: -246.903 - -225.433 mm (-1647.16 - -1503.93 ps) = 21.4698 INFO: Tbin -10: -225.433 - -203.963 mm (-1503.93 - -1360.7 ps) = 21.4698 INFO: Tbin -9: -203.963 - -182.493 mm (-1360.7 - -1217.47 ps) = 21.4698 INFO: Tbin -8: -182.493 - -161.024 mm (-1217.47 - -1074.23 ps) = 21.4698 INFO: Tbin -7: -161.024 - -139.554 mm (-1074.23 - -931.003 ps) = 21.4698 INFO: Tbin -6: -139.554 - -118.084 mm (-931.003 - -787.772 ps) = 21.4698 INFO: Tbin -5: -118.084 - -96.6142 mm (-787.772 - -644.54 ps) = 21.4698 INFO: Tbin -4: -96.6142 - -75.1444 mm (-644.54 - -501.309 ps) = 21.4698 INFO: Tbin -3: -75.1444 - -53.6745 mm (-501.309 - -358.078 ps) = 21.4698 INFO: Tbin -2: -53.6745 - -32.2047 mm (-358.078 - -214.847 ps) = 21.4698 INFO: Tbin -1: -32.2047 - -10.7349 mm (-214.847 - -71.6156 ps) = 21.4698 INFO: Tbin 0: -10.7349 - 10.7349 mm (-71.6156 - 71.6156 ps) = 21.4698 INFO: Tbin 1: 10.7349 - 32.2047 mm (71.6156 - 214.847 ps) = 21.4698 INFO: Tbin 2: 32.2047 - 53.6745 mm (214.847 - 358.078 ps) = 21.4698 INFO: Tbin 3: 53.6745 - 75.1444 mm (358.078 - 501.309 ps) = 21.4698 INFO: Tbin 4: 75.1444 - 96.6142 mm (501.309 - 644.54 ps) = 21.4698 INFO: Tbin 5: 96.6142 - 118.084 mm (644.54 - 787.772 ps) = 21.4698 INFO: Tbin 6: 118.084 - 139.554 mm (787.772 - 931.003 ps) = 21.4698 INFO: Tbin 7: 139.554 - 161.024 mm (931.003 - 1074.23 ps) = 21.4698 INFO: Tbin 8: 161.024 - 182.493 mm (1074.23 - 1217.47 ps) = 21.4698 INFO: Tbin 9: 182.493 - 203.963 mm (1217.47 - 1360.7 ps) = 21.4698 INFO: Tbin 10: 203.963 - 225.433 mm (1360.7 - 1503.93 ps) = 21.4698 INFO: Tbin 11: 225.433 - 246.903 mm (1503.93 - 1647.16 ps) = 21.4698 INFO: Tbin 12: 246.903 - 268.373 mm (1647.16 - 1790.39 ps) = 21.4698 INFO: Tbin 13: 268.373 - 289.843 mm (1790.39 - 1933.62 ps) = 21.4698 INFO: Tbin 14: 289.843 - 311.312 mm (1933.62 - 2076.85 ps) = 21.4698 INFO: Tbin 15: 311.312 - 332.782 mm (2076.85 - 2220.08 ps) = 21.4698 INFO: Tbin 16: 332.782 - 354.252 mm (2220.08 - 2363.31 ps) = 21.4698 INFO: Creating parallelproj data-structures INFO: done INFO: Calling parallelproj forward Thread 1 "forward_project" received signal SIGSEGV, Segmentation fault. Please let me know if more information is needed. |
Hi Kris and all,
I'm encounting INFO: Calling parallelproj forward Segmentation fault (core dumped) when I did forward projection for a large scanner.
I am using CPU version of parallelproj: 1.10.1, STIR: 6.2.
The error occured several minutes after created parallelproj data-structures. From resource monitor, the task used 274GB RAM before it crashed, in total we have 1.5 TB RAM.
Please see 1. gdb info from Debug_release mode
INFO: Calling parallelproj forward
Thread 1 "forward_project" received signal SIGSEGV, Segmentation fault.
0x00005555557afb3b in stir::ForwardProjectorByBinParallelproj::set_input(stir::DiscretisedDensity<3, float> const&) ()
(gdb) backtrace
#0 0x00005555557afb3b in stir::ForwardProjectorByBinParallelproj::set_input(stir::DiscretisedDensity<3, float> const&) ()
#1 0x00005555556b93d4 in stir::ForwardProjectorByBin::forward_project(stir::ProjData&, stir::DiscretisedDensity<3, float> const&, int, int, bool) ()
#2 0x00005555555af71d in main ()
2. attached valgrind_output (very long).
valgrind_output.log
Please let me if the information is enough, if not I can insert cout to see what happens inside.
Thank you!
The text was updated successfully, but these errors were encountered: