Hello everyone,
I encountered issues after compiling the solver and submitting the computational tasks to our cluster (slurm system). The application experiences repeated segmentation faults across multiple processes. Here is the error log from my most recent run:
[R7625-1:2001585:0:2001585] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001581:0:2001581] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001573:0:2001573] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001561:0:2001561] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001591:0:2001591] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001575:0:2001575] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001593:0:2001593] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001598:0:2001598] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001578:0:2001578] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001600:0:2001600] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001602:0:2001602] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001565:0:2001565] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001558:0:2001558] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001567:0:2001567] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001588:0:2001588] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001583:0:2001583] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001569:0:2001569] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001572:0:2001572] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001605:0:2001605] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001589:0:2001589] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001559:0:2001559] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001604:0:2001604] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001564:0:2001564] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001595:0:2001595] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001580:0:2001580] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
[R7625-1:2001557:0:2001557] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x3ff00000001)
Segmentation fault
Segmentation fault
Segmentation faultprterun detected that one or more processes exited with non-zero status,
thus causing the job to be terminated. The first process to do so was:Process name: [prterun-R7625-1-2001550@1,15] Exit code: 139
I have tried debugging the issue but have not been able to pinpoint the exact cause of the problem.
Could anyone provide some guidance on how to diagnose and resolve this issue?
Best regards,