log in

Recent Errors in Lasieved

Message boards : Questions/Problems/Bugs : Recent Errors in Lasieved
Message board moderation

To post messages, you must log in.

AuthorMessage
entity

Send message
Joined: 27 Jul 17
Posts: 3
Credit: 52,651,322
RAC: 58,031
Message 2498 - Posted: 7 Oct 2024, 14:56:39 UTC
Last modified: 7 Oct 2024, 14:58:26 UTC

Would it be possible to test batches in lasieved before submitting to the grid? There have been two recent batches that all contain computation errors and cause my machines to become unreliable and then only get 1 WU per day per quota. Just as they become reliable again, another batch of ~2000 WUs get submitted and all contain errors again and I'm back to unreliable again. To make the situation even worse, nobody recognizes that the batch is totally messed up and when the errors are returned to the server another erroneous WU is submitted again. This work isn't labeled as beta!

Respectfully request:

1. Test before submitting batch
2. Monitor the batch after submission and if erroneous, cancel remaining work

Edited to add error data:
<core_client_version>8.0.4</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
boinc initialized
work files resolved, now working
-> ../../projects/escatter11.fullerton.edu_nfs/lasieved_1.11_x86_64-pc-linux-gnu
-> -r
-> -f
-> 42432000
-> -c
-> 16000
-> -R
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1.poly
-> -o
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1_42432_0_r1249026950_0
Expected to create 12 schedules on side 0, have 7
09:43:32 (35873): called boinc_finish(1)

</stderr_txt>
]]>
ID: 2498 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
JZD

Send message
Joined: 23 Jul 18
Posts: 1
Credit: 1,509,958
RAC: 1,305
Message 2499 - Posted: 9 Oct 2024, 19:05:40 UTC

Linux bad.
<core_client_version>7.22.2</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
boinc initialized
work files resolved, now working
-> ../../projects/escatter11.fullerton.edu_nfs/lasieved_1.11_x86_64-pc-linux-gnu
-> -r
-> -f
-> 48624000
-> -c
-> 16000
-> -R
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1.poly
-> -o
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1_48624_1_r315520415_0
Expected to create 12  schedules on side 0, have 7
05:49:51 (2634649): called boinc_finish(1)

</stderr_txt>
]]>

Windows good.
<core_client_version>8.0.2</core_client_version>
<![CDATA[
<stderr_txt>
boinc initialized
work files resolved, now working
-> projects/escatter11.fullerton.edu_nfs/lasieved_1.08_windows_intelx86.exe
-> -r
-> -f
-> 48624000
-> -c
-> 16000
-> -R
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1.poly
-> -o
-> ../../projects/escatter11.fullerton.edu_nfs/177473_41m1_48624_3_r440598530_0

total yield: 24126, q=48640001 (0.22471 sec/rel,  100.00625 % done of 16000)called boinc_finish

</stderr_txt>
]]>

Why Linux is bad and Windows is fine? Workunit https://escatter11.fullerton.edu/nfs/workunit.php?wuid=355476863.
ID: 2499 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Questions/Problems/Bugs : Recent Errors in Lasieved


Home | My Account | Message Boards