Up and Running Again
log in

Advanced search

Message boards : News : Up and Running Again

1 · 2 · 3 · 4 . . . 5 · Next
Author Message
Profile Slicker
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 11 Jun 09
Posts: 2525
Credit: 740,580,099
RAC: 1
Message 17102 - Posted: 26 Jul 2013, 19:30:46 UTC

The new server is finally up and running and everything that could be recovered from the old server was moved over to it. Total credit and RAC were updated from the last stats dump since it was several hours newer than the database backup. Unfortunately, the server has no knowledge of any work units sent out between the database backup and the crash. While I can recreate the missing work units, I cannot create the missing result records because I have no way of knowing which host was sent which work unit. As is often the case with a new deployment, there will likely be a few glitches so please let me know of any issues. Thanks.

Profile Pooh Bear 27
Avatar
Send message
Joined: 1 Aug 10
Posts: 54
Credit: 108,227,920
RAC: 0
Message 17104 - Posted: 26 Jul 2013, 19:40:06 UTC

On the Collatz Preference page:

Warning: Creating default object from empty value in /home/boincadm/projects/collatz/html/project/project_specific_prefs.inc on line 233

Profile JayPi
Send message
Joined: 25 Sep 11
Posts: 5
Credit: 1,691,845,339
RAC: 3,358,832
Message 17105 - Posted: 26 Jul 2013, 20:01:50 UTC
Last modified: 26 Jul 2013, 20:08:16 UTC

While trying to return finished WUs the following message appears:

26.07.2013 21:41:43 | Collatz Conjecture | Started upload of solo_collatz_2380223520128266643816_824633720832_0_0
26.07.2013 21:41:45 | Collatz Conjecture | Temporarily failed upload of solo_collatz_2380223520128266643816_824633720832_0_0: transient HTTP error
26.07.2013 21:41:45 | Collatz Conjecture | Backing off 4 hr 12 min 40 sec on upload of solo_collatz_2380223520128266643816_824633720832_0_0

Rebooting of the PC doesn't help. The WUs are still listed on my account page. Here one can see all running WUs with the Computer where the tasks resides.

Have I to reattach the Collatz Project?

JayPi

Kombizahl
Send message
Joined: 29 Sep 09
Posts: 12
Credit: 158,620,174
RAC: 137
Message 17106 - Posted: 26 Jul 2013, 20:12:13 UTC

Can not upload the finished tasks.I've this message:

26.07.2013 22:05:26 | Collatz Conjecture | Temporarily failed upload of mini_collatz_2380204697966667868520_103079215104_2_0: transient HTTP error
26.07.2013 22:05:26 | Collatz Conjecture | Backing off 4 min 17 sec on upload of mini_collatz_2380204697966667868520_103079215104_2_0
Greetings
____________
Greetings

Charles Paul
Send message
Joined: 16 Jul 10
Posts: 1
Credit: 968,005,481
RAC: 52,447
Message 17107 - Posted: 26 Jul 2013, 20:56:22 UTC

I removed the project and then added it and it is working fine. Too bad all the uploads were lost.

mauro1[veneto]
Send message
Joined: 19 Mar 10
Posts: 2
Credit: 8,059,414
RAC: 0
Message 17108 - Posted: 26 Jul 2013, 21:06:15 UTC

cannot upload
mini_collatz_2380301240207187618152_103079215104_1

Profile David Riese
Send message
Joined: 23 Sep 12
Posts: 132
Credit: 4,029,600,829
RAC: 5,603,195
Message 17109 - Posted: 26 Jul 2013, 21:20:25 UTC - in response to Message 17102.

I can't imagine the challenges involved in porting the project to the new server and bringing that unit online. Thanks for all that you do on behalf of the project and its volunteers! - Dave

Kombizahl
Send message
Joined: 29 Sep 09
Posts: 12
Credit: 158,620,174
RAC: 137
Message 17110 - Posted: 26 Jul 2013, 21:25:41 UTC

I detach and retach the project but i can't upload the tasks.
____________
Greetings

Profile Slicker
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 11 Jun 09
Posts: 2525
Credit: 740,580,099
RAC: 1
Message 17111 - Posted: 26 Jul 2013, 21:27:28 UTC - in response to Message 17106.

Can not upload the finished tasks.I've this message:

26.07.2013 22:05:26 | Collatz Conjecture | Temporarily failed upload of mini_collatz_2380204697966667868520_103079215104_2_0: transient HTTP error
26.07.2013 22:05:26 | Collatz Conjecture | Backing off 4 min 17 sec on upload of mini_collatz_2380204697966667868520_103079215104_2_0
Greetings


It seems that any WUs stuck in the middle of a file transfer won't upload. I had it happen to me as well on one WU. My other WUs all uploaded OK though.

Profile Slicker
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 11 Jun 09
Posts: 2525
Credit: 740,580,099
RAC: 1
Message 17112 - Posted: 26 Jul 2013, 21:37:50 UTC - in response to Message 17108.

cannot upload
mini_collatz_2380301240207187618152_103079215104_1


Hmm... "transient HTTP errors". It would be nice if the BOINC error messages weren't dumbed down since it makes them totally useless for solving the problem.

Any WU sent between the time the server was backed up (midnight CST) and the server crashed (between 5 and 6 am CST) on July 12th will ether fail to upload or will fail to validate. I cannot fix that because there is no way to reverse engineer the data.

Profile Jimbo
Send message
Joined: 25 Apr 10
Posts: 23
Credit: 1,102,655,543
RAC: 1,080,815
Message 17113 - Posted: 26 Jul 2013, 21:43:09 UTC
Last modified: 26 Jul 2013, 21:43:40 UTC

i had about 35 uploads left- wouldn't upload so i reset project. didn't see an increase in credit:
(7/26/2013 4:33:53 PM | Collatz Conjecture | Resent lost task collatz_2380301113728990685544_824633720832_1)

but all is running now.

Profile Slicker
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 11 Jun 09
Posts: 2525
Credit: 740,580,099
RAC: 1
Message 17114 - Posted: 26 Jul 2013, 21:44:43 UTC - in response to Message 17104.

On the Collatz Preference page:

Warning: Creating default object from empty value in /home/boincadm/projects/collatz/html/project/project_specific_prefs.inc on line 233


That section of code has to do with the max graphics cpu percentage. Since collatz doesn't use a screensaver (makes no sense with GPU apps and slows down CPU apps which seems like a bad idea to me) the setting is useless. The fix, at least for now, was to just disable the preference.

Profile Zydor
Avatar
Send message
Joined: 19 Aug 09
Posts: 364
Credit: 840,811,292
RAC: 0
Message 17115 - Posted: 26 Jul 2013, 22:31:12 UTC

I have a mix of "Timed out no response" but still marked as "in progress"

http://boinc.thesonntags.com/collatz/results.php?hostid=128929&offset=20&show_names=0&state=0&appid=

If they are still marked as "in progress" - do I assume that (despite the time out marking) or go by the time out?

I am just fishing for the ones I need to dump, as there are around 200 completed stacked up this end not yet uploaded, and I might as well dump the ones that will not validate with the Data gap circa midnight-6am

Patrick Harnett*
Send message
Joined: 11 Mar 10
Posts: 3
Credit: 271,625,783
RAC: 0
Message 17117 - Posted: 26 Jul 2013, 22:45:27 UTC

You might just have to be patient - there were over 80,000 WUs in progress, and probably most of those were trying to report at once when the system came back on.

Profile Mankka*
Avatar
Send message
Joined: 3 Jun 10
Posts: 14
Credit: 5,538,135,572
RAC: 102,887
Message 17120 - Posted: 26 Jul 2013, 23:41:35 UTC
Last modified: 26 Jul 2013, 23:49:46 UTC

I had to ditch 'em all & reset the project to get it working...
But I'm very happy that you got the new server up, thanks for your hard work Slicker =)

Profile Slicker
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 11 Jun 09
Posts: 2525
Credit: 740,580,099
RAC: 1
Message 17124 - Posted: 27 Jul 2013, 0:49:01 UTC - in response to Message 17115.

I have a mix of "Timed out no response" but still marked as "in progress"

http://boinc.thesonntags.com/collatz/results.php?hostid=128929&offset=20&show_names=0&state=0&appid=

If they are still marked as "in progress" - do I assume that (despite the time out marking) or go by the time out?

I am just fishing for the ones I need to dump, as there are around 200 completed stacked up this end not yet uploaded, and I might as well dump the ones that will not validate with the Data gap circa midnight-6am


There is a 3 day delay after they get marked as late so you should be able to upload them and get credit for most of them now that I fixed a library out of sync problem with the file upload handler.

Profile Zydor
Avatar
Send message
Joined: 19 Aug 09
Posts: 364
Credit: 840,811,292
RAC: 0
Message 17125 - Posted: 27 Jul 2013, 1:22:06 UTC - in response to Message 17124.
Last modified: 27 Jul 2013, 1:22:29 UTC

I fixed a library out of sync problem with the file upload handler


I had a stack of them suddenly upload on both machines, so whatever you did it appears to have worked .... many thanks :)

Profile Jimbo
Send message
Joined: 25 Apr 10
Posts: 23
Credit: 1,102,655,543
RAC: 1,080,815
Message 17127 - Posted: 27 Jul 2013, 2:26:15 UTC

mine will not upload:

7/26/2013 9:20:07 PM | Collatz Conjecture | [error] Error reported by file upload server: can't open file /home/boincadm/projects/collatz/upload/285/collatz_2380301120016822806888_824633720832_0_0: Permission denied

A.M.
Send message
Joined: 19 Jun 11
Posts: 7
Credit: 195,055,901
RAC: 0
Message 17128 - Posted: 27 Jul 2013, 2:27:08 UTC - in response to Message 17102.

Is the validator not working, or just buried under backlog? I have several tasks that seem like they should be validating, but are not:

http://boinc.thesonntags.com/collatz/result.php?resultid=143475319
http://boinc.thesonntags.com/collatz/result.php?resultid=143473902

I noticed that the second one has been re-sent to another computer. Both of these examples were sent to me well before the crash.

Profile Pooh Bear 27
Avatar
Send message
Joined: 1 Aug 10
Posts: 54
Credit: 108,227,920
RAC: 0
Message 17130 - Posted: 27 Jul 2013, 2:45:00 UTC

Some are uploading, but not all. Did the uploader become overloaded and crash? Have same errors as others on all new uploads again, after several made it back earlier.

____________

1 · 2 · 3 · 4 . . . 5 · Next
Post to thread

Message boards : News : Up and Running Again


Main page · Your account · Message boards


Copyright © 2018 Jon Sonntag; All rights reserved.