Out of Memory errors
log in

Advanced search

Message boards : Number crunching : Out of Memory errors

Author Message
Martin
Send message
Joined: 27 Sep 16
Posts: 3
Credit: 10,318,329
RAC: 27,965
Message 23063 - Posted: 8 Oct 2016, 11:31:55 UTC

Nearly every wu I have run in the last few days has crashed during restarts. Previously there had been no problems, even when running two wu's simultaneously.

The program reports:
_____________________________________________________________

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x000007FEFD82A06D

Engaging BOINC Windows Runtime Debugger...
______________________________________________________________

The computer has 2GB of memory and the task manager reports that there is nearly 1GB free, so I do not believe that I am actually running out of memory unless the Collatz Sieve 1.20 suddenly, and only for a short time, has a grab at a lot of memory.

Has anyone had a similar problem, or can suggest a solution.
____________

Dr Who Fan
Avatar
Send message
Joined: 27 May 14
Posts: 21
Credit: 4,562,054
RAC: 0
Message 23093 - Posted: 14 Oct 2016, 19:05:21 UTC - in response to Message 23063.

Have seen the SAME thing on a quite a few Collatz tasks (I run CPU ONLY tasks)in past month. Win 8.1, Win 7 and Win Vista ALL seem to be affected.

Not sure if there is a BUG in the latest Microsoft C++ run times files since the September Windows update (on 9/13/2016) or if it is ONLY Collatz related.

Upon starting it seems they use ALL the CPU resources, gobble up ALL free (RAM) memory & ALL free swap file space Windows has available then crash a second or three later with out of memory error.
____________

Redstone
Send message
Joined: 7 Sep 14
Posts: 2
Credit: 208,527
RAC: 0
Message 23264 - Posted: 8 Nov 2016, 21:55:09 UTC

Getting the same error for nearly every workunit I've received as of recent. I don't have much in terms of solutions but I'm going to see what resetting the project will do.

If the problems persist, I will simply suspend this project until the matter is resolved.

Profile valterc
Send message
Joined: 21 Sep 09
Posts: 39
Credit: 14,500,072,950
RAC: 15,460,656
Message 23671 - Posted: 31 Dec 2016, 10:51:54 UTC - in response to Message 23264.

I'm also started to get memory errors (see here: https://boinc.thesonntags.com/collatz/results.php?hostid=145423&offset=0&show_names=0&state=6&appid=). Does anyone know what may be the problem?

Tackleway
Send message
Joined: 29 Sep 13
Posts: 53
Credit: 1,737,506,463
RAC: 1,755,061
Message 23673 - Posted: 31 Dec 2016, 23:25:30 UTC
Last modified: 31 Dec 2016, 23:27:05 UTC

I'm not sure if this will help you but recently I've found that 'Suspending' all tasks before shutting down / restarting
usually prevents running tasks from crashing out, and 'Restoring' tasks after the restart has completed its activities.

Hope this is of use but anyway, a Happy New Year to all crunchers and admins.
____________

Profile step2000
Avatar
Send message
Joined: 1 Aug 13
Posts: 96
Credit: 1,475,954,492
RAC: 1,944,984
Message 23674 - Posted: 1 Jan 2017, 16:02:23 UTC

It could be two things. You could need to repair/install the C runtime again. You should also run a quick repair on your hard drive (SCAN - Check Disk). Also what is your hardware GPU or CPU details will help direct a better solution.

Profile valterc
Send message
Joined: 21 Sep 09
Posts: 39
Credit: 14,500,072,950
RAC: 15,460,656
Message 23675 - Posted: 2 Jan 2017, 10:28:07 UTC - in response to Message 23674.

It could be two things. You could need to repair/install the C runtime again. You should also run a quick repair on your hard drive (SCAN - Check Disk). Also what is your hardware GPU or CPU details will help direct a better solution.

Yep, just a week ago I started noticing random hard disk failures. I was able to do a sector by sector copy of the faulty hard disk to another one, skipping a lot of bad sectors (the original disk is no more able to pass the DST short test, it's definitely broken). Anyways, windows went up, after the copy, sfc /scannow was able to fix a lot of errors expect one that I had to manually fix. Now things are getting better, no errors until now, so probably your hint was the right one. (Just to be sure I will reinstall the C runtime and some other software too). The host is a I7-4930k with two r9-290X gpus.


Post to thread

Message boards : Number crunching : Out of Memory errors


Main page · Your account · Message boards


Copyright © 2018 Jon Sonntag; All rights reserved.