Use at your own risk

Message boards : News : Use at your own risk
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 15 · Next

AuthorMessage
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 27 Jun 12
Posts: 11
Credit: 66,668,399
RAC: 4,056
Message 523 - Posted: 17 Jun 2018, 21:03:30 UTC

Maybe that's why this topic is called "use at your own risk" ?
ID: 523 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Michael

Send message
Joined: 31 Oct 10
Posts: 16
Credit: 80,971,858
RAC: 60,090
Message 524 - Posted: 18 Jun 2018, 21:37:12 UTC - in response to Message 523.  

You think!
ID: 524 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 525 - Posted: 19 Jun 2018, 3:07:39 UTC

I've had to comment out a bunch more of the creditnew code since the validation logic is utterly stupid when a project doesn't use FLOPS since all estimates (and also some of the work fetch logic and validation logic) use FLOPS measured on the device compared to estimated FLOPS for the workunit. When a project uses IOPS instead of FLOPS, those estimates can be off by an order of magnitude. When that happens BOINC thinks you are cheating (an outlier) so it won't grant credit even if the result was valid. Yes, it was logging that it was valid, then granting 0.0 credit and then changing the state back to inconclusive. I guess logic works different on the West Coast than the rest of the planet, as I would expect that a valid result _should_ actually get credit. I commented out all the "outlier" code, so it should work now. Let me know if it doesn't.

Credit has been granted to as many valid yet inconclusive results as I could find. It was a royal PITA to find the issue, identify the improper logic (specially since it is in code that they don't want projects to change) , and then search the validator log files containing the 60k results returned per day to find the 20 (that's 20, not 20k) that came from CPUs on average per day so I could manually grant the credit to them. I could only go back a few weeks since I don't archive the log files and once someone else completed the workunit, it is no longer on the server and the only history stored is about the person whose data was valid.
ID: 525 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 526 - Posted: 19 Jun 2018, 3:22:27 UTC - in response to Message 468.  

05/06/2018 18:42:54 | collatz | Tasks for CPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | Tasks for NVIDIA GPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | Tasks for Intel GPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | New computer location: work

Notice the work location, is for PC with ATI 7790 GPU.
The projects\boinc.thesonntags.com_collatz dir continue to stay empty.


Enable sched_op_debug to get more detailed info. You will get more detail about it requesting work which may explain why it isn't getting work.

Have you installed the AMD OpenCL drivers? The ones installed by Windows will most likely be missing the OpenCL drivers.

If you still can't figure it out, what is the Host ID? (I'm not going to waste time wading though hidden hosts and search the log files for every computer you own)
ID: 526 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Alessandro Freda

Send message
Joined: 23 Oct 09
Posts: 4
Credit: 730,800,060
RAC: 56
Message 537 - Posted: 23 Jun 2018, 15:01:12 UTC - in response to Message 526.  

05/06/2018 18:42:54 | collatz | Tasks for CPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | Tasks for NVIDIA GPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | Tasks for Intel GPU are available, but your preferences are set to not accept them
05/06/2018 18:42:54 | collatz | New computer location: work

Notice the work location, is for PC with ATI 7790 GPU.
The projects\boinc.thesonntags.com_collatz dir continue to stay empty.


Enable sched_op_debug to get more detailed info. You will get more detail about it requesting work which may explain why it isn't getting work.

Have you installed the AMD OpenCL drivers? The ones installed by Windows will most likely be missing the OpenCL drivers.

If you still can't figure it out, what is the Host ID? (I'm not going to waste time wading though hidden hosts and search the log files for every computer you own)



Yes, I've the same setup on my host working fine before the project stop,
same AMD OpenCL drivers and MS Visual CC++ xxxx Redistributable packages.
is an update necessary?

My host ID is 832917

23/06/2018 16:43:29 | | Starting BOINC client version 7.10.2 for windows_x86_64
23/06/2018 16:43:29 | | log flags: file_xfer, sched_ops, task, sched_op_debug
23/06/2018 16:43:29 | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8
23/06/2018 16:43:29 | | Data directory: C:\Program Files\BOINCdata

23/06/2018 16:43:29 | | OpenCL: AMD/ATI GPU 0: AMD Radeon HD 7700 Series (driver version 2117.13 (VM), device version OpenCL 2.0 AMD-APP (2117.13), 1024MB, 1024MB available, 1926 GFLOPS peak)

23/06/2018 16:44:53 | | Fetching configuration file from https://boinc.thesonntags.com/collatz/get_project_config.php
23/06/2018 16:45:17 | collatz | sched RPC pending: Project initialization
23/06/2018 16:45:17 | collatz | [sched_op] Starting scheduler request
23/06/2018 16:45:17 | collatz | [sched_op] Fetching master file
23/06/2018 16:45:19 | collatz | [sched_op] Got master file; parsing
23/06/2018 16:45:19 | collatz | [sched_op] Found 1 scheduler URLs in master file
23/06/2018 16:45:19 | collatz | Master file download succeeded
23/06/2018 16:45:25 | collatz | sched RPC pending: Project initialization
23/06/2018 16:45:25 | collatz | [sched_op] Starting scheduler request
23/06/2018 16:45:25 | collatz | Sending scheduler request: Project initialization.
23/06/2018 16:45:25 | collatz | Requesting new tasks for CPU and AMD/ATI GPU
23/06/2018 16:45:25 | collatz | [sched_op] CPU work request: 1.00 seconds; 0.00 devices
23/06/2018 16:45:25 | collatz | [sched_op] AMD/ATI GPU work request: 1.00 seconds; 0.00 devices
23/06/2018 16:45:27 | collatz | Scheduler request completed: got 0 new tasks
23/06/2018 16:45:27 | collatz | [sched_op] Server version 711
23/06/2018 16:45:27 | collatz | No tasks sent
23/06/2018 16:45:27 | collatz | Tasks for CPU are available, but your preferences are set to not accept them
23/06/2018 16:45:27 | collatz | Tasks for NVIDIA GPU are available, but your preferences are set to not accept them
23/06/2018 16:45:27 | collatz | Tasks for Intel GPU are available, but your preferences are set to not accept them
23/06/2018 16:45:27 | collatz | Project requested delay of 121 seconds
23/06/2018 16:45:27 | collatz | New computer location: work
23/06/2018 16:45:27 | collatz | [sched_op] Deferring communication for 00:02:01
23/06/2018 16:45:27 | collatz | [sched_op] Reason: requested by project
23/06/2018 16:47:32 | collatz | [sched_op] Starting scheduler request
23/06/2018 16:47:32 | collatz | Sending scheduler request: To fetch work.
23/06/2018 16:47:32 | collatz | Requesting new tasks for AMD/ATI GPU
23/06/2018 16:47:32 | collatz | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
23/06/2018 16:47:32 | collatz | [sched_op] AMD/ATI GPU work request: 864000.00 seconds; 1.00 devices
23/06/2018 16:47:34 | collatz | Scheduler request completed: got 0 new tasks
23/06/2018 16:47:34 | collatz | [sched_op] Server version 711
23/06/2018 16:47:34 | collatz | Project requested delay of 121 seconds
23/06/2018 16:47:34 | collatz | [sched_op] Deferring communication for 00:02:01
23/06/2018 16:47:34 | collatz | [sched_op] Reason: requested by project

23/06/2018 16:51:19 | collatz | update requested by user
23/06/2018 16:51:21 | collatz | sched RPC pending: Requested by user
23/06/2018 16:51:21 | collatz | [sched_op] Starting scheduler request
23/06/2018 16:51:21 | collatz | Sending scheduler request: Requested by user.
23/06/2018 16:51:21 | collatz | Requesting new tasks for AMD/ATI GPU
23/06/2018 16:51:21 | collatz | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
23/06/2018 16:51:21 | collatz | [sched_op] AMD/ATI GPU work request: 864000.00 seconds; 1.00 devices
23/06/2018 16:51:23 | collatz | Scheduler request completed: got 0 new tasks
23/06/2018 16:51:23 | collatz | [sched_op] Server version 711
23/06/2018 16:51:23 | collatz | Project requested delay of 121 seconds
23/06/2018 16:51:23 | collatz | [sched_op] Deferring communication for 00:02:01
23/06/2018 16:51:23 | collatz | [sched_op] Reason: requested by project
ID: 537 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 542 - Posted: 24 Jun 2018, 20:52:06 UTC - in response to Message 537.  

I added the opencl_ati_gpu plan class specifications for both i686 and x64 versions for Linux just in case the ati_opencl was the cause. In theory, any project can make up any plan class they want. I'm not so sure that works in reality. I had the plan classes listed as opencl_amd and those weren't working for windows apps. So, let me know if that solves the issue. If not, send me a private message with the host id so I can set the BOINC scheduler to log debug information. Then after you do an update, I can check the server log.

But, I hope the plan class change will fix the issue. The previous server version I was using didn't use the plan_class_spec.xml file and I just coded the plan class info in C++. This is supposed to be easier and not require coding, but it sure seems likes it's more work! (That, or I code faster than I write valid XML).
ID: 542 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 543 - Posted: 24 Jun 2018, 20:54:25 UTC - in response to Message 524.  

You think!


Yep, I do! Thanks for being so patient (cough, cough). ;-)
ID: 543 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Christian

Send message
Joined: 9 Oct 11
Posts: 1
Credit: 65,467,558
RAC: 352
Message 550 - Posted: 25 Jun 2018, 19:36:03 UTC
Last modified: 25 Jun 2018, 19:36:40 UTC

I use Ubuntu 18.04 with Boinc 7.9.3.

I cant get WU for my Nvidia Gforce GTX1050:

Crunshing with Seti with GPU works fine:

My UserID is: 33848

    Mo 25 Jun 2018 21:24:22 CEST | | Starting BOINC client version 7.9.3 for x86_64-pc-linux-gnu
    Mo 25 Jun 2018 21:24:22 CEST | | log flags: file_xfer, sched_ops, task, sched_op_debug
    Mo 25 Jun 2018 21:24:22 CEST | | Libraries: libcurl/7.58.0 OpenSSL/1.1.0g zlib/1.2.11 libidn2/2.0.4 libpsl/0.19.1 (+libidn2/2.0.4) nghttp2/1.30.0 librtmp/2.3
    Mo 25 Jun 2018 21:24:22 CEST | | Data directory: /var/lib/boinc-client
    Mo 25 Jun 2018 21:24:22 CEST | | CUDA: NVIDIA GPU 0: GeForce GTX 1050 (driver version 390.48, CUDA version 9.1, compute capability 6.1, 1997MB, 1698MB available, 1862 GFLOPS peak)
    Mo 25 Jun 2018 21:24:22 CEST | | [libc detection] gathered: 2.27, Ubuntu GLIBC 2.27-3ubuntu1
    Mo 25 Jun 2018 21:24:22 CEST | | Host name: XXX
    Mo 25 Jun 2018 21:24:22 CEST | | Processor: 16 AuthenticAMD AMD Ryzen 7 1700 Eight-Core Processor [Family 23 Model 1 Stepping 1]
    Mo 25 Jun 2018 21:24:22 CEST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb hw_pstate sme ssbd vmmcall fsgsbase bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 xsaves clzero irperf xsaveerptr arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif overflow_recov succor smca
    Mo 25 Jun 2018 21:24:22 CEST | | OS: Linux Ubuntu: Ubuntu 18.04 LTS [4.15.0-23-generic|libc 2.27 (Ubuntu GLIBC 2.27-3ubuntu1)]
    Mo 25 Jun 2018 21:24:22 CEST | | Memory: 15.68 GB physical, 2.00 GB virtual
    Mo 25 Jun 2018 21:24:22 CEST | | Disk: 915.40 GB total, 659.34 GB free
    Mo 25 Jun 2018 21:24:22 CEST | | Local time is UTC +2 hours
    Mo 25 Jun 2018 21:24:22 CEST | | Config: GUI RPCs allowed from:
    Mo 25 Jun 2018 21:24:22 CEST | collatz | URL https://boinc.thesonntags.com/collatz/; Computer ID 834229; resource share 100

    Mo 25 Jun 2018 21:26:30 CEST | collatz | update requested by user
    Mo 25 Jun 2018 21:26:32 CEST | collatz | sched RPC pending: Requested by user
    Mo 25 Jun 2018 21:26:32 CEST | collatz | [sched_op] Starting scheduler request
    Mo 25 Jun 2018 21:26:32 CEST | collatz | Sending scheduler request: Requested by user.
    Mo 25 Jun 2018 21:26:32 CEST | collatz | Requesting new tasks for NVIDIA GPU
    Mo 25 Jun 2018 21:26:32 CEST | collatz | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
    Mo 25 Jun 2018 21:26:32 CEST | collatz | [sched_op] NVIDIA GPU work request: 777600.00 seconds; 1.00 devices
    Mo 25 Jun 2018 21:26:34 CEST | collatz | Scheduler request completed: got 0 new tasks
    Mo 25 Jun 2018 21:26:34 CEST | collatz | [sched_op] Server version 711
    Mo 25 Jun 2018 21:26:34 CEST | collatz | Project requested delay of 121 seconds
    Mo 25 Jun 2018 21:26:34 CEST | collatz | [sched_op] Deferring communication for 00:02:01
    Mo 25 Jun 2018 21:26:34 CEST | collatz | [sched_op] Reason: requested by project
    Mo 25 Jun 2018 21:28:40 CEST | collatz | [sched_op] Starting scheduler request
    Mo 25 Jun 2018 21:28:40 CEST | collatz | Sending scheduler request: To fetch work.
    Mo 25 Jun 2018 21:28:40 CEST | collatz | Requesting new tasks for NVIDIA GPU
    Mo 25 Jun 2018 21:28:40 CEST | collatz | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
    Mo 25 Jun 2018 21:28:40 CEST | collatz | [sched_op] NVIDIA GPU work request: 777600.00 seconds; 1.00 devices
    Mo 25 Jun 2018 21:28:42 CEST | collatz | Scheduler request completed: got 0 new tasks
    Mo 25 Jun 2018 21:28:42 CEST | collatz | [sched_op] Server version 711
    Mo 25 Jun 2018 21:28:42 CEST | collatz | Project requested delay of 121 seconds
    Mo 25 Jun 2018 21:28:42 CEST | collatz | [sched_op] Deferring communication for 00:02:01
    Mo 25 Jun 2018 21:28:42 CEST | collatz | [sched_op] Reason: requested by project



What can i do?

ID: 550 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 559 - Posted: 26 Jun 2018, 14:16:04 UTC - in response to Message 550.  

> CUDA: NVIDIA GPU 0: GeForce GTX 1050 (driver version 390.48, CUDA version 9.1, compute capability 6.1, 1997MB, 1698MB available, 1862 GFLOPS peak)

Collatz only has OpenCL apps, not CUDA and from the description above, there's no OpenCL installed. Check out https://wiki.tiker.net/OpenCLHowTo
ID: 559 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 560 - Posted: 26 Jun 2018, 14:38:54 UTC

There's no join in Mudville!

The previous fix for the CPU credits wasn't working. Or rather, it didn't fix (and by fix I mean remove the stupid BOINC code that assumes all projects use floating point arithmetic). So, I commented out several hundred more lines of creditnew madness in both the validator.cpp and credit.cpp BOINC source code and then recompiled the server daemons.

I also manually changed the credits for the 204 WUs that were valid but not granted any credit.

If you run into a a problem, please provide the host id and result id as it makes it a lot easier to track down the problems. (Thanks, Conan for providing that info which led to this latest fix.)
ID: 560 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 580 - Posted: 4 Jul 2018, 1:41:18 UTC

Validator and assimilator are offline.
ID: 580 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 581 - Posted: 4 Jul 2018, 2:55:17 UTC - in response to Message 580.  

Computing status
Work
Tasks ready to send 799
Tasks in progress 149608
***Workunits waiting for validation 14886***
Workunits waiting for assimilation 19
Workunits waiting for file deletion 0
Tasks waiting for file deletion 0
Transitioner backlog (hours) 0.00
ID: 581 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 582 - Posted: 4 Jul 2018, 4:55:59 UTC - in response to Message 581.  

Update:

Tasks ready to send 800
Tasks in progress 151204
**Workunits waiting for validation 19536**
Workunits waiting for assimilation 13
Workunits waiting for file deletion 0
Tasks waiting for file deletion 0
Transitioner backlog (hours) 0.00
ID: 582 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 586 - Posted: 4 Jul 2018, 16:21:02 UTC - in response to Message 582.  

Morning update:

Tasks ready to send 799
Tasks in progress 150896
****Workunits waiting for validation 42725****
Workunits waiting for assimilation 15
ID: 586 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 588 - Posted: 4 Jul 2018, 17:02:46 UTC - in response to Message 586.  

Slicker, it would seem that a server reboot might be needed to restart the validator and the assimilator

Both of them display a status of not running and have been in that state for 12 hours or so.

Tasks ready to send 770
Tasks in progress 150977
Workunits waiting for validation 44650
Workunits waiting for assimilation 16
ID: 588 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brent

Send message
Joined: 25 Jun 14
Posts: 14
Credit: 214,320,756
RAC: 116,898
Message 589 - Posted: 4 Jul 2018, 17:25:29 UTC

Apparently, Slicker is gone for the 4th of July holiday. So we are all out of luck until he returns, Hope we don't lose any more credits!
Brent
ID: 589 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 21 Aug 09
Posts: 36
Credit: 16,656,887,437
RAC: 17,178,820
Message 596 - Posted: 4 Jul 2018, 21:51:08 UTC - in response to Message 589.  

Right -- that's my guess as well -- I restarted my Moo and GPUGrid streams in the interim.
ID: 596 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Slicker
Project administrator

Send message
Joined: 11 Jun 09
Posts: 50
Credit: 780,094,359
RAC: 513,731
Message 600 - Posted: 5 Jul 2018, 1:31:45 UTC

The issue is that someone is trying to hack the output and their crap is causing the validator to crash. I'm trying to add code to filter their invalid results so it will stop crashing on every bad WU returned.
ID: 600 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Steve Dodd

Send message
Joined: 30 Jul 09
Posts: 29
Credit: 6,668,785,889
RAC: 27,727,337
Message 601 - Posted: 5 Jul 2018, 3:12:04 UTC

Your filtering efforts have seemed to have had the unfortunate side effect of invalidating all the my "pending validation" WUs :(. Here's hoping you're successful and the side effects are temporary :). Lovely way to spend the 4th, Jon. Again, I can't thank you enough for the effort and support you offer to the BOINC community!
ID: 601 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile williebthere

Send message
Joined: 30 Mar 13
Posts: 3
Credit: 148,858,012
RAC: 1,671,043
Message 602 - Posted: 5 Jul 2018, 3:15:43 UTC

Bummer, Seems like a lot of WU's that were ready to be validated lost to invalid. Going to do something else until I know this has been fixed...................
https://signature.statseb.fr/sig-872.png
ID: 602 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 15 · Next

Message boards : News : Use at your own risk


©2018 Jon Sonntag; All rights reserved