BSOD Win7 ntoskrnl.exe

Erwin0265

Honorable
Feb 22, 2015
4
0
10,510
I have recently started getting BSOD's every time I try to render an image in the landscaping software, Terragen 3.
After a bit of reading and downloading, I used BSOD_Windows7_Vista_jcgriff2.exe to collect all of my crash dumps together for someone who knows what they are doing to look at.
I tried out BlueScreenView and most had the ntoskrnl.exe listed as the culprit (although most of my Terragen-related crashes list nothing in the bottom pane of BlueScreenView when set to "Only drivers found in stack").
I have uploaded all of the crash dumps to Sendspace (https://www.sendspace.com/file/fgtdtr).
This PC was rebuilt last November (2014) so many components are new and it would be good to have a good idea as to the problem before driving 50 plus km to have it looked at.........
Thank you in advance for any help you may be able to provide.
Erwin

Win7 Pro
Intel Core i7-4790K CPU & 4.00GHz
64-bit
32GB RAM
 
remove TuneUpUtilitiesDriver64.sys and see if you still get the bugcheck. I think the 2012 version of this utility incorrectly overclocks your new (2014) low voltage CPU.

Also, run this https://downloadcenter.intel.com/Detail_Desc.aspx?DwnldID=20775
this will update the intel windows microcode driver for your CPU. You have a 2010 version.

you might also run the generic intel driver installer, you had a 2010 version of another intel file,i think it is in the intel chipset driver update but I would also run this to make sure: http://www.intel.com/p/en_US/support/detect

--------------------------
I looked at another bugcheck 0x124 that had a uptime of almost 7 days
I would suspect this driver for that failure:
C:\Program Files (x86)\AVG\AVG PC TuneUp\TuneUpUtilitiesDriver64.sys Fri May 25 03:28:19 2012

also, this driver is pretty old:
C:\Program Files (x86)\Common Files\Realtime Soft\UltraMonMirrorDrv\x64\UltraMonUtility.sys Thu Nov 13 17:10:30 2008

notes:
-BIOS is one BIOS version out of date
- memory running at 1333MHz CMZ16GX3M2A1600C9 is rated at 1600Mhz

I will take a quick look at the other bugchecks
---------------
first bugcheck 0x124 WHEA_UNCORRECTABLE_ERROR
was a CPU internal error on processor 0 memory bank 4
the system up time was 3 seconds so I would suspect the
CPU power fluctuated low enough to reset the CPU and that your
power supply allowed the CPU to start up again before the power was ok.
(cheap power supply/motherboard combination will not provide the proper power_OK signal)

I would be looking to why the CPU reset in the first place.
-----------------
machine info:
BIOS Version F5
BIOS Release Date 05/31/2014
Manufacturer Gigabyte Technology Co., Ltd.
Product Name Z97X-Gaming GT
Processor Version Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
Processor Voltage 8ch - 1.2V
External Clock 100MHz
Max Speed 3800MHz
Current Speed 4000MHz




 

Erwin0265

Honorable
Feb 22, 2015
4
0
10,510
@ Johnbl;
Hi,
Thanks for the super-fast response!

I uninstalled AVG PC TuneUp 2014; you'd think the product would be up to date given the fact that it's meant to be the 2014 version....
I don't know how much real tuning it did; it made a lot of fuss over how many fixes it made and what programs it had stopped from running unnecessarily in the background.
I like the concept of the program but would be interested in your thoughts; is it something worthwhile (for the graphic artist more interested in spending time creating art on the PC than tweaking the PC) or something gimmicky?
I also uninstalled UltraMon; it was an out of date trial and I went with Display Fusion (I run 3 monitors on my graphics rig).

I installed the chipset update and the Intel Driver Update Utility; this provided me with a graphics update (Iris & HD Graphics driver v 15.36.14.64.4080; the installed version was v 10.18.10.3540.
However, every time I tried to get the utility to download the update, it crashed! So I downloaded manually........
I HAVE YET TO INSTALL THIS DRIVER - I THOUGHT I'D WAIT ON FURTHER ADVICE BEFORE MAKING TOO MANY CHANGES IN ONE HIT....


OK, I'm going to illustrate how little I know when it comes to the technical aspect of computers by asking some very (probably) stupid questions:_


"remove TuneUpUtilitiesDriver64.sys"

I assume that this would have been removed when I uninstalled AVG PC TuneUp?

"notes:
-BIOS is one BIOS version out of date
- memory running at 1333MHz CMZ16GX3M2A1600C9 is rated at 1600Mhz"

In one BIOS; is there more than one BIOS on my machine?
Is the memory running slower than it should (1333MHz opposed to 1600MHz)?

"I would be looking to why the CPU reset in the first place."

So this was the very first crash that you are referring to?
I think that may have been when it was in the shop getting a new MB, CPU & RAM - all needed to be replaced on a 4 year old PC!!

After all of this; I tried to do the same render; before I got about 7 - 9 minutes into the render before a crash; this time it managed 57 seconds!
So there was an effect; just not the one we were hoping for..........
It's just weird; I have been rendering far more complex and larger scenes than this one and had no issues (some renders have taken 30 plus hours).
This render was simply a test render to see how things were shaping up - still, the render still does max out the CPU no matter the size or complexity of the render.............

I have uploaded the minidump to Sendspace (https://www.sendspace.com/file/0h6q5n)
BTW; is there an easy way to attach the dump file to a post?
Thanks for all of your help thus far; I truly appreciate it.
 
basically, your CPU reset due to a low power condition, then it tried to start up again and the power was not stable so it got a error inside the CPU.

The bugcheck is a side effect of a power supply/ motherboard combination that is allowing the CPU to startup while the power is not stable. (cheap power supply will hard code the power_ok signal to true to cut costs)

you still have to figure out why the power fluctuated. Most often if you were running a 3D video app it will be caused by the GPU not getting proper power from its external power connections. You need to make sure they are connected well, and the power supply is not underrated. Also a GPU will pull more power as it heats up so make sure the fans are running and run them at top speed and make sure to blow out dust and confirmed they are not blocked.
If the GPU does not get proper power it will take it from the motherboard slot and it can only get 75 watts from that slot. If it pulls too much power it will cause the CPU to reset. The CPU has a reset pin, then the voltage drops to a certain point it will reset the CPU and your system will start to reboot. The power supply is supposed to introduce a delay after the reset but before the CPU starts up again (to make sure the power is stable).

so you have two problems: the power fluctuated for some reason, and the power supply allowed the CPU to start when the power was not stable.
- also, plug your power into a wall outlet, sometimes you get strange effects from the circuits in a power strip.


the new minidump shows
BugCheck 124, {0, fffffa801a602038, 0, 0}

this was called by the CPU because of a fatal error
Error : Internal timer (Proc 0 Bank 4)

your System Uptime: 0 days 0:00:03.106





 

Erwin0265

Honorable
Feb 22, 2015
4
0
10,510
So I am a bit perplexed as to what to do.
I run a UPS as we have power outages on a semi regular basis.
The UPS is supposed to maintain a constant power supply; I don't want to plug the PC into a power point directly as I will end up with constant data loss due to power outages.
To be clear; I have lived here for 20 plus years and the power supply has always been like this; it's due to above ground power lines in a very forested area - but I have never had my PC behave like this before.
You state that I have a cheap power supply (?); it's a Cooler Master brand - I believe that is a fairly decent brand...

I checked the temperatures the day I made my first post (ie. this one); the only thing that struck me was that one graphics card was running at 48 C whilst the other one was running at 58 C.
It is summer here (Australia) and we have had a few hot days but the PC is in an air conditioned room.....

To be honest, I'm not sure that I can resolve the issues you are telling me of as I don't have sufficient knowledge..
You are dealing with a person who knows enough about computers to help out most of his family, but when dealing with stuff like this I am totally out of my depth..............
I will blow out the PC tomorrow and make a visual inspection as to the amount of dust that has accumulated (we live on a dirt road so dust build up tends to happen quite quickly).
My previous set up had a different MB, CPU and RAM but all other components are the same and it ran almost continuously for 4 years with no BSOD at all, so I'm tending to think that dust accumulation probably won't be the issue - please tell me if my assumption is/can be dead wrong..............
Fans; how can I run them at any speed other than what they are running at? I have no fan speed control that I am aware of...
Other than blowing out the dust, I don't know how to deal with the other things you have mentioned: Other than plugging the PC into a wall outlet (which,I think can be lower in voltage than through a UPS - please correct me if I am wrong), what else can I do?

FYI, the set up is still well within warranty; is this something I should return it for and have them sort it out?
I mean, they would have the right equipment to test everything fully(?)
I am not feeling that confident in what I am doing and worry that I may end up voiding the warranty(?)
NOTE: everything I am writing here now are my thoughts; I respect your opinion and thoughts on this; what do you think would be the best thing to do; continue troubleshooting as we have been doing or take it back for service?
Again, thank you very much for your help and your thoughts.

Erwin