Hello again folks. Sorry for the delay in the response to both Specialk90 and Sub mesa. I basically wanted information with teeth to it before posting back to the forum.
First of all, let me give you the update. I have completely ditched ICH10R RAID drivers after the last crash and decided to resort to good ol fashioned troubleshooting by eliminating one variable at a time. So I have loaded all the Fail Safe params in the BIOS (running F9 revision) and switched all the drivers to standard IDE not AHCI. I have dropped two drives out of the setup keeping a total of 5 x 500GB drives and 1 DVD Writer (all SATA).
The drives were setup as independent drives with no fault tolerance. I completed a fresh install of Windows XP on the first drive, installed all sound/lan drivers, and completed all the house keeping items of a fresh install. Then I put the machine to the test in the 24x7 with moderate read/write requests.
To my amazement, the machine started BSODS with the IRQL memory errors! Puzzled about this, i disconnected four drives and kept only one drive which has the OS. Then ran the machine again and was further puzzled by another BSOD within a few hours with the typical PAGE_FAULT_IN_NON_PAGED_AREA errors. Then it dawned on me what the last two posters have highlighted, the memory could be the culprit. I yanked one of my kid's DDR2 2GB 800 MHz RAM Chips out of his computer and into mine. I took out the two Corsair 2048MB 1066 MHz chips out of the server. I also reconnected all five drives back into service. Then booted the machine, it has been running non-stop for the past 4 or 5 days!
I feel that the memory is 80% of the problem right now but the acid test is a full 1-2 weeks of 24x7 operation without a failure. So I will keep you posted with what happens. Needless to say, I have no RAID setup whatsoever right now, but frankly I am really happy that I have a working system. Let's hope that it passes the two weeks mark and then I have to figure out where to go from there.
SpecialK on your question regarding cooling - I am using a hefty Gigabyte case with 2-front side fans and 2 back side fans. I paid a little extra cash at time of purchase to make sure I have plenty of fans in the machine. I am also using an Epsilon600W power supply with its own fan, so the system runs fairly cool across the board.
Sub mesa on your two questions, i believe that the answer now is related to the memory. The system seems to be fairly cool and I will double check the temperature of those chipsets over the next two weeks and let you know.
I am definitely interested in your suggestion about the dedicated RaidZ and ZFS and would appreciate any insight. I must say that having a file server that doubles up as a workstation is really nice. The file server provides media storage for the home network meanwhile its downloading files from the Internet in the background.
Thanks for all the help guys - it has really been crucial in isolating the problem and giving me hope when it was needed the most.
First of all, let me give you the update. I have completely ditched ICH10R RAID drivers after the last crash and decided to resort to good ol fashioned troubleshooting by eliminating one variable at a time. So I have loaded all the Fail Safe params in the BIOS (running F9 revision) and switched all the drivers to standard IDE not AHCI. I have dropped two drives out of the setup keeping a total of 5 x 500GB drives and 1 DVD Writer (all SATA).
The drives were setup as independent drives with no fault tolerance. I completed a fresh install of Windows XP on the first drive, installed all sound/lan drivers, and completed all the house keeping items of a fresh install. Then I put the machine to the test in the 24x7 with moderate read/write requests.
To my amazement, the machine started BSODS with the IRQL memory errors! Puzzled about this, i disconnected four drives and kept only one drive which has the OS. Then ran the machine again and was further puzzled by another BSOD within a few hours with the typical PAGE_FAULT_IN_NON_PAGED_AREA errors. Then it dawned on me what the last two posters have highlighted, the memory could be the culprit. I yanked one of my kid's DDR2 2GB 800 MHz RAM Chips out of his computer and into mine. I took out the two Corsair 2048MB 1066 MHz chips out of the server. I also reconnected all five drives back into service. Then booted the machine, it has been running non-stop for the past 4 or 5 days!
I feel that the memory is 80% of the problem right now but the acid test is a full 1-2 weeks of 24x7 operation without a failure. So I will keep you posted with what happens. Needless to say, I have no RAID setup whatsoever right now, but frankly I am really happy that I have a working system. Let's hope that it passes the two weeks mark and then I have to figure out where to go from there.
SpecialK on your question regarding cooling - I am using a hefty Gigabyte case with 2-front side fans and 2 back side fans. I paid a little extra cash at time of purchase to make sure I have plenty of fans in the machine. I am also using an Epsilon600W power supply with its own fan, so the system runs fairly cool across the board.
Sub mesa on your two questions, i believe that the answer now is related to the memory. The system seems to be fairly cool and I will double check the temperature of those chipsets over the next two weeks and let you know.
I am definitely interested in your suggestion about the dedicated RaidZ and ZFS and would appreciate any insight. I must say that having a file server that doubles up as a workstation is really nice. The file server provides media storage for the home network meanwhile its downloading files from the Internet in the background.
Thanks for all the help guys - it has really been crucial in isolating the problem and giving me hope when it was needed the most.