Cloud Strifes :
Thanks !
Coincidentally, the CRC error occurred on the same day (I think) that the temperature started to get high.
Here the HD TUNE info:
The EXACT answer you are looking for is your SSD
MIGHT be failing. Keep monitoring it, aslong as the error count does not keep increasing your in the clear.
Most UDMA CRC errors are caused by the number of errors in data transfer via the interface cable as determined by ICRC (Interface Cyclic Redundancy Check) and typically involve an issue with the drive, SATA cable or the Motherboards hard drive Controller.. These errors aren't generally something to worry about in terms of an absolute count as errors may have occurred on day 1 and never re-occurred.
You should worry when it occasionally increases.
The CRC errors are communication errors, this means that it has to do with your connections.
A Cyclic Redundancy Check error(CRC) is caused when data on the hard-drive or DVD becomes corrupt. to explain further, when the drive send data to the MOBO through your sata, it send it in blocks. on the other side of this is a system that checks for the intergrity of the information transferred and
can detect up to a 99.998% error rate.
These errors can often be caused by
faulty connections,
bad sata ports, bad sata cords, dirty contacts on the SSD, Broken Cables, and the biggest one, Power failure of the SSD while it is writing a large file will cause it to become corrupt therefore not sending all the information it was supposed to and showing up as an error.
The CRC error is ONLY an attention, a minor warning, it is nothing to worry about UNLESS you notice the errors begin to keep increasing, then there is a problem with your SSD and its ability to communicate through the sata.
If it does not go up and seems to just go away, then you are okay. It happens to multiple people from time to time, there is no true need to worry about it unless it increases, otherwise your good to go.
If it does keep getting worse and increasing then it means its time to get a new SSD.
It gets very complicated and there is a lot of scientific explanation behind it.
For the exact reasons on the temp and error, I have figured this out for you. The SSD you have uses whats called NAND Flash Memory to store and retain data, this depends on the temperature which the NAND flash is subjected to during writing, and between the time the data is written to the time the data is read. The higher the temperature that the NAND flash experiences, the greater the acceleration of charge de-trapping mechanisms that could lead to random data bit failures(A CRC Error). The endurance of NAND Chips andthe rate of wear-out of NAND cells is affected by temperature at the time of programming and erasing NAND.
At the end of life of a NAND cell, when the device has been cycled through the maxium number of program-erase (endurance) cycles as specified by the manufacturer, data loss can occur if the NAND is stored or read over extended periods of time at high temperature, The temperature will slowly increase as the NAND cell AND is subject to two competing factors relative to temperature. At high temperature, programming and erasing a NAND cell is relatively less stressfull to its structure, but data retention of a NAND cell suffers. At low temperature, data retention of the NAND cell is enhanced but the relative stress to the cell structure due to program and erase operations increases. So in simpler terms, your SSD has many NAND cells, like memory blocks, these NAND cells are constantly Going through a complex write/erase which I will further explain in detail, incase anyone does not fully understand quite yet.
The NAND cells store memory and delete memory on a constant basis, this action slowly wears down the NAND cells, as the NAND cells are slowed and are beginning to have trouble they heat up due to the electrical current passed through a NAND cell to clear and transmit data, if it becomes a BAD cell then it will heat up because the voltage and energy put into it is being essentially halted and trying to be recovered by the SSD, by this time its usually too late as the NAND cell has been exposed to higher temps, higher temps lead to damage to the NAND structure and its functions, making it increase in temp and suffer in data retention rates.
Eventually in a SSD every single NAND cell can and will go bad, this is the downfall to SSDS sure they are amazing, fast and great storage, but they are very complicated and do not have very long life cycles. Especially considering the life cycle that it lives has to do mostly with the temperature and voltage applied to NAND cells, aswell as the usage of repeated erasing/writing to the cell.
NOW FOR YOUR ANSWER .
Your SSD has a NAND cell that has gone bad. The repeated electron tunneling mechanism from writing and erasing NAND cells causes the buildup of charge traps in the tunnel oxide layer. Some of the traps are deep, and eventually accumulate to the point where the tunnel oxide becomes conductive without application of an input voltage, and the cell is no longer capable of storing charge. At this point, programming the NAND will result in program failures and the entire block must be marked “bad” by the SSD.
SIDE NOTE FOR THOSE INTERESTED
This is exactly how the NAND cell processes memory, I will include a picture for better reference aswell to help other understand.
1.The floating-gate transistor is the building block of all flash technology:
In the floating-gate transistor, an insulating oxide layer resides between the floating gate and the substrate. When voltage above the cell threshold, (Vt), is applied to the top gate, the transistor is “turned on” and conducts a current. To prevent the transistor from conducting current, electrons are forced through the thin oxide layer by the application of high voltage applied to the top gate. This is the process of writing (or programming) the NAND cell. To erase a cell, the substrate well is raised to a high voltage forcing the electrons back through the oxide layer from the floating gate into the substrate. To read a NAND cell, a voltage above Vt is applied to the top gate and the current flowing in the transistor sensed by a sense amplifier, which gives information about the amount of charge stored in the cell. De-trapping of stored charge is accelerated by exposure to high temperature, and the temperature that the NAND flash is subjected to is a
critical factor as it is explained by the Arrhenius equation(very long and complex for most without general knowledge of electricity to understand.)
It is important to note that the temperature and duration to which the NAND flash is subjected to after programming is the most critical part in determining the acceleration factor. The acceleration factor for a NAND relative to a temperature of 55c is shown in the table below. For example, for NAND devices specified with 1 year of data retention, storing at 85c will accelerate the charge de-trapping mechanism by 26 times when compared to storing at 55c
The data shows that NAND programmed to its endurance limit at -40C will have a higher RBER than NAND programmed at 25C, and higher than that of NAND programmed at 85C. The operation of programming NAND at low temperature increases the rate of degradation of the cell oxide layer relative to programming at higher temperature.
---What is a RBER? It is the residual bit error rate (RBER) is a receive quality metric in digital transmission, one of several used to quantify the accuracy of the received data.------
Also It should be noted that at the end of NAND’s rated endurance, the NAND device is usually
not in jeopardy of immediate failure. NAND manufacturer’s endurance ratings are typically specified to ensure that the number of bad blocks that occur over time will be within a predictable percentage limit and that the NAND will be able to retain data for 1 year at 55C in accordance with JESD47H.01. Beyond the endurance limit, blocks may become bad at a faster rate and the data retention capabilities of the drive become diminished. The impact to reliability of the drive is then dependent upon the
media management capabilities of the drive controller.
LASTLY
NAND is subject to two competing factors relative to temperature. At high temperature, programming and erasing a NAND cell is relatively less stressful to its structure, but data retention of a NAND cell suffers. At low temperature, data retention of the NAND cell is enhanced but the relative stress to the cell structure due to program and erase operations increases.
The effects of temperature apply in varying degrees to all NAND devices from all NAND vendors. Industrial temperature rated NAND devices from the manufacturer are tested for functionality at temperatures of -40˚C and +85˚C (depending on the NAND manufacturer’s specification), but that does not give these parts higher endurance or greater immunity to the effect of charge de-trapping. Sensitivities to these factors appear to be increasing as NAND process geometries shrink, but the actual endurance and data retention characteristics of a particular NAND device will vary between NAND manufacturers, process materials, and geometries.
Therefore, it behooves the SSD vendor to fully understand the physical characteristics of the NAND chosen for various applications, and to specify SSD products with the different endurance and data retention characteristics seen across the industrial temperature range. At the same time, it is also important for users of NAND flash based SSDs to understand the relationship between temperature, data retention, and endurance; and how their usage models will affect the long term reliability of the SSD.
The
best way to optimize the data retention of a NAND-based SSD is to l
imit the temperature at which the NAND flash is stored. When the drive has reached or is approaching its end of life, limiting the time of exposure to high temperature will also help extend the data retention
-taken from the science behind creation of flash memory and SSDs aswell as how they work/function, Multiple books on SSD and NAND memory aswell as the treasure troves of pages explaining why hard drives do this. I did not come up with ALL of this material on my own, Only partial amounts, Thank you for reading and I hope this helps you understand SSDs, how they work, why yours is having problems, and the reasoning behind temp increases in SSDs and Decreases in performance over time.
some SSDs dont even make it a year, It all depends on how you treat it and how cold you keep it. The hotter it gets the faster it dies. The reason yours is hot is because one of the NAND cells has gone bad, leading to voltage resistance and the inability to hold a charge, resulting in heat, this damaged NAND chip will slowly and possibly corrupt nearby chips and eventually will destroy the SSD.
Your Disk is fine as it is, if the errors increase, its time to get a new one or strap a aftermarket cooler onto your SSD to help reduce the temps and increase retentions.
The reasons your benchs fluxuate is because all the NAND cells are different, and lets say its benching all of them, when it reaches the bad cell, its going to drop the score immensly as it trys to read corrupted data no longer obtainable.
your welcome
My main site used for sourcing
Another site for sources
Information on CRC Errors
The rest is personal knowledge.
I hope any of you enjoy this and it helps you to better understand partially if not fully grasp the way a SSD functions and works.
TROUBLESHOOTING
You can test this aswell by plugging in aniother drive into the same slot that you had your SSD in, run the same test, if you have a CRC error, then you know its not the drive, its the sata or drive controller.
Try replacing the cable with a decent one, and to do a final test, take the drive with a new sata and plug it into a different motherboard, open it run the test, if its still 1, then you are 100% fine. if you use another random drive with the same sata on a new mobo, if it develops a CRC error then you know its the cable, if while using the SSD on a different motherboard with the same cable, then you know its a SSD Issue.
In the end it all comes down to communication of data between the mobo and the harddrive, that is what causes CRC errors and there is not much you can do about it.
I recommend replacing the SATA with a new 6gb one, and possibly even changing the sata port currently used for it.
Also remember, a power failure can ultimately cause this simple error.
Your temperature may be increased as well because when you try to read data from the SSD, if it is corrupt it will have great difficulty doing it increasing the temp, and then that same corrupted data that it tried to read will be passed through the sata to the mobo, where it receives corrupt data causing an error in communication.
You can attempt to try the SSD in another computer and look for the same error. if not then you know its with the connection between your SSD and MOBO sata plugs. Make sure the sata plug is clean and not damaged.
Make sure you are using a 6gb SATA and have enough and the correct amount of power going to it. Your sata cables should be tight, fit snuggly and should not fold over or bend, A lose connection with the sata can cause this issue.
Honestly the last thing I wanted to be know for was spreading "Misinformation" so I took a few hours to put together this complex guide that I created using many different resources, sources, sites, and my own personal knowledge. I hope that in the future this can ease the minds of others with the same issue and explain to those interested in the science behind what makes your PC truly drive. This should answer OP's question as well. I tried to make it as thorough and descriptive as possible so most everyone can read and understand/learn something new.