To: Tenchusatsu who wrote (43156 ) 5/30/2000 1:38:00 AM From: Bilow Read Replies (2) | Respond to of 93625
Hi Tenchusatsu; "- A common piece of anti-Rambus FUD is echoed yet again in this article: "RDRAM is rather more susceptible to EMI (electromagnetic interference) than SDRAM, and it is often advised to enable ECC (Error Correcting Code) especially with the faster PC800 RIMMs." I don't know of any engineer that would try and mask reliability problems with ECC. That's absolutely ludicrous. It's like driving a car more recklessly just because the car is equipped with airbags. " The truth is a lot more subtle than this. If it were the case that EMI induced memory failures were bit independent, and that they had a sufficiently low rate, then you could reduce that rate considerably by going to ECC. While this will not entirely eliminate the problem, as it would still be possible for two bits to err on the same 64-bit word, it is possible that the the error rate would be moved from an unacceptably high rate to an acceptably low rate. Everything in engineering has a limit, and error rates are no different. As an example, suppose that the bit error rate is independent and is at a rate of one error every 10^15 bits. This would mean that the computer would have a memory failure every few weeks or so, depending on how fast memory was accessed. This would be an unacceptably high failure rate. But if ECC could then take that rate to one error every 10^30 bits, then it is highly likely that no user of any such computer ever built would ever see an error, even assuming that they ship 100 million of them, and that they each compute for 100 years. Metastability is the classic example of where engineers ship products with a non-zero chance of failure. As was mentioned on the thread, the i820 and VIA chipsets are "asynchrnonous". With such interfaces, metastabilities are inevitable. (You get these when, in this case, you transfer data from one clock domain to another, asynchronous one.) Engineers essentially eliminate the problem by adding delays to the system, delays that are long enough to force the probability that a metastability will cause a system error to be extremely tiny. In fact, every time you type a character on a keyboard, there is a non-zero chance that you will, through a (highly improbable) cascading metastability cause your computer to err. The subject is quite fascinating, here's a link to a typical industry article giving metastability calculations. The fact is that in this imperfect world of mucus, blood and feces, (as opposed to God's world of infinite perfection) engineers do drive faster (but not necessarily recklessly) when they have airbags:Metastability Considerations January 1997Metastability is unavoidable in asynchronous systems. However, using the formulas and test measurements supplied here, designers can calculate the probability of failure. Design techniques for minimizing metastability are also provided. xilinx.com -- Carl