To: Petz who wrote (5244 ) 8/16/2000 1:54:39 PM From: pgerassi Read Replies (1) | Respond to of 275872 Dear John: Re: Best Case? If all the program fits into the L1 that is the best case for the P4 over the P3, not the worst case. In business benchmarks, the bottlenecks are outside the CPU. That is why a large CPU horsepower increase does not change the benchmarks by that much. This means that even a 2G P4 will not have much higher a score than a 1G P3 and the breakeven will occur at a higher ratio than 1.4. The best programs for the P4 are simulations and games where the data quantities are large and the programs are small so that the trace cache rarely misses. If the trace cache does miss, whenever there is a call or branch will cause a 8 cycle wait while the trace cache begins to fill with decoded instructions. This would cause even a larger ratio. Assuming that SSE2 execution is not bungled and the FSB can keep the CPU fed, the trace cache almost never missing is where the breakeven might get down to 1 to 1. Unfortunately, this is also the type of applications that the Duron, Tbird, Mustang, and especially the Sledgehammer excel. Do you think that P4 could stay with a same clocked Tbird? A Mustang? It would not even be a chance P4 could stay with Sledgehammer at same clock, possibly even >2 times clock. What do you think that the minimum clock ratio must be for P4 against Duron, Tbird, Mustang, and the "Hammer" series to be saleable? IMHO the real reason for P4 delays is that they are having trouble getting the clock high enough to make up for the disadvantage of the lower IPC. They keep trying small changes (more silicon respins) to better balance the pipe stages and hope to find that sweet point that overall performance is best (hopefully over the profitably breakeven point). Pete