To: Cirruslvr who wrote (120399 ) 7/17/2000 1:56:20 AM From: Charles R Respond to of 1571373 Cirruslvr, < amd.com > Nice link. Thanks for posting. <Potential answers:> I am no Scumbria but see my comments. <1. Prior to the L2 cache going on die, the P6 was ALWAYS underfed, but is a VERY efficient design whose full performance wasn't attainable until the cache went on die.> There is no question that 32-kb L1 cache is a bottleneck for standard Windows x86 kind of code. <2. Athlon's HUGE L1 cache SIGNIFICANTLY reduces the need for L2 cache access, and therefore doesn't benefit as much as PIII from faster L2 cache.> That is a given. <3. AMD's interpretation of "no significant impact" is significantly different from mine.> Unless there is a stated number who knows what "no significant" means. <4. There is a bottleneck somewhere within the Athlon.> Yes. It is the extra latency on the L1. That is by design. Nowadays architectures are tuned for MHz. I continue to be amused by the relative performance discussions of PIII and Athlon on a clock-per-clock basis. Due to the increasingly longer pipelines, the IPC for typical application code has nowhere to go but down barring some heretofore unknown breakthrough. The game is about MHz and Athlon is a winner by long shot. PIII can't keep up with notched gates, die shrinks, tight layouts, massive amount of speed path work, etc. <5. The P6 is just THAT good.> See above. On integer code, IPC improvements will be minimal. Athlon FPU is clearly better architected and there should be significant delta there with equivalent platforms and compilers. <What do you think after reading AMD's explanation?> No new ground broken. Improved understanding of L1 miss penalty when the victim buffer is full. Chuck