SI
SI
discoversearch

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor.  We ask that you disable ad blocking while on Silicon Investor in the best interests of our community.  If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.
Technology Stocks : Advanced Micro Devices - Moderated (AMD) -- Ignore unavailable to you. Want to Upgrade?


To: pgerassi who wrote (53366)8/31/2001 11:33:29 AM
From: jcholewaRead Replies (1) | Respond to of 275872
 
> Athlon is the only x86 CPU IIRC which can decode 3
> x86 instructions per cycle

counterargument:
azillionmonkeys.com
"So when everything is working well, the P-II can take 3 simple x86 instructions and turn them into 3 micro-ops on every clock. But, as can be plainly seen in their comments, they have a bizzare problem: they can only read two physical input register operands per clock (rename registers are not constrained by this condition.) This means scheduling becomes very complicated. Registers to be read for multiple purposes will not cost very much, and data dependencies don't suffer from any more clocks than expected, however the very typical trick of spreading calculations over several registers (used especially in loop unrolling) will upper bound the pipeline to two micro-ops per clock because of a physical register read bottleneck."



To: pgerassi who wrote (53366)8/31/2001 8:47:46 PM
From: Dan3Respond to of 275872
 
Re: Athlon is the only x86 CPU IIRC which can decode 3 x86 instructions per cycle. The P3 could do 2 and the P4 can only do 1

P4 can only decode one instruction every other cycle - you're giving it twice the credit it deserves. It runs at half speed every time it hits new code (the trace cache helps it sometimes, but not always).