SI
SI
discoversearch

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor.  We ask that you disable ad blocking while on Silicon Investor in the best interests of our community.  If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.
Technology Stocks : Intel Corporation (INTC)
INTC 36.32-0.2%3:50 PM EST

 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext  
To: Tenchusatsu who wrote (177960)5/13/2004 6:02:42 PM
From: TigerPaw  Read Replies (1) of 186894
 
If every local access to the L2 cache has to go out onto the bus for cache coherency purposes,....

I believe that is the case (with current OS) if there is more than one CPU chip. The OS just doesn't know if memory is shared or not without checking each access.

A lone dual-core chip could avoid this bottleneck, but as soon as you try to expand you hit this coherency check slowdown.

then there is no need for the MESI cache protocol or for writeback caches in the first place. This question has been settled way back in the early-90s when WB caches were invented in the first place.

The strategies to avoid this endless checking is rather limited. The OS uses processor affinity, which means that once a task runs on a certain CPU the OS gives preference to running on the same CPU after a stall so that the L2 cache is still associated with the same data. That way coherency may have to be checked, but the data doesn't always have to be swapped. Critical tasks can be locked to a specific CPU. Otherwise it just runs the task when it is ready on the CPU that is least busy at the time.

The external bus issue was not much of an issue when all memory references went through an external northbridge anyway. It has a proportionally greater effect on performance now.

The advantage AMD has for dual core is as much the on-board memory controller as the hypertransport. Actually AMD has a bottleneck of it's own when the memory accessed by one CPU chipset is physically attached to the memory controller on another CPU chipset. The OS doesn't do much to try and avoid that performance penalty either.
TP
Report TOU ViolationShare This Post
 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext