SI
SI
discoversearch

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor.  We ask that you disable ad blocking while on Silicon Investor in the best interests of our community.  If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.
Technology Stocks : Intel Corporation (INTC)
INTC 35.75+3.6%3:59 PM EST

 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext  
To: Tenchusatsu who wrote (109122)8/31/2000 4:51:04 PM
From: Rob Young  Read Replies (1) of 186894
 
Tench,

Good overview but to add one wrinkle here that you
overlooked. You are mostly touching on the on-chip
memory switch. Couple that with the directory based
cache that allows a very low latency L2 hit to bypass
a global switch (if you will) but route directly
from L2 to L2 via the mesh as you describe. Why 16
as a choice? Most hops are 2. When you add up the
latency for those 2 hops you are still doing much
better than an L3 hit and of course main memory. So
in a sense you would have a system with 24 MB of L2
per CPU. A stretch? Somewhat ... but not much of one.

Go out and read the earlier referenced papers or
some detail from MPR:

alphapowered.com

Slide 14 and others show the P2P nature

Regarding interconnect... not sure.. they can and will
go higher. Rumor has it Marvel is 64 CPUs (solid rumor)
but that puts remote L2 several hops away.. still better
than main memory though !! ;-)

Rob
Report TOU ViolationShare This Post
 Public ReplyPrvt ReplyMark as Last ReadFilePrevious 10Next 10PreviousNext