| View previous topic :: View next topic |
| Author |
Message |
dougSF30
Joined: 18 Sep 2007 Posts: 52
|
Posted: Mon Dec 17, 2007 8:58 am Post subject: |
|
|
| mas wrote: |
LOL, they aren't die photos you clown ! |
Did I say that link was to the die photo, mas? Idiot.
|
|
| Back to top |
|
 |
mas
Joined: 31 Jul 2007 Posts: 74
|
Posted: Mon Dec 17, 2007 9:03 am Post subject: |
|
|
| dougSF30 wrote: | | mas wrote: |
LOL, they aren't die photos you clown ! |
Did I say that link was to the die photo, mas? Idiot. |
Yeah Cretin, you did implicitly as it directly followed otherwise what was the point of the link Twit.
The die photo also appeared to confirm this.
http://pc.watch.impress.co.jp/docs/2007/1128/kaigai403_05.pdf
|
|
| Back to top |
|
 |
who?
Joined: 01 Sep 2007 Posts: 464
|
Posted: Mon Dec 17, 2007 9:38 am Post subject: |
|
|
|
|
|
| Back to top |
|
 |
Eric Bron
Joined: 31 Aug 2007 Posts: 125 Location: Switzerland
|
Posted: Mon Dec 17, 2007 9:51 am Post subject: |
|
|
| dkanter wrote: |
Nehalem is a 4 issue core, with a 3 level cache hierarchy.
DK |
do you have an official source to share for the "3 level cache hierarchy" ?
|
|
| Back to top |
|
 |
jack
Joined: 27 Jun 2007 Posts: 284
|
Posted: Mon Dec 17, 2007 11:01 am Post subject: |
|
|
| mas wrote: | | 3.2-3.4 GHz is clearly just a matter of time. |
Technically, this will probably be correct: if/when Shanghai is delayed or has trouble with the scaling, AMD may introduce 3.2GHz quad core 65nm Barcelona in 2009. Just like they introduced 3.2GHz K8 in 2007.
|
|
| Back to top |
|
 |
Uffe Merrild
Joined: 27 Jun 2007 Posts: 107 Location: Silkeborg, Denmark
|
Posted: Mon Dec 17, 2007 1:07 pm Post subject: |
|
|
This thread summarized:
LOL AMD
LOL STOCK
LOL BUY
LOL SELL
LOL ASS
What happened to you guys??
|
|
| Back to top |
|
 |
j3pflynn
Joined: 28 Jul 2007 Posts: 15
|
Posted: Mon Dec 17, 2007 1:15 pm Post subject: |
|
|
| DUCK of DEATH wrote: |
I suppose it is a bit like pulling the wings off a retarded fly. LOL!!! |
One of the first indicators of a serial killer - abuse of animals.
And no, I'm not a PETA nut(unless it's People Eating Tasty Animals), but abusing them ain't right.
|
|
| Back to top |
|
 |
P4man
Joined: 26 Jun 2007 Posts: 419
|
Posted: Mon Dec 17, 2007 1:40 pm Post subject: |
|
|
| Uffe Merrild wrote: |
What happened to you guys?? |
I don't know, but i do know what *will* happen if this is repeated once more: a triple ban.
Mas, Doug and Duck consider this your very last warning.
|
|
| Back to top |
|
 |
caboosemoose
Joined: 17 Dec 2007 Posts: 4
|
Posted: Mon Dec 17, 2007 2:23 pm Post subject: |
|
|
| dougSF30 wrote: |
When you spoke to the Nehalem architect, did he mention the cache structure by chance? My strong impression is that with the possible exception of the 8-core MP part, the "last level shared cache" is an L2, not an L3 as was assumed by most people prior to the die photo. Intel was always careful to say only "multi"-level, and "last level shared".
Here's a quote about Nehalem performance:
http://www.theinquirer.net/gb/inquirer/news/2007/10/16/idf-taipei-nehalem-real-big
So, how much will the Nehalem be faster than the already oh-so-fast Penryn, focusing specifically on the core IPC single thread performance? I asked Kirk Skaugen, Intel's Digital Enterprise Group VP and GM of Server Platforms Group.
Kirk is always friendly, even to the point of having had a fun chat during the previous Taipei IDF on how great it would have been if Nvidia actually did come out with a dual socket Xeon Nforce SLI chipset - which it didn't up to now - so I expected a fun answer this time too.
Well, no fun here - he didn't want to comment on the CPU core-specific performance expectations beyond the well known integrated memory controllers and interconnects. But he did say that the CPU core performance jump from the same process Core 2 (Penryn) to Nehalem would be higher than the jump for Netburst to Core 2 itself. |
Sorry, I didn't ask him about the cache structure, but he probably wouldn't have said much, he was obviously only will to talk very broad brush.
We'll see. Personally I'm not expecting Nehalem to be all that huge a jumpm in IPC. Netburst vs Conroe was HUGE for IPC. I just can't see that happening, except perhaps for some specific bandwidth hungry server work loads.
| Paul DeMone wrote: |
Given Core 2 Duo is a four issue core the odds are you don't
understand very well. |
Given that I even mentioned that Core and Nehalem are any "issue" at all the odds are that I made a silly mistake / typo - it's not a complicated concept, three vs four issue - which is in fact what happened, so if you can all find it in your cold, poisoned hearts to forgive me, that would be swell.
One other comment I can make is that the Nehalem architect was very quick to say how wonderful he thought the improvements in IPC were for Penryn over Conroe. It was probably just a political gesture, and perhaps said to make him and his team look good when Nehalem arrives and blows away Penryn. But honestly, in the vast majority of instances Penryn is a pretty modst improvement IPC-wise. I am aware that Penryn is a tock and not a tick, so let's not argue about that. Process obviosuly another matter.
|
|
| Back to top |
|
 |
Gabriele Svelto
Joined: 27 Jun 2007 Posts: 263 Location: Milano, Italy
|
Posted: Mon Dec 17, 2007 2:55 pm Post subject: |
|
|
| caboosemoose wrote: | | We'll see. Personally I'm not expecting Nehalem to be all that huge a jumpm in IPC. Netburst vs Conroe was HUGE for IPC. I just can't see that happening, except perhaps for some specific bandwidth hungry server work loads. |
I wouldn't discount Nehalem improvements coming from the IMC as just something which will improve 'bandwidth' sensitive workloads. That's because even on non-bandwidth intensive workloads the C2x hardware prefetchers are perfectly able to generate a significant amount of traffic and reduce apparent memory latency. However on quad-core models this doesn't reap as many benefits as it does on dual-core variants and this is a visible effect on benchmarks with well behaved memory patterns or significant bandwidth demands were turning-off prefetching usually leads to higher performance.
Quad-core Nehalem variants will have both significantly lower latency and per-core bandwidth than C2Q which will in turn allow for even more aggressive prefetching strategies. This could provide a significant boost even on integer applications - especially the ones with larger working sets - and will have the benefit of providing this performance with less tuning compared to C2Q.
|
|
| Back to top |
|
 |
dougSF30
Joined: 18 Sep 2007 Posts: 52
|
Posted: Mon Dec 17, 2007 5:35 pm Post subject: |
|
|
| mas wrote: | | dougSF30 wrote: | | mas wrote: |
LOL, they aren't die photos you clown ! |
Did I say that link was to the die photo, mas? Idiot. |
Yeah Cretin, you did implicitly as it directly followed otherwise what was the point of the link Twit.
The die photo also appeared to confirm this.
http://pc.watch.impress.co.jp/docs/2007/1128/kaigai403_05.pdf |
The point of the link was (obviously) to provide a reference to the different models of Nehalem, as it is only the EX part that appears to have 3 levels of cache indicated.
|
|
| Back to top |
|
 |
dougSF30
Joined: 18 Sep 2007 Posts: 52
|
Posted: Mon Dec 17, 2007 6:33 pm Post subject: |
|
|
Besides the die photo, and speculation on a couple sites, another reason I suspect Nehalem does not simply have a shared L3 like Barcelona for all parts in the range is the way Intel has been so deliberately vague about it.
It seems to me there'd be little reason to do that with a "standard" L1/L2 shared L3 structure. But if they have "leveraged SmartCache" in an innovative manner to be able to have a large 4-way shared L2, well, that WOULD be something to keep quiet about for a longer time.
|
|
| Back to top |
|
 |
mas
Joined: 31 Jul 2007 Posts: 74
|
Posted: Mon Dec 17, 2007 10:08 pm Post subject: |
|
|
| dougSF30 wrote: | | mas wrote: | | dougSF30 wrote: | | mas wrote: |
LOL, they aren't die photos you clown ! |
Did I say that link was to the die photo, mas? Idiot. |
Yeah Cretin, you did implicitly as it directly followed otherwise what was the point of the link Twit.
The die photo also appeared to confirm this.
http://pc.watch.impress.co.jp/docs/2007/1128/kaigai403_05.pdf |
The point of the link was (obviously) to provide a reference to the different models of Nehalem, as it is only the EX part that appears to have 3 levels of cache indicated. |
What you are seeing there with the EX part is probably an L4, with the L1 and L2 being considered part of the exclusive core cache.
|
|
| Back to top |
|
 |
mas
Joined: 31 Jul 2007 Posts: 74
|
Posted: Mon Dec 17, 2007 10:19 pm Post subject: |
|
|
| dougSF30 wrote: | Besides the die photo, and speculation on a couple sites, another reason I suspect Nehalem does not simply have a shared L3 like Barcelona for all parts in the range is the way Intel has been so deliberately vague about it.
It seems to me there'd be little reason to do that with a "standard" L1/L2 shared L3 structure. But if they have "leveraged SmartCache" in an innovative manner to be able to have a large 4-way shared L2, well, that WOULD be something to keep quiet about for a longer time. |
The chip experts are going for a 512KB L2 and 8MB L3
http://aceshardware.freeforums.org/viewtopic.php?t=141
David also has direct access to Intel architects, I think you are fighting a heavy tide here ;-).
|
|
| Back to top |
|
 |
dougSF30
Joined: 18 Sep 2007 Posts: 52
|
Posted: Mon Dec 17, 2007 11:02 pm Post subject: |
|
|
| mas wrote: | | dougSF30 wrote: | Besides the die photo, and speculation on a couple sites, another reason I suspect Nehalem does not simply have a shared L3 like Barcelona for all parts in the range is the way Intel has been so deliberately vague about it.
It seems to me there'd be little reason to do that with a "standard" L1/L2 shared L3 structure. But if they have "leveraged SmartCache" in an innovative manner to be able to have a large 4-way shared L2, well, that WOULD be something to keep quiet about for a longer time. |
The chip experts are going for a 512KB L2 and 8MB L3
http://aceshardware.freeforums.org/viewtopic.php?t=141
David also has direct access to Intel architects, I think you are fighting a heavy tide here ;-). |
Hans' analysis is not convincing at all. (I believe he's got some other errors there, in addition to the cache analysis) Still waiting for David to reply. I'm sticking with my position, as it stands:
L3 in the EX, but shared 8MB L2 in the rest.
|
|
| Back to top |
|
 |
|