Welcome to the MacNN Forums.

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

You are here: MacNN Forums > Community > Team MacNN > S@H: 4x vs 8x Mac Pro Performance

S@H: 4x vs 8x Mac Pro Performance
Thread Tools
Senior User
Join Date: Jun 2006
Location: Dublin, CA
Status: Offline
Reply With Quote
Mar 6, 2007, 10:53 PM
 
1) E.T from Tellus, dual quad (@2.33ghz) + 2gb RAM + alexkan's "Core 2-optimized v8-prerelease-nographics". RAC ~7200:
http://www.boincstats.com/stats/boin...amp;id=2019771

2) Bad to the bone, dual dual (@ 3.0ghz, I assume) + 2gb RAM + alexkan's "Intel, Core 2-optimized v8-prerelease-nographics". RAC ~5000.
http://www.boincstats.com/stats/boin...amp;id=1720553

Everything is identical except #1 has double the cores, and #2 is faster. So what's the deal here? I did not expect 2x with #1 obviously. But I did expect better than only a 44% increase.

If I understand correctly, most of the work stays between the CPU and the L2. If so, especially with the 8 core, I am wondering if lack of CPU affinity is making things worse for the 8-way? Each pair of cores share an L2. So with #1, you have only a 1 in 4 chance of having the data in the right L2. But with #1, a 1 in 2 chance.

Or do I just have over inflated expectations here?
     
Administrator
Join Date: May 2000
Location: California
Status: Offline
Reply With Quote
Mar 7, 2007, 02:20 AM
 
If you correct for the MHz diff, the picture looks much more normal.

1) E.T. from Tellus: (8-core) 2.33 GHz = 7200 RAC. -> 3.0 GHz would equal 9270 RAC

2) Bad to the bone: (4-core) 3.0 GHz = 5000 RAC

85% improvement when MHz is normalized.
     
Forum Regular
Join Date: Aug 2005
Location: Cupertino, CA
Status: Offline
Reply With Quote
Mar 9, 2007, 02:34 AM
 
Originally Posted by zombie67 View Post
Everything is identical except #1 has double the cores, and #2 is faster. So what's the deal here? I did not expect 2x with #1 obviously. But I did expect better than only a 44% increase.

If I understand correctly, most of the work stays between the CPU and the L2. If so, especially with the 8 core, I am wondering if lack of CPU affinity is making things worse for the 8-way? Each pair of cores share an L2. So with #1, you have only a 1 in 4 chance of having the data in the right L2. But with #1, a 1 in 2 chance.

Or do I just have over inflated expectations here?
While a good portion of the work that SETI does on a single WU does a good job of fitting in the L2 cache, there is also a sizable portion of the WU that doesn't fit as well, and which tends to bottleneck on the RAM's ability to keep the cores fed with data. These parts of the computation tend to be responsible for SETI's less-than-perfect scaling with the number of cores, and the poor performance of Mac Pros with only two installed DIMMs.
     
   
Thread Tools
Forum Links
Forum Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Top
Privacy Policy
All times are GMT -5. The time now is 11:35 PM.
All contents of these forums © 1995-2011 MacNN. All rights reserved.
Branding + Design: www.gesamtbild.com
vBulletin v.3.8.7 © 2000-2011, Jelsoft Enterprises Ltd., Content Relevant URLs by vBSEO 3.3.2