<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>insideHPC &#187; HPCAnswers</title>
	<atom:link href="http://insidehpc.com/category/hpcanswers/feed/" rel="self" type="application/rss+xml" />
	<link>http://insidehpc.com</link>
	<description>HPC news for supercomputing professionals</description>
	<lastBuildDate>Fri, 25 May 2012 16:15:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.1</generator>
		<item>
		<title>New Whitepaper: Introduction to Intel OpenCL Tools</title>
		<link>http://insidehpc.com/2011/07/05/new-whitepaper-introduction-to-intel-opencl-tools/</link>
		<comments>http://insidehpc.com/2011/07/05/new-whitepaper-introduction-to-intel-opencl-tools/#comments</comments>
		<pubDate>Tue, 05 Jul 2011 07:15:32 +0000</pubDate>
		<dc:creator>RichB</dc:creator>
				<category><![CDATA[HPC]]></category>
		<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/?p=21246</guid>
		<description><![CDATA[Clipped from: software.intel.com (share this clip) &#160; A new whitepaper by Intel&#8217;s Vinay Awasthi describes the status of the company&#8217;s OpenCL implementation and available tools for developers using the Intel OpenCL SDK. The Intel implementation is the only implementation at the moment that implements out of order queues. Intel&#8217;s implementation also allows multiple work-items per workgroup [...]]]></description>
			<content:encoded><![CDATA[<div class="clply_clip" style="margin: 5px auto 0 auto; clear: both; width: 450px;"><a href="http://s.tt/12Mz2"><img style="border: none; background: none;" src="http://i.curate.us/img/9f49eac4c942de0cf6053c81991868da?offset=0&amp;size=450&amp;stamp=1309810413&amp;bg=ffffff" alt="" /></a><br />
<span class="clply_caption" style="display: block; font-size: 10px; font-family: sans-serif; text-align: center;">Clipped from: <a href="http://s.tt/12Mz2">software.intel.com</a> (<a class="clply_share_link" href="http://curate.us/12Mz2+">share this clip</a>)</span></div>
<p>&nbsp;</p>
<p>A new <a href="http://software.intel.com/en-us/articles/introduction-to-intel-opencl-tools/?cid=sw:Intro_OpenCL_Tools_Blog_JK10">whitepaper</a> by Intel&#8217;s Vinay Awasthi describes the status of the company&#8217;s OpenCL implementation and available tools for developers using the Intel OpenCL SDK.</p>
<blockquote><p>The Intel implementation is the only implementation at the moment that implements out of order queues. Intel&#8217;s implementation also allows multiple work-items per workgroup for CPUs. There is also preview support for device fission extension (not fully validated). We will cover the benefit of such options later in this whitepaper. With this implementation, you will also receive OpenCL offline compiler. This compiler will let you observe assembly instructions and intermediate representation (IR) of your OpenCL kernels instantly without having to plug them into a program or using any APIs to get IR. Developers can use this tool to also compile kernels for correctness.</p></blockquote>
<p>Read the <a href="http://software.intel.com/en-us/articles/introduction-to-intel-opencl-tools/?cid=sw:Intro_OpenCL_Tools_Blog_JK10">Full Story</a>.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=21246&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2011/05/22/whitepaper-from-cuda-to-opencl-towards-a-performance-portable-solution-for-multi-platform-gpu-programming/' rel='bookmark' title='Permanent Link: Whitepaper: From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming'>Whitepaper: From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming</a></li><li><a href='http://insidehpc.com/2010/12/13/kanters-introduction-to-opencl/' rel='bookmark' title='Permanent Link: Kanter&#8217;s Introduction to OpenCL'>Kanter&#8217;s Introduction to OpenCL</a></li><li><a href='http://insidehpc.com/2012/01/06/amds-opencl-the-good-the-bad-and-the-ugly/' rel='bookmark' title='Permanent Link: AMD&#8217;s OpenCL &#8211; The Good, the Bad, and the Ugly'>AMD&#8217;s OpenCL &#8211; The Good, the Bad, and the Ugly</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2011/07/05/new-whitepaper-introduction-to-intel-opencl-tools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HPC enables discovery of new blood pressure drug</title>
		<link>http://insidehpc.com/2008/05/05/hpc-enables-discovery-of-new-blood-pressure-drug/</link>
		<comments>http://insidehpc.com/2008/05/05/hpc-enables-discovery-of-new-blood-pressure-drug/#comments</comments>
		<pubDate>Tue, 06 May 2008 02:58:35 +0000</pubDate>
		<dc:creator>John West</dc:creator>
				<category><![CDATA[HPC]]></category>
		<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2008/05/05/hpc-enables-discovery-of-new-blood-pressure-drug/</guid>
		<description><![CDATA[Also at the new HPCwire today, news that U of Florida researchers used HPC to find a new drug that lowers blood pressure and prevents heart and kidney damage, at least in rats. More research in humans coming. Researchers used one of the world’s most powerful supercomputers to process 140,000 prospective drug compounds in a [...]]]></description>
			<content:encoded><![CDATA[<p>Also at the new HPCwire today, <a href="http://www.hpcwire.com/offthewire/Supercomputer_Points_Researchers_to_New_Blood_Pressure_Drug.html">news</a> that U of Florida researchers used HPC to find a new drug that lowers blood pressure and prevents heart and kidney damage, at least in rats. More research in humans coming.</p>
<blockquote><p>Researchers used one of the world’s most powerful supercomputers to process 140,000 prospective drug compounds in a matter of weeks. The computer predicted which molecules would be most likely to enhance the activity of ACE2, rotating them in thousands of different orientations to see how they would bind to certain pockets on the enzyme’s surface.</p>
<p>&#8230;After hitting on the “lead” compound, UF researchers then tested it in hypertensive rats that had developed fibrosis of the heart and kidney. The animals received the drug for two weeks. Tissue samples from treated animals revealed a significant decrease in fibrosis of the heart, kidney and blood vessels, said Ostrov, who described the findings as “striking and reproducible.”</p></blockquote>
<p>And the good news on this one keeps coming, apparently</p>
<blockquote><p>&#8230;Early results also show the compound inhibits inflammation, which has significant implications for a number of human diseases, including autoimmune diseases such as type 1 diabetes and rheumatoid arthritis as well as other diseases involving fibrosis, such as Alzheimer’s, Ostrov said.</p></blockquote>
<p>As the old SNL sketch goes, &#8220;it&#8217;s a non-dairy whipped topping AND a floor wax.&#8221; Anyway, good stuff.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=1713&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2012/05/24/drug-discovery-through-molecular-matchmaking-at-tacc/' rel='bookmark' title='Permanent Link: Drug Discovery through Molecular Matchmaking at TACC'>Drug Discovery through Molecular Matchmaking at TACC</a></li><li><a href='http://insidehpc.com/2007/05/21/in-silico-drug-discovery/' rel='bookmark' title='Permanent Link: in silico Drug Discovery'>in silico Drug Discovery</a></li><li><a href='http://insidehpc.com/2010/04/15/hpc-facilitates-discovery-in-drug-design-and-genomics/' rel='bookmark' title='Permanent Link: HPC facilitates discovery in drug design and genomics'>HPC facilitates discovery in drug design and genomics</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2008/05/05/hpc-enables-discovery-of-new-blood-pressure-drug/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>InsideTrack: Letter from SGI to LNXI customers</title>
		<link>http://insidehpc.com/2008/02/19/insidetrack-letter-from-sgi-to-lnxi-customers/</link>
		<comments>http://insidehpc.com/2008/02/19/insidetrack-letter-from-sgi-to-lnxi-customers/#comments</comments>
		<pubDate>Tue, 19 Feb 2008 23:08:16 +0000</pubDate>
		<dc:creator>John West</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>
		<category><![CDATA[InsideTrack]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2008/02/19/insidetrack-letter-from-sgi-to-lnxi-customers/</guid>
		<description><![CDATA[The letter linked below was sent from SGI to Linux Networx customers, and the InsideTrack secured a copy through its vast network of industry insiders. The letter, from SGI Global Services VP Bob Pette, outlines what LNXI customers can expect. A few highlights of interest. First, SGI&#8217;s take on what happened Today SGI announced the [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://insidehpc.com/images/sgi.gif" style="padding-top: 5px; float: right" />The letter linked below was sent from SGI to Linux Networx customers, and the InsideTrack secured a copy through its vast network of industry insiders.</p>
<p>The letter, from SGI Global Services VP Bob Pette, outlines what LNXI customers can expect. A few highlights of interest.</p>
<p>First, SGI&#8217;s take on what happened</p>
<blockquote><p>Today SGI announced the purchase of certain assets of Linux Networx, Inc.. In conjunction with this transaction SGI has offered employment to a number of Linux Networx employees in the Engineering, Sales and Services areas and acquired LNXI&#8217;s spare parts inventory.</p></blockquote>
<p>And then what it means for LNXI customers (note the honesty):</p>
<blockquote><p>SGI did not acquire Linux Networx’s service contracts and as such, does not have a service contract in place with you.  &#8230;During this period of transition, please continue to place your LNXI service requests as before by calling 1-800-459-7138 or online at http://support.linuxnetworx.com.</p></blockquote>
<p>Very refreshing. <a href="http://insidehpc.com/images/02192008/LNXISGILetter.pdf">Here is the whole letter</a> as a PDF.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=1471&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2008/03/19/additional-support-options-for-your-lnxi/' rel='bookmark' title='Permanent Link: Additional Support Options for Your LNXI'>Additional Support Options for Your LNXI</a></li><li><a href='http://insidehpc.com/2008/04/29/lnxi-holds-a-fire-sale/' rel='bookmark' title='Permanent Link: LNXI holds a fire sale'>LNXI holds a fire sale</a></li><li><a href='http://insidehpc.com/2007/11/20/lnxi-announces-solution-center-for-mid-market-customers/' rel='bookmark' title='Permanent Link: LNXI announces Solution Center for mid market customers'>LNXI announces Solution Center for mid market customers</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2008/02/19/insidetrack-letter-from-sgi-to-lnxi-customers/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Article: File Systems for HPC Clusters</title>
		<link>http://insidehpc.com/2007/10/15/article-file-systems-for-hpc-clusters/</link>
		<comments>http://insidehpc.com/2007/10/15/article-file-systems-for-hpc-clusters/#comments</comments>
		<pubDate>Mon, 15 Oct 2007 17:45:45 +0000</pubDate>
		<dc:creator>John Leidel</dc:creator>
				<category><![CDATA[Enterprise HPC]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2007/10/15/article-file-systems-for-hpc-clusters/</guid>
		<description><![CDATA[Jeffrey B. Layton has written yet another interesting article for Linux Magazine.  This time, he&#8217;s put together an overview for those interested in implementing a parallel file system on a cluster. If you have an interest in pulling together your own cluster, or maybe you just want to understand more about cluster technology, it’s necessary [...]]]></description>
			<content:encoded><![CDATA[<p>Jeffrey B. Layton has written yet another interesting article for Linux Magazine.  This time, he&#8217;s put together an overview for those interested in implementing a parallel file system on a cluster.</p>
<blockquote><p>If you have an interest in pulling together your own cluster, or maybe you just want to understand more about cluster technology, it’s necessary to grok the differences between clusters and standard systems.</p></blockquote>
<p>Read the full article <a href="http://www.linux-mag.com/launchpad/business-class-hpc/build/4169" title="linux-mag">here</a>.  [free registration required]</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=977&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2008/09/01/what-can-nature-teach-us-about-really-big-clusters/' rel='bookmark' title='Permanent Link: What can nature teach us about really big clusters?'>What can nature teach us about really big clusters?</a></li><li><a href='http://insidehpc.com/2009/03/05/article-explores-answers-for-challenges-faced-by-virtualization-in-hpc/' rel='bookmark' title='Permanent Link: Article explores answers for challenges faced by virtualization in HPC'>Article explores answers for challenges faced by virtualization in HPC</a></li><li><a href='http://insidehpc.com/2008/08/18/linux-mag-on-the-value-of-cluster-standards/' rel='bookmark' title='Permanent Link: Linux Mag on the value of cluster standards'>Linux Mag on the value of cluster standards</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2007/10/15/article-file-systems-for-hpc-clusters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Will SaaS work in HPC?</title>
		<link>http://insidehpc.com/2007/02/12/will-saas-work-in-hpc/</link>
		<comments>http://insidehpc.com/2007/02/12/will-saas-work-in-hpc/#comments</comments>
		<pubDate>Mon, 12 Feb 2007 21:54:57 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2007/02/12/will-saas-work-in-hpc/</guid>
		<description><![CDATA[Software-as-a-Service is useful for products that require network connectivity, such as email and instant messaging; just witness the popularity of Gmail and Meebo. Among enterprise customers, ERP/CRM applications hold some degree of promise, like Salesforce.com. So then the question is whether technical computing customers could use SaaS. The best example I can think of is [...]]]></description>
			<content:encoded><![CDATA[<p>Software-as-a-Service is useful for products that require network connectivity, such as email and instant messaging; just witness the popularity of <a href="http://www.gmail.com/">Gmail</a> and <a href="http://www.meebo.com/">Meebo</a>. Among enterprise customers, ERP/CRM applications hold some degree of promise, like <a href="http://www.salesforce.com/">Salesforce.com</a>. So then the question is whether technical computing customers could use SaaS.</p>
<p>The best example I can think of is in offloading hefty workloads to managed servers. This isn&#8217;t grid computing as traditionally known, but rather an application that can be called on-demand to perform a very specific task. I believe that in the future, it may be possible to farm heavy number-crunching from Excel or MATLAB to another company&#8217;s server on the fly. I&#8217;m just waiting for Microsoft to introduce &#8220;Excel Services Live.&#8221; You heard it here first.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=268&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2007/03/21/new-saas-built-on-amazons-ec2/' rel='bookmark' title='Permanent Link: New SaaS built on Amazon&#8217;s EC2'>New SaaS built on Amazon&#8217;s EC2</a></li><li><a href='http://insidehpc.com/2008/10/20/gridswork-this-week-in-france/' rel='bookmark' title='Permanent Link: GRIDS@Work this week in France'>GRIDS@Work this week in France</a></li><li><a href='http://insidehpc.com/2010/05/24/grid-dynamics-announces-partnership-with-microsoft/' rel='bookmark' title='Permanent Link: Grid Dynamics Announces Partnership with Microsoft'>Grid Dynamics Announces Partnership with Microsoft</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2007/02/12/will-saas-work-in-hpc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is the benefit of domain-specific languages?</title>
		<link>http://insidehpc.com/2007/02/10/what-is-the-benefit-of-domain-specific-languages/</link>
		<comments>http://insidehpc.com/2007/02/10/what-is-the-benefit-of-domain-specific-languages/#comments</comments>
		<pubDate>Sun, 11 Feb 2007 00:41:37 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2007/02/10/what-is-the-benefit-of-domain-specific-languages/</guid>
		<description><![CDATA[For starters, domain-specific languages make users more productive than general-purpose languages and give them more flexibility than a simple GUI. Consider what SQL gives to database managers, or Excel to finance professionals. And while C++ has features like polymorphism and operator overloading that allow for &#8220;syntactic sugar&#8221; in mathematics libraries, most engineers will prefer MATLAB [...]]]></description>
			<content:encoded><![CDATA[<p>For starters, domain-specific languages make users more productive than general-purpose languages and give them more flexibility than a simple GUI. Consider what SQL gives to database managers, or Excel to finance professionals. And while C++ has features like polymorphism and operator overloading that allow for &#8220;syntactic sugar&#8221; in mathematics libraries, most engineers will prefer MATLAB because, if for no other reason, it&#8217;s interactive.</p>
<p>But these languages have an added bonus that the HPC community should now take seriously: because they are limited, domain-specific language are easier to optimize. While <a href="http://www.aspeed.com/">ACCELLERANT</a> tries to parallelize any and all code, <a href="http://www.interactivesupercomputing.com/">Star-P</a> sticks to just matrix operations. After all, why should a non-programmer bother with stream computing, electronic systems-level design, and partitioned global address spaces when all he wants is to crunch numbers faster?</p>
<p>Before anyone decries dynamically typed languages for their perceived low performance, just keep in mind that many popular websites (massively distributed computing infrastructures) are actually programmed in ASP or PHP. Given proper optimization, a domain-specific language can be fast for both the computer and the user. So here&#8217;s to new languages for professionals.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=267&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2007/04/12/computerworld-looks-at-3-new-languages-for-hpc/' rel='bookmark' title='Permanent Link: Computerworld looks at 3 new languages for HPC'>Computerworld looks at 3 new languages for HPC</a></li><li><a href='http://insidehpc.com/2009/06/08/do-we-need-new-languages-for-parallel-processing/' rel='bookmark' title='Permanent Link: Do we need new languages for parallel processing?'>Do we need new languages for parallel processing?</a></li><li><a href='http://insidehpc.com/2010/10/12/review-introduction-to-concurrency-in-programming-languages/' rel='bookmark' title='Permanent Link: Review: Introduction to Concurrency in Programming Languages'>Review: Introduction to Concurrency in Programming Languages</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2007/02/10/what-is-the-benefit-of-domain-specific-languages/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is the best way to keep up on HPC news?</title>
		<link>http://insidehpc.com/2007/01/26/what-is-the-best-way-to-keep-up-on-hpc-news/</link>
		<comments>http://insidehpc.com/2007/01/26/what-is-the-best-way-to-keep-up-on-hpc-news/#comments</comments>
		<pubDate>Fri, 26 Jan 2007 20:35:35 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2007/01/26/what-is-the-best-way-to-keep-up-on-hpc-news/</guid>
		<description><![CDATA[Staying informed in this market can be difficult given our niche position. However, there are a few sources that anyone in this field should most definitely be familiar with. First and foremost are the conferences, namely Supercomputing (SC). Held annually in the US, this monster get-together showcases all of the latest in research and development, [...]]]></description>
			<content:encoded><![CDATA[<p>Staying informed in this market can be difficult given our niche position. However, there are a few sources that anyone in this field should most definitely be familiar with.</p>
<p>First and foremost are the conferences, namely <a href="http://www.supercomp.org/">Supercomputing</a> (SC). Held annually in the US, this monster get-together showcases all of the latest in research and development, plus offers a number of tutorials for emerging technology. A week here is equivalent to a semester in grad school. A distant second in this category is the <a href="http://www.supercomp.de/">International Supercomputer Conference</a> (ISC) held annually in Germany.</p>
<p>Among online sources, the best for original articles is <a href="http://www.hpcwire.com/">HPCwire</a>, whom I&#8217;ve written for. As for news snippets, John E. West&#8217;s <a href="http://www.insidehpc.com/">InsideHPC</a> is a daily source. Coincidentally, John is also a regular contributor to HPCwire.</p>
<p>For the broader technology market, there are always <a href="http://slashdot.org/">Slashdot</a>, <a href="http://www.dzone.com/">Dzone</a>, and <a href="http://www.theregister.co.uk/">The Register</a>. These occasionally have articles that may be of interest to HPC practitioners.</p>
<p>Those are the major news and information sources. As mentioned before, a surprisingly bad source is <a href="http://en.wikipedia.org/wiki/Main_Page">Wikipedia</a>. I had thought about the effort to create a &#8220;wikiHPC&#8221; to act as an online Hennessy and Patterson, but then I realized that we already have Wikipedia and so could probably just add to that. Grad students should feel free to copy and paste the factual background material of their thesis.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=266&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2010/11/30/nasa-to-hold-news-conference-on-astrobiology-discovery/' rel='bookmark' title='Permanent Link: NASA to Hold News Conference on Astrobiology Discovery'>NASA to Hold News Conference on Astrobiology Discovery</a></li><li><a href='http://insidehpc.com/2007/04/03/linux-networx-under-new-leadership/' rel='bookmark' title='Permanent Link: Linux Networx under new leadership'>Linux Networx under new leadership</a></li><li><a href='http://insidehpc.com/2006/12/28/whats-all-this-then/' rel='bookmark' title='Permanent Link: What&#8217;s all this, then?'>What&#8217;s all this, then?</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2007/01/26/what-is-the-best-way-to-keep-up-on-hpc-news/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Duff&#8217;s Device?</title>
		<link>http://insidehpc.com/2007/01/03/what-is-duffs-device/</link>
		<comments>http://insidehpc.com/2007/01/03/what-is-duffs-device/#comments</comments>
		<pubDate>Wed, 03 Jan 2007 21:56:46 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2007/01/03/what-is-duffs-device/</guid>
		<description><![CDATA[Duff&#8217;s Device is a loop-optimization technique for C code that relies on macros to unroll a repetitive task. The primary benefit of loop unrolling is reduce branching, which is one of the single most expensive operations in computing. While some branching is necessary for the cache, too much branching will actually break the memory hierarchy, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://systemcoder.blogspot.com/2007/01/tight-loop-optimization-in-c.html">Duff&#8217;s Device</a> is a loop-optimization technique for C code that relies on macros to unroll a repetitive task. The primary benefit of loop unrolling is reduce branching, which is one of the single most expensive operations in computing. While some branching is necessary for the cache, too much branching will actually break the memory hierarchy, in addition to the pipeline. Programmers who require extreme performance would do well to learn a number of best-practice loop optimizations. Duff&#8217;s Device is one of them.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=265&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2006/08/27/how-do-we-improve-locality-of-reference/' rel='bookmark' title='Permanent Link: How do we improve locality of reference?'>How do we improve locality of reference?</a></li><li><a href='http://insidehpc.com/2006/09/02/what-is-software-pipelining/' rel='bookmark' title='Permanent Link: What is software pipelining?'>What is software pipelining?</a></li><li><a href='http://insidehpc.com/2011/07/19/optimizing-parallel-memory/' rel='bookmark' title='Permanent Link: Optimizing Parallel Memory'>Optimizing Parallel Memory</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2007/01/03/what-is-duffs-device/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Parallel Knoppix?</title>
		<link>http://insidehpc.com/2006/12/05/what-is-parallel-knoppix/</link>
		<comments>http://insidehpc.com/2006/12/05/what-is-parallel-knoppix/#comments</comments>
		<pubDate>Tue, 05 Dec 2006 20:31:42 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/12/05/what-is-parallel-knoppix/</guid>
		<description><![CDATA[Have you ever been in a position where you needed to run an MPI application a few times, but not enough times to justify buying your own cluster? Do you have access to a few PCs, but can&#8217;t or don&#8217;t want to install any software such as Condor on them? Then maybe you could use [...]]]></description>
			<content:encoded><![CDATA[<p>Have you ever been in a position where you needed to run an MPI application a few times, but not enough times to justify buying your own cluster? Do you have access to a few PCs, but can&#8217;t or don&#8217;t want to install any software such as Condor on them? Then maybe you could use <a href="http://www.parallelknoppix.net/">Parallel Knoppix</a>.</p>
<p>Parallel Knoppix is a bootable CD for running MPI applications on a network of workstations. It&#8217;s a Linux distribution that executes the common steps for determining hardware and configuring devices. As of this writing, there is no 64-bit version of it, though that may change in the future. The disc image can be downloaded from the project&#8217;s website, or may be purchased from <a href="http://www.linuxcd.org/view_item.php?&#038;id_version=2023">LinuxCD.org</a>.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=264&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2012/05/02/intel-advisor-xe-tool-to-turn-your-serial-code-into-parallel/' rel='bookmark' title='Permanent Link: Intel Advisor XE Tool to Turn Your Serial Code into Parallel'>Intel Advisor XE Tool to Turn Your Serial Code into Parallel</a></li><li><a href='http://insidehpc.com/2006/03/21/what-is-data-parallel-programming/' rel='bookmark' title='Permanent Link: What is data-parallel programming?'>What is data-parallel programming?</a></li><li><a href='http://insidehpc.com/2012/05/16/video-the-future-is-parallel-and-the-future-of-parallel-is-declarative/' rel='bookmark' title='Permanent Link: Video: The Future is Parallel, and the Future of Parallel is Declarative'>Video: The Future is Parallel, and the Future of Parallel is Declarative</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/12/05/what-is-parallel-knoppix/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Terracotta?</title>
		<link>http://insidehpc.com/2006/12/04/what-is-terracotta/</link>
		<comments>http://insidehpc.com/2006/12/04/what-is-terracotta/#comments</comments>
		<pubDate>Mon, 04 Dec 2006 22:48:51 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/12/04/what-is-terracotta/</guid>
		<description><![CDATA[Terracotta is an open source distributed shared object facility for Java, which allows multithreaded applications to run on clusters with minimal changes. It works with existing application servers and other web platforms, which makes distributing application loads across multiple nodes (JVMs) straightforward. It performs thread synchronization and even thread migration transparently for the user. In [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.terracotta.org/">Terracotta</a> is an open source distributed shared object facility for Java, which allows multithreaded applications to run on clusters with minimal changes. It works with existing application servers and other web platforms, which makes distributing application loads across multiple nodes (JVMs) straightforward. It performs thread synchronization and even thread migration transparently for the user.</p>
<p>In addition to the runtime facilities, Terracotta provides a declarative approach to clustered software. That is, the programmer merely annotates which data members are shared. Likewise, the user may specify which methods contain critical sections, thereby creating a monitor.</p>
<p>The system architecture relies on a central server that stores the state of shared objects. Client nodes (JVMs) receive updates for objects currently in memory; thus, any data transfers occur only at the object level. For fault tolerance, the server itself may be clustered with one live and others in standby.</p>
<p>The <a href="http://www.terracottatech.com/">company</a> behind Terracotta has an open source business model that sells support contracts for enterprise customers.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=263&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2008/12/08/four-paths-to-parallelism-with-java/' rel='bookmark' title='Permanent Link: Four paths to parallelism with Java'>Four paths to parallelism with Java</a></li><li><a href='http://insidehpc.com/2009/09/10/dell-makes-the-case-for-iscsi-storage-in-hpc/' rel='bookmark' title='Permanent Link: Dell makes the case for iSCSI storage in HPC'>Dell makes the case for iSCSI storage in HPC</a></li><li><a href='http://insidehpc.com/2012/04/12/for-performance-keep-file-counts-small/' rel='bookmark' title='Permanent Link: For Performance, Keep File Counts Small'>For Performance, Keep File Counts Small</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/12/04/what-is-terracotta/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is CPUShare?</title>
		<link>http://insidehpc.com/2006/12/03/what-is-cpushare/</link>
		<comments>http://insidehpc.com/2006/12/03/what-is-cpushare/#comments</comments>
		<pubDate>Sun, 03 Dec 2006 18:26:52 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/12/03/what-is-cpushare/</guid>
		<description><![CDATA[CPUShare is a grid computing initiative that pays its participants for providing idle processing time. Unlike BOINC, the provider is selling his time rather than donating it. While there is no word on the actual revenue a seller could reasonably expect to earn, anyone considering this program should consider the cost of electricity for running [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cpushare.com/">CPUShare</a> is a grid computing initiative that pays its participants for providing idle processing time. Unlike BOINC, the provider is selling his time rather than donating it. While there is no word on the actual revenue a seller could reasonably expect to earn, anyone considering this program should consider the cost of electricity for running the software before picturing profits.</p>
<p>It seems like this system is more aimed for buyers in that they can order CPU time without paying for a cluster. However, the buyer must port his code to CPUShare&#8217;s platform. Given the time and money required to use this system, a user may be better served by purchasing an accelerator and porting his software to that, especially since grid computing only works in scenarios where there is lots of computation and little need for synchronizing communication.</p>
<p>As a word of advice for sellers who are contemplating any shared computing program, please anticipate the wear-and-tear that can occur against the disk drive. One work around for this is the create a RAM disk.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=262&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2008/11/26/microsoft-opens-hpc-competency-center-in-russia/' rel='bookmark' title='Permanent Link: Microsoft opens HPC competency center in Russia'>Microsoft opens HPC competency center in Russia</a></li><li><a href='http://insidehpc.com/2009/09/17/paratools-signs-agreement-to-distribute-hmpp-in-north-america/' rel='bookmark' title='Permanent Link: ParaTools signs agreement to distribute HMPP in North America'>ParaTools signs agreement to distribute HMPP in North America</a></li><li><a href='http://insidehpc.com/2007/11/06/rapidmind-revs-multicore-development-platform/' rel='bookmark' title='Permanent Link: RapidMind revs multicore development platform'>RapidMind revs multicore development platform</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/12/03/what-is-cpushare/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>When will Ethernet be able to compete directly with InfiniBand&#8217;s latency?</title>
		<link>http://insidehpc.com/2006/11/28/when-will-ethernet-be-able-to-compete-directly-with-infinibands-latency/</link>
		<comments>http://insidehpc.com/2006/11/28/when-will-ethernet-be-able-to-compete-directly-with-infinibands-latency/#comments</comments>
		<pubDate>Tue, 28 Nov 2006 18:07:41 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/11/28/when-will-ethernet-be-able-to-compete-directly-with-infinibands-latency/</guid>
		<description><![CDATA[I received this question in reference to an article from a few months ago. My paper was about functionality instead of mere performance, though my comments regarding RDMA-based overhead should hint at how poor InfiniBand is for some applications. Many of the benchmarks out there assume that the memory region is being reused and that [...]]]></description>
			<content:encoded><![CDATA[<p>I received this question in reference to an <a href="http://www.hpcwire.com/hpc/951282.html">article</a> from a few months ago. My paper was about functionality instead of mere performance, though my comments regarding RDMA-based overhead should hint at how poor InfiniBand is for some applications. Many of the benchmarks out there assume that the memory region is being reused and that the protection tags can be cached, which isn&#8217;t the case when there are numerous communication partners in the system.</p>
<p>As for 10 Gig E, vendors typically offload TCP onto the card, which takes care of most issues when communicating over the Internet Protocol. The real question is whether 10 Gig E can match InfiniBand for IP-based communication. I believe it already can.</p>
<p>It is certainly possible to tweak an IB app to run faster by using uDAPL in place of Sockets, provided there are few communication partners. Oracle RAC does this by restricting communication to selected pre-determined pairs; that is, there is no free-for-all that one typically finds in open client / server architectures.</p>
<p>Most customers would be served equally well with Ethernet. The reason I&#8217;m pushing that network is that it is much more commodity than InfiniBand. And indeed, we now see that vendors are pushing a hybrid solution, such as iWARP, Myri-10G, and QsTenG. That is, vendors with experience in high-performance computing are building on Ethernet and pushing it for enterprise markets, in addition to their traditional technical markets. The overall goal isn&#8217;t performance (though they certainly are achieving that) but rather price.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=261&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2011/12/27/slidecast-low-ethernet-mpi-latency-over-linux-vfio/' rel='bookmark' title='Permanent Link: Slidecast: Low Ethernet MPI Latency Over Linux VFIO'>Slidecast: Low Ethernet MPI Latency Over Linux VFIO</a></li><li><a href='http://insidehpc.com/2006/03/02/what-is-infiniband/' rel='bookmark' title='Permanent Link: What is InfiniBand?'>What is InfiniBand?</a></li><li><a href='http://insidehpc.com/2010/04/19/infiniband-trade-association-announces-rdma-over-converged-ethernet/' rel='bookmark' title='Permanent Link: InfiniBand Trade Association announces RDMA over Converged Ethernet'>InfiniBand Trade Association announces RDMA over Converged Ethernet</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/11/28/when-will-ethernet-be-able-to-compete-directly-with-infinibands-latency/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is the difference between AMD&#8217;s Stream Processor and NVIDIA&#8217;s GeForce 8800? (Or, is Cray&#8217;s strategy the right one after all?)</title>
		<link>http://insidehpc.com/2006/11/14/what-is-the-difference-between-amds-stream-processor-and-nvidias-geforce-8800-or-is-crays-strategy-the-right-one-after-all/</link>
		<comments>http://insidehpc.com/2006/11/14/what-is-the-difference-between-amds-stream-processor-and-nvidias-geforce-8800-or-is-crays-strategy-the-right-one-after-all/#comments</comments>
		<pubDate>Tue, 14 Nov 2006 22:04:12 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/11/14/what-is-the-difference-between-amds-stream-processor-and-nvidias-geforce-8800-or-is-crays-strategy-the-right-one-after-all/</guid>
		<description><![CDATA[AMD has announced a Stream Processor that comes from its recent acquisition of ATI. The processor is currently available on a PCI Express board and is provided with one gigabyte of dedicated memory. It also comes with the Close to Metal (CTM) interface for software developers. CTM is the target of stream programming platforms such [...]]]></description>
			<content:encoded><![CDATA[<p>AMD has announced a <a href="http://home.businesswire.com/portal/site/google/index.jsp?ndmViewId=news_view&#038;newsId=20061113006468&#038;newsLang=en">Stream Processor</a> that comes from its recent acquisition of ATI. The processor is currently available on a PCI Express board and is provided with one gigabyte of dedicated memory. It also comes with the <a href="http://home.businesswire.com/portal/site/google/index.jsp?ndmViewId=news_view&#038;newsId=20061113006448&#038;newsLang=en">Close to Metal</a> (CTM) interface for software developers. CTM is the target of <a href="http://insidehpc.com/2006/09/21/what-is-stream-programming/">stream programming</a> platforms such as PeakStream and RapidMind, though its open nature allows it be targeted by in-house developers.</p>
<p>The Stream Processor is different from the <a href="http://insidehpc.com/2006/11/10/what-is-cuda/">CUDA</a> technology in  the GeForce 8800 in that the latter has cooperating cores and can therefore run multithreaded applications without stream programming. That is, AMD&#8217;s approach is a vector processor—SIMD—whereas NVIDIA&#8217;s approach is a multithreaded processor—MIMD. (To be precise, a stream processor applies a &#8220;kernel&#8221; of related instructions stored in a cache, whereas a vector processor applies a single instruction stored in a register; for our discussion, the difference is minimal.) This SIMD vs. MIMD divide also appears when comparing ClearSpeed and the Cell BE.</p>
<p>It is interesting to note that the offer of vector processors and multithreaded processors matches Cray&#8217;s <a href="http://www.cray.com/about_cray/vision.html">adaptive supercomputing</a> strategy. (Cray also offers FPGAs, which have been the focus of Celoxica and DRC.) And the CPU behind all of this is the x86; AMD&#8217;s offerings are currently being favored over Intel because of the <a href="http://insidehpc.com/2006/11/03/how-can-we-overcome-bus-saturation-in-multi-core-systems/">direct connect architecture</a>.</p>
<p>Cray might have the satisfaction of being right, but they still need to worry about market penetration before the smugness settles in. The other vendors have the benefit of commoditization, which is the exact force that removed Sun from being the leader in enterprise computing. Third-party OEMs have already announced the inclusion of the Stream Processor at Supercomputing this week. Can Cray keep up with that amount of volume?</p>
<p>One interesting side note I&#8217;d like to close with: while contemplating the SIMD and MIMD issues, I realized that the x86 vendors already have a watered-down version of both of these, namely SSE and multi-core architectures. It appears that Flynn&#8217;s taxonomy still rings true today; everyone is rushing to add these components to CPUs, either on-chip or along-side.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=260&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2007/01/10/the-amd-stream-processor/' rel='bookmark' title='Permanent Link: The AMD Stream Processor'>The AMD Stream Processor</a></li><li><a href='http://insidehpc.com/2006/09/21/what-is-stream-programming/' rel='bookmark' title='Permanent Link: What is stream programming?'>What is stream programming?</a></li><li><a href='http://insidehpc.com/2007/02/13/stream-processors-decloaks/' rel='bookmark' title='Permanent Link: Stream Processors decloaks'>Stream Processors decloaks</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/11/14/what-is-the-difference-between-amds-stream-processor-and-nvidias-geforce-8800-or-is-crays-strategy-the-right-one-after-all/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is CUDA?</title>
		<link>http://insidehpc.com/2006/11/10/what-is-cuda/</link>
		<comments>http://insidehpc.com/2006/11/10/what-is-cuda/#comments</comments>
		<pubDate>Fri, 10 Nov 2006 17:29:59 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/11/10/what-is-cuda/</guid>
		<description><![CDATA[CUDA (compute unified device architecture) is NVIDIA&#8217;s GPU architecture featured in the GeForce 8800. Positioning itself as a new means for general purpose computing with GPUs, CUDA provides 128 cooperating cores. Because the cores can communicate with each other, the GPU can run multithreaded applications without the need for stream computing. Along with this innovation, [...]]]></description>
			<content:encoded><![CDATA[<p>CUDA (compute unified device architecture) is NVIDIA&#8217;s <a href="http://www.nvidia.com/object/IO_37226.html">GPU architecture</a> featured in the GeForce 8800. Positioning itself as a new means for general purpose computing with GPUs, CUDA provides 128 cooperating cores. Because the cores can communicate with each other, the GPU can run multithreaded applications without the need for <a href="http://insidehpc.com/2006/09/21/what-is-stream-programming/">stream computing</a>. Along with this innovation, NVIDIA has released a <a href="http://developer.nvidia.com/object/cuda.html">software development kit</a> that includes a standard C compiler as well as an optimized BLAS library. CUDA may indeed be the final piece needed to make <a href="http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/">GPUs the next wave in HPC</a>.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=259&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2007/07/14/nvidia-releases-cuda-10/' rel='bookmark' title='Permanent Link: NVIDIA releases CUDA 1.0'>NVIDIA releases CUDA 1.0</a></li><li><a href='http://insidehpc.com/2007/10/26/nvidia-releases-cuda-rocks-roll/' rel='bookmark' title='Permanent Link: Nvidia Releases CUDA Rocks Roll'>Nvidia Releases CUDA Rocks Roll</a></li><li><a href='http://insidehpc.com/2009/06/23/pgi-and-nvidia-team-up-on-new-cuda-compiler/' rel='bookmark' title='Permanent Link: PGI and NVIDIA team up on new CUDA compiler'>PGI and NVIDIA team up on new CUDA compiler</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/11/10/what-is-cuda/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How can we overcome bus saturation in multi-core systems?</title>
		<link>http://insidehpc.com/2006/11/03/how-can-we-overcome-bus-saturation-in-multi-core-systems/</link>
		<comments>http://insidehpc.com/2006/11/03/how-can-we-overcome-bus-saturation-in-multi-core-systems/#comments</comments>
		<pubDate>Fri, 03 Nov 2006 16:14:52 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/11/03/how-can-we-overcome-bus-saturation-in-multi-core-systems/</guid>
		<description><![CDATA[Multi-core systems, in combination with specialized co-processors for hefty tasks, are hailed as the future of high-performance computing. In a bus-based architecture, the environment is an SMP in which all of the memory is accessible by all of the processors in the same amount of time. This setup works well for a few cores, but [...]]]></description>
			<content:encoded><![CDATA[<p>Multi-core systems, in combination with specialized co-processors for hefty tasks, are hailed as the future of high-performance computing. In a bus-based architecture, the environment is an SMP in which all of the memory is accessible by all of the processors in the same amount of time. This setup works well for a few cores, but has tremendous trouble for the dozens of cores promised in the future. The resource contention in an SMP is not a new issue; the solution of yesterday is the same for today: NUMA.</p>
<p>In a NUMA architecture, memory regions are aligned with processors, so that some memory accesses take longer than other memory accesses. Of course this setup brings other headaches, such as cache coherence (which really needs to be performed directly in hardware for performance reasons) and data partitioning choices (so that most accesses are for local memory rather than remote). These downsides are usually accepted simply because NUMA is the only way to achieve scalability in systems with many multiple processors, and now many multiple cores.</p>
<p>This is a key difference between AMD&#8217;s and Intel&#8217;s respective strategies. AMD has embraced the NUMA architecture and is proceeding with HyperTransport. Intel may do something similar in the future, but for now is sticking with SMP by using PCI. Because of AMD&#8217;s approach, there are some startups that are creating Opteron computers that rely heavily on HyperTransport. (<a href="http://www.fabric7.com/">Fabric7</a>, <a href="http://www.pantasys.com/">PANTA Systems</a>, and <a href="http://www.liquidcomputing.com/">Liquid Computing</a> also share the fact that they embrace virtualization, which is another blog post altogether.)</p>
<p>So the answer for dealing with bus saturation is to not have a bus at all. That is, multi-core systems require a direct connect architecture. The original vision of InfiniBand was to achieve this, though the bloated spec and the delayed product launches quickly dashed the Trade Association&#8217;s plans for world domination. Perhaps HyperTransport and other less ambitious technologies will be the saviour for multi-core computers.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=258&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2011/07/19/clustermonkey-benchmarks-multi-core-processors-for-hpc/' rel='bookmark' title='Permanent Link: ClusterMonkey Benchmarks Multi-Core Processors For HPC'>ClusterMonkey Benchmarks Multi-Core Processors For HPC</a></li><li><a href='http://insidehpc.com/2012/02/28/multi-core-arm-gets-opencl-compiler-from-the-portland-group/' rel='bookmark' title='Permanent Link: Multi-core ARM Gets OpenCL Compiler from The Portland Group'>Multi-core ARM Gets OpenCL Compiler from The Portland Group</a></li><li><a href='http://insidehpc.com/2008/09/03/nsf-sponsors-multi-core-research/' rel='bookmark' title='Permanent Link: NSF Sponsors Multi-Core Research [UPDATED]'>NSF Sponsors Multi-Core Research [UPDATED]</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/11/03/how-can-we-overcome-bus-saturation-in-multi-core-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How are FPGAs programmed?</title>
		<link>http://insidehpc.com/2006/10/27/how-are-fpgas-programmed/</link>
		<comments>http://insidehpc.com/2006/10/27/how-are-fpgas-programmed/#comments</comments>
		<pubDate>Fri, 27 Oct 2006 14:51:30 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/10/27/how-are-fpgas-programmed/</guid>
		<description><![CDATA[As mentioned previously, the greatest hurdle to FPGA adoption is the developer&#8217;s perception of usability. Usually a computer engineer must design the hardware via a description language such as Verilog or VHDL. This process involves defining the transfer of data between registers, which is a distinct departure in the practices most software engineers use. A [...]]]></description>
			<content:encoded><![CDATA[<p>As <a href="http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/">mentioned previously</a>, the greatest hurdle to FPGA adoption is the developer&#8217;s perception of usability. Usually a computer engineer must design the hardware via a description language such as Verilog or VHDL. This process involves defining the transfer of data between registers, which is a distinct departure in the practices most software engineers use. A newer approach is to model the behaviour of the entire system via SystemC, which permits a higher level of abstraction. The C-based approach may make FPGAs more accessible by users.</p>
<p>An Oxford spinout known as <a href="http://www.celoxica.com/">Celoxica</a> has a design suite called <a href="http://www.celoxica.com/products/dk/default.asp">DK</a>, which permits the user to write software in Handle-C. This language is a subset of C with extensions to describe parallelism. DK can generate VHDL or Verilog from the user&#8217;s Handle-C code, thereby making FPGAs just as useable as CPUs. These kinds of tools may come to be essential for further FPGA adoption.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=257&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2011/06/07/colfax-to-boost-financial-trading-performance-with-accelize-fpgas/' rel='bookmark' title='Permanent Link: Colfax to Boost Financial Trading Performance with Accelize FPGAs'>Colfax to Boost Financial Trading Performance with Accelize FPGAs</a></li><li><a href='http://insidehpc.com/2007/09/18/more-fpgas-from-idf/' rel='bookmark' title='Permanent Link: More FPGAs from IDF'>More FPGAs from IDF</a></li><li><a href='http://insidehpc.com/2008/06/02/appro-set-to-demonstrate-xtremedata-integrated-fpgas/' rel='bookmark' title='Permanent Link: Appro Set to Demonstrate XtremeData Integrated FPGAs'>Appro Set to Demonstrate XtremeData Integrated FPGAs</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/10/27/how-are-fpgas-programmed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Are GPUs the next wave in HPC?</title>
		<link>http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/</link>
		<comments>http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/#comments</comments>
		<pubDate>Thu, 26 Oct 2006 17:20:33 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/</guid>
		<description><![CDATA[AMD&#8217;s recent purchase of ATI was accompanied by an announcement that AMD will introduce &#8220;Fusion,&#8221; a combination CPU and GPU intended for general-purpose computing. This is on the heals of the work from PeakStream and RapidMind in the arena of stream programming, which attempts make software development on GPUs easier for non-graphics applications. It certainly [...]]]></description>
			<content:encoded><![CDATA[<p>AMD&#8217;s recent purchase of ATI was accompanied by an <a href="http://arstechnica.com/news.ars/post/20061025-8070.html">announcement</a> that AMD will introduce &#8220;Fusion,&#8221; a combination CPU and GPU intended for general-purpose computing. This is on the heals of the work from PeakStream and RapidMind in the arena of <a href="http://insidehpc.com/2006/09/21/what-is-stream-programming/">stream programming</a>, which attempts make software development on GPUs easier for non-graphics applications. It certainly appears that GPUs are leading the wave in vector processing, which is of course complimentary to multi-core architectures.</p>
<p>For a while it looked like FPGAs would be the major source of hardware acceleration. Indeed, AMD&#8217;s Torrenza initiative is very attractive to vendors like DRC, whose solutions permit Xilinix chips to communicate with the CPU via HyperTransport. However, programming here is different because designers must use a hardware-description language rather than merely port their existing applications. Such a constraint will put off a large number of potential customers. I believe that FPGAs will be relegated to hardware creators who want to test their designs; I do not see FPGAs as the future of HPC.</p>
<p>The Cell BE is another competitor in the accelerator space. IBM is using its own chip as a co-processor for the <a href="http://insidehpc.com/2006/09/10/what-is-roadrunner/">Roadrunner</a> computer. Mercury, which makes a Cell-based 1U, has the MultiCore Plus SDK for programming these processors. I believe the Cell&#8217;s adoption in non-IBM systems (aside from gaming, etc) will be about as wide-spread as Itanium&#8217;s.</p>
<p>The only competitor left in this space is <a href="http://insidehpc.com/2006/07/05/what-is-clearspeed/">ClearSpeed.</a> The Advance requires significantly less energy that vanilla GPUs and is a true vector processor (unlike SSE, etc). My only reservation here is that Advance is being produced by a start-up.</p>
<p>As much as I&#8217;d prefer to see someone like ClearSpeed succeed over GPU-based general-purpose computing, I&#8217;ve seen enough in this industry to understand commoditization, volume, and market penetration. I believe a more likely scenario is that CPU + GPU will indeed become standard in blade-based clusters aimed at technical computing applications. Perhaps Advance will have its own niche, but even Quadrics and Myricom are introducing Ethernet-based high-performance networks as part of their survival strategy. Maybe Advance can target Torrenza and <a href="http://insidehpc.com/2006/09/29/what-is-geneseo/">Geneseo</a>.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=256&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2006/12/28/gpus-and-hpc/' rel='bookmark' title='Permanent Link: GPUs and HPC'>GPUs and HPC</a></li><li><a href='http://insidehpc.com/2007/01/25/conference-on-cells-gpus-and-fpgas-in-hpc/' rel='bookmark' title='Permanent Link: Conference on Cells, GPUs and FPGAs in HPC'>Conference on Cells, GPUs and FPGAs in HPC</a></li><li><a href='http://insidehpc.com/2006/07/05/what-is-clearspeed/' rel='bookmark' title='Permanent Link: What is ClearSpeed?'>What is ClearSpeed?</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Marlet?</title>
		<link>http://insidehpc.com/2006/10/12/what-is-marlet/</link>
		<comments>http://insidehpc.com/2006/10/12/what-is-marlet/#comments</comments>
		<pubDate>Fri, 13 Oct 2006 00:03:35 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/10/12/what-is-marlet/</guid>
		<description><![CDATA[When computing with really large data sets, such as in the earth or life sciences, it is usually easier to pass the function rather than the data. Marlet is a work-flow language for distributed data analysis; it is based on the principles of functional programming and allows the user to operate while abstracting the underlying [...]]]></description>
			<content:encoded><![CDATA[<p>When computing with really large data sets, such as in the earth or life sciences, it is usually easier to pass the <em>function</em> rather than the <em>data</em>. <a href="http://www.allhands.org.uk/2006/proceedings/papers/595.pdf">Marlet</a> is a work-flow language for <a href="http://www.gridtoday.com/grid/959014.html">distributed data analysis</a>; it is based on the principles of functional programming and allows the user to operate while abstracting the underlying system. The user provides abstract functions that are converted to concrete functions at runtime when concrete data is available.</p>
<p>While similar in spirit to Google&#8217;s MapReduce, Martlet is more general in that it does not require a specific programming methodology. And despite the fact that the research into Martlet was originally geared towards grid computing, it is feasible that it could be applied to other interests in web services or possibly even large corporate data centers.</p>
<p>As a personal note, Martlet was created by <a href="http://web.comlab.ox.ac.uk/oucl/people/daniel.goodman.html">Daniel Goodman</a>, my old officemate at Oxford. He explains that the name for the project comes from the type of bird featured on the crest at <a href="http://www.worc.ox.ac.uk/">Worcester College</a>.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=255&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2006/03/02/what-is-infiniband/' rel='bookmark' title='Permanent Link: What is InfiniBand?'>What is InfiniBand?</a></li><li><a href='http://insidehpc.com/2009/06/26/new-gpu-ified-vector-signal-and-image-processing-library/' rel='bookmark' title='Permanent Link: New GPU-ified Vector, Signal and Image Processing Library'>New GPU-ified Vector, Signal and Image Processing Library</a></li><li><a href='http://insidehpc.com/2006/04/03/what-is-mpi/' rel='bookmark' title='Permanent Link: What is MPI?'>What is MPI?</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/10/12/what-is-marlet/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Geneseo?</title>
		<link>http://insidehpc.com/2006/09/29/what-is-geneseo/</link>
		<comments>http://insidehpc.com/2006/09/29/what-is-geneseo/#comments</comments>
		<pubDate>Fri, 29 Sep 2006 07:50:58 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/09/29/what-is-geneseo/</guid>
		<description><![CDATA[Geneseo is the codename for an enhancement to PCI Express. Lead by Intel and IBM, this technology will make it easier for co-processor vendors to produce accelerator cards. Among other improvements, Geneseo includes new semantics that reduce the overhead of signaling and synchronization, which makes more efficient interaction between the application software and the accelerator [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.hpcwire.com/hpc/937831.html">Geneseo</a> is the codename for an enhancement to PCI Express. Lead by Intel and IBM, this technology will make it easier for co-processor vendors to produce accelerator cards. Among other improvements, Geneseo includes new <a href="http://www.hpcwire.com/hpc/937980.html">semantics</a> that reduce the overhead of signaling and synchronization, which makes more efficient interaction between the application software and the accelerator chip.</p>
<p>Intel&#8217;s work on Geneseo appears to take aim at blunting AMD&#8217;s work on <a href="http://www.hpcwire.com/hpc/917955.html">Torrenza</a>, which allows accelerators to plug into AMD processor sockets and communicate with the CPU via <a href="http://insidehpc.com/2006/03/07/what-is-hypertransport/">HyperTransport</a>. The concept behind Torrenza is already being used by <a href="http://www.drccomputer.com/">DRC</a>.</p>
<p>It is important to note that Geneseo and Torrenza represent two completely different means of aiding co-processors. The former is an enhancement to the bus architecture, whereas the latter leverages the direct connect architecture. AMD of course still has the HTX expansion slot, but this has not been as widely used as PCI Express. And Intel still has its work on CSI, which should debut any decade now.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=254&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2006/10/26/are-gpus-the-next-wave-in-hpc/' rel='bookmark' title='Permanent Link: Are GPUs the next wave in HPC?'>Are GPUs the next wave in HPC?</a></li><li><a href='http://insidehpc.com/2007/06/19/src-reconfigurable-processors-come-to-amd/' rel='bookmark' title='Permanent Link: SRC reconfigurable processors come to AMD'>SRC reconfigurable processors come to AMD</a></li><li><a href='http://insidehpc.com/2007/09/18/amds-pimps-torrenza/' rel='bookmark' title='Permanent Link: AMD&#8217;s pimps Torrenza'>AMD&#8217;s pimps Torrenza</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/09/29/what-is-geneseo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is stream programming?</title>
		<link>http://insidehpc.com/2006/09/21/what-is-stream-programming/</link>
		<comments>http://insidehpc.com/2006/09/21/what-is-stream-programming/#comments</comments>
		<pubDate>Thu, 21 Sep 2006 19:37:16 +0000</pubDate>
		<dc:creator>Christopher C. Aycock</dc:creator>
				<category><![CDATA[HPCAnswers]]></category>

		<guid isPermaLink="false">http://insidehpc.com/2006/09/21/what-is-stream-programming/</guid>
		<description><![CDATA[A stream is an array whose elements can be operated on in parallel, similar to SIMD computing. In stream programming, data is gathered from memory into a stream, operated on in the stream, and then scattered from the stream back into memory. Memory latency is thus minimized by accessing the data in chunks, similar in [...]]]></description>
			<content:encoded><![CDATA[<p>A stream is an array whose elements can be operated on in parallel, similar to SIMD computing. In stream programming, data is gathered from memory into a stream, operated on in the stream, and then scattered from the stream back into memory. Memory latency is thus minimized by accessing the data in chunks, similar in effect to caching.</p>
<p>This style of computing was popularized by Stanford&#8217;s <a href="http://merrimac.stanford.edu/">Merrimac</a> project, which featured the Brook programming language (an extension to C, actually). At least two start-ups have come from this work: <a href="http://www.streamprocessors.com/">Stream Processors</a>, which will develop signal processors, and <a href="http://www.peakstreaminc.com/">PeakStream</a>, which has just released a software engineering tool known as &#8220;Platform&#8221; intended to simplify development on <a href="http://insidehpc.com/2006/04/25/what-is-a-co-processor/">co-processors</a>.</p>
<p>PeakStream&#8217;s Platform is a combination virtual machine (no kernel modification) and library (a standard C++ API). The VM includes a scheduler that directs operations to the best runtime system, whether it be a CPU, a GPU, or even the <a href="http://insidehpc.com/2006/05/26/do-video-game-consoles-make-good-cluster-nodes/">Cell</a> processor. The library should be easy for anyone familiar with products like MATLAB. Ultimately the goal with Platform is that technical computing customers will obtain much better performance in so-called &#8220;heterogeneous&#8221; systems.</p>
<p>In a way, the combination of PeakStream + GPU resembles <a href="http://insidehpc.com/2006/07/05/what-is-clearspeed/">ClearSpeed&#8217;</a>s own approach, though Advance uses standard BLAS rather than a proprietary library. It is interesting to note that programming solutions for both co-processors and multicore CPUs have appeared recently. Intel is now offering their <a href="http://www.intel.com/cd/software/products/asmo-na/eng/294797.htm">Threading Building Blocks</a> library and Mercury has released their <a href="http://www.mc.com/cell/">MultiCore Plus SDK</a>. The only thing left is better vendor-supported tools for <a href="http://www.highproductivity.org/">distributed-memory programming</a>.</p>
<p>Update: To be fair to competitors, <a href="http://www.rapidmind.net/">RapidMind</a> is a commercial distribution of <a href="http://www.libsh.org/">Sh</a>. RapidMind / Sh is similar to PeakStream / Brook as both use stream programming to target CPU, GPU, and Cell. Pick whichever you prefer.</p>
<img src="http://insidehpc.com/?ak_action=api_record_view&id=253&type=feed" alt="" />

<p>Related posts:<ul><li><a href='http://insidehpc.com/2007/11/06/rapidmind-revs-multicore-development-platform/' rel='bookmark' title='Permanent Link: RapidMind revs multicore development platform'>RapidMind revs multicore development platform</a></li><li><a href='http://insidehpc.com/2006/11/14/what-is-the-difference-between-amds-stream-processor-and-nvidias-geforce-8800-or-is-crays-strategy-the-right-one-after-all/' rel='bookmark' title='Permanent Link: What is the difference between AMD&#8217;s Stream Processor and NVIDIA&#8217;s GeForce 8800? (Or, is Cray&#8217;s strategy the right one after all?)'>What is the difference between AMD&#8217;s Stream Processor and NVIDIA&#8217;s GeForce 8800? (Or, is Cray&#8217;s strategy the right one after all?)</a></li><li><a href='http://insidehpc.com/2009/12/23/amd-releases-new-stream-sdk-support-for-opencl-1-0/' rel='bookmark' title='Permanent Link: AMD releases new Stream SDK, support for OpenCL 1.0'>AMD releases new Stream SDK, support for OpenCL 1.0</a></li></ul></p>]]></content:encoded>
			<wfw:commentRss>http://insidehpc.com/2006/09/21/what-is-stream-programming/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

