<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Isaías A. Comprés Ureña</title>
	<atom:link href="http://www.isaiascompres.com/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.isaiascompres.com</link>
	<description>Parallel programming and algorithm performance.</description>
	<lastBuildDate>Thu, 21 Mar 2013 17:21:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>4th. MARC Symposium and ARCS Conference</title>
		<link>http://www.isaiascompres.com/?p=6</link>
		<comments>http://www.isaiascompres.com/?p=6#comments</comments>
		<pubDate>Sat, 03 Mar 2012 00:00:53 +0000</pubDate>
		<dc:creator>compres</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.isaiascompres.com/?p=6</guid>
		<description><![CDATA[Although the fourth Intel Many-core Applications Research Community (MARC) symposium and the 2012&#8242;s Architecture of Computer Systems (ARCS) conference took place a few months ago already, I will take the time to summarize the topics that grabbed most of my &#8230; <a href="http://www.isaiascompres.com/?p=6">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Although the <a href="http://www.dcl.hpi.uni-potsdam.de/research/scc/marc/" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.dcl.hpi.uni-potsdam.de']);" target="_blank">fourth Intel Many-core Applications Research Community (MARC) symposium</a> and the <a href="http://www.arcs2012.tum.de/" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.arcs2012.tum.de']);" target="_blank">2012&#8242;s Architecture of Computer Systems (ARCS) conference</a> took place a few months ago already, I will take the time to summarize the topics that grabbed most of my attention.</p>
<p>The 4th. MARC symposium took place in the Hasso Plattner Institute at University of Potsdam, in Germany.  I anticipate that most of the submissions to MARC symposia in the near future will be about work on the Xeon Phi (the commercial product based on the MIC architecture); in this occasion, all presentations were related to the Single-chip Cloud Computer (SCC).</p>
<p>Although I could not attend the first day of the event due to schedule conflicts, I managed to make it for the second day (barely on time for my presentation) and went through the <a href="http://www.hpi.uni-potsdam.de/fileadmin/hpi/Forschung/Publikationen/Technische_Berichte/tbhpi55.pdf" onclick="javascript:_gaq.push(['_trackEvent','download','http://www.hpi.uni-potsdam.de/fileadmin/hpi/Forschung/Publikationen/Technische_Berichte/tbhpi55.pdf']);" target="_blank">proceedings</a> afterwards.  In summary, plenty of proof was given (as in previous symposia) that both distributed shared memory and message passing programming models can be supported on SCC-like systems.  The runtime systems and library stacks are not necessarily stable enough for production environments, but a lot has been learned from the efforts of the MARC, about the viability of non-coherent systems for the future.  In that sense, I consider the <a href="http://communities.intel.com/community/marc/symposiums" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://communities.intel.com']);" target="_blank">MARC symposia</a> a success for Intel and all researchers involved, and I look forward to future interesting and sometimes unusual solutions presented there.</p>
<p>My presentation at the MARC was about the<a href="http://communities.intel.com/docs/DOC-19246" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://communities.intel.com']);" target="_blank"> improved communication protocol</a> for message passing that was developed at Intel for the RCKMPI2 library.  The aim of the first protocol, released with RCKMPI, was to eliminate the overhead related to TCP/IP on the SCC; this was achieved by communicating through the on die SRAM directly, effectively bypassing the TCP/IP stack.  In the second iteration, the efficiency of message passing for larger MPI jobs was significantly improved, and the design was made flexible enough to allow the use of the Message Passing Buffer (MPB) for multiple types of communication: MPI collectives, MPI point-to-point, process management and inter-core signaling.  In our submission, we talk about the changes and how MPI-2 dynamic processes (basically MPI_Comm_spawn) were supported.  Support for dynamic processes was required for our work on the <a href="http://invasic.informatik.uni-erlangen.de/en/index.php" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://invasic.informatik.uni-erlangen.de']);" target="_blank">invasive programming model</a>, presented in the ARCS 2012 conference.</p>
<p>The ARCS conference was a bit larger event than the MARC symposium.  A lot of topics were discussed; as expected, my attention was caught mostly by presentations about future many-core programming models and observed trends.  In particular, the keynote by <a href="http://www.arcs2012.tum.de/ARCS_Learning_from_Exprimental_Silicon.pdf" onclick="javascript:_gaq.push(['_trackEvent','download','http://www.arcs2012.tum.de/ARCS_Learning_from_Exprimental_Silicon.pdf']);" target="_blank">Sebastian Steibl from Intel Labs</a>, got most of my attention.  He discussed recent developments from researchers at Intel, that resulted in a very low power IA32 core.  What is interesting is that to achieve this low power operation, near threshold voltage (NTV) circuit design was used for the cores.  I had read about it before in <a href="http://www.theregister.co.uk/2012/02/19/intel_isscc_ntv_digital_radio/" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.theregister.co.uk']);" target="_blank">The Register</a> and <a href="http://www.anandtech.com/show/5555/intel-at-isscc-12-more-research-into-near-threshold-voltage" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.anandtech.com']);" target="_blank">Anandtech</a> websites, from their <a href="http://isscc.org/" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://isscc.org']);" target="_blank">2012&#8242;s International Solid-State Circuits Conference (ISSCC) coverage</a>.  What I had not considered was that threshold voltage is quite sensible to variations related to the manufacturing process.  The consequence is that even in what could be considered a homogeneous many-core architecture, in order to operate at maximum efficiency, the chip may need to be handled as a heterogeneous architecture due to the varying frequencies at which each core reaches its peak efficiency.  Imagine a 256 core CPU with clusters of cores that operate at different frequencies.  Utilizing such a CPU efficiently with today&#8217;s programming models will be challenging, to say the least.</p>
<p>As mentioned earlier, <a href="http://www.springerlink.com/content/372237075231j449/" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.springerlink.com']);" target="_blank">our contribution to the ARCS</a> conference was in relation to the <a href="http://www.invasic.de" onclick="javascript:_gaq.push(['_trackEvent','outbound-article','http://www.invasic.de']);" target="_blank">InvasIC protect</a>, in this case with MPI on the SCC.  We went to great length to reduce the latency of process management operations for the SCC, so that applications can re-adjust their process counts at runtime with as low as possible overhead.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.isaiascompres.com/?feed=rss2&#038;p=6</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
