Warning: include(header_insidesc09.php) [function.include]: failed to open stream: No such file or directory in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 1

Warning: include(header_insidesc09.php) [function.include]: failed to open stream: No such file or directory in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 1

Warning: include() [function.include]: Failed opening 'header_insidesc09.php' for inclusion (include_path='.:/usr/lib/php:/usr/local/lib/php') in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 1

Jaguar pulls ahead of RoadRunner for #1 slot on Top500, visualization revisited

Well, its time again for the semi-annual HPC community feats of strength with the release of the fall Top500. You can of course find the list itself here, and the release here, but here are a few things I took away.

Cray’s Opteron-based XT5 system at ORNL took the top slot away from LANL’s IBM RoadRunner system following an upgrade from quad- to six-core, and we now have a 2 PF (theoretical) system on the list

Top500 list logoJaguar, which is located at the Department of Energy’s Oak Ridge Leadership Computing Facility and was upgraded earlier this year, posted a 1.75 petaflop/s performance speed running the Linpack benchmark. Jaguar roared ahead with new processors bringing the theoretical peak capability to 2.3 petaflop/s and nearly a quarter of a million cores.

When the Roadrunner system at Los Alamos first appeared at the top of the June 2008 TOP500 list, it was the world’s first petaflop/s supercomputer. This time around, Roadrunner recorded a performance of 1.04 petaflops, dropping from 1.105 petaflop/s in June 2009 due to a repartitioning of the system. In both November 2008 and June 2009, Jaguar came close but couldn’t dislodge Roadrunner from the top slot.

There’s actually quite a big gap between number one and number two now. Cray and IBM hold slots 1-4, but China now shows up in slot 5 — the highest ranking ever for a system in that country.

Rounding out the top 5 positions is the new Tianhe-1 (meaning River in Sky) system installed at the National Super Computer Center in Tianjin, China and to be used to address research problems in petroleum exploration and the simulation of large aircraft designs. The highest ranked Chinese system ever, Tianhe-1 is a hybrid design with Intel Xeon processors and AMD GPUs used as accelerators. Each node consists of two AMD GPUs attached to two Intel Xeon processors.

And, in terms of chips, 402 systems on Top500 list feature Intel processors, including three in top 10. Intel has come a long way since they were nearly forced off the list in 1999 (when they had only 1% of the Top500).

I don’t typically spill a lot of ink on the Top500 list because so many other people do; if you are hungry for more analysis, Timothy Prickett Morgan at The Register does a particularly good job this year.

Top500 visHere is the bubble chart (click for larger image) from the latest list showing the distribution of Rpeak GFLOPS by country (you can interact with the visualization in realtime at the ManyEyes site). Qualitatively, not much change from the June visualization; the world’s FLOPS are still highly concentrated in just a few countries, nearly all of which are north of the equator (New Zealand, Australia, and South Africa being notable exceptions).

Also posted in Events, HPC, National and Legislative Action, SC09 | Leave a comment

Supermicro brings its first-to-market attitude to SC09

Supermicro logoOne of the companies that is really interesting to keep tabs on in the HPC community is Super Micro Computer, Inc.. You’ll find the company’s motherboards, blades, and custom kit in supercomputers from many of the tier 1 and tier 2 manufacturers in HPC, and they are often the premier launch partner when the chip manufacturers announce new chips. insideHPC talked with Don Clegg, the VP of Supermicro, ahead of the conference next week to find out what the company will be showing, and to get a peek at some of the kit that will be showing up the next generation of supers.

Early to market with Fiorano

Supermicro is platform agnostic, applying their engineering and packaging skills to whatever hardware the company’s customers want to build into systems. In terms of platforms of interest to HPC, Supermicro already has solutions built around Intel and AMD chips, as well as NVIDIA’s GPUs, and those offerings continue to grow. During SC09 the company will be showing hardware featuring AMD’s Fiorano chipset, the first native chipset for the Opteron, and the first time gen 2 PCI-e has been available to Opteron users. This chipset supports today’s Socket F systems, but it will also support the future Magny-Cours DDR3-based chips which use Socket G34, so this solution positions Supermicro for the next announcement from AMD. Supermicro is one of the only companies will a Fiorano offering this year: most everyone else is waiting until next year. This early adoption was key in getting Supermicro’s gear into the new 4,320 core supercomputer (with quad Istanbul sockets per blade) headed for PRACE, where the PCI Express 2.0 features will allow that cluster to support data communications over QDR InfiniBand.

2U Twin, the TwinBlade, and partners make the world go ‘round

Supermicro’s 2U Twin will feature prominently in what gets shown at the show. The 2U Twin, featured in this press release, puts two hot-plug dual-processor (DP) server nodes and redundant power into a single enclosure for high availability. One advantage of this form factor over the more traditional 1U twin is the ability to stuff more gear in with the processors, such as additional hard drives. The 2U Twin is part of the Twin line of products that Supermicro co-developed with Intel. The full line includes 1U, the new 2U Twin, and the 2U Twin2 (four nodes).

New at the show from Supermicro this year will be a new TwinBlade, designed for the power conscious, that doubles the number of compute blades per enclosure. Clegg says that their engineers spent a lot of time getting the thermals and power right to be able to move from 10 blades per 7U to 20 blades, and is also doubling their 14 blade density to 28 blades. These configurations will start shipping at the end of this month.

To highlight just how many people use Supermicro’s gear, the company will be featuring partners in their booth running end-user applications throughout the show. Look for folks like Intel, NVIDIA and the Green500 (showing off a GPU-driven demo), LSI, Atipa, 3Leaf, Bright Computing, and others.

What does Supermicro get out of SC?

A question I had was why Supermicro comes to SC at all, given that their customers are the businesses that build supercomputers for users. Clegg explained that part of it is just to meet with those businesses on common ground. “But our business core is to be first to market, and we have to be first with a product that users want.” Not getting it right the first time would leave a big opening for the second to market to fix Supermicro’s misses and take over the market. To make sure that they continue to get it right, they spend a lot of time talking directly to end users about how they use today’s hardware, and what they want tomorrow.

Also posted in Events, Featured Stories, HPC Hardware, SC09 | 1 Comment

Climate Modeling Research at Argonne National Laboratory

Argonne National Laboratory is one of the U.S. Department of Energy’s oldest and largest national laboratories for science and engineering research. Managed by UChicago Argonne, LLC, for the U.S. Department of Energy’s Office of Science, Argonne supports over 200 research projects in the areas of energy, biological and environmental systems, and national security and operates major experimental and computational facilities for the nation.

“Understanding climate change is an interdisciplinary effort that combines many Argonne research areas,” observes Rick Stevens, associate laboratory director of Computing, Environment, and Life Sciences (CELS) at Argonne. “We are creating the scientific understanding and computational tools required first to understand and then to respond to global climate change at a regional level.”

Climate modeling research at Argonne has three main thrusts. Continue reading »

Also posted in Applied HPC, Featured Stories, SC09 | 2 Comments

Attending SC09? Special software promo for insideHPC readers only

What’s better than free HPC news? Free HPC stuff, of course, and we’re pleased to be able to get you guys and gals the hookup from EM Photonics.

The CULA team at EM Photonics (the CUDA-accelerated LAPACK solution that we’ve covered before) will be giving away 5 copies of CULA Premium ($395 value) during SC09 from Tuesday, Nov. 17th through Thursday, Nov. 19th.  CULA Premium is loaded with a growing list of CUDA-accelerated LAPACK functions in single, double, single-complex and double-complex precisions (see current list at www.culatools.com/versions/premium) and it also comes with a 1-year ticket support and software updates.

How to get your free copy of CULA Premium: Stop by EM Photonics’ booth #3091 (same hall as the NVIDIA booth) and tell one of the CULA Engineers on duty that you are an insideHPC subscriber!  Only 5 copies will be given away on a first come first serve basis!

(Curious to see how fast CULA is, try the free version at www.culatools.com/versions/basic.)

Also posted in Events, SC09, Tools | 1 Comment

State of the Art Climate Modeling Research at NERSC

The National Energy Research Scientific Computing Center (NERSC) is the scientific computing facility for the Office of Science in the U.S. Department of Energy. As one of the largest facilities in the world devoted to providing computational resources and expertise for basic research, NERSC is a world leader in accelerating scientific discovery through computation. NERSC is located at Lawrence Berkeley National Laboratory in Berkeley, California and currently provides supercomputing resources to more than 3,000 users at universities and national labs. Access to NERSC systems is awarded based on scientific need and computational requirements of a particular problem.

NERSC has long been meeting the complex demands of the climate modeling community, hosting a variety of projects involving researchers from DOE laboratories, universities, and other climate modeling laboratories such as the National Center for Atmospheric Research (NCAR), Scripps Institution of Oceanography, and the Geophysical Fluid Dynamics Laboratory. These projects have spanned a wide range of activities that include medium- and high-resolution production global climate simulations, ocean modeling, validation of climate models, comparisons of various climate and weather models, creation and testing of climate codes, and reconstruction of 3-D weather data for the last 150 years from sparse 2-D observational data.

NERSC Director Kathy Yelick is particularly proud of the services that NERSC has offered climate modeling researchers and of the importance of the work to the public at large.

Yelick states, “Simulations are essential to understanding global climate change and supercomputers are vital because of the complexity of the computations. The 2007 Nobel Peace prize that was awarded to the Intergovernmental Panel on Climate Change (IPCC) is recognition of the importance of this work.” Many of the simulations for the IPCC Fourth Assessment Report were carried out at NERSC and researchers are planning runs at NERSC in support of the upcoming Fifth Assessment Report.

Continue reading »

Also posted in Applied HPC, Featured Stories, SC09 | Leave a comment

SC09 Student Cluster Competition: Go Green!

SC09 logoThose of you that have been faithful readers of insideHPC for at least a year might remember our coverage of last year’s SC08 Cluster Challenge. Well, we’re at it again! This year’s challenge has a new name, new theme and a few new twists to the rules.

First, the competitors. Who’s permitted to participate? You won’t find doctoral candidates in the mix. The rules state that teams shall consist of up to six students and a supervisor. Students are classified as those who have not been granted a degree from four-year college or university. This leaves the door open for — you guessed it — high school students. Supervisors are not permitted to provide anything but pizza, snacks and soda: No Technical Assistance.

This year’s SC09 Student Cluster Competition is built around a “Go Green!” theme, tying it in with this year’s show. Just like the previous competition, this year’s rules have capped the overall power requirements of each team’s gear to a pair of 120-volt, 20-amp circuits. Each circuit will have a soft limit of 13 amps. Penalties will be assessed if a respective team trips an alarm on the metered power circuits. Each team’s hardware, along with the metered power units, must fit into a single rack.

The respective teams are not, however, responsible for providing their own hardware. Each team partners with one or more vendors in order to provide the necessary compute and networking gear for the competition. Vendor partners also have the option of providing training and other financial support to assist the teams along the way. The student/vendor teams are permitted to use any hardware platforms, operating systems and software stacks they want as long as they meet the power and space requirements. So, who are the teams?

  • University of Colorado: Captain: Doug Smith; Vendor: Aspen Systems
  • Purdue University: Captain: Preston Smith; Vendor: HP
  • Stony Brook University: Captain: Xaingman Jiao; Vendor: Dell/AMD/Mellanox
  • Arizona State University: Captain: Earl Duque; Vendor: Microsoft/IBM

Now that we have our teams and our gear, lets compute something. What “something?” This year, the workloads have been split into two categories: benchmark runs and application runs. Results of each of the two categories will be announced separately throughout the week of the show. Here is the flavor of HPC goodness for this year’s competition:

  • HPCC (benchmark category): The ubiquitous High Performance Computing Challenge benchmark will be used for the, well, benchmark category. The seven-suite-slurry of synthetic HPC wonderment will be utilized to stress multiple technical aspects of the respective platforms.
  • NWChem (application category): NWChem is a computational chemistry code developed by the nice folks in the Molecular Sciences Group at Pacific Northwest National Laboratory (PNNL). NWChem provides methods to compute the properties of molecular and periodic systems using standard quantum mechanical descriptions of the electronic wave function or density. It also has the capability to perform classical molecular dynamics and free energy simulations.
  • Chombo (application category): Chombo provides a set of tools for implementing finite difference methods for the solution of partial differential equations on block-structured adaptively refined rectangular grids
  • WRF (application category): The Weather Research and Forecasting Model is a next-generation mesoscale numerical weather prediction system designed to serve both operational forecasting and atmospheric research needs.
  • VisIT (application category): VisIt is a free interactive parallel visualization and graphical analysis tool for viewing scientific data on Unix and PC platforms.

This year’s competition includes new participants, new student teams and new vendors. All told, it will be both challenging and rewarding for both the participants and audience.

Also posted in Featured Stories, Green HPC, SC09 | 4 Comments

Moving beyond Pax MPI into the Exascale

The execution model is the machine

As part of our series with some of the big thinkers in HPC today, insideHPC talked with Thomas Sterling (the Arnaud and Edwards Professor of Computer Science at Louisiana State University) about execution models and their role in the next generation of exascale computing.

Thomas SterlingAs Sterling says in his talks, the execution model is the machine: not as a virtual machine, but as the vertical cross-cutting model that binds all layers in to a single operational domain. We routinely interact with aspects of the execution model, ways of looking at the system that embody some subset of fundamental characteristics of a system’s execution model. MPI, for example, isn’t an execution model, it is an API that lets developers create applications that cooperate on a single task in the Communicating Sequential Processes execution model. Likewise with the operating systems and architectures that all manifest other aspects of the CSP model into which most of our computing falls today.

In many respects the execution model is an exercise in formalism, particularly when one looks back and realizes that through all the previous phases of computation our models have adapted to the hardware that designers presented us with, which was itself a reaction to the available technology. We’ve gotten along just fine without a lot of upfront work on models, so why bother now? Continue reading »

Also posted in Computing Research, Featured Stories, SC09 | Leave a comment

UCAR’s Community Climate System Model and Research

UCAR logoThe University Corporation for Atmospheric Research (UCAR) was created in the late 1950s by faculty from 14 leading universities to support and nourish the atmospheric sciences. These visionaries recognized the need for community observational and computational facilities and a world-class research staff, which together would allow the community to carry out complex, long-term scientific programs beyond the reach of individual universities. In partnership with the National Science Foundation (NSF), they established the National Center for Atmospheric Research (NCAR). Since its inception UCAR has managed NCAR, on behalf of NSF, to address pressing scientific and societal needs involving the atmosphere and its interactions with the oceans, land, and Sun—what is now called Earth system science. UCAR now comprises 73 member universities, 21 affiliates and 48 international affiliates.

The Community Climate System Model (CCSM)

UCAR developed and maintains the Community Climate System Model (CCSM), a global climate model developed with funding from the National Science Foundation, Department of Energy (DOE), and NASA. The CCSM provides physical/dynamical/biogeochemical model components of the climate system. Specifically, the components include an atmospheric model (Community Atmosphere Model), a land-surface model (Community Land Model), an ocean model (Parallel Ocean Program), and a sea ice model (Community Sea Ice Model). The CCSM model is available online to anyone in the world. There are various versions of the Climate Model that run on major systems like Cray, IBM, Linux, NEC, Unix but the model developed to run on supercomputer systems.

Dr. Warren Washington, UCAR Senior Scientist, indicates that Community Climate System Model (CCSM) includes a variety of working groups and many university and DOE Laboratory scientists work on the model. There is large CCSM conference each year where as many as 350 people attend from universities, national labs, and government groups. The working groups also meet as needed each year.

“Clearly we have made great progress on climate modeling in the last four decades. This is the time scale that I became involved…starting in 1964. In the early days, the models were quite simple and did not include the complex set of processes that are in present day models. I believe the models are producing simulations that include most of the major features of the climate system such change of seasons, monsoons, jet stream structures, major regional temperature and precipitation patterns and storm systems in both the tropics and higher latitudes. They also the important El Nino and La Nina patterns. This was not the case in the earlier versions of the models…substantial progress has been made in the science of climate modeling” states Washington.

Continue reading »

Also posted in Applied HPC, Featured Stories, SC09 | 1 Comment

Ed Seidel on the state of HPC software

…and how the NSF plans to help catalyze change.

In late September the Chronicle of Higher Education attended the 20th anniversary celebration of the Coalition for Academic Scientific Computation (CASC), and filed a story called Supercomputers Often Run Outdated Software, which included this quote among several others that I took the Chronicle to task for in this posting

Supercomputers keep breaking records for processing speed, but software to operate them has not kept up with that increasingly zippy hardware. The often-rickety supercomputing computer code is becoming an obstacle to making better weather models, medical simulations, and other applications of high-performance computers, said experts at a conference here Wednesday on the future of academic supercomputing.

Ed SeidelAs I said in my response, the ideas of the speakers at the event are sound — for example, Ed Seidel, director of the National Science Foundation’s Office of Cyberinfrastructure, said that legacy supercomputing codes for large scale science are going to have to be “retooled or rethought” to take full advantage of the latest supercomputers. Right on.

But the article around the quotes painted a broad picture of “rickety software” written in languages that are no longer “stylish,” such as (brace yourself) Fortran. I argued that the article is inaccurate and unhelpful in broader context of the absolute criticality of HPC to national and international research agendas.

Following the Chronicle article and my response to it, Ed Seidel agreed to talk with us in detail about his comments, the meeting, and how the NSF is positioning itself to spearhead change in a new generation of scientific software.

Seidel described an NSF that is acutely aware of the central role that computing plays in the process of scientific discovery. As a reflection of the cross-cutting impact of computing on all the disciplines that the NSF supports, Seidel told me the agency supports an Advisory Committee on Cyberinfrastructure (ACCI) comprised of representatives from all the discipline areas of the Foundation. The committee is organized into six task forces

  • Grand Challenge Communities and Virtual Organizations, aimed at helping disparate science communities create products that integrate with each other to solve a total problem (hurricane satellite observations input to atmospheric models whose output goes into wave models, and so on)
  • HPC and Advanced Computing, which covers all aspects of the computing pyramid, not just apex systems
  • Software
  • Data and Visualization, which considers tools, algorithms, and policies for data-driven scientific applications
  • Learning and Workforce Development
  • Campus Bridging, which considers ways to connect researchers on university campuses to remote computing resources

Together these task forces will spend the next 18-24 months conducting a series of workshops and gathering ideas and recommendations before submitting final reports on their focus areas. These reports will be gathered together and serve as input for the NSF’s strategic plan for cyberinfrastructure, which should be complete within the next 3 years.

A point that Seidel was at pains to emphasize is that the Office of Cyberinfrastructure isn’t just about apex computing hardware. This isn’t a recent shift in focus for the agency when it comes to computing — Revolutionizing Science and Engineering Through Cyberinfrastructure, a report published by the NSF in 2003 (also known as the Atkins Report after its chair, Dan Atkins), connected all of these components together into a complete cyberinfrastructure to support science discovery. But the dramatic drop in the costs/FLOPS that our community has enjoyed over the past five to ten years has certainly made it more feasible to equitably distribute available funds among all components, rather than just focusing on the hardware. “The investments are out of balance in terms of hardware and software,” says Seidel as he considers how cyberinfrastructure has been funded. “Hardware is very tangible, and is just easier to fund.”

He notes that there have been 100s of millions of dollars invested in hardware in the recent past, without anything close to that going into software. And hardware certainly used to be a bigger proportion of the challenge than it is going to be in the near future. But as we are prepare for the exascale, software has become a first-class concern on hardware architectures that will support billion-way parallelism.

“Software is really becoming the broader language of science,” says Seidel. “Even broader than mathematics, but we don’t really know how to fund it.” He notes that we have decades of experience funding hardware, and we now have a culture that knows how build and run very large scale datacenters. By contrast, software efforts to date have been very individual, and “there is less and less efficiency in that model” he says. “Software needs to be treated like a first-class citizen. So much is riding on the software side that it is really time to rethink how we build, fund, and maintain it.”

Seidel identifies issues like reproducibility, and bringing software engineering disciplines out of business applications and into scientific software as key issues. Work is also needed on abstraction layers and documentation, and researchers need to be taught as students how to contribute to existing codes.

The NSF at SC09

NSF logoInterested in learning more about what the NSF is doing in software and more? Of course NSF-funded researchers will be contributing to the SC09 program throughout the week, but you’ll also want to consider attending some of the Birds-of-a-Feather session they are organizing this year (all of these are on Tuesday)

  • NSF Strategic Plan for a Comprehensive National CyberInfrastructure (details)
  • Accelerating Discovery in Science and Engineering through Petascale Simulations and Analysis: The NSF PetaApps Program (details)
  • NSF High End Computing University Research Activity (HECURA) (details)

“As we look forward to architectures of the near future,” argues Seidel, “the number of cores in these systems will surpass the number of transistors on the Motorola 68000 processor. Parallelism at that scale completely changes the game. There is a lot of work to be done.”

Also posted in Computing Research, Featured Stories, HPC Software, SC09 | 4 Comments

Rick Stevens talks about the future of supercomputing

I first met Rick Stevens at SC96, although I doubt he’d have cause to remember that meeting. He was a judge in the category of the HPC Challenge that I had entered (heterogeneous computing). We only spent a few minutes together as judge and contestant before he moved on to the other teams, but his comments and questions were perceptive; the meeting made an impression on me.

Rick StevensToday Rick Stevens is the Associate Director for Computing, Environment, and Life Sciences at the DOE’s Argonne National Laboratory. He is also a professor of computer science at the University of Chicago, a senior fellow of the Argonne/University of Chicago Computation Institute, and he heads the Argonne/Chicago Futures Lab (the group that developed the Access Grid collaboration system). And that just scratches the surface (click through for a more detailed look at his bio).

He is widely engaged in thinking about, developing, and teaching a host of technologies, from computer architecture and parallel computing to collaboration technology and virtual reality. All of which come together in his passion for reaching the next milestone in computing history: the exaflops computer.

Argonne is gearing up for their next generation system from IBM. They aren’t ready to talk too much about that system just yet (look for more news closer to SC09), but Stevens would say that they expect the system to be in the 10-20 PFLOPS range. ANL has about half a PFLOPS now, and Stevens says they are looking to this new system as a stepping stone into the exascale regime. “At that size we are within a factor of 50 or 100 of a PFLOPS, and the systems start to look like what an exascale system will look like.”

Mind the gap

Stevens is well aware of the challenges in software and architecture that stand between him and a useful PFLOPS, and he is involved in leading a variety of efforts to help bridge that gap. For example, ANL is building a consortium around their new machine to address the algorithmic and application challenges of scaling out applications into the petascale. This effort builds on the success ANL had with a similar consortium they built around their early Blue Gene in 2004. Stevens says the new consortium will address “how we get application developers and the broader academic community to have momentum addressing the software challenges in exascale,” using the new IBM system as a testbed.

Exascale systems, which Stevens is hoping we’ll attain by the end of the next decade, are likely to have on the order of billion-way concurrency, with a factor of 1,000 or so in the nodes themselves, and about 1,000,000 nodes per system. According to Stevens the recently announced Hybrid Multicore Consortium headed up by Oak Ridge will focus within a node, and the ANL consortium will be looking specifically at how to build applications that can use 1,000,000 nodes, attacking the problem from both the algorithmic and application technology perspective.

Scale invariance

How do we get there? Clearly, a new way of thinking about algorithms is needed in which there aren’t any inherent bottlenecks to scaling arbitrarily large (or small, for that matter) — Stevens likes the term “scale invariant algorithms” for this approach. “The revolution that I’d like to see is that thinking about problems at scale is easier than thinking about them sequentially,” he says.

One approach that may hold promise is to reformulate problems from the perspective of a single entity — a grid point, molecule, or whatever. “We see this kind of design with the 1014 entities in the human body, and that works just fine.” It occurs to me while he’s making this point that programming this way actually has a deep physical analog. One of the side effects of Einstein’s general theory of relativity was that people were able to demonstrate that physics (like politics) is local. Bodies don’t need universal knowledge of all the masses in their vicinity in order to know what kind of path to travel: they simply follow the straightest possible path in their curved space, like the ant walking on an apple who traces out a geodesic without doing anything other than putting one foot straight in front of the other. It took physicists a long time to get to this kind of elegance, and it is both satisfying and humbling to see its potential in computing as well and to wonder where else we’ve missed the boat in our modeling of the real world.

Facing a constrained reality

Developing programs at the entity level may also help address the expected challenges of hardware and software failure in exascale systems. Stevens draws a parallel with current systems that work this way today, like Google and the Internet. “The Internet is always up as a whole,” he says, “but parts of it are always down. Our current hardware model is crystalline, everything has to work all the time in order for anything to work. We need to move from this way of thinking to a more biological approach.”

Part of the design for this way of building at scale may include over-provisioning at many different levels, and with many different kinds of resources. “Systems may have all kinds of resources that programmers can choose to use in their applications,” he notes, “but they can’t use them all at once. Our current notions of efficiency may not make sense when we have to optimize our resource use to stay within a fixed power budget.” This is a different way of thinking than most of us have about our computers today, but it is the mode of thinking that dominates the natural world. “For example, we see this in the plant world,” Stevens explains. “Plants have much more potential for growth than they ever achieve because they have a fixed energy budget available to them that they have to allocate among all the processes of life.”

Wither MPI?

“We’ll definitely have MPI, at least in the early exascale systems,” Stevens answers in response to my question about how we’ll program the very large-scale systems he spends so much of his time thinking about. “A new programming model takes about a decade to take hold, so whatever we have today is what we are going to have for these first exascale systems.” Is this a problem? “The jury is still somewhat out, but I don’t believe that the barrier to achieving exascale computing is the programming model.”

The rest of the picture

Stevens is part of several other efforts that, when taken together, form a holistic approach to reasoning about the design of exascale systems. ANL and Stevens are part of the broader community effort to look at the system software needed for exascale computers, the International Exascale Software Project (the website for this group is quite nice, by the way). The IESP has held two international meetings already, with a third planned for Japan on Oct 19-21. Where the ANL-based consortium is focused on applications, the IESP is more focused on system tools, operating systems, compilers, and the like, Stevens explains. The IESP is interesting because it is a grassroots attempt to coordinate the research efforts of organizations around the globe so that we can all get to effective exascale computing faster than if any one nation tried to do all of the research on its own.

Stevens is also part of a cross-lab effort in the DOE to build the science case for exascale — what good will all of this computing power do once we finally have it in place? Stevens co-chairs this effort with Andy White, which is aimed at building the science case and producing a technical roadmap for the DOE. Over the last year the group has held a series of workshops, called the Scientific Grand Challenges Workshop Series, with practitioner communities in national security, biology, basic energy science, climate science, high energy physics, and other areas central to the DOE mission (for a full list see the web site). Each of the meetings will ultimately result in a report that documents the impact of large scale computational modeling and simulation in the domain area. The presentations and materials from the workshops are already available online, and the reports will be available to the public once they are complete.

Getting there sooner rather than later

When you look at all of these efforts, it is clear that there is a lot of federal government leadership going in to meeting the exascale goal by 2020. Is it necessary? “The vendor community won’t get to exascale without significant federal investment by 2020,” says Stevens. “It will take at least 20 years for industry to get to the exascale on its own.” Why does that matter? Not because we just want to cross an arbitrary computational boundary by some arbitrary date. “Computing at that scale will give us a new way of thinking about doing science,” he says, “and that’s what I find so exciting about this work.”

Also posted in Computing Research, Featured Stories, HPC, HPC People, SC09 | 2 Comments

The U.S. Department of Energy’s Role in Climate Research

Climate research looks at weather patterns and evaluates the interaction of elements such as atmosphere, land surface, ocean and sea ice systems. Climate modeling research uses sophisticated mathematical algorithms and complex calculations on high-performance supercomputers to gain a better understanding of weather patterns and to predict changes in the Earth’s climate.

The U.S. Department of Energy (DOE) is at the forefront of climate research and climate modeling. DOE has a Climate Modeling Program whose mission is to improve climate change projections using state-of-the-science coupled climate and earth system models, on time scales of decades to centuries and spatial scales of global to regional. DOE plays a vital and unique role in the climate modeling enterprise in the U.S., primarily through two offices within DOE’s Office of Science:  the Office of Biological and Environmental Research (BER) as well as the Office of Advanced Scientific Computing Research (ASCR).

Climate Modeling at DOE

The climate modeling program in the DOE Office of Science sponsors projects that develop, test, and apply state-of-the-science coupled climate and earth system models, based on theoretical climate change science foundations. According to Anjuli Bamzai DOE Program Manager, Climate Change Prediction Program, Climate and Environmental Sciences Division, “In order to enable sound decision-making on issues pertaining to future energy use and technology options, credible high-resolution climate change simulations are required at a regional scale. To achieve such high-resolution simulations, both the accuracy and throughput need to be dramatically increased; thus the climate modeling activity takes advantage of emerging high performance computing (HPC) and information technologies. An example of climate modeling using HPC and supercomputers can be found in the DOE Leadership-class Computing Facility.”

Why Climate Modeling is Important

Continue reading »

Also posted in Applied HPC, Featured Stories, HPTC, SC09 | Leave a comment

Opportunities and challenges in HPC: a conversation with Stan Ahalt

Note: this article is part of the inside SC09 series of features we are running in the weeks before SC09 this November. In this series of articles we are talking with some of our community’s leading thinkers about what they are doing today, where they are headed tomorrow, and the outlook for HPC. It is our hope that this series will help inform your own view of HPC’s future, and make your time at SC09 more productive and rewarding.

Stan AhaltStan Ahalt is still excited about what he does, a fact that still surprises (and inspires) me every time we talk. This energy probably explains a good bit of the success he has enjoyed in his career. Now the head of the Renaissance Computing Institute (RENCI) in North Carolina, Ahalt was until recently executive director of the Ohio Supercomputer Center, and before that a professor in the department of electrical and computer engineering at The Ohio State University for 22 years. While at OSC he started the Blue Collar Computing initiative, a national program aimed at bringing HPC to nontraditional users in business and industry, and OSCnet, a high-speed research network for K-12 schools, higher education and economic development.

On the national stage he is the chair of the Coalition for Academic Scientific Computing, co-chair of the Ohio Broadband Council, extramural member of the National Cancer Institute’s Advanced Biomedical Computing Center’s Oversight Committee, and member of the Council on Competitiveness High Performance Computing Advisory Committee.

Looking ahead at RENCI

When I spoke to him in late September, he had just moved in to his offices at RENCI. RENCI is facing some tough times these days: a 35% budget cut was announced just days before he took the job, a result of a decline in state revenues resulting from the tough economic climate.

The $3.8M cut (the media was inaccurately reporting $11M for a while) forced the elimination of 23 jobs, but Ahalt was aware of the possibility of cuts before he accepted the position, and the challenge hasn’t slowed him down. “RENCI is an assemblage of remarkable resources,” he says. “And we have had consistent state support even though the budget was cut. This was just a reaction to the current financial reality; lots of budgets got cut.” He describes RENCI as having incredible opportunities for the future, all driven by “the incredibly talented people at RENCI.” But his resources are now less than they once were. His reaction? Focus.

Ahalt says that RENCI is going to focus on two key areas of its mission. The first, environmental planning and emergency preparedness, is an area that Ahalt sees as already world class at the Institute. This work includes topics as diverse as storm surge analysis, fire prediction, flood plain studies, and many others. The second area, health care and medicine, is a growth opportunity for RENCI, and capitalizes upon the wealth of research being done by its neighbors, including Duke and UNC Chapel Hill.

Our biggest challenge? Software.

Our conversation about the key challenges and opportunities facing our community was broad. Our leading challenge? Software. “Since 1999,” he says, “all of the PITAC and PCAST reports have emphasized software investment needs. But during that time software funding has steadily declined relative to hardware funding.” Where are the main opportunities here? He points out that ISV codes still don’t scale well past a small number of processors, and we have substantial needs in middleware and “ease of use” systems like portals. According to him the consensus at the recent CASC meeting was that large-scale application codes will probably need domain-specific languages. “DOD and DOE are pretty effective today,” he says, “but small and medium-sized companies need help. We need to push HPC into new places.”

What about software needs specific to exascale systems? Interestingly, he shares the circumspect attitude Justin Rattner had when I talked to him last week about this same issue. In particular, he is impressed at the current performance of the non-application suite of tools that power today’s petascale systems. “We are having problems,” says Ahalt, “but not nearly as many as we thought we’d have.”

Workforce of tomorrow

Stan AhaltAnother key area of concern for Ahalt is HPC workforce development. But not at the high end: “We are doing OK on staff to take care of the very large scale HPC installations,” he says. “But we need to focus on building a large, well-trained community of users for the ‘everyday’ HPC. This is a national competitive issue. We lead in cycles, but we don’t have the minds to use them.”

This has been a topic of conversation for years now, and I was curious to know what approach he thought would work, and why we haven’t already made more progress. My computational engineering degree — still a new discipline in the early 90s — was supposed to have addressed some of these problems, but it hasn’t. “We probably need a national level effort,” he says. “We know how to fund hardware, but we haven’t really tried hard enough yet to pull people into computational science.”

Data — lots and lots of data

A third area in which he sees both challenges and opportunities for our community is in finding, storing, labeling, and exploring the hundreds of exabytes of data modern society produces each year. My own recent experience as a data center manager is that much of the science and engineering community isn’t “managing” their data at all. They simply write it all, every byte they generate, to vast archival storage systems. But without the tools to label, search, and dig into their growing archives, the reality is that most of what they write is never accessed again.

As Ahalt sees it, this free ride has to come to an end. “I think we are fooled by our personal experiences with our laptops and desktop computers that says that we can always buy another terabyte to store the extra data,” he says. “Computer cycles are a mechanism to get more actionable information than we started with,” and engineers and scientists who create massive amounts of data need to seriously examine when the whole dataset needs to be saved, when only parts of it should be stored, and when it is simply more cost effective to recalculate if data are needed again in the future.

He does see some hope that other factors in the computing community may drive the changes in user behavior needed to begin managing the data deluge. “Data movement has always been a problem, but as some classes of users shift to cloud computing they will be dealing directly with data cost and retention issues. This could be a good thing,” finally exposing the implicit costs of large scale data storage and management.

Hosted computing

Hosted computing is most commonly lumped in with the term “cloud computing” these days, but the idea is old. Some types of capacity-oriented HPC jobs will run quite satisfactorily on clouds that are designed specifically for this kind of work (like Penguin’s POD offering; there is already mounting evidence of the inappropriateness of general and highly-virtualized cloud solutions like Amazon’s EC2 for HPC work).

In terms of this capacity work Ahalt feels that “a dramatic percentage of this type of HPC will fall naturally into a cloud computing model.” This is not to say that everything will migrate to the cloud: “We will still need [NSF] Track 1 and Track 2 machines. But what about a ‘Track 3?’ There isn’t one, but we still need more workhorse machines.”

The greater good

As Stan Ahalt gets ready to lead RENCI into the next phase of its life, he is also preparing to help lead the HPC community through these, and other, challenges. I asked him to describe his role at RENCI in this context. For the state of North Carolina, Ahalt is the go-to guy for HPC. But as RENCI’s new director he is focused on helping the Institute meet its mission of making a difference for society through technology. That will mean wrangling desktops, GPUs, clusters, supercomputers — and many other technologies — into the service of the greater good.

Also posted in Events, Featured Stories, HPC People, SC09 | 1 Comment

Interview With An HPC Pioneer

From the first teraflop to the 3D Internet: an exclusive profile with Intel CTO Justin Rattner

Intel CTO Justin RattnerIntel’s CTO Justin Rattner will be giving the opening address at SC09 in Portland next month, and he’ll be talking about the 3D internet and his own view that this could be an inflection point — where HPC “goes consumer.” But Rattner’s appearance at SC this year isn’t just about the 3D Internet, and his selection wasn’t just about picking one of the most influential suits in the tech world. As one of the pioneers of the modern era of parallel supercomputing, he is an ideal choice as the person to deliver the opening address for SC09.

Rattner’s career in HPC goes all the way back to the introduction — and eventual proof — of the idea that supercomputers should be built out of thousands of smaller processors rather than a few very powerful processors. Rattner led the collaboration that built the Touchstone Delta, and then the machine that was the first in the world to top one trillion floating point operations per second on the Linpack: ASCI Red. In this exclusive insideHPC feature interview, we talk with Rattner about his career to get a view of where HPC has come from, and where it is going next. Continue reading »

Also posted in Computing Research, Events, Featured Stories, HPC, HPC People, SC09 | 7 Comments

Al Gore to keynote SC09: an exclusive interview with SC09 chair Wilf Pinfold

SC09, the HPC community’s largest technical conference, trade show, and annual homecoming, has announced that Al Gore (yes, the Nobel Prize winning, Academy Award winning, US Vice President-ing, Albert A. Gore, Jr.) will keynote the conference this fall. And you thought that Michael Dell drew a big crowd last year. I talked with SC09’s General Chair, Dr. Wilfred Pinfold, about the selection of Gore and how he fits into this year’s conference.

“We try to pick speakers every year that will stimulate people’s thinking in a number of areas,” explains Pinfold. Previous keynote speakers have included entrepreneurs Bill Gates and Michael Dell, biologist J. Craig Venter, and inventor Ray Kurzweil.

This year the conference is focused around the theme of Computing for a Changing World, with program elements that highlight the supercomputing community’s contribution to the search for new forms of energy, understanding weather and climate change, and the technologies that will help our society build a more sustainable future. The conference has even adopted a sustainable philosophy for its own operations this year, and is looking for ways to make the event more eco-friendly. Continue reading »

Also posted in Events, Featured Stories, SC09 | 14 Comments

Warning: include(sidebar_insidesc09.php) [function.include]: failed to open stream: No such file or directory in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 46

Warning: include(sidebar_insidesc09.php) [function.include]: failed to open stream: No such file or directory in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 46

Warning: include() [function.include]: Failed opening 'sidebar_insidesc09.php' for inclusion (include_path='.:/usr/lib/php:/usr/local/lib/php') in /home/bruekner/public_html/wp-content/themes/insideHPC4_0/category-46.php on line 46

insideHPC.com is a production of insideHPC, LLC. © 2006-2011 Sitemap