Entries filed under “Featured HPC Rock Star”

The latest installment in insideHPC’s exclusive look at the people changing the face of HPC.

Announcing Our Newest Rock Star of HPC: David Bader

At insideHPC, we are pleased to announce that David Bader is our latest Rock Star of HPC.

While HPC tends to focus on compute-intensive problems, Big Data challenges require novel architectures for data-intensive computing. My group has been the first to parallelize and implement large-scale graph theoretic algorithms, which are quite a challenge because of the irregular memory accesses, little computation to overlap with these memory references, and fine grain synchronization. In the past several years, our research has enabled social scientists to analyze some of the largest social networks, detecting communities, finding the proverbial “needle in the haystack”, and “connecting the dots” by identifying central actors hidden in these networks. As you know, data and social media are now torrential streams of information that may provide valuable information to make decisions related to business intelligence, market analysis, and social trends.

Read the Full Story (PDF) or download here if your IT Crowd blocks Dropbox.

Also posted in Rock Stars of HPC | Leave a comment

Rock Stars of HPC: Thomas Schulthess

Our Rock Stars of HPC gallery is growing as we look to a new generation of heterogeneous computing. And when the opportunity came to us to name our first European Rock Star of HPC, one name kept coming up: Thomas Schulthess:

Thomas Schulthess is the director of the Swiss National Supercomputing Centre (CSCS) at Manno. He studied physics and earned his Ph.D. degree at ETH Zurich. As CSCS director, he will also be professor of computational physics at ETH.. He worked for twelve years at the Oak Ridge National Laboratory (ORNL) in Tennessee, a leading supercomputing and research in the US. Since 2002, he led “Computational Materials Science Group” with 30 co-workers.

Thomas Schulthess studied physics at ETH Zurich and earned his doctorate in 1994 with a thesis on metal alloys based on experimental data and supercomputing simulations. He subsequently continued his research activity in the US and published around seventy research papers in the best journals of his field. His present research interests are in the focused on the magnetic properties of metallic nano-particles (nano-magnetism). Using high-performance computing, he is studying the magnetic structures of metal alloys. Of particular interest are his studies on the giant magnetoresistance. He is also a two-time winner of the Gordon Bell Award.

insideHPC: You were schooled as a physicist. What got you interested in high performance computing?

Thomas Schulthess: Physics being the mother of modern science, it is not at all surprising that many researchers in this field are interested in high performance computing – I am no exception. In my particular domain, condensed matter physics and material science, we have a canonical model (the many-body Schrödinger equation) that suffers from the curse of dimensionality. We are therefore constantly looking for better algorithms and more powerful computers to solve the particular problems we are investigating. This is how I got interested in ever more powerful supercomputers, and we had a wave of machines developed at ORNL that helped us tremendously. But when you look around, haven’t most serious players in HPC been trained as physicists? One could even argue that physicists involved in the Manhattan Project started HPC.

insideHPC: You have been involved in so many milestone HPC activities in this community – what would you call out as one or two of
the high points of your career – some of the things of which you are most proud?

Thomas Schulthess: The end-station for computational nanoscience we developed at the Center for Nanophase Materials Sciences (CNMS) at ORNL. We invested heavily in application and algorithm development, and now we have some of the best performing codes on petascale systems that are productive research tools in the user program of the nanocenter. Others have adopted the concept, e.g. the simulation labs in Jülich. In Switzerland we are developing version 2 of this concept, where we are pushing the HPC application development out into the research groups and communities that develop the models and application codes. The response from the application community that is now taking charge in 12 projects makes me confident about the sustained use of supercomputers as scientific instruments.

insideHPC: As Director of the Swiss National Supercomputing Centre you must have extensive administrative responsibilities. Do you still write code?

Thomas Schulthess: The responsibilities are of course much higher than in my previous job, but I have very competent staff to manage operations and I work for an institution with a lean administration that entrusts researchers with the leadership of projects. This means I have to find time to remain active in research and train graduate students. I am expected to develop the user community and the supercomputing strategy that meets their research needs. You have to be an active researcher to be credible for this job, that’s just the way science works in Switzerland. I don’t write big codes myself anymore, but I still lead teams who do – such codes have to be implemented by professionals who are fully committed to the job.

insideHPC: What are your thoughts on how we can attract the next generation of HPC professionals into the community – and provide them with the experience-based training that they will need to be successful?

Thomas Schulthess: We have to focus on the science and engineering problems we solve, discoveries we facilitate, and technologies created with HPC. We have to push productive HPC and maintain a high standard. This will make HPC interesting and attract bright young people to the field. At the same time we have to introduce HPC training into the computational science education at universities. HPC must become part of undergraduate and graduate curricula, rather than being limited to training courses given by computer centers when researchers need access to systems. Creating highly efficient and scalable simulations requires considering HPC from the very beginning of the thought process. This has to be reflected in education.

insideHPC: You were recently quoted as saying: ”Given the remarkable interest in GPU technology from the Swiss computational
science community, it is essential that CSCS adopt this technology into its high-end production systems soon.” Why is it essential for an institution like yours to adopt GPUs in a big way?

Thomas Schulthess: Application developers in Switzerland and elsewhere in Europe are rapidly adopting this technology. Supercomputing has to respond to this trend! Since CSCS has established a record in early adoption of new technologies in high-end computing systems, there are high expectations for us to look at GPU technology. At the same time, it is clear that we will only introduce GPU technology into our main production line of systems, if it can be used productively at scale. This is not yet the case, but I’m quite certain it will happen within a year or two.

insideHPC: What is your favorite way to spend time when you’re not working?

Thomas Schulthess: I spend all my time away from work with my family. We have two growing teenagers that are harder and harder to keep up with. I’m an outdoor person, I love skiing, hiking, sailing and more.

insideHPC: What motivates you? What is your passion?

Thomas Schulthess: Science! I am a physicist, a researcher, like many of my colleagues I love to create machines or systems that allow us to do new experiments and look at nature in ways we have not been able to before. In recent years I had a lot of fun collaborating with peers from other domains to push the envelope with simulation based science in areas outside my own.

insideHPC: You received the Gordon Bell Award for for attaining the fastest performance ever in a scientific supercomputing simulation of superconductors. Was this the same record-breaking code that recently scaled to 1.84 Petaflops on the Tianhe-1A system in China?

Thomas Schulthess: No, the runs on Tianhe-1A were done with a classical molecular dynamics code. DCA++, with which we set the 1.9 Petaflops record in 2009, uses totally different quantum Monte Carlo algorithms today. The efficiency and scalability of the new code is probably higher today, but most importantly, the new algorithms allow us to reach a level of precision not possible with the implementation used in 2008/9 and time to solution has been improved dramatically. In my field, algorithms are still improving faster than computer architectures do. This is why we have to introduce the knowledge about architecture into the communities that develop algorithms.

Also posted in HPC, Rock Stars of HPC | 1 Comment

Rock Stars of HPC: Dona Crawford

In this special feature written by Mike Bernhardt from The Exascale Report, we honor Dona Crawford, the first woman to grace the ranks of our Rock Stars of HPC.


I first met Dona Crawford at SC’95 in San Diego when she was the conference Deputy Program Chair and the HPC Challenge Co-chair.  Two years later, Dona was one of the most visible leaders in the HPC community as the General Chair for SC’97 in San Jose.

I have worked with many top corporate and agency executives during my 23 years in the HPC community, and I have met very few community leaders with the spirit, enthusiasm, and love of life that we see in Dona Crawford.  From her days as one of the original leaders of the Accelerated Strategic Computing Initiative (ASCI) program, a national effort dating back to the early 90s, to her current position as Associate Director for Computation at Lawrence Livermore National Laboratory (LLNL) where she is responsible for a staff of roughly 900, she has built a tremendous following of loyal employees and close friends.  I have heard numerous colleagues refer to Dona as a true leader who inspires and motivates with vision and passion.  She is admired by her employees and peers, respected by her colleagues, and loved by her friends.

It is indeed a great pleasure to acknowledge and introduce you to Dona Crawford – a true Rock Star of HPC.

INSIDEHPC: You have such a rich history in this community and have been involved in so many milestone activities – what would you call out as one or two of the high points of your career – some of the things of which you are most proud?

There are a few career milestones that come to mind.

Dona Crawford (left) and her sister Gail (right) having just finished running/walking Bay to Breakers

Dona Crawford (left) and her sister Gail (right) having just finished running/walking Bay to Breakers

I was one of the original leaders of the Accelerated Strategic Computing Initiative (ASCI), a national effort dating back to the early 90s to provide—at that time—teraFLOPS of computing and the associated environment for nuclear weapons scientists to use computer simulations instead of conducting underground nuclear tests to certify the safety, security and reliability of the stockpile. ASCI (now known as ASC—Advanced Simulation and Computing) signified a paradigm shift in science from test-based to modeling- and simulation-based validation.

In the early days of ASCI, few computing experts believed the program would be able to take high-performance computing from 50 gigaFLOP/s to 100 teraFLOP/s in 10 years. Nobody had broken the one-teraflop barrier at the time, so it was quite a tall order. I would walk into a room with colleagues from other institutions and many would scoff at me. It was uncomfortable, but it turns out there’s something to be said for believing in a “wild” idea and fighting for its success.

ASCI is the result of a very large team of dedicated people from DOE Headquarters, industry, academia, and the labs. ASCI brought together the computing expertise of Livermore, Los Alamos, and Sandia national laboratories and established the framework for advancing computing to where it is today at the labs, with each of the national labs working in partnership with industry to pursue different hardware approaches and applications software. ASCI changed the way the world thinks about computing, and I’m proud of the small role I had in its inception. ASC today continues to push the boundaries of computational science, demanding ever more capability in computing hardware for predictive science.

As part of ASCI, I helped establish the Academic Strategic Alliances Program (ASAP). The idea was to have leading-edge universities work on large, complex, multi-disciplinary problems to validate our simulation-based approach. The Alliances were extremely successful at accelerating new developments in simulation science and high-performance technologies for computer modeling. This type of working relationship is good for the discipline and for the HPC community, and it continues today.

From the beginning, it was clear a critical component of the initiative would be making the supercomputing resources easy to access and use from remote locations. I was part of the team that created the DisCom2 (Distance and Distributed Computing and Communication) strategy, which blended two strategic thrusts. Distance Computing extended high-performance computing to remote sites, while Distributed Computing developed an enterprise-wide integrated supercomputing environment capable of supporting DOE’s science and engineering requirements. DisCom2 took advantage of the ongoing revolution in commodity- and cluster-based high-performance computing, as well as adopting and expanding on the open software approach to cluster computing. I was very interested in seeing DisCom2 come to fruition since I was located at the smaller of the two Sandia National Laboratories’ locations at the time and was leading the network research and development activities.

In 1993, I co-founded InfoTEST (née the National Information Infrastructure Testbed), which was a partnership between academia, private industry, and the national laboratories that was a precursor to the World Wide Web and Internet commerce. I was working at Sandia-Livermore and knew that to be efficient and productive, we had to have a way to access the big computers and computing resources at the other labs (Sandia/New Mexico, Livermore, and Los Alamos). For InfoTEST, I led a group that tied together distributed computing resources throughout the country and then demonstrated the feasibility and effectiveness of national-level access. This development resulted in performance data and practical experience that was critical to the establishment of the Internet. This was also one of the early efforts in network-based computing, which proved the viability of the concept for what would become known as grid computing.

I’ve also had several “firsts” as a woman in computing and management. For instance, I noticed I’m the first insideHPC female “rock star.” I was also the first mid-level and the first top-level technical female manager at Sandia. In the early 80s, I was the first technical female staff member to reduce my workweek to spend more time with my two babies at home. I worked four days a week, and even though I was still putting in 40 hours, it was considered part time. My managers were initially reticent — there was an underlying fear that all mothers would exercise this “reduced workweek” option if it were available and the workforce would therefore be reduced and part of it become less productive. Of course, none of those things happened. I only worked that schedule for four months, but I really treasured having an extra day at home with my children.

I think we’ve made progress in gender equality in computing. To be perfectly honest, I never felt there was a “glass ceiling” that I needed to push through. I just did what I was good at, worked hard, and was rewarded. But it does not go unnoticed that I am still the only woman in the room in many meetings.

INSIDEHPC: What are your thoughts on how we can attract the next generation of HPC professionals into the community – and provide them with the experience-based training that they will need to be successful?

I think it’s a combination of marketing and education. You have to put resources— time and money—into stoking the pipeline, and you need to find a way to communicate the exciting parts of what you’re doing in a way that connects with aspiring young scientists.

It’s never too early to start talking to kids and encouraging their curiosity about science and computing. There’s a great program I’m involved with called Expanding Your Horizons that encourages young women to consider careers in science, technology, engineering, and math. More of these organized efforts directed at young people are needed.

Lawrence Livermore organizes and funds an excellent summer scholar program and postdoctoral research program that is well known in academia. Our students and postdocs interact with world-renowned scientists on new areas of research and are given access to some of the most advanced computing facilities in the world. If you take the time to show the next generation a path—one that is exciting, meaningful, and has staying power—a good number of them will follow it.


INSIDEHPC: What motivates you?  What is your passion?

Hands down, my passion is helping people. Luckily, in my job I get to do that in various ways, not the least of which is the fact that computing technology can transform the way we live and help improve our relationship with mother earth.

I’m passionate about sustainability. Clean air, clean water and low-carbon emitting, sustainable energy are goals of the highest order. The computing community has a tremendous capability at its disposal. We can design a model that reflects the entire earth system, not just its individual parts. And we can present the system in a manner that is understandable and even compelling to the general public. Because the changes to the earth are playing out over decades, it’s hard to comprehend and convey the need for individual and collective change today. We as a nation can’t implement the sort of changes necessary to achieve a sustainable world if we as citizens do not clearly understand the problem.

Science diplomacy is another topic I’m passionate about. I just returned from an eye-opening trip to Saudi Arabia on behalf of CRDF Global. CRDF’s objective is to advance global peace and prosperity through international scientific and technological collaboration. In my opinion, nothing but good can come from nurturing a spirit of science and technology cooperation, supporting opportunities to strengthen research and education in universities abroad, and providing critical benefits to the global community. When researchers interact on objective topics, each subconsciously learns to understand how the other feels about subjective topics. Understanding one another at different levels is what helps promote peace. The bonus is that we can help improve our global standard of living through science and technology.

INSIDEHPC: What “non-HPC” hobbies or activities do you have?  If you ever really have ‘time off’ – how do you spend it?

There is nothing I’d rather be doing than spending time with my family. I have two very successful children who have always been my top priority as well as my greatest joy and source of pride. I also love to cook for and spend time with my extended family, plus I have many good, long-lasting friendships.

Additionally, I enjoy traveling and meeting people from other cultures. In those situations, I try to be respectful of their social mores.

Because I like people, I guess I have a bit of a reputation of trying to get people to “cut loose.” The way I do that is by cutting loose myself. I won’t reveal my secrets—you’ll have to ask others in the community about some of my antics.


INSIDEHPC: Approximately how many conferences do you attend each year?  What would you say is your percentage of travel?

I’m selective about the conferences I attend, but I never miss the SC (Supercomputing) conference.  I’ve been involved as an organizer in some way or another since 1991 and I was the general chair in 1997.  I also try to attend the International Supercomputing Conference (ISC) and the Salishan Conference on High-Speed Computing.

Travel is a big part of my job. I travel about 25% of the time. I’ve been to Washington, D.C. on 11 separate trips so far in 2010. I serve on advisory committees for the National Research Council, the National Science Foundation, and the Council on Competitiveness, and I’m on the board of directors for CRDF Global. I also serve on a number of industrial advisory committees and academic or laboratory review committees. These opportunities all require a commitment to travel.

My current travel schedule is paltry compared to 20 years ago. During the early ASCI years, when I worked at Sandia-Livermore, I was traveling 48–50 weeks a year. I was leading Sandia’s effort to consolidate its Livermore computing operations with its other location in New Mexico, so I commuted by plane every week between Livermore and Albuquerque, with frequent side trips to Washington, D.C. thrown in. For 10 years, I lived mostly in hotels. I was on a first name basis with the hotel and car rental people.

INSIDEHPC: How do you keep up with what’s going on in the community and what do you use as your own “HPC Crystal Ball?”

Mark Seager is my crystal ball.

It’s easy for me to keep up with what’s going on in HPC because I have a staff of 900 absolutely brilliant people moving in many different directions, and I get all that information filtered back to me.

INSIDEHPC: What do you see as the most exciting possibility of what we can hope to accomplish over the next 5-10 years through the innovative applications of HPC?

I’m not sure it’ll happen in 10 years, but there will come a day when all the various tools and technology platforms available—our iPads, cell phones, supercomputers, televisions—will merge into one big knowledge ecosystem. Technology will change our existence in ways that we can’t foresee today. That is exciting. With technology comes knowledge, knowledge breaks down fear, and fear is what causes trouble in the world. I think the most exciting possibility is that technology will help humans become more unified.

INSIDEHPC: What are your thoughts on HPC’s ability to address what many are referring to as “the missing middle” which I loosely interpret as a broad spectrum of small and mid-size businesses. (Is HPC starting to reach a larger audience of people who previously did not have access to it?)

The first thing we have to do is understand the problems that small and mid-size businesses are interested in solving, and then figure out practical ways HPC can address their needs. The next step is forming partnerships to help overcome some of the barriers: make the infrastructure affordable and the codes easy to use. With our history of building out the technology focused on solving specific problems, we can and should provide a bridge to companies looking to do the same. The national labs will always need to push the tip of the pyramid in HPC, but if we don’t help build out the base, there is a danger the tip will topple . Many of the labs, Livermore among them, are working with a wider variety of industrial partners, one at a time, to do just that.

INSIDEHPC: What do you see as the single biggest challenge we face over the next 5-10 years?

Power. We could easily go about making the next big supercomputer by stringing a bunch of components together, but if the resulting machine requires 100MW of power to operate, that’s just not a realistic option. We have to innovate machines that require substantially less power.

INSIDEHPC: Any closing thoughts you would like to share with the HPC community?

From left, Karli, Donas son Josh, Dona, Donas daughter Julia, and Mike

From left, Karli, Dona's son Josh, Dona, Dona's daughter Julia, and Mike

It has never just been about creating the computer with the most superlatives attached to it; it’s the discoveries the machines make possible. Supercomputers have become the backbone of science and technology, and the simulations performed on them will enable virtually every scientific field for decades to come.

For instance, climate modelers have said they need exascale computing capabilities to achieve high-resolution coupled earth system models at 1-km resolution. Having more knowledge about climate change and its effects earlier by even a few years may well be worth a billion dollars.

China’s new supercomputer recently took the world performance lead, and the country’s government and scientists should be applauded for this remarkable achievement. Because I work at a national security laboratory, I tend to think about future challenges in terms of national security. High performance computing and simulation are essential both for national security and for industrial competitiveness in the world economy. China’s major investments in HPC show that they recognize this truth and are willing and able to focus money, energy, and creativity in this direction. There are other similarly focused efforts in Russia, Europe, and Japan.

Technology is at an inflection point where the laws of physics are dictating that we do things differently. The underlying technologies exploited this past decade to build ever-faster supercomputers are facing the end of an era of “easy” gains. Technology will change and everything that technology touches will change.

If we cede our leadership on the hardware side, it’s very likely we’ll also eventually cede our leadership in software, components, and other critical technologies that support cost-effective and powerful server and PC markets all the way down to the cell phone. There is no doubt that these technologies provide distinct advantages to the sponsoring nation.

As a nation, we’ve given a lot away; let’s at least keep our innovation. If we’re going to continue to use HPC as an economic engine for competitiveness in the global marketplace, we need focused and consistent investments in advanced computing technology.

Also posted in Featured Stories, HPC, HPC People, Rock Stars of HPC | 6 Comments

Rock Stars of HPC: Steve Wallach

In the realm of Rock Stars, there are One-hit Wonders, Divas, Boy Bands, American Idols, Crazy Hearts, and Legends. Our insideHPC Rock Stars are clearly an elite group of industry luminaries and thought leaders, but even among this group, few have attained the legendary status of this month’s insideHPC Rock Star.

The old timers in the community of course know Steve Wallach. As co-founder of Convex Computer Corporation, he was well respected throughout the computational science and the investment communities. His technical leadership was chronicled in Tracy Kidder’s Pulitzer Prize winning book, “The Soul of a New Machine.” Convex was eventually acquired by Hewlett-Packard, and Steve took on the role as Chief Technology Officer of HP’s Enterprise Systems Group.

Most recently, Wallach has once again drawn the spotlight as the Chief Scientist, Co-Founder, and Director of Convey Computer Corporation.
Wallach has 33 patents and is a member of the National Academy of Engineering, an IEEE Fellow, and was a founding member of the Presidential Information Technology Advisory Committee. He is the 2008 recipient of IEEE’s prestigious Seymour Cray Award.

It is with great pleasure that we present to you the newest Rock Star of HPC, Steve Wallach.

insideHPC: You have such a rich history in this community and have been involved in so many milestone activities – what would you call out as one or two of the high points of your career – some of the things of which you are most proud?

Steve Wallach: One high point in my career was starting Convex Computer, a company known for its “easy to use, affordable supercomputing” technology. The tagline when Convex started was: “A minicomputer version of a Cray, but program like a VAX.” At the time, very few people believed in the “program like a VAX” part. Today, the compiler technology that Convex developed, with the help of the late Ken Kennedy of Rice University, is considered standard.

The second high point is being able to give something back. I was a founding member of PITAC (Presidential Information Technology Advisory Committee). We helped to increase NSF budgets by hundreds of millions of dollars. Also, I’ve served on various government studies on high-performance computing. I spend lots of time in the greater DC area. One associate even went so far as to suggest that I get an apartment in DC, so it would be easier for me. I politely declined.

insideHPC: What are your thoughts on how we can attract the next generation of HPC professionals into the community – and provide them with the experience-based training that they will need to be successful.

Steve Wallach: That’s a tough one. Perhaps we should follow the lead of Apple’s App Store (or Android): Easily available and easy to use.

First of all, we need to increase the productivity of programmers. This generally means system manufactures need to employ a higher level of co-design (designing hardware and software together). I believe that, as part of all major RFPs, there should be a section on programmer productivity. There’s nothing like losing a bid because of a lack of a productive software environment to result in changes. But innovations in hardware tend to come first. Then, the software is shoe-horned in to get things to work. I refer to this as “Pornographic Programming:” You cannot define it, but you know it when you see it.

insideHPC: What motivates you? What is your passion?

Steve Wallach: When someone says “that cannot be done,” my juices flow. Too many very smart people do not get a chance to really do their thing. Startups are the major places where innovation and risks take place. My latest passion is to figure out how to use Google calendar.

insideHPC: Are there any people who have been an influence on you during your years in this community?

Convey Computer Co-founders Steve Wallach, Bruce Toal and Tony Brewer

Convey Computer Co-founders Steve Wallach, Bruce Toal and Tony Brewer

Steve Wallach: There are three people who have had great influence on me. One person was Ken Kennedy. The high-performance computing community lost a giant and I lost a great friend. He convinced me that a compiler could be developed that could take VAX serial code and produce vector code. And he was correct. When I came up with the idea for Convey Computer, the first person I asked for advice was Ken. I flew down to Houston and spent four hours going over the concept. At the end, he said, “This can be done, but you need a world-class compiler team.” I responded, “I know where they are.” “Where?” Ken asked. I responded: “Right where I left them.”

Another influential person was Alan Deerfield of Raytheon. Alan was a pioneer in the design of DoD specific signal processors. I learned all about: FFTs, Radar Range Gating, Kalman filtering, etc. I worked for Alan for five years in the early ‘70s. He taught me and showed me that small teams of highly motivated engineers are the most fun and accomplish the most. But you have to work hard. When motivated, working hard just comes naturally.

Lastly, Tom West of Data General showed me how to manage a group of highly motivated engineers and how to shelter the team from corporate politics. He was great at moving new products out the door, too. Tom definitely sets the gold standard for how to manage engineers. Plus, he taught me how to use high-dollar words such as “quintessential” and “canard.”

insideHPC: What “non-HPC” hobbies or activities do you have? If you ever really have ‘time off’ – how do you spend it?

Steve Wallach: I like to work out a lot – it’s kind of like training for work and keeps me mentally sharp. When I do have the time, I go to the horse track with my best buddy. I am trying to get back to my college “skill level” when shooting pool. When I can run a rack, again, I will be happy.

Also, I now have a granddaughter. Any opportunity to play with her is number one.

insideHPC: Approximately how many conferences do you attend each year? What would you say is your percentage of travel?

Steve Wallach: Let’s put it this way, in 2011, I will pass the eight-million-mile mark on American Airlines. Of course, that is not real miles traveled – that’s perhaps closer to 3.5 million real miles. And this ignores the miles on Southwest and various European and Asian airlines. My guess is that I attend, on the average, one conference a month.

insideHPC: How do you keep up with what’s going on in the community and what do you use as your own “HPC Crystal Ball?”

Steve Wallach: Attending conferences is certainly one way to see what is happening in the community. Prior to Convey, I was a contractor/consultant to Los Alamos for almost 10 years. That certainly helped me keep up. Also, I still perform due diligence for venture capitalists. Every once in a while there is a HPC type of deal, but those deals are relatively rare.

As previously described, I try to search out the HARD PROBLEMS. That is my crystal ball. And solving these problems may involve all types of technologies, hardware, software, and algorithms. I read a lot. When I find an interesting paper, I often send an email to the authors and begin a dialogue. I never know when I will use that body of knowledge.

insideHPC: What do you see as the most exciting possibility of what we can hope to accomplish over the next 5-10 years through the application of HPC.

Steve Wallach: I believe that HPC coupled with bioinformatics will lead to new ways to deal with all types of medical issues. We are already beginning to see some results. I hope one day, as described in episodes of Star Trek, we will genetically sequence a virus, take this sequence, model the behavior under certain conditions, and then synthesize a drug that hunts down the virus to destroy it.

insideHPC: What are your thoughts on HPC addressing what many are referring to as “the missing middle” which I loosely interpret as a broad spectrum of small and mid-size businesses.

Again, as I mentioned, we need an HPC App store. Any application that will aid the small and mid-size business has to be easy to use and affordable. One would also expect these applications to be available in the cloud. That is already happening. However, the user interface to these applications is different for each cloud. That slows widespread adoption.

insideHPC: What do you see as the single biggest challenge we face over the next 5-10 years?

Steve Wallach: Well, the march toward exascale computing is upon us. The consensus is that getting to exascale will NOT be as straightforward as getting to petascale. I have some thoughts on this and will be presenting those thoughts at several SC10 panels. Clearly exascale will be discussed and debated.

But the absolute biggest challenge is finding a way to get more HPC performance for less watts into our data centers. Sure, that’s related to exascale, but it goes way beyond that. If we’re ever going to solve what we used to call the “grand challenge problems,” we need some way to overcome the laws of physics that we’re facing today. By that I mean general-purpose processors just can’t get much faster because they can’t get any hotter. Today if our HPC users say “we need more performance,” we just add another 30 kilowatt rack on the datacenter floor in an attempt satisfy them.

The only way to do that is with heterogeneous computing, or to be more specific, with application-specific hardware. That means designing instruction sets that are absolutely specific to a particular application or class of applications. Which is why technology like FPGAs and GPGPUs are such a hot topic today—we’re all looking down the road and saying, “Where is this performance going to come from?”

And another thing, related to that, is that we do not need new languages. We need extensions to existing languages (like Fortran and C) that reflect the changes in computer architecture. In general, these are extensions that reflect the memory hierarchy within a node and the hierarchy among nodes. Please do not interpret this to mean I am against language research. I believe the results of this research can be reflected within current languages.

I do have one hot button in this area. I believe that MATLAB (MathWorks) is the easiest to use and the most productive HPC language. I use it all the time on my laptop. Users who are willing to accept a two-to-four reduction in performance (relative to Fortran or C) can gain an order of magnitude more user productivity. So, a great example of application-specific computing would be to build MATLAB machines to eliminate this imbalance.

In my humble opinion, cloud computing is time-sharing in a contemporary architecture. There are many models of cloud computing, thus there is no simple answer to cloud computing delivering real value. For many users and companies, having a shared resource, not having to deal with system administration and facilities is a very big win. In this area, cloud computing provides real value. Additionally, having access to additional resources for spikes in computing needs is a big win.

Also posted in Events, HPC People, Rock Stars of HPC | 2 Comments

Rock Stars of HPC: Bill Kramer

Bill Kramer has spent his career finding, catalyzing, and managing change in HPC. Early in his career he helped field the first, production Unix-based supercomputer, and he has continued to work to design and commission some of the most innovative and successful computers of the past twenty years: during his career he has fielded twenty large supers, 7 of which have been in the top 5 of the Top500. Kramer’s career choices have always drawn him to our community’s leading organizations, places that were changing something fundamental about what it means to be a supercomputer center. But he isn’t about change just for the sake of change: for Kramer it is a way to make sure that he stays fresh, and does the best job he can for the people he is leading, and for the people who use his systems.




He is the kind of leader that the HPC community, and just about everyone else, needs more of: someone focused on service to a community he believes in and on getting the job done for the benefit of all.


Bill KramerToday Bill Kramer is the deputy project director and co-principal investigator for the Blue Waters project at the National Center for Supercomputing Applications (NCSA), at the University of Illinois in Urbana-Champaign. This is ground zero for the first sustained PFLOPS (10+ PFLOPS peak) supercomputing center dedicated to diverse science and engineering; but it’s not really about the computer. Over the past several years Bill and his team have been focused on building the facility and designing a system that, when finally turned on next year, will probably be the largest system for open science in the world. But if you’ve been following what the Blue Waters team has been doing you’ll see that they have taken a radically different approach to the launch of this capability into the community.

Getting the system fielded is only the beginning of their efforts, not the end. The really innovative things that the Blue Waters team are can be seen in their focus on training potential users, evangelizing the machine and its capabilities, and reaching out to new disciplines that should be able to benefit from the capability. In short, they are building a community around the resource: a community of users, architects, administrators, and developers that will work together and support one another once the machine is launched to, hopefully, conduct research that will change the world.

Kramer storing a technology time capsule following S06This is the perfect place for Bill Kramer. In talking with Kramer about his accomplishments, it is clear that he is one of those people who have driven their career paths with a guided purpose. As he describes it, the common thread across all of the places he’s been in his career is that they were all setting the pace for HPC at the time.

William T. C. Kramer, PhD, started his career at the University of Delaware supporting code development for the college of engineering. He helped develop applications and visualize datasets for the college’s various research projects on systems like DEC’s PDP-10 and VAX. Some of this work was on the systems side, working on device drivers and components of the operating system. This was the early days of Unix, and U Delaware was one of the early sites on the ARPANet. This put Kramer in a position to be hands deep in the Unix kernel, making systems work with the new TCP/IP. From there he moved on to systems management, getting exposure to both the human and technical issues in running large systems for scientific users.

After a while at Delaware, Kramer started sending blind resumes out to NASA centers. “I always thought NASA was cool,” Kramer says. NASA Ames was about to field a Cray-2, the first production Unix-based supercomputer in the world, and they needed someone who knew how to run a multi-user computer system and someone with the system experience to make it all work. This was the first of the moves Kramer made into an organization undergoing change. “NASA was building a supercomputing center from the ground up, and it was a very exciting time both in terms of the organization and the technology,” he says.

In fielding the Cray-2, Kramer helped finalize several pieces of software that would eventually become staples in HPC centers around the world, from UNICOS to NQS. Eventually he moved from system engineering to development and then leadership as Ames continued to field supercomputers from Cray (the site actually tried to install an ETA-10, but ended up refusing to accept the system because it never worked). They also started experimenting with MPPs, including TMC and Intel systems, and an early IBM SP. He remembers that one of the big debates they had during his time running the high speed processing group was whether or not to allow interactive editing on the Cray. His position — in favor of interactive editing — eventually won the day, but not for the reasons you’d expect. “We argued that it made more sense in terms of the demand on system resources for users to be able to make small edits to files directly on the Crays, instead of incurring all the overhead of transferring the complete file off and then back on to the system for a small change.”

Kramer was then recruited to NERSC, another organization in the midst of tremendous change. They had just moved from Livermore to Berkeley, and they had set out to become a different kind of supercomputing center. “NERSC was focused on big science — results — rather than on just having lots of users,” he says, and that was a difference that attracted him. NERSC was also one of the first organizations to commit 100% of their production resources (“in with both feet” is how he puts it) to MPP systems in a time when vector was the norm. While at NERSC Kramer contributed to the evolution of the Cray T3E, ultimately becoming Deputy Division Director as he fielded IBM SPs and, most recently, the Cray XT4 known as Franklin, before moving on to NCSA to run the Blue Waters project.

Gates and KramerThroughout all of these very challenging assignments, Kramer has remained dedicated to volunteer service. “These are very symbiotic commitments,” he says. “Certainly the organizations benefit, and I enjoy giving back to the community. But volunteer assignments are a great way to refresh my point of view and to develop new skills that, sometimes, end up helping out professionally.” Kramer says that a lot of what he has learned about managing people has come from experience in volunteer organizations. Over the years he has served in SCUBA organizations and volunteered in schools and community theaters. He also helped start the tutorials effort and graphics special interest group of the Digital User’s Group, and has been active in SIGGRAPH. But people are probably most familiar with his service to the SC conference series, which included a year as General Chair of the Conference in 2005 when he hosted Microsoft Chairman Bill Gates on the stage in Seattle.

“I try very hard to make sure I don’t get staid in my ideas. Volunteering is a great way to learn about yourself, and find new things you like to do that challenge you.”

Kramer is at the point in his career where he has the perspective to identify, and to be proud of, a few key accomplishments. His list is interesting as much for the kinds of items it contains as for the specific items themselves: facilitating the first ab initio turbulence simulation at NASA Ames, and supporting the efforts to return to flight after the Challenger disaster, the first FAA certification of an aircraft change based solely on computation, and discoveries in the search for dark matter.

What is special about this list is that Kramer doesn’t include any of the contributions he made to machines, only to discoveries the machines made possible. Unlike many managers of supercomputing centers, including myself, Kramer has managed to stay connected to the work his machines make possible. “I have always tried to make sure I kept one technical activity to keep me connected to the work that supercomputers make possible.” This, he says, reminds him why he came to supercomputing in the first place, and makes him a better center manager.

In researching this article with Bill’s colleagues and co-workers, I continually received anecdotes of his “boundless energy” and “deep commitment” along with adjectives like “focused”, “tireless”, and “dedicated.” But how does he describe his own contribution? “I think the most value I bring is in making large, complex systems work well so that people can get something done with them.”

And that, in the end, is what an HPC rock star does.


Also posted in Featured Stories, HPC People, Rock Stars of HPC | 5 Comments

Advertisement

Spectra Logig Ad

Video Archive

insideHPC.com is a production of insideHPC, LLC. © 2006-2013 Sitemap