Entries filed under “Compute”

News related to the processors used in HPC gear.

Russia Turns to HPC to Revive National Defense Industry

This week Russian HPC vendor T-Platforms announced it has launched the first phase of a new data center for the “Burevestnik” Central Research Institute designed for modeling samples of artillery weapons with the use of HPC technology. The institute will upgrade to a T-Platforms hybrid supercomputer with 50 Terflops of compute power powered by GPUS, multi-core X86 processors, Panasas storage, and integrated 10GbE technology.

Our Institute makes technically sophisticated products which will reliably serve our forces for many decades in the harshest conditions,” said Georgy I. Zakamennyh, DPhil, Professor, General Director of CNII “Burevestnik” JSC, and Russian Federation Artillery Armament Chief Designer. “Research and development work to design new types of weapons requires costly full-scale tests,” he added. “Over the last decade, CNII “Burevestnik” has created an integrated information framework providing automation of all business processes and information support for the full product life cycle. The use of information systems for enterprise resource planning and product information management has greatly improved the productivity and output.”

Read the Full Story.

Also posted in GPUs, HPC, HPC Hardware, New Installations | Leave a comment

Webinar: Maximizing Performance with AMD’s “Bulldozer” Architecture – April 19

Admin Magazine, ScaleMP, and HP are hosting a webinar on Maximizing AMD Bulldozer Application Performance on April 19, 2012 at 2:00 PM EDT.

Reaching new levels of performance with the AMD Opteron™ 6200 Series processor technology on today’s applications requires an understanding of the underlying architecture and scalable system design. This webinar targets application developers and system administrators who want a deeper understanding of AMD architecture as it relates to HPC workloads. Our discussion will include what you should know; from the operating system, to tools and compilers, as well as software visible features only available on AMD technology. We’ll also describe how vSMP Foundation™ (vSMP) from ScaleMP creates larger memory SMPs from HP ProLiant systems, to scale beyond traditional server designs in a technical computing environment, as we present “Maximizing performance with AMD’s “Bulldozer” Architecture”

Speakers:
Richard Meyer, Sr. Design Engineer – AMD
Alanna Dwyer, HPC Marketing Manager- HP
Benzi Galili, COO, ScaleMP

Register now.

Also posted in Events, HPC, HPC Hardware | Leave a comment

Colfax Goes Cluster Ready with Sandy Bridge and Bright Computing

Colfax International is one of the first vendors out of the gate to have certified Intel Cluster Ready systems based on the new Sandy Bridge Xeon E5 processors. To get customers up and running quickly, the company announced that it will include a free trial of Bright Cluster Manager on its new Intel Xeon E5 processor-based HPC clusters.

Our partnership with Bright Computing is a key step in expanding our HPC product offerings,” said Gautam Shah, CEO of Colfax International. “This partnership provides our customers with powerful, compute-ready HPC systems. Colfax customers can get to work immediately, without the complexity and time-consuming task of setting up and testing their clusters.

Read the Full Story.

Also posted in Collaborations, HPC, HPC Hardware, HPC Software, System Management | Leave a comment

Tuesday Webinar: OpenACC – for Cray Supercomputers

On Tuesday, April 10, Cray and Nvidia will host a webinar on OpenACC support in Cray’s current and future programming environments.

The OpenACC programming model uses directives and compiler analysis to compile your existing C, C++ and Fortran codes for accelerator based supercomputers; this often allows you to maintain a single source version, since ignoring the directives will compile the same program for the X64 CPU, preserving your software development investment.

The webinar starts at 9:00am PDT and will be presented by Dr. James C. Beyer from Cray’s Programming Environment Optimization Group. Register now.

Also posted in Events, GPUs, HPC, HPC Hardware, HPC Software | Leave a comment

Jülich Orders Eurotech Super for DEEP Exascale Research

Germany’s Jülich center has ordered a 1.2 million euro Aurora supercomputer from Eurotech for use in the EU-funded project DEEP (Dynamical Exascale Entry Platform).

The DEEP consortium, led by Forschungszentrum Jülich, proposes to develop a novel, Exascale-enabling supercomputing architecture that takes the concept of compute acceleration to a new level: instead of adding accelerator cards to Cluster nodes, an accelerator Cluster, called Booster, will complement a conventional HPC system and increase its compute performance. While the Cluster is an off-the-shelf component, the Booster will be designed and built by the DEEP project partners using cutting edge technology. Together with a software stack focused on meeting Exascale requirements, comprising adapted programming models, libraries and performance tools, the DEEP architecture will enable unprecedented scalability. In the DEEP consortium Eurotech will be the responsible to assemble and design the cabinets, racks and blades of the system.

The HPC 10-10 Aurora supercomputer is based on the Sandy Bridge Intel Xeon 2600 series processors with 32 Gb per node of memory and a 3D Torus driven by Extoll software. Read the Full Story.

Also posted in Collaborations, Exascale, HPC, HPC Hardware, Network | Leave a comment

Accelerate Your Science on Titan: INCITE Call for Proposals

 
Oak Ridge National Labs has opened up the INCITE Call for Proposals, which allows researchers to submit projects to run on the new GPU-powered “Titan” supercomputer.

Expected to reach 20 petaflops when deployed, Titan will be a ground-breaking new tool for scientists to leverage the massive parallel computing power of GPUs for research and discovery.

The call for proposals is open to researchers from academia, government labs, and industry worldwide. To help you get started drafting a proposal, a “how-to” webinar will be held on Tuesday, April 24.

The submission form is now available.

Also posted in GPUs, HPC, HPC Hardware | Leave a comment

Fujitsu Wins Hybrid Super Deal in Japan – Sparc, x86, and GPUs

By Timothy Prickett Morgan • Get more from this author

Japanese IT conglomerate Fujitsu has been pushing hard to peddle its Sparc and x86 supercomputer clusters to take on IBM, Cray, Silicon Graphics, and others, and not surprisingly Fujitsu it has scored some big deals in the home country. The company just took down a big hybrid Sparc-x86 cluster deal at Kyushu University, which is the dominant educational institution in the southern part of Japan.

Like the 1.13 petaflops “Oakleaf-FX” machine that was just fired up this week at the University of Tokyo, the new machine going into Kyushu University will be compatible with the 10.5 petaflops massively parallel Sparc64 machine known as K that was built by the Japanese government and that has been the most powerful machine in the world for the past year. And like the University of Tokyo, Kyushu University was also a user of IBM’s parallel clusters, in this case a Power6-based system based on Power6 processors that is resold by Hitachi as the SR16000 rated at 25.3 teraflops.

Kyushu University also had a 64-engine VPP5000/64 vector supercomputer made by Fujitsu rated at a comparatively tiny 563 gigaflops and an old PrimeQuest Itanium cluster rated at 10.9 teraflops. The largest machine in its data center is a cluster of Primergy RX200 S6 rack servers with 4,704 Xeon cores that is rated at 50.2 teraflops. Add it all up and this is no big deal in today’s petaflops era.

But Japan, like the United States, China, and Europe, is on an upgrade wave in the HPC data center, and the hybrid Kyushu University machine, which has not yet been nicknamed, will weigh in at a combined 691.7 teraflops across the Sparc and Xeon nodes used for computation, and that is perfectly respectable.

On the Sparc side of the system, Kyushu University is getting eight racks of the PrimeHPC FX10 system, which is a modified version of the K super that uses a 16-core Sparc64-IXfx processor instead of the older eight-core Sparc64-VIIIfx used in the K machine. The Sparc side of the machine will have 768 server nodes with a total of 24TB of main memory and will of course use the “Tofu” 6D mesh/torus interconnect to link those nodes together.

The PrimeHPC FX10 setup will be back-ended with 44 of Fujitsu’s Eternus storage arrays with a total of 576TB of capacity and running the Fujitsu Exabyte File System (FEFS), a variant of the Lustre cluster file system that was designed to span 100,000 nodes. The machine is front end-end by 22 Primergy x86 servers that are used as log-in and management nodes. This part of the hybrid machine will have a peak theoretical performance of 181.8 teraflops.

The bulk of the raw math in the Kyushu University system comes from a cluster of Fujitsu density-optimized CX1000 machines, which were first announced two years ago with the Xeon 5600s. The CX1000 machines were updated in March with CX400 enclosures sporting Intel’s new Xeon E5-2600 processors, and the system has 1,476 two-socket nodes that together have 184.5TB of main memory and deliver 510.1 teraflops of peak performance on double-precision math. This machine is also using FEFS on a cluster of 74 Eternus storage units with a total of 4PB of capacity and is front-ended with 44 Primergy rack servers used to manage access to the cluster.

The whole shebang will presumably run Linux – Kyushu University didn’t say – but there is obviously the option of running Solaris on the Sparc side and Windows HPC Server 2008 on the x86 side.

Later this year, when the machine has been up and running for a bit, Kyushu University will be slapping some GPU coprocessors into the box to boost its performance, very likely through the petaflops barrier and possibly as high as many petaflops. If the university waits for the “Kepler” Tesla GPUs from Nvidia – and those CX400 enclosures have room – then Kyushu University could be sitting near the top of the Top 500 supercomputer rankings by the fall, instead of near the bottom as it is with its current Primergy cluster. ®

This article originally appeared in The Register. It appears here in its entirety as part of a cross-publishing agreement.

Also posted in GPUs, HPC, HPC Hardware, New Installations | Leave a comment

Petaflop Fujitsu Sparc Super Delivers Extreme Efficiency at University of Tokyo

Today Fujutisu announced that the new Oakleaf-FX supercomputer has begun opeation at the University of Tokyo’s Information Technology Center. Based on the SPARC64 IXfx processor (an enhanced version of the SPARC processor used in the #1 K Supercomputer), the 1.13 Petaflop system achieves high energy efficiency with just 1.4 MW overall system power consumption.

The PRIMEHPC FX10 supercomputer system will contribute to advancements in various types of research and development activities by users from both academia and industry,” said Kengo Nakajima, Director, Supercomputing Division, Information Technology Center, The University of Tokyo. “Oakleaf-FX will be used for the HPC education program in the Graduate School of the University of Tokyo for future computational scientists. Oakleaf-FX will be operated so that priority is given to larger-scale jobs. Furthermore, a new project, called “Large-Scale HPC Challenge” is also starting. Members of a group whose proposal has been accepted can occupy all computing nodes (4,800 nodes, 1.13 PFLOPS) of the FX10 system for 24 hours. One proposal is selected per month.”

Read the Full Story.

Also posted in HPC, HPC Hardware, New Installations | Leave a comment

OSU Grad Student Explores Strengths, Challenges of Cray, IBM, and Nvidia

When it comes to benchmarks, your performance mileage may vary. Now an Ohio State University researcher has established some side-by-side performance comparisons that surveying the wide range of parallel system architectures offered in the supercomputer market, .

We explore the parallelization of the subset-sum problem on three contemporary but very different architectures, a 128-processor Cray massively multithreaded machine, a 16-processor IBM shared memory machine, and a 240-core NVIDIA graphics processing unit,” said Bokhari. “These experiments highlighted the strengths and weaknesses of these architectures in the context of a well-defined combinatorial problem.”

Read the Full Story.

Also posted in GPUs, HPC, HPC Hardware | Leave a comment

HPC Startup Silicon Wolves Moves on up to SUNY Incubator

Today the SUNY Fredonia Technology Incubator announced that the Silicon Wolves Computing Society (SWCS) will be joining their incubator program. SWCS is a consumer-friendly high-performance computing system developer and manufacturer of the most advanced workstations and computer gaming solutions on the market. The high-tech start-up company recently relocated to the Incubator from Anaheim, Calif.

We are thrilled to be in Dunkirk and affiliated with SUNY Fredonia through the Technology Incubator,” said Ryan Wolf, President of SWCS LLC. “This is a great community, and a renowned university and we are excited about our opportunities for growth.”

SWCS is a computing solutions manufacturer and integrator that conduct research and development in highly specialized computer workstations, desktops, laptops and servers, and in particular, develop and configure dedicated computing solutions for Cloud, Virtualization, HPC and Reconfigurable Computing environments.

The Incubator is all about attracting companies which are poised for substantial growth, and helping them through the process while providing them access to the resources that a nearly 6,000-student institution like SUNY Fredonia can provide,” said Robert Fritzinger, Director of the Technology Incubator. “I see a strong fit between Silicon Wolves Computing Society, the SUNY Fredonia campus, and the Incubator and I am anticipating a successful partnership.”

Read the Full Story.

Also posted in Accelerators, HPC, HPC Hardware, inside Startups | Leave a comment

Video: Accelerating HPC with Intel Many-Core Technologies

In this video, Intel’s Herbert Cornelius presents: Many-Core Technologies: Towards general purpose, high-performance/throughput and energy-efficient IA/x86 computing.  Download the slides (PDF).

Recorded at the HPC Advisory Council Switzerland Workshop on March 14, 2012.

Also posted in Accelerators, Events, HPC, HPC Advisory Council Workshop, HPC Hardware | 2 Comments

Video: Cray and the Changing Landscape of High Performance Computing

In this video, Peter Ungaro, president and CEO of Cray Inc. presents: The Changing Landscape of High Performance Computing at NCSA on March 14. This talk was co-sponsored by NCSA and the Illinois Parallel Computing Institute.

A Tip of the Hat goes to HPC Guru for pointing us to this story.

Also posted in GPUs, HPC, HPC Hardware | Leave a comment

Dell Taps Mellanox to Accelerate Blade Performance

This week Mellanox announced that Dell is using the company’s 10/40 Gigabit Ethernet and FDR InfiniBand solutions to up performance and flexibility of M620 blade servers.

Mellanox solutions are the core interconnect across data centers integrating servers, storage systems and users with industry leading performance. With today’s announcement, customers can order new Dell M620 blade servers equipped with Mellanox FDR 56Gb/s InfiniBand switch blades and ConnectX®-3 PCIe 3.0 FDR 56Gb/s InfiniBand and 10/40GbE mezzanine adapters. The new Dell PowerEdge blade server and Dell M1000e blade chassis have been designed to take full advantage of Mellanox’s FDR 56Gb/s InfiniBand and 40GbE PCIe 3.0-compliant adapters, which can offer up to double the I/O throughput versus older generation technologies.

Read the Full Story.

Also posted in HPC, HPC Hardware, Network | Leave a comment

Interview: Dr. Boku Looks Forward to HPC in Asia Day at ISC’12

The second annual HPC in Asia conference will take place in Hamburg on June 17 in association with ISC’12. To learn more, I caught up with Dr. Taisuke Boku, Professor Department of Computer Science Graduate School of Systems and Information Engineering University of Tsukuba and the Chair of HPC in Asia Steering Committee.

insideHPC: The first HPC in Asia Workshop was held as part of ISC’11 and drew 90 attendees from Asia, Europe and the U.S. As the Steering Committee Chair for the HPC in Asia Day, what would you say are the main objectives for the conference?

Dr. Taisuke BOKU: The main purpose of the workshop is to share the information on HPC activities in Asian countries with European and U.S. researchers and vendors. Last year was really great for Asian HPC because Japan’s K Computer was ranked as #1 in TOP500 list twice on June and November as the follower to Tienha-1A of China got #1 position on November 2010. These machines were very attractive topics for people around the world and it was so good opportunity to share application activities. I like to continue this movement to announce the Asian HPC activity to the world.

The second purpose is to provide a good opportunity to promote the activities between Asian countries. Of course it has been continued in many conferences within Asia but the higher stage greatly stimulates the researchers especially for young ones to collaborate within/outside Asia. In this year, we will start the poster session for any HPC research and activity in Asia in this workshop, and that is exactly for this purpose.

insideHPC: The new HA-PACS supercomputer at the University of Tsukuba is described as a “demonstration system for parallel computing with Tightly Coupled Accelerators.” What’s new and different with this generation of the PACS machine vs. its predecessor?

BOKU: Basically, the largest difference of HA-PACS from its predecessors is that this is the first PACS system introducing accelerated computing technology. So far, PACS system employed ordinary scalar processors except QCDPAX of which computation node is also equipped with an ASIC for a small vector processing feature. This time, we introduced GPGPU technology as much as we can in the meaning of very compact and dense implementation of multiple GPUs per node. As shown in the question, it is not just a large scale GPU cluster but we will develop our original technology on this machine which is called TCA (Tightly Coupled Accelerators) to enable real GPU-to-GPU communication over node which is impossible today. For this purpose, we have been developing a new chip based on FPGA to use PCI-Express as the communication link between any combination of CPU-GPU, CPU-CPU or GPU-GPU for internal/external.

insideHPC: How much of its computational capability is derived from GPUs?

BOKU: HA-PACS consists of two parts. The primary part is called “HA-PACS Base Cluster” which is now running from February 2012 only with commodity parts as a large scale compact and dense GPU cluster. It has 268 nodes and each node is equipped with two sockets of Intel E5 (SandyBridge-EP) and four of NVIDIA M2090 GPUs. Each E5 CPU has 8 cores to support AVX SIMD instruction with 2.6 GHz of frequency and achieves 166.4 GFLOPS of peak performance. Each M2090 has 512 of CUDA cores to provide 665 GFLOPS (double precision) of peak performance. In total the performance of each node is 332.8 GFLOPS (CPU) + 2660 GFLOPS (GPU) = 2993 GFLOPS in total, very close to 3 TFLOPS/node. As the system, HA-PACS Base Cluster achieves 802 TFLOPS of peak performance. Base Cluster is mainly used for the development of large scale applications and algorithms on various domain sciences in our center.

We will also develop “HA-APCS TCA System” equipped with our TCA technology for experimental platform of this new technology. It will be deployed one year later (March 2013) with approximately 64 nodes with enhanced GPUs which will provides more than 300 TFLOPS of additional performance to its Base Cluster.

In conclusion, we will have more than 1 PFLOPS of peak performance when the TCA System is added to the Base Cluster.

insideHPC: Does the Hybrid architecture give you advantages in terms of power efficiency?

BOKU: Yes, of course. If we just look at Linpack benchmark, we can estimate that the sustained performance of CPU is approximately 85% of peak while GPU achieves approximately 50% of peak. Each E5 CPU of HA-PACS consumes 115W at peak while M2090 consumes 225W. In summary, the power efficiency of CPU and GPU for Linpack is 1.45 GFLOPS/W and 1.48 GFLOPS/W, respectively, so they are very close. But on actual applications, CPUS performance will be largely degraded while we can keep relatively high performance if we carefully select the target and apply an appropriate coding. GPU, of course, is not a magical device to accelerate any application, but for some fitting applications, we can exploit a great performance supported by an array of simple computation cores and high bandwidth of GDR memory, so the sustained power efficiency will be much largely different, I believe.

insideHPC: Is the core mission of HA-PACS tied to Exascale research?

BOKU: Yes, the basic purpose of our research on HA-PACS is focusing on Exascale both on application and system. While HA-PACS with Base Cluster and TCA System achives just around 1 PFLOPS, we will develop a new algorithm and basic code toward Exascale computing based on accelerated computing technology. TCA is a system side research to enable the direct communication among accelerators to apply accelerators to large scale systems with much smaller latency than today’s technology, which will be important to achieve high performance in strong scaling. We think there will be some limit of weak scaling in Exascale era due to memory capacity shortage, computation time to solution, high probability of partial system failure, etc. So the capability for strong scaling where the latency issue is more serious than today is quite important.

Please remind that we don’t think that GPU is not the final solution for Exascale. What we are aiming on HA-PACS research is not “GPU Computing” but “Accelerated Computing.”

insideHPC: What do you think makes ISC’12 an attractive place to meet for HPC researchers?

BOKU: I think this workshop can be the hottest place for information sharing and research promotion for Asian HPC activities. So far, we have been having such an opportunity in U.S in SC series of conference, but I felt we need something more especially for Europe.

ISC is for all over the world of course, but it is a special occasion to contact with a large number of potential researchers in Europe. Through this workshop, we like to make any opportunity for sharing everything between Asia and Europe as well as U.S. on all issues on HPC technology and applications.

Registration for HPC in Asia Day is now open.

Also posted in Accelerators, Events, Exascale, GPUs, HPC, HPC Hardware, ISC12 | Leave a comment

Eurotech Rolls Out Sandy Bridge supercomputer

Eurotech has launched the Aurora HPC 10-10 supercomputer based on the Intel Sandy Bridge processing platform. Optimized for high-density, the HPC 10-10 can thermally manage the top-of-the-lineXeon E5 series processors (the E5-2687W at 3.1 Ghz) in a  standard rack  to deliver 100 Tflops of performance at 110 Kw of peak consumption.

The HPC 10-10 inherits the efficient power conversion of the AU 5600 and allow data center PUEs as low as 1.05. This means great energy savings and together with a very respectable 900 Mflops/W this means also the Aurora high performance computer is one of most efficient and greenest systems in the market.

Read the Full Story.

Also posted in HPC, HPC Hardware | Leave a comment

Advertisement

ClusterStor Ad

View All Videos

insideHPC.com is a production of insideHPC, LLC. © 2006-2011 Sitemap