Entries filed under “System Management”

News related to batch schedulers, installers, and operating systems

Video: Managing and Monitoring a Scalable Lustre Infrastructure

In this video from the Lustre User Group 2013, Makia Minich from Xyratex presents: Managing and Monitoring a Scalable Lustre InfrastructureDownload the slides (PDF) or check out our LUG 2013 Video Gallery.

During his talk, Makia mentions an excellent presentation from John West entitled What’s Missing from HPC.

Also posted in Events, HPC, HPC Hardware, HPC Software, LUG, Lustre, Storage, Video | Leave a comment

Video: Blue Waters and Resource Management – Now and in the Future

In this video from Moabcon 2013, Bill Kramer from NCSA presents: Blue Waters and Resource Management – Now and in the Future.

View the Slides on Slideshare or check out the Moabcon 2013 Video Gallery.

Also posted in Compute, Events, HPC, HPC Hardware, Moab, Moab.con, Video | Leave a comment

Video: Productive Parallel Programming for Intel Xeon Phi Coprocessors

In this video from Moabcon 2013Bill Magro from Intel presents: Productive Parallel Programming for Intel Xeon Phi Coprocessors.

View the Slides on Slideshare or check out the Moabcon 2013 Video Gallery.

Also posted in Co-processors, Events, HPC, HPC Hardware, Moab.con, Video | Leave a comment

Video: Using GUI’s to Open Up Your HPC Environment

In this video from Moabcon 2013, Wil Wellington from Adaptive Computing presents: Using GUI’s to Open Up Your HPC Environment.

In this session we will demonstrate how to start migrating away from using the CLI to using GUI’s via MWS to interact with Moab. We will desmontrate how easy it is to make the jump and open up your HPC environment for more projects and departments in your organisation, as well as making life easier for your System Administrators.”

View the Slides on Slideshare or check out the Moabcon 2013 Video Gallery.

Also posted in Events, HPC, HPC Software, Moab, Moab.con, Video | Leave a comment

Univa Grid Engine Steps Up to Intel Xeon Phi with Version 8.1.4

Today Univa released the latest version of Univa Grid Engine. With cross-platform support, Release 8.1.4. of Univa Grid Engine includes a number of customer-driven enhancements:

  • Improved Load collection tool for Intel Xeon Phi coprocessors
  • Extended memory usage metrics for Multi-Threaded applications
  • Scheduler performance enhancements ensuring maximum number of jobs running in the cluster while improving system responsiveness
  • Interactive Univa Grid Engine jobs can now set their memory affinity

Our latest Univa Grid Engine version 8.1.4 has been completely customer driven and is the largest update of the last 10 months,” said Fritz Ferstl, CTO Univa Corporation and father of Grid Engine. “We are leading the industry right now in converged infrastructures supporting Big Data and Big Compute, and our customers rely on Univa Grid Engine to manage mission-critical applications – so we make sure to always stay close to them in order to support their needs.”

Read the Full Story or check out our recent podcast interview with Fritz Ferstl on the Grid Engine State of the Union.

In related news, Univa is partnering with us at insideHPC on a Technical Computing User Survey. Please participate. We’ll share our results and you’ll get a chance to win valuable prizes.

Take the Survey.

Also posted in Grid Engine, HPC, HPC Software | Leave a comment

Altair and HP White Paper: An Integrated Approach to Workload and Cluster Management

Altair’s PBS Professional® is now fully integrated with HP’s Insight Cluster Management Utility (CMU), automating 80% or more of administrator’s cluster and workload management tasks.

Setting up a cluster can be challenging for a system administrator. The burden of installing a complete software stack across hundreds or thousands of compute nodes, combined with the pressure to have the system ready for production quickly (sometimes in days or hours), results in a highly complex project that can be daunting to even the most seasoned administrators.

The white paper walks through the integration between PBS Professional and Insight CMU, explains how the concept of a “connector” unites these two tools to simplify cluster setup and job execution, and provides instructions for performing key PBS Professional tasks within Insight CMU.

Key benefits to administrators/users include:

  • Improved visibility into utilization and performance metrics
  • Improved access to job data (e.g. jobs on cluster, reservations on cluster…)
  • Simplified management (admins can easily perform numerous tasks without needing to open a terminal window)

Download the white paper and learn more how the CMU PBS Professional Connector provides seamless integration between PBS Professional and CMU, automating common tasks to simplify cluster setup and job execution for users and administrators.

Also posted in HPC, HPC Software | Leave a comment

Video: Management of large-scale GPU Clusters

In this video from the HPC Advisory Council Switzerland Conference, Axel Koehler from Nvidia presents: Management of large-scale GPU Clusters. Download the slides (PDF).

Also posted in Events, GPUs, HPC, HPC Advisory Council Workshop, HPC Hardware, Video | Leave a comment

Adaptive Computing’s MoabCon 2013 Coming to Park City

Today Adaptive Computing announced that its annual user conference, MoabCon 2013, will be held April 8-11 at the Hyatt Escala in Park City, Utah. Sponsored by Intel, HP, and IBM, the conference will bring together Adaptive Computing customers, partners and industry experts for four days of talks, technical sessions, and networking events.

MoabCon brings together the industry’s best minds to discuss how to optimize HPC and cloud resource management and utilization,” said Rob Clyde, CEO of Adaptive Computing. “In addition to discovering the myriad ways Moab solves today’s complex management problems, attendees will have the unique opportunity to interact with industry experts and discuss the latest issues and trends.”

MoabCon will feature end-user speakers from LBNL, NERSC, ORNL, NCSA, and the University of Tennessee at Knoxville. Read the Full Story and Register Now.

Also posted in Events, HPC, HPC Software, Moab.con | Leave a comment

Video: Introduction to SLURM at CSCS

In this video, Neil Stringfellow from CSCS presents an overview of the SLURM workload manager. This course organized by Swiss National Supercomputing Centre in Lugano, Switzerland.

In related news, HPC Advisory Council Switzerland Conference returns to Lugano March 13-15, 2013.

Also posted in HPC, HPC Software, Video | Leave a comment

Altair Simplifies Workload Management for HP Insight CMU

Today Altair announced the CMU PBS Professional Connector, which is designed to simplify workload management for HP Insight CMU users. Using the Connector, administrators can automate most common tasks for managing a PBS Professional cluster via CMU.

With the CMU PBS Professional Connector, cluster administrators have a much easier way to monitor and manage cluster nodes and jobs,” said Bill Nitzberg, chief technology officer for PBS Works at Altair. This important integration automates the most common cluster administration tasks so system administrators can spend as much time as possible on higher-value aspects of managing their HPC resources.”

Read the Full Story.

Also posted in HPC, HPC Software | Leave a comment

Provisioning Tips for Cluster Software Packages

Over at HPC Admin, Douglas Eadline writes that adding and removing software from a running cluster is not as difficult as it used to be.

Regardless of the provisioning system, the goal is to make changes without having to reboot nodes. Not all changes can be made without booting nodes (i.e., changing the underlying provisioning); however, many application packages can be added or removed without too much trouble if some simple steps are taken.

Read the Full Story.


Also posted in HPC, HPC Software, Tools | 1 Comment

Slidecast: Grid Engine for ARMv7 Opens the Door for Energy Efficient Datacenters of the Future

In this slidecast, Gary Tyreman from Univa discusses the new Univa Grid Engine for ARMv7 Release.

Driven by the demand for new datacenter services to support mobile and cloud computing, ARM will continue to gain in-roads into the datacenter server market because of the low-power and energy efficient design of SOC’s based on ARM’s technology”, said Karl Freund, VP Marketing at Calxeda. “As enterprises shift towards highly scalable solutions such as Calxeda, a key enabling technology is intelligent workload management – and we have partnered with Univa to provide our customers with a great solution.”

Read the Full StoryDownload the MP3 * Download the SlidesSubscribe on iTunes * If Dropbox is blocked, download audio from Google Drive.

Also posted in HPC, HPC Software, Podcast, Video | Leave a comment

Univa Brings Enterprise Workload Management to ARM-based Servers

Today Univa announced that its Univa Grid Engine software will support the ARM processor architecture. Now available as Beta Release, Univa Grid Engine ARM Beta enables ARM to perform tasks far more quickly and efficiently.

Large enterprise users have become increasingly interested in using ARM-based chips in the data center as the focus shifts from performance to making their servers more energy efficient,” said Fritz Ferstl, Univa CTO. “ARM-based servers support multiple use cases in the modern data center as part of a larger trend toward matching the server hardware to the workload. Univa Grid Engine, the leading intelligent workload management solution in the enterprise, excels at automating the placement of dynamic workloads across all architectures from Intel x86_64, Intel Xeon Phi, NVIDIA GPGPUs and now ARM-based resources.”

Read the Full Story.

Also posted in Green HPC, HPC, HPC Software | Leave a comment

Altair PBS Works to Power Minerva Cluster University of Nottingham

Today Altair announced that the University of Nottingham has selected Altair’s PBS Works suite to expand the capabilities of its campus-wide HPC service based on Minerva, the university’s latest generation HPC cluster.

Working in partnership with ClusterVision, Altair will provide PBS Professional, Compute Manager and PBS Analytics to transform the 45 Tflop cluster into a robust and easily usable HPC service for the cluster’s hundreds of users. Altair was chosen over competitors because of its portal offerings, strong feature set and highly customizable framework. The company’s proven technology leadership and flexible approach were also key factors in the selection.

Read the Full Story.

Also posted in HPC, New Installations | Leave a comment

Interview: Managing HLRN Cray Cascade Systems with Moab

This week Adaptive Computing announced that the HLRN Consortium in Germany will be using Moab to manage its new Cray Cascade supercomputers. To learn more I caught up with Wolfgang Dreyer, Lee Carter, and Chad Harrington from Adaptive.

insideHPC: This win at HLRN in Germany is part of a ongoing relationship. How long has the HRLN consortium been using Moab?

Wolfgang Dreyer

Wolfgang Dreyer: HLRN is a long time customer of Adaptive Computing. HLRN was looking during the tender to available competitive solutions but having experience with Moab and Cray offering the Moab solution as part of their offering made a strong partnership a winning team-play.

Lee Carter

Lee Carter: HLRN first became an Adaptive (Moab) customer back in 2008 when they purchased their existing SGI environment – the system the new Cray hardware will be replacing.

insideHPC: What do you think makes Moab the resource management tool of choice for HLRN?

Wolfgang Dreyer: HLRN is using a wide variety of Moab modules already. Grid functionality and Accounting Manager are two of them as well as HLRN is keen to use and adopt the newly developed Power functionality on an optimized CRAY environment.

Lee Carter: In addition, maximum, business-aligned utilization of their systems is important. Our alignment of budget allocations to utilization using Moab Accounting Manager is key. Power-aware workload management is very much an additional value-add capability we will be jointly exploring and weaving into their day-to-day operations in collaboration with Cray and HLRN going forward.

insideHPC: The Cray “Cascade” systems at HLRN represent the state-of-the-art in clustering technology from the company and even features its own version of Linux. How closely do you work with Cray to ensure that Moab can optimize management of HPC resources?

Chad Harrington

Chad Harrington: Adaptive has long worked closely with Cray, since we have many common customers. Cray makes many of the world’s largest systems and Moab is particularly well suited for very large systems and workloads. As a result, Adaptive and Cray work together to ensure that Moab can take best advantage of Cray’s unique architecture and capabilities.

Lee Carter: Cray Cascade systems have a special interconnect technology invented by Cray. Moab is aware of this interconnect structure and can place Jobs depending on the JOB Specification on specific blades taking interconnect hops to account or depending on cache and memory availability. These are only a small fraction of parameters Moab can handle on Cray systems.

insideHPC: HLRN does a wide variety of research spanning from bio-informatics, chemistry, climate and ocean modeling, engineering, environmental research, and fluid dynamics to physics. With such a diverse workload, how do you ensure that the systems don’t get bogged down and are kept busy?

Wolfgang Dryer: Moab HPC Suite, Enterprise Edition has features that have long since been used by HLRN. One feature is Grid Option, which helps to ensure that both clusters are load balanced. Load Balancing two remote locations has special challenges as you must take into account time delays and the communication with an independent cluster having it´s own job responsibilities that change each time.

The second feature is the accounting manager which is integrated into Enterprise Edition. This can administrate accounts for different research groups, which get “Fair share usage” of the cluster. Fair-Share can be based on money, compute time or other parameters available in accounting manager.

Moab policies ensure that the cluster is always used in optimized utilization even if a group of researchers do not have jobs to run at a certain time. In this case Jobs with low priority can run even when they would normally run later (also known as backfill). The policy engine ensures that backfill jobs get low priority or suspended when a new high priority job is expected to run.

Also posted in HPC, HPC Software, New Installations | Leave a comment

Advertisement


Video Archive

insideHPC.com is a production of insideHPC, LLC. © 2006-2013 Sitemap