Entries filed under “NVIDIA GPU Technology Conference”

Posts, stories, and announcements from the NVIDIA GPU Technology Conference

Who’s Who Headed to GPU Tech Conference

nvidiaNVIDIA announced the first set of speakers, tutors and presenters for this year’s GPU Technology Conference [GTC].  GTC 2010, the follow on to inaugural event in 2009, will take place September 20-23 at the San Jose Convention Center.

Among those speaking, holding tutorials or conducting technology previews are computing thought leaders Satoshi Matsuoka, a supercomputing expert from Tokyo Institute of Technology; Pat Hanrahan, a computer graphics pioneer at Stanford University; and Hanspeter Pfister, a leader in scientific computing at Harvard University.

The speaker lineup will also include:

  • Ross Walker, computational biologist from the University of California San Diego and the San Diego Supercomputing Center
  • Vijay Pande, computational biologist of Stanford
  • Homer Pien, medical-imaging expert at Massachusetts General Hospital and Harvard Medical School
  • Wei Ge, multi-scale particle simulation expert at Chinese Academy of Sciences
  • Timothy Warburton, Rice University specialist in computational and applied mathematics

Speakers from industry, research and vendors will include representatives from:

  • Adobe
  • Agilent Systems
  • Beckman Coulter
  • CSIRO
  • Dolby Laboratories
  • GE Intelligent Platforms
  • Georgia Tech
  • Microsoft
  • Siemens Medical
  • University of Tennessee
  • Wolfram Research

For those interested in submitting a talk for GTC 2010, the call for submissions is open until June 1, so hurry hurry hurry.

GTC attendees are going to be amazed at the breakthrough applications made possible by GPU technology. This year’s sessions will reflect revolutionary work done over the past year by some of the world’s leading industry, research and academic authorities,” said Bill Dally, NVIDIA Chief Scientist.

insideHPC had the pleasure of attending last year’s GTC.  It was one of the best technical conferences I have ever attended.  For more info and registration details, check out the GTC 2010 website here.

Also posted in Events, HPC | Leave a comment

Call for submissions out for NVIDIA’s GPU Technology Conference

The call is out on the interwebs for the NVIDIA’s second (annual, I guess) GPU Technology Conference. This year’s GTC will be held Sept. 20 to Sept. 23 at the San Jose Convention Center in San Jose, California.

According to an email sent out today

nVidia logoDevelopers, researchers, scientists and entrepreneurs are cordially invited to submit proposals on topics related to the burgeoning GPU ecosystem.

Building on the format of last year’s inaugural GTC, three summits will be held under one roof:

  • GPU Developers Summit
    Session Topics deadline: June 1, 2010
  • Emerging Companies Summit
    “CEO on Stage” Nominations deadline: August 1, 2010
  • NVIDIA Research Summit
    Research Posters deadline: August 15, 2010

More information on submissions at the website.

Also posted in Events | Leave a comment

NVIDIA Posts Recordings of GTC Keynotes and General Sessions

nvidiaNVIDIA has posted a growing series of videos from the various keynote speeches and general sessions of the GPU Tech Conference back earlier this month. Of special note is Jen-Hsun Huang’s keynote speech and the Breakthrough’s in High Performance Computing general session.  The list of recorded sessions being posted continues to grow, so check back often.

You can see the full list of recorded GTC goodness here.

Also posted in Events, GPUs, HPC Hardware, Visualization | Leave a comment

NVIDIA GPU Technology Conference keynotes available online

Just over the email transom, news that NVIDIA has posted videos of the 5 keynote sessions online, include CEO Jen-Hsun Huang’s keynote where he announced NVIDIA’s next generation Fermi architecture.

  • Opening Keynote with Jen-Hsun Huang
  • General Session: Important Trends in Visual Computing
  • General Session: Breakthroughs in High Performance Computing
  • Day 2 Keynote: Hanspeter Pfister, Professor, Harvard University
  • Day 3 Keynote: Richard Kerris, CTO, Lucasfilm

You can watch the videos here.

Also posted in Events | 4 Comments

NVIDIA GTC Debrief

For those of you following this week’s coverage of the NVIDIA GPU Technology Conference, I wanted to take a quick minute and reflect on the previous three days.  As I sit here and contemplate (over a tasty Hefeweizen) how to summarize the events, one word seems to bubble up in my head: electric.

I’ve been attending technical conferences from coast to coast for a number of years and never have I experienced the electricity that I felt here.  The hallways were abuzz with all sorts of burgeoning ideas on how to retask the GPU with new work.  The hallways were lined with posters detailing the work of countless educational and research institutions from all over the world.  I admit, I often miss much of the poster content from Supercomputing.  I found myself stopping mid-stride to analyze an overwhelming number of them here at GTC.

I had the pleasure of dining with several NVIDIA folks yesterday for lunch.  They were extremely pleased with the response from all the startups, researchers and participants whom submitted papers and projects for the conference.  Never have I seen the (often fickle) members of the HPC community latch on to a technology so fast and with such ferocity.  All in all, its pretty amazing.

The aforementioned electricity of the event was deeply embedded not only in the participants, but in NVIDIA as well.  The NVIDIA folks delivered their presentations, press releases and product announcements, then sat back and genuinely listened to their users.  In my opinion, the event was an overall success.  I learned a lot, reconnected with friends and listened to some wonderful presentations.

Also posted in Events, GPUs, HPC Hardware | 4 Comments

Bloomberg Runs Bond Pricing on GPUs

Right alongside this week’s GPU Tech Conference, Wall Street Tech has an interesting article detailing how Bloomberg has converted their bond pricing infrastructure and applications to utilize the power of GPUs.  Every night, Bloomberg calculates 1.3 million hard-to-price asset-backed securities.  These calculations, single-factor Stochastic models, were originally run on a Linux cluster.

These models are ideal for doing things in parallel, and we did parallelize them over traditional x86 Linux computers,” says CTO Shawn Edwards.

However, as customer demand increased, the overall scalability of the pure Linux cluster did not lend itself well to realistically doing the job.  One of the core software architects on Edwards’ team suggested using GPUs to solve the scalability issue.  The core Stochastic models exhibited in their applications lend themselves well to parallel computing on GPUs.

It turned out that in order to compute everything within that eight-hour window, we would need to go from 800 cores to 8,000 cores,” Edwards. “That’s a lot of servers, about 1,000. We could do it, but it doesn’t scale very well. If we wanted to use it for other ideas, we were faced with having to pile on more and more computers. That’s when the idea came in for GPU computing.”

The newly minted GPU method went live in 2009.  Rather than running on 1,000 traditional server nodes, the cluster shrank to 48 server/gpu pairs.  Whats more amazing is the fact that they achieved an 800% performance increase.

Overall, we’ve achieved an 800% performance increase,” Edwards says. “What used to take sixteen hours we’re computing in two hours.”

We’ve all seen the various performance numbers regarding GPU-based speedup.  Outside of pure speedup, also consider the power and cooling costs that Bloomberg is saving by migrating to a more consolidated architecture.  For more info on, read the full article here.

Also posted in Applied HPC, Events, GPUs, HPC Hardware | Leave a comment

NVIDIA Emerging Companies Summit

nvidiaFor those following our coverage of the NVIDIA GPU Tech Conference this week, I decided to take a slightly different route in covering the fringe topic of emerging companies surrounding the recent GPU computing explosion.  For the second year in a row, NVIDIA is hosting an emerging companies summit alongside their technical conference.  (Last year it was held alongside NVISION).

This year, the keynote for the event was delivered by Jeff Herbst, VP of Business Development for NVIDIA.  Prior to his keynote presentation, I didn’t realize how seriously NVIDIA considered their role of igniting the innovative fires surrounding their core technology.  Jeff coined their motto in business development as “Supporting the Ecosystem.”  This is a very interesting perspective on the part of NVIDIA.  This implies that their relationship with OEMs and end users is symbiotic in nature (as it should be).  NVIDIA launched a GPU Ventures project in March of this year in order to become even more pervasive within their user community.

I was able to speak with the CEO of Milabra, Samuel Cox, yesterday evening.  Milabra is one of sixty such organizations that has aligned its core technology very closely with the NVIDIA core architecture.  Ever heard of Milabra?  Me neither!  They are doing for images what Google AdWords has done for text.  They’re essentially using NVIDIA GPU technology to classify and extract rich sets of information from images on the web.  Once they classify this image data, they deliver point-based ads based on the resulting info.  They’re already cataloging and mining info from billions (yes, billions with a ‘b’) of images with only 200 milliseconds of latency (from image delivery to ad response).  They do so via their growing NVIDIA Tesla S1070 farm.  Not HPC, but very cool stuff!

This fundamental relationship is quite different than other vendor organizations.  I’ve seen all to many integrators and vendors that take a very NIH (not invented here) perspective on technical relationship development.  Kudos to the folks at NVIDIA for taking this approach.

Also posted in Business of HPC, Collaborations, Events, HPC | Leave a comment

OSC Offers Remote Viz Using NVIDIA Quadroplex

The Ohio Supercomputing Center (OSC) has announced that they have taken delivery of several NVIDIA QuadroPlex 2200 S4′s as supplied by PNY and JRTi.  These new devices will allow OSC users, researchers and partners to interactively visualize their data from their classrooms and labs.

OSC chose the NVIDIA Quadro Plex solution for their optimized support of CG, GLSL, CUDA and OpenCL”, said Don Stredney, Director, OSC Interface lab. This will allow OSC to further explore both interactive volume rendering of extremely large data sets, through CUDA programming, and additional uses in remote visualization. Now, numerous faculty, staff, and students will have access to the unique architectural environment to expand current course work as well as research in the areas of extremely large scale image processing, molecular dynamics, data mining, scientific and information visualization.

For more info on the new install, check out the OSC website here.

Also posted in Events, GPUs, HPC Hardware, New Installations | 1 Comment

NVIDIA introduces IDE for CPU+GPU code development

Nexus IDEDuring his keynote yesterday at NVIDIA’s GPU Technology Conference, CEO Jen-Hsun Huang announced a new development environment (called Nexus) that integrates into MS Visual Studio 2008. We mentioned this in our features on NVIDIA’s big news, the next generation Fermi architecture, but the company has also issued a separate release.

Nexus is composed of three components:

  • The Nexus Debugger is a source code debugger for GPU source code, such as CUDA C, HLSL and DirectCompute. It supports source breakpoints, data breakpoints and direct GPU memory inspection. All debugging is performed directly on the hardware.
  • The Nexus Analyzer is a system-wide performance tool for viewing GPU events (kernels, API calls, memory transfers) and CPU events (core allocation, threads and process events and waits)—all on a single, correlated timeline.
  • The Nexus Graphics Inspector provides developers the ability to debug and profile frames rendered using APIs such as Direct3D. Developers can use the Graphics Inspector™ to scrub through draw calls, look at any textures, vertex buffers, and API state in the entire frame.

Two versions of Nexus will be available: Nexus Standard and Nexus Professional. Nexus Standard is available at no cost to developers, and includes basic source debugging and profiling functionality that matches existing functionality with PerfHUD. Nexus Professional includes advanced debugging features, full system event tracing, and premium support; it will retail for $349.

A beta will be available for download on Oct 15, and you can find more information at www.nvidia.com/nexus.

Also posted in Events, GPUs, HPC Hardware, Tools | 1 Comment

EM Photonics Annoucnces GA Release of CULA Math Lib

image002EM Photonics, today, announced the general availability of its CULA accelerated computing math library.  This release includes NVIDIA GPU-accelerated versions of many commonly used LAPACK linear algebra routines.  I had a chance to sit down with EM Photonics CEO Eric Kelmelis here at the NVIDIA GPU Technology Conference.

Applications ranging from video games, to medical imaging, to scientific computing have come to depend on the superior processing capabilities of GPUs.  By every measure, this trend is rapidly growing and impacting more and more markets,” said Eric Kelmelis, CEO of EM Photonics. “To bridge the current gap between what GPUs can offer and how they can be used to accelerate applications, we have developed CULA in close association with NVIDIA.  A broad range of users took advantage of our beta release over the last few months and achieved 5-10x performance gains over CPU implementations.”

You may not hear the words “EM Photonics” or “CULA” often, but you will soon.  They are doing for LAPACK what NVIDIA has already done for BLAS.  NVIDIA’s new Fermi hardware is indeed impressive.  Billions of transistors are begging to be raked over the proverbial coals.  However, without a reasonable programming interface available to port existing codes to the new platform, one can argue the cart before the horse.

EM Photonics has been running silent and deep for the last several years in the various GPU efforts.  They’ve landed various DARPA and DoD grants/contracts.  Most recently, NASA Ames had them work the GPU magic on several production routines.

The CULA linear algebra library enables developers for a wide range of technical computing applications including computational fluid dynamics, electronic design automation, finite element analysis, and electromagnetic simulations, to take advantage of the performance boost of the GPU”, said Andy Keane, General Manager for the Tesla high-performance computing group at NVIDIA.   “With this release, EM Photonics is making a meaningful addition to the NVIDIA CUDA eco-system by providing a mature, complete math library”.

CULA is currently available in three forms: Basic, Premium and Commerical.  CULA basic is FREE and it comes with six common LAPACK routines.  CULA Premium lists at $395 and contains a significantly larger number of routines.  Keep and eye on these folks.  ORNL is buying a 20PF GPU-accelerated system that needs solvers and EM Photonics has an early lead.

For more info, check out their website here.

Also posted in Events, HPC Software | 2 Comments

First Public Photos of NVIDIA Fermi

insideHPC was lucky enough to get a few shots of the new Fermi silicon as it will be eventually packaged for workstation [internal card] consumption.

 


Also posted in Events, HPC | Leave a comment

ORNL Bites GPU Bug

Oak Ridge National Laboratory has officially bitten the GPU bug.  Following John’s overview of the new Fermi chip, we find Dr. Jeff Nichols, computational director at ORNL, standing foot to foot with NVIDIA’s CEO.

This is BIG news.  ORNL is interested in very large, leadership class supercomputing platforms directed towards ingesting the workloads of a many scientists.  Hence, they depend on production computing.  Their support of GPU computing has the ability to launch the technology into the next dimension of computing.

Nichols also mentioned that ORNL is spearheading what he calls the Hybrid Multicore consortium.  Long story short, NVIDIA’s new Fermi GPUs have the backing of one of the largest collection of computational talent in the world.

Also posted in Events | Leave a comment

NVIDIA’s next generation GPU architecture has a lot for HPC to love

And it all gets sealed with an engagement to ORNL

nVidia logoToday NVIDIA’s CEO Jen-Hsun Huang delivered the keynote address at the GPU Technology Conference in San Jose, and gave a big fat kiss to folks looking to get extra performance for their computations out of NVIDIA’s GPUs. Huang announced Fermi, NVIDIA’s third generation GPU architecture, which NVIDIA is pitching as “the soul of a supercomputer in a GPU.” There is a lot that is new here, so let’s start at the beginning and break it down.

2x the cores

NVIDIA’s goal with Fermi is to bring more users and more applications to GPU computing, and many of the changes they’ve made this time are aimed at enlarging the domains in which GPUs make sense. The new design has 512 cores, and NVIDIA has engineered a design with 8x the double precision performance of the last generation. This is more than double the core count in about 12 months, leading us to get pretty excited about the idea of >1024 threads by 2011.

Fermi’s double precision performance now runs at about 50% of the single precision performance (makes sense, it takes 2 32-bit lines to move the data around), a ratio that is 5 times better than the last generation GPU could manage. The accuracy of double precision computations has also been improved with support for the new IEEE 754-2008 floating-point standard, and a fused multiply-add instruction. Fermi can issue 512 FMAs per clock in single precision, or 256 FMAs per clock in double precision mode.

Real cache, more RAM, and ECC (finally)

Also targeted at winning over new users is the introduction of a cached memory hierarchy, the first NVIDIA GPU to do so. The new design features a dedicated 64KB L1 cache per Streaming Multiprocessor (GPU cores are organized hierarchically into “Streaming Multiprocessors,” or SMs; 32 cores form an SM, and there are 16 SMs on a board), and a 768KB L2 cache shared among all SMs. NVIDIA calls this the “Parallel DataCache Hierarchy,” and Sumit Gupta, senior manager in the Tesla GPU Computing group, says that this feature is very important not only to sparse matrix and physics calculations (for gaming), but also for traditional graphics applications like ray tracing. Application engineers should now see a much more familiar programming environment when porting code from CPUs.

Speaking of memory, there’s more of it — a lot more. The previous GT200 architecture had a 4GB physical limit on the amount of memory that could be connected to the cards. With Fermi NVIDIA’s engineers have moved to 64-bit memory addressing, and the architecture will now theoretically support up to 1 TB of RAM on a single card. It’s pretty unlikely that there will be a standard SKU with that much memory, but certainly cards with 6 GB or more of memory should be economically viable. Fermi also supports the faster GDDR5 memory interface (the GT200 generation supported GDDR3 memory) and ECC, addressing a big concern that users had for the reliability of data. According to Gupta this is the first GPU to support ECC for the RAM itself — ATI’s GPUs use GDDR5 and offer data link protection for errors that could occur during data transfer, but that is part of the DDR5 standard (and Fermi has this as well). All the major internal memories are ECC protected, including the register file and both the L1 and L2 caches. GDDR5 is theoretically about twice the bandwidth of GDDR3, but the actual speeds and feeds will vary with the specific products built on the Fermi architecture. I would also expect GDDR5 to add a premium to the price, at least at first.

Hardware thread scheduling and concurrent kernel execution

Concurrent task scheduling in FermiA big step forward in improving the application sweet spot for GPUs is the change in the way threads are scheduled. NVIDIA’s GigaThread Hardware Thread Scheduler (HTS) handles all of the task scheduling for developers who are now free to just throw tasks at the GPU without having to worry (as much) about packing the tasks together to efficiently manage the resource. The old architecture executed individual tasks one at a time (see image to the right) and had a relatively slow context switch, so developers had to be sure they sent large pieces of work to the GPU in order to get anything like good performance. The new HTS supports concurrent kernel execution and context switching that’s about an order of magnitude faster than before, so now much smaller units of work can be sent to the GPU. If you think of work kept on the CPU as the “serial fraction,” then these two changes should help move the Amdahl performance limit for GPU-accelerated applications further to the right.

If you are worried about keeping the CUDA kernels busy while you are doing all this concurrent kernel execution (and you should be worried about that), then you’ll be happy about the new twin DMA engines in Fermi. With the single DMA of the GT200, developers could overlap communication and computation in one direction, for example writing a result back to the CPU while they computed a new result. But with dual DMA, applications can write to and read from the CPU while computing, allowing for full overlap of communications and computation, and again helping to expand the set of applications that can potentially benefit from a GPU assist.

NVIDIA adds IDE, C++, and (thank goodness) print debugging

But what about the software that makes it all work? NVIDIA announced earlier this week that a new FORTRAN compiler for CUDA had entered beta, a big deal for the scientific computing community. With Fermi NVIDIA is adding full support for C++ to the already existing support for C, including the hardware needed to support exception handling and virtual functions. Fermi will also support system calls for the first time on the GPU, so that file streaming will no longer have to be mediated by the CPU and printf debugging is at long last possible (yeah, I know, it’s not exactly modern, but we all still do it). When you step back and look at it, NVIDIA has done a lot to make their general purpose GPU pretty general purpose. With support for C, FORTRAN, C++, Python, Java, OpenCL, OpenGL, DirectCompute, and DirectX 11 there is something for just about everyone here.

NVIDIA is also announcing a new Integrated Development Environment that will be available this week, called Nexus. Nexus integrates into MS Visual Studio 2008 and, according to Gupta, has been designed to speed up CPU+GPU computing. Two versions of Nexus will be available: Nexus Standard and Nexus Professional. Nexus Standard is available at no cost to developers, and includes basic source debugging and profiling functionality that matches existing functionality with PerfHUD. Nexus Professional includes advanced debugging features, full system event tracing, and premium support; it will retail for $349.

A vote of confidence: plans for a big super at ORNL

Today’s announcement by Huang focuses on the architecture, not the specific products. Those won’t start showing up until “sometime next year” according to Gupta. While the company declined to be more specific with us on release dates, there are at least some folks out there that know a little bit more about the schedule and are pretty excited about the technology. Today we also learned that Oak Ridge National Laboratory is planning to build a supercomputer based on Fermi products. From a statement released by the company during the announcement

“This would be the first co-processing architecture that Oak Ridge has deployed for open science, and we are extremely excited about the opportunities it creates to solve huge scientific challenges,” Nichols said. “With the help of NVIDIA technology, Oak Ridge proposes to create a computing platform that will deliver exascale computing within ten years.”

That’s a pretty awesome announcement for a design that isn’t even a product yet.

Also posted in Events, Featured Stories, GPUs, HPC Hardware | 8 Comments

Huang Broadcasting LIVE in HD Stereo

Huang, the CEO of NVIDIA, is delivering his keynote live via HD Stereo.  I’m live here at the conference with my 3D goggles and I’d like to say that it looks amazing!

Kudos to Elemental for helping to make this happen!

Also posted in Events | Leave a comment

NVIDIA GTC Kickoff

NVIDIA Air Gun

NVIDIA Air Gun

The NVIDIA GPU Tech Conference has officially kicked off.  The first keynote speaker, NVIDIA CEO Jen-Hsun Huang, will begin in t-minus one hour.  insideHPC will be providing live coverage!

In the mean time, I wanted to give our readers a little taste of the typical NVIDIA pizzaz when it comes to conference trends.  What better to greet its visitors than a giant compressed air gun at the door?  Wicked cool.

Also posted in Events | 2 Comments

Advertisement

ClusterStor Ad

Video Archive

insideHPC.com is a production of insideHPC, LLC. © 2006-2013 Sitemap