This October TACC will host a hands-on workshop on Leveraging High Performance Computing Resources for Managing Large Datasets. The event will take place Oct. 27-30 in Washington DC in conjunction with the IEEE Big Data Conference.
The possibilities of making unprecedented discoveries through data-driven and data-intensive research are based on the existence of reusable, sustainable, accessible, organized and documented Big Data collections. For this reason, data management has taken a center stage in research practices across diverse domains. Even though it is clear that scalable resources and data management workflows are required to attain curated collections in a timely manner, there are significant knowledge barriers in using such scalable high-end resources. This hands-on workshop will introduce the audience to scalable High Performance Computing and high-end storage resources, and will provide experience in setting their data management workflows on these resources.
The main target audience of this workshop is researchers, data curators, data managers, librarians and archivists who are in charge of lifecycle management, long-term preservation, and access of data. The key topics that will be covered during the workshop are: TACC resources, data transfer, data integrity, metadata extraction, file-migration, quality control, data classification and reorganization, and data preservation.
Read the Full Story.