Argonne’s Data Science Program Doubles Down with New Projects

Today Argonne announced that the ALCF Data Science Program (ADSP) has awarded computing time to four new projects, bringing the total number of ADSP projects for 2017-2018 to eight. All four of the program’s inaugural projects were also renewed.

The new project award recipients include an industry-based deep learning project; a national laboratory-based cosmology workflow project; and two university-based projects: one that uses machine-learning for materials discovery, and a deep-learning computer science project.

The current projects have made great progress in just the past year alone:

  • Jacqueline M. Cole (University of Cambridge) has used ADSP resources to build a large database of organic molecule candidates with desirable structural and electronic qualities by data mining materials’ physical and chemical properties from 300,000 published research articles as part of her Data-Driven Molecular Engineering of Solar-Powered Windows project.
  • Doga Gursoy (Argonne National Laboratory), with his Large-Scale Computing and Visualization on the Connectomes of the Brain project, has streamlined the reconstruction of mice brains utilizing novel imaging and analytical tools to image at the level of individual cells and blood vessels.
  • Fabien Delalondre (École polytechnique fédérale de Lausanne) has been focused on code development and infrastructure setup for his Leveraging Non-volatile Memory, Big Data, and Distributed Workflow Technology to Leap Forward Brain Modeling project.
  • J.Taylor Childers from Argonne’s project on Advancing the Scalability of LHC Workflows to Enable Discoveries at the Energy Frontier has deployed Athena, the ATLAS simulation and analysis framework, on ALCF’s Theta supercomputer; and are working to deploy HTCondor-CE, a “gateway” software tool developed by the Open Science Grid to authorize remote users and provide a resource provisioning service.

New ADSP Projects include:

  • Enabling Multiscale Physics for Industrial Design Using Deep Learning Networks: Rathakrishnan Bhaskaran, GE Global Research. This project will leverage machine learning and large datasets generated by wall-resolved large-eddy simulations to develop data-driven turbulence models with improved predictive accuracy. The team will apply this approach to turbomachinery such as a wind turbine airfoil, demonstrating the impact that deep learning can have on industrial design processes for applications in power generation, aerospace, and other fields.
  • Realistic Simulations of the LSST Survey at Scale: Katrin Heitmann, Argonne National Laboratory. To help prepare for the massive amount of data that will be generated by the Large Synoptic Survey Telescope (LSST), this project aims to develop a comprehensive workflow starting from a structure formation simulation to the creation of sky maps with realistic galaxies. The team’s work will lead to one of the largest, most detailed synthetic sky maps ever created, as well as an end-to-end pipeline for LSST data processing and analysis on ALCF supercomputers.
  • Massive Hyperparameter Searches on Deep Neural Networks Using Leadership Systems: Pierre Baldi, University of California, Irvine. This project will run massive hyperparameter searches on deep neural networks to investigate the fundamentals of deep learning algorithms, helping to improve the use of deep learning on leadership computing systems. The research team also seeks to advance high-energy physics research by applying deep learning methods to improve the detection of exotic particles at CERN’s Large Hadron Collider.
  • Constructing and Navigating Polymorphic Landscapes of Molecular Crystals: Alexandre Tkatchenko, University of Luxembourg. This project seeks to combine state-of-the-art atomistic quantum simulations and data science methods to enable accurate predictions of novel molecular crystals for alternative energy materials, disease-curing pharmaceuticals, and molecular electronics. The generated data (structures, energies, and properties) and the associated big-data analytics tools developed during this project have the potential to enable major breakthroughs in computational materials discovery.

Sign up for our insideHPC Newsletter