Atmospheric data analysis with Dask and Xarray

Date and Time: 
Thursday 2018 Apr 5th
CG Auditorium
Kevin Paul and Joe Hamman

Dask is a Python-based "flexible parallel computing library for analytic computing." Dask provides a simple interface to parallel data analysis tasks without requiring parallel computing knowledge from the user. Xarray is a Python package that provides an in-memory data model (and operations on that data) consistent with the Common Data Model used by NetCDF. Together, Dask and Xarray provide a parallel-computing platform ideal for analysis of Earth System Science data. Users interact with Xarray data objects without needing to think about the parallel operations being performed by Dask "under the hood." In essence, Dask and Xarray provide a platform that can scale a user's analysis from their laptop to a high-performance computing (HPC) center or the cloud, without needing to change a single line of code!

Speaker Description: 

Dr. Kevin Paul is a scientist at the National Center for Atmospheric Research and an expert on parallel computation and workflow optimization.

Dr. Joe Hamman is a post-doctoral fellow with the National Center for Atmospheric Research and one of the lead authors of Xarray.

Event Category: