conference-talk

Building a Distributed Oceanography Match-up Service (DOMS) to pair field observation and satellite data

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Zaihua Ji

Geoscience applications increasingly rely on the integration and collocation of data in the form of in-situ field observations with data in the form of satellite observations and global models. Both types of data reside in scattered repositories for both historical and economic (too large to replicate) reasons. Finding all possible data match-ups between distributed data repositories is a fundamental challenge for geoscience work such as satellite calibration and validation (Cal/Val).

Speaker Description: 

Zaihua Ji is Senior Software Engineer in the Data Support Section of CISL at NCAR.

Event Category:

Transmission Distribution Systems Hub

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Monte Lunacek

The electric grid is a complex network that delivers power from many, distributed sources to commercial and residential consumers. Researchers are able to understand the impact of changes to this system through modeling and simulation.  This is important because several technologies that are growing in residential use, such as solar, electric vehicles, and smart home appliances, impact the load placed on the grid in different ways. New and different retail market structures also impact how electricity is consumed.

Speaker Description: 

Monte Lunacek is member of the Modeling & Simulation Group at NREL since 2014. Before, he was an HPC Application Specialist in the Research Computing group at the University of Colorado.  Monte received his PhD in Computer Science from Colorado State University.

Event Category:

Apache Spark for scientific data at scale

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Neal McBurnett

Apache Spark is a modern open source cluster computing platform. It is helping data scientists analyze and explore large datasets more effectively than ever before, in terms of both software development productivity and efficient use of hardware, scaling from on-premises clusters to on-demand cloud computing.

Speaker Description: 

Neal McBurnett is a consultant in Boulder Colorado. Since his career as a Distinguished Member of Technical Staff at Bell Labs, working on tools for software development, security and open source web collaboration, he has taught Artificial Intelligence at CU and worked as a techincal content developer at Databricks for courses on Apache Spark, including two massive online courses on Spark in 2015.

Event Category:

Using open source tools to reduce large data sets in a distributed environment.

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Scott Collis

Radar data is complex and with the archive of NOAA NEXRAD data approaching the petabyte range, managing and extracting geophysical insight from this data is truly challenging. This presentation will discuss an approach using Python, specifically using resources available through project Jupyter, to solve pleasantly parallel problems. This is especially pertinent given the increasing prevalence of distributed computing resources.

Speaker Description: 

Scott Collis is radar metereologist at Argonne National Lab.

Event Category:

On the potential of Big Data capabilities for the validation of a Weather Forecasting System

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Giuseppe Iannitto
A key component of the EO projects is the validation of the EO data products through a Ground Truth Validation. In the validation process data can be collected from various ground-based sources and sensors (in situ measurements, instruments, crowd-sourcing, open source platform), then quality-controlled, and finally compared with the satellite products in order to get validated retrievals.
Speaker Description: 
Currently 3rd year PhD student at Tor Vergata University (Rome) for the geoinformation department, studying on big data “A Map-Reduce approach for management and exploitation of Earth Observation Big Data”
Senior IT Consultant/Manager, Mr. Iannitto worked as Technical Director for an IT Consulting organization, with over 10 years of hands on system integrations, management and software development experience in the Information and Communication Technology Industry.
Ten years experienced as system integrator: responsible of system integration and acceptance test on a Europe-wide project of a Combat Management System for the Italian and French Navy.
His expertise as project manager encompasses any project phase from pre-project sales, strategic planning, project planning and project control, as well as post-project evaluation with the client sponsor to assure satisfaction and to identify any process improvements that can be made to the client.
Also, proficient in most client server platforms and numerous programming languages. Very good analytical problem-solving ability with excellent verbal and written communication skills.

Event Category:

What Climate Science Knows About Big Data

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Seth McGinnis

Big Data refers to any data set that's too big to be handled by conventional tools and techniques. This criterion is relative, context-dependent, and changes over time. It is often taken to mean data with high storage volume on the terabyte to petabyte scale (currently), but there are other ways for data to be Big, and an array of different approaches are needed to wrestle such data sets into tractability.

Speaker Description: 

Seth McGinnis is an Associate Scientist IV in the Institute for Mathematics Applied to Geosciences (IMAGe) at NCAR. As the Data Manager and User Community Manager for NARCCAP, the North American Regional Climate Change Assessment Program, he makes the output from climate models usable by and available to people who need information about climate change. His research focuses on bias correction, interpolation, and other issues affecting the practical use of model output by non-specialists.

Event Category:

Big Data from CESM - Past, Present and the near Future

Date and Time: 
Monday, April 4th, 2016
Location: 
Center Green
Speaker: 
Gary Strand

The Community Earth System Model (CESM) has been in the realm of "big data" since before it became a buzzword. I'll talk about past experiences with big data (primarily CMIP5), and the lessons learned from those experiences. Those lessons, which have informed and improved big data practices within CESM since that time, will be discussed as well as the preparations being made for the upcoming CMIP6. I'll also mention the various aspects of good data management and how the CESM project is addressing them.

Speaker Description: 

Gary Strand is a software engineer who has become the *de facto* data manager and sometime data scientists for the CESM project.

Event Category:

Welcoming remarks

Date and Time: 
2015 April 13 @ 9:15am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Jim Hurrel
Speaker Description: 

Jim Hurrell is the Director of NCAR.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Project Jupyter: a language-independent architecture for open computing and data science

Date and Time: 
2015 April 13 @ 9:30am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Fernando Perez

IPython began its life in Boulder in 2001, as an environment for interactive scientific computing and data analysis, motivated by my needs as a physics graduate student. Over the years, it evolved into one of the main elements of the collaboratively developed ecosystem of open source tools for science in Python. In recent years, IPython has evolved into Project Jupyter: an architecture that takes the foundations of IPython and extends them to any programming language. Jupyter offers interactive terminals as well as a popular web-based notebook environment.

Speaker Description: 

Fernando Pérez (@fperez_org) is a staff scientist at Lawrence Berkeley National Laboratory and a founding investigator of the Berkeley Institute for Data Science, created in 2013. He received a PhD in particle physics, followed by postdoctoral research in applied mathematics, developing numerical algorithms. Today, his research focuses on creating tools for modern computational research and data science across domain disciplines, with an emphasis on high-level languages, literate computing and reproducible research. He created IPython while a graduate student in 2001 and continues to lead it as it evolves into the Jupyter Project, now as a collaborative effort with a talented team that does all the hard work. He regularly lectures about scientific computing and data science, and is a member of the Python Software Foundation as well as a founding member of the Numfocus Foundation. He is the recipient of the 2012 Award for the Advancement of Free Software from the Free Software Foundation.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Open Space

Date and Time: 
2015 April 15 @ 3pm
Location: 
FL2 Cafeteria Atrium
Speaker: 
n/a

Open Space is an approach that enables groups of any size to address complex, important issues and achieve meaningful results quickly. In Open Space meetings and events, participants create and manage their own agenda of parallel working sessions around a central theme of strategic importance. Conference participants will have the opportunity to apply Open Space technique to self-organize and discuss relevant topics with other meeting attendees. A space to write down ideas, questions, and discussion topics will be available throughout the first two days of the conference.

Speaker Description: 

Facilitator:
Nathan Wilhelmi is a software engineer at the National Center for Atmospheric Research where he leads the Software and Web Engineering Group. Prior to working at NCAR Wilhelmi worked in software development across a range of private sector domains. His primary focus is in the area of application development using Agile project management and engineering practices.

Event Category:

Pages

Subscribe to conference-talk