conference-talk

Profiling Python code to improve memory usage and execution time

Date and Time: 
2015 April 14 @ 3:00pm
Location: 
FL2-1022 Large Auditorium
Speaker: 
Jonathan Helmus

Python is an excellent language for rapid prototyping of algorithms and programs in order to determine the feasibility of a task, but oftentimes fails to meet the run time and memory usage requirements needed for deployment to production.  This presentation will discuss tools within the Python ecosystem for profiling Python code to identify memory and run time hot spots. Techniques will be presented which can be used to improve the performance of Python code by utilizing the information provided by these tools.

Speaker Description: 

Jonathan Helmus is a scientist and advanced algorithms engineer at Argonne National Laboratory where he develops software for the Atmospheric Radiation Measurement (ARM) climate research facility. He is the lead developer of Py-ART, an open source toolkit for analysis of weather radar data in Python as well as having contributed to a number of other Scientific Python modules. Jonathan completed a postdoc at the University of Connecticut Health Center after receiving his Ph.D. in Chemical Physics from The Ohio State University. 

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

KGEN: Fortran Kernel Generator

Date and Time: 
2015 April 14 @ 2:00pm
Location: 
FL2-1022 Large Auditorium
Speaker: 
Youngsung Kim

There are cases that we want to extract a part of Fortran codes from a software application as a stand-alone executable. For example, when a programmer debugs a large software application such as CESM of NCAR, he/she needs to run whole CESM program up to the source line to debug. If we can take the part of codes that are only relevant to the debugging, it would remove time to run CESM and time to wait in queuing system.

Speaker Description: 

After an undergraduate degree in Electronic Engineering at Dankook University of South Korea, Youngsung Kim has worked in mobile telecommunication industry for 13 years mostly as a software developer. In 2010, he returned back to school at University of Utah and majored in Scientific Computing. During the study, he participated WRF climate simulation project and brain image matching project along with taking core courses including numerical methods and parallel computing. After graduation with master's degree from Univ. of Utah, he joined NCAR and has been working on accelerator technologies until now.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Software Deployment in the Field (Technical Debt and Data-Ops)

Date and Time: 
2015 April 14 @ 11:30am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Gary Granger

In the Earth Observing Laboratory at NCAR, we develop and deploy many different instruments, for platforms ranging from miniature sondes to radars and aircraft, to areas all over the world to observe all kinds of phenomena. Every field deployment is in fact a custom and very complicated configuration of systems, software, instruments, and data streams. Besides the stories about writing software in some interesting situations, field projects present some (exciting) challenges for software engineering.

Speaker Description: 

Gary Granger received a Bachelor of Science degree in Computer Engineering from Virginia Tech, then began working for the Atmospheric Technology Division at NCAR in 1992. Over the years he has worked in several software development areas related to field deployment and instrument development, including field operations, visualization, and wind profiling radars. Currently he works in the Software Systems Group of the Earth Observing Laboratory, developing software in C++ and Python for the Integrated Sounding System, and developing LabVIEW software for spectrometers. He also advocates good software engineering practices in EOL and supports related infrastructure, such as subversion, build frameworks, and continuous integration servers.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Docker for Scientific Applications

Date and Time: 
2015 April 14 @ 10:00am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Joe Stubbs

Container technology, and in particular Docker, has revolutionized distributed systems in a very short time. At the Texas Advanced Computing Center, we see the potential for Docker to have an enormous impact on scientific computing as well. Containers enable developers to distribute their applications with all necessary dependencies included in a single file. As a result, the software is immensely more portable and provides far greater reproducibility of results. Additionally, the barrier to entry is greatly reduced, as installation becomes as simple as downloading a file.

Speaker Description: 

Joe Stubbs earned a PhD in Mathematics from the University of Michigan. Since then he has been at the University of Texas where he has focused on building infrastructure software in various contexts. He is currently a research scientist at TACC where he primarily works on the Agave "science as a service" platform, enabling the next generation of science gateways to harness petascale HPC over the web.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Composing and deploying a cluster of Docker containers

Date and Time: 
2015 April 14 @ 9:30am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Walter Moreira

Containers, and in particular Docker, are quickly transforming how we think about software architecture. Despite its popularity, there are two big problems that have not been fully solved yet: composability, and multi-node deployment of containers. Many products from big companies are trying to address them, but there is not a clear leader yet.

Speaker Description: 

Walter Moreira received his PhD in Mathematics from Texas A&M University. He previously worked in the HET Dark Energy Experiment at the McDonald Observatory, building the control system for a large telescope. He is currently working as a research engineer at TACC, concentrating in distributed systems. His main focus is building a federated data architecture for the Arabidopsis Information Portal.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Utilizing Scientific Python Tools for the Application of Data Science Techniques to High Impact Weather Prediction

Date and Time: 
2015 April 13 @ 3:00pm
Location: 
FL2-1022 Large Auditorium
Speaker: 
David Gagne

The developments and optimizations provided by Python’s scientific libraries have enabled the development of real-time high-resolution forecast post-processing systems primarily in Python. Numpy, Scipy, Matplotlib, and a set of newer scientific libraries have made this development possible. The Pandas library introduced efficient ways to load, analyze, manipulate, and merge large datasets. Scikit-Image provides a diverse array of image processing tools, which are useful for filtering and extracting information from gridded data.

Speaker Description: 

David John Gagne is a doctoral candidate in meteorology at the University of Oklahoma and a visiting graduate research assistant with the NCAR Research Applications Lab. His main research interests involve the application of machine learning techniques to numerical weather models and observations in order to improve the prediction of high impact weather. He has developed frameworks for improving the prediction of hail, solar energy, wind energy, heavy rain, aircraft turbulence, and tornadoes. He is an active Python developer and has contributed to packages for weather data visualization, forecast verification, and gridded forecast correction.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Python for Wind Energy

Date and Time: 
2015 April 13 @ 2:00pm
Location: 
FL2-1022 Large Auditorium
Speaker: 
Katherine Dykes

Authors: Katherine Dykes, Peter Graf, Rick Damiani, George Scott, Andrew Ning, Yi Guo, Taylor Parsons, Ryan King, and Paul Veers

Speaker Description: 

The NREL NWTC wind energy systems engineering initiative has developed a Python-based analysis platform to leverage its research capabilities toward integrating wind energy engineering and cost models across wind plants. This Wind-Plant Integrated System Design & Engineering Model (WISDEM™) platform captures the important interactions between various subsystems to achieve a better understanding of how to improve system-level performance and achieve system-level cost reductions. This work illustrates a few case studies with WISDEM that focus on the design and analysis of wind turbines and plants at different system levels.

Event Category:

Processing of ensemble simulations in a Python framework: PyEnsembles

Date and Time: 
2015 April 13 @ 11:00am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Ernesto Munoz

Authors: Ernesto Munoz, Emily Becker, Sheri Mickelson  

Speaker Description: 

Dr. Ernesto Munoz is Associate Scientist in NCAR's Climate and Global Dynamics Division. His current focus is on the development of applications for the analysis of ocean biogeochemistry (in collaboration with Dr. Keith Lindsay). Ernesto was awarded a Ph.D. degree in Atmospheric and Oceanic Sciences from the University of Maryland at College Park. After graduation, he completed a Postdoctoral appointment at NOAA's Cooperative Institute for Marine and Atmospheric Studies.

Event Category:

Python Base Scientific Data Analysis Web Application

Date and Time: 
2015 April 13 @ 10:00am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Hannah Keller

Authors: Seogmin Choi, Robert Crimi, Connor Guerrieri, Bo Han, Hannah Keller, Hannah Thomas

Speaker Description: 

Hannah Keller is a senior computer science major at the University of Colorado Boulder and is working on a team of 6 students under the guidance a group of developers at NCAR to create a scientific data analysis web application. Hannah is the team lead for the project, which is a Senior Software Engineering Project as part of the university's degree program. Currently, Hannah also works as a software developer associate at Alteryx, Inc., a strategic analytics software provider, where she works on the core engine team and was integral in developing Alteryx social media integration capabilities. Previously, she attended the University of British Columbia studying Environmental Engineering where she worked as a research assistant to an Environmental Fluid Mechanics Research Associate and co-authored a water quality report for the British Columbia Hydro and Power Authority.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Light-weight Parallel Python Tools Within the CESM Workflow

Date and Time: 
2015 April 13 @ 9:30am
Location: 
FL2-1022 Large Auditorium
Speaker: 
Sheri Mickelson and Alice Bertini

Authors: Sheri Mickelson, Kevin Paul, Alice Bertini, Dave Brown, John Dennis

Speaker Description: 

Sheri Mickelson is a software engineer in the Application Scalability and Performance Group and the CESM Software Engineering Group at NCAR. Her main interest is in application performance. Currently, she is helping lead the effort to improve the performance of various CESM post-processing tools in order to get ready for CMIP6.

Alice Bertini is a software engineer at NCAR in the CESM Software Engineering Group (CSEG). She is the CESM release coordinator, database programmer and administrator for the CSEG databases, and python programmer integrating light-weight parallel python tools developed by NCAR CISL ASAP group into the CESM workflow post-processing tools.

Event Category:

Video recorded: 

If you use a non-flash enabled device, you may download the video here

Pages

Subscribe to conference-talk