IS&T RCS Tutorial - Python with Dask (Hands-on)

Dask is an open source Python library for parallel computing. This helps to scale Python code to large scale problems, including ones where the quantity of data is much greater than the amount of computer memory on hand. It provides a convenient way to adapt existing programs based around libraries such as Pandas and Numpy to run in parallel. This tutorial will cover using Dask to scale up Pandas Dataframes, numpy array processing, parallelizing custom Python code, and scalable file processing.

When 12:30 pm to 2:30 pm on Friday, June 14, 2024
Location Biological Science Center, 2 Cummington Mall, Room 107