IS&T RCS Tutorial - Python with Dask (Hands-on)

  • Starts: 12:30 pm on Friday, June 14, 2024
  • Ends: 2:30 pm on Friday, June 14, 2024
Dask is an open source Python library for parallel computing. This helps to scale Python code to large scale problems, including ones where the quantity of data is much greater than the amount of computer memory on hand. It provides a convenient way to adapt existing programs based around libraries such as Pandas and Numpy to run in parallel. This tutorial will cover using Dask to scale up Pandas Dataframes, numpy array processing, parallelizing custom Python code, and scalable file processing.
Brian Gregor
Biological Science Center, 2 Cummington Mall, Room 107

Back to Calendar