Lecture 8: Data manipulation (with pandas)
Contents
Lecture 8: Data manipulation (with pandas)#
In this lecture, we will introduce you to the data analysis library for Python pandas. pandas is used extensively analysing and manipulating data, designed to be open source, powerful, flexible and easy to use. It is used widely in a variety of academic and commercial settings.
OUTLINE:
DataFrames - dictionaries on steroids
Viewing data
Selection - Selecting particular elements of a Data Frame
More operations on DataFrame’s rows/columns/entries
Grouping
Visualisation of DataFrames
Importing and Exporting files/spreadsheets.
Summary Exercises
Please note, as a notebook on data manipulation you will need to import the data files into your Colab environment (see video in week 4). Please do download the 3 datasets from Moodle. Each notebook will identify which datasets it will require.
Weekly tasks#
- Introduction
- Introduction - SOLVED
- Part 1: data frames (extra powerful dictionaries)
- Part 1: data frames (extra powerful dictionaries) - SOLVED
- Part 2: viewing data
- Part 2: viewing data
- Part 3: selecting elements from a data frame
- Part 3: selecting elements from a data frame - SOLVED
- Part 4: More operations on DataFrames
- Part 4: More operations on DataFrames - SOLVED
- Parts 5 and 6: grouping and visualising data
- Parts 5 and 6: grouping and visualising data - SOLVED
- Part 7: Importing and exporting files / spreadsheets
- Part 7: Importing and exporting files / spreadsheets - SOLVED
- Part 8: Summary exercises
- Part 8: Summary exercises - SOLVED