This module introduces students to the basic principles of data science, importing data, data transformation, cleaning and imputation, and visualisation for exploratory analysis.
-
Importing and preparing data:
-
Data science life-cycle and the CRISP-DM process;
-
Developing data understanding and analysis pipelines;
-
Data file types and importation strategies;
-
Variable types, formatting, labelling;
-
-
Data imputation
-
Imputation of a mean;
-
Random imputation;
-
Imputation by a model;
-
Effect on modelling;
-
-
Data transformation, wrangling and joins
-
Data filtering and selection;
-
Rotations;
-
Relational data and data joins;
-
-
Data visualisation techniques
-
Visualisation theory;
-
Exploratory vs Explanatory visualisations; infographics, art;
-
Techniques for trends, comparisons, relationships
-
Use of encodings, scaling, annotations, labelling, colour;
-
A mixture of lectures, practical computing laboratory classes and tutorials. Programming will be taught in the computer laboratory, and with supplemental lectures. The module will use the computer laboratory throughout the syllabus to achieve as much as possible subject matter interaction.
Module Content & Assessment | |
---|---|
Assessment Breakdown | % |
Other Assessment(s) | 100 |