51ºÚÁϳԹÏÍø

Internal

CSMBD21 - Big Data and Cloud Computing

51ºÚÁϳԹÏÍø

CSMBD21-Big Data and Cloud Computing

Module Provider: School of Mathematical, Physical and Computational Sciences
Number of credits: 20 [10 ECTS credits]
Level:7
Terms in which taught: Spring term module
Pre-requisites:
Non-modular pre-requisites:
Co-requisites:
Modules excluded:
Current from: 2021/2

Module Convenor: Prof Atta Badii
Email: atta.badii@reading.ac.uk

Type of module:

Summary module description:

This module covers the topic of big data and advanced computing.


Aims:

The massively increased uptake of computing, with devices at all scales of operation, has driven the development of large-scale distributed systems capable of meeting the demands for handling scalable parallel data analysis and processing and supporting execution of analytical algorithms on computer clusters such as Hadoop. This module aims to introduce the concepts and design principles for big data analytics and advanced network-centric computing platforms.



This module also encourages students to develop a set of professional skills, such as software development documentation, technical reporting writing, and project management.


Assessable learning outcomes:

It is expected that students will be able to:




  • Identify and describe the challenges of analysing big data and appraise relevant algorithms, tools and techniques to tackle such challenges;

  • Analyse complex data in structured, semi-structured and/or unstructured format and adopt/adapt analytics techniques to tackle the problems and evaluate the solutions;

  • Validate and redefine solutions for analytics problems, so that they could be applied to new but similar problems;

  • Understand the rationale for the design choices to be made in building live web-scale service-oriented architectures;

  • Acquire an integrated perspective on data processing in cloud computing platforms;

  • Address socio-legal, security, privacy and trust issues involved in operating and using cloud services;

  • Apply the cloud computing skills for data management and distributed and parallel data process ing.


Additional outcomes:

It is expected that students will also be able to:




  • Recognise real world applications of big data analytics and also demonstrate how to deploy and evaluate data mining applications for big data on computer clusters;

  • Become familiar with the potential applications of cloud computing.


Outline content:

The contents are organised in two parts:



Part 1




  • Introduction to big data analytics principles and challenges;

  • Techniques and tools for large data set analysis, including unstructured data analysis;

  • Algorithms and tools for the analysis of fast streaming real time data;

  • Techniques for building recommender systems.



Part 2




  • Introduction to distributed and parallel computing; Cloud Computing (IaaS, PaaS, SaaS, AI-as-a-S);

  • Framework design of large-scale distributed systems to support web-scale service-oriented architectures;

  • Security and privacy protection challenges in Cloud Computing;

  • Cloud Computing middleware, Map/Reduce; RESTful systems;

  • Cloud computing design features, such as consistency, availability and partition toler ance in distributed Information Systems, consistent Hashing, parallelism and computational efficiency;

  • Trustless Distributed Ledger Technologies and applications.



51ºÚÁϳԹÏÍø List: selected essential text:




  • Data Mining, Concepts and Techniques, (Second Edition) Jiawei Han, Micheline Kamber Morgan Kaufmann Publishers, March 2006. ISBN: 978-1-55860-901-3

  • Mahout in Action Sean Owen, Robin A nil, Ted Dunning, and Ellen Friedman ISBN 9781935182689


Brief description of teaching and learning methods:

The module comprises lectures, practical sessions, and a project-based assignment.


Contact hours:
Ìý Autumn Spring Summer
Lectures 20
Practicals classes and workshops 10
Guided independent study: Ìý Ìý Ìý
Ìý Ìý Wider reading (independent) 20
Ìý Ìý Wider reading (directed) 20
Ìý Ìý Exam revision/preparation 20
Ìý Ìý Advance preparation for classes 20
Ìý Ìý Preparation for tutorials 20
Ìý Ìý Preparation of practical report 30
Ìý Ìý Essay preparation 30
Ìý Ìý Reflection 10
Ìý Ìý Ìý Ìý
Total hours by term 0 200 0
Ìý Ìý Ìý Ìý
Total hours for module 200

Summative Assessment Methods:
Method Percentage
Written exam 50
Set exercise 50

Summative assessment- Examinations:

One 2 hour examine paper in May/June.


Summative assessment- Coursework and in-class tests:

One piece of project-based assignment.


Formative assessment methods:

Penalties for late submission:

The below information applies to students on taught programmes except those on Postgraduate Flexible programmes. Penalties for late submission, and the associated procedures, which apply to Postgraduate Flexible programmes are specified in the policy 􀀓Penalties for late submission for Postgraduate Flexible programmes􀀔, which can be found here: