Description
The module will build on software engineering skills into the area of building data architectures and pipelines for managing large data sets for processing and analysis.
Areas that may be addressed include:
- Scaling principles including horizontal and vertical scaling
- Distributed Computing including scalability, reliability and achieving consensus
- DataOps & Devops and Data Lineage
- Batch and Streaming data architectures
- Constructing Data Pipelines for Machine Learning
- Large scale data storage strategies involving various Database Types, Data Warehouses and Data Lakes
- Designing Data Flows including: Message Queues and APIs & Web Services
- Distributed Architectures Centralised and decentralised distributed architectures
Module deliveries for 2024/25 academic year
Last updated
This module description was last updated on 19th August 2024.
Ìý