Advanced Data Management
Run by School of Computer Science and Electronic Engineering
20 Credits or 10 ECTS Credits
Semester 1 & 2
Overall aims and purpose
This modules aims to:
- provide exposure to the key concpets surrounding large scale data storage, management, queries and analysis.
- provide practical instruction in storing large-to-big data sets.
- expose students to core Big Data concepts, such as the 4 Vs.
- explore the OSEMN lifecycle/process.
- illustrate why traditional storage mechanisms are inappropriate for analysis workloads.
Indicative content includes:
- Data Marts and Warehouses and how they manage data collections.
- Big Data and the 4 Vs (Volume, Variety, Velocity and Veracity).
- Data Quality, the meaning and methods to raise it.
- Data Mining, extracting meaning from data.
- Analysis Tools and processes used to describe and evaluate data collections.
Equivalent to the range 60%-69%. Is able to analyse a task or problem to decide which aspects of theory and knowledge to apply. Solutions are of a workable quality, demonstrating understanding of underlying principles. Major themes can be linked appropriately but may not be able to extend this to individual aspects. Outputs are readily understood, with an appropriate structure but may lack sophistication.
Equivalent to 40%. Uses key areas of theory or knowledge to meet the Learning Outcomes of the module. Is able to formulate an appropriate solution to accurately solve tasks and questions. Can identify individual aspects, but lacks an awareness of links between them and the wider contexts. Outputs can be understood, but lack structure and/or coherence.
Equivalent to the range 70%+. Assemble critically evaluated, relevent areas of knowledge and theory to constuct professional-level solutions to tasks and questions presented. Is able to cross-link themes and aspects to draw considered conclusions. Presents outputs in a cohesive, accurate, and efficient manner.
Critically discuss the various methods for storage large or Big Data data sets.
Construct an appropriate data warehouse for a given data set.
Demonstrate skills to suitably sanitise a given data set.
Explain the definitions and examples of the various types of analytical data.
|COURSEWORK||Identify and Sanitise Exercise||
Identification of Facts, Dimensions and modality of a data-set and recode/sanitise exercise.
|COURSEWORK||Big Data Analysis with Spark||
Analysis of a Big Data data-set using Apache Spark
|COURSEWORK||Construct and Document a Data Warehouse||
Build, document, and populate a data warehouse for an appropriate data-set.
Teaching and Learning Strategy
Traditional lecture (2 hrs x 24 weeks).
Practical laboratories (S1 - 1 hrs x 12; S2 - 2 hrs x 12)
Private study including completing assignments.
- Literacy - Proficiency in reading and writing through a variety of media
- Numeracy - Proficiency in using numbers at appropriate levels of accuracy
- Computer Literacy - Proficiency in using a varied range of computer software
- Self-Management - Able to work unsupervised in an efficient, punctual and structured manner. To examine the outcomes of tasks and events, and judge levels of quality and importance
- Information retrieval - Able to access different and multiple sources of information
- Critical analysis & Problem Solving - Able to deconstruct and analyse problems or complex situations. To find solutions to problems through analyses and exploration of all possibilities using appropriate methods, rescources and creativity.
Subject specific skills
- Knowledge and understanding of facts, concepts, principles & theories
- Use of such knowledge in modelling and design
- Problem solving strategies
- Deploy theory in design, implementation and evaluation of systems
- Deploy tools effectively
- Development of general transferable skills
- Deploy systems to meet business goals
- Specify, deploy, verify and maintain information systems
- System Design
- Knowledge and/or understanding of appropriate scientific and engineering principles
- Knowledge and understanding of computational modelling
- Specify, deploy, verify and maintain computer-based systems
- Principles of appropriate supporting engineering and scientific disciplines
Talis Reading listhttp://readinglists.bangor.ac.uk/modules/ice-3601.html
- Star Schema The Complete Reference, Christopher Adamson, ISBN 9780071744324.
- Getting Started with Data Science: Making Sense of Data with Analytics: Making Sense of Data with Analytics, Murtaza Haider, ISBN 9780133991024.
- The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling (3rd Ed.), Ralph Kimball, ISBN 9781118530801.
Courses including this module
Compulsory in courses:
- H116: BSc Applied Data Science (Degree Apprenticeship) year 3 (BSC/ADS)
- H113: BSc Data Science and Machine Learning year 3 (BSC/DSML)
- H114: BSc Data Science and Visualisation year 3 (BSC/DSV)