Advanced Data Management
Rhedir gan School of Computer Science and Electronic Engineering
20 Credyd neu 10 Credyd ECTS
Semester 1 a 2
This modules aims to:
- provide exposure to the key concpets surrounding large scale data storage, management, queries and analysis.
- provide practical instruction in storing large-to-big data sets.
- expose students to core Big Data concepts, such as the 4 Vs.
- explore the OSEMN lifecycle/process.
- illustrate why traditional storage mechanisms are inappropriate for analysis workloads.
Indicative content includes:
- Data Marts and Warehouses and how they manage data collections.
- Big Data and the 4 Vs (Volume, Variety, Velocity and Veracity).
- Data Quality, the meaning and methods to raise it.
- Data Mining, extracting meaning from data.
- Analysis Tools and processes used to describe and evaluate data collections.
Equivalent to the range 60%-69%. Is able to analyse a task or problem to decide which aspects of theory and knowledge to apply. Solutions are of a workable quality, demonstrating understanding of underlying principles. Major themes can be linked appropriately but may not be able to extend this to individual aspects. Outputs are readily understood, with an appropriate structure but may lack sophistication.
Equivalent to 40%. Uses key areas of theory or knowledge to meet the Learning Outcomes of the module. Is able to formulate an appropriate solution to accurately solve tasks and questions. Can identify individual aspects, but lacks an awareness of links between them and the wider contexts. Outputs can be understood, but lack structure and/or coherence.
Equivalent to the range 70%+. Assemble critically evaluated, relevent areas of knowledge and theory to constuct professional-level solutions to tasks and questions presented. Is able to cross-link themes and aspects to draw considered conclusions. Presents outputs in a cohesive, accurate, and efficient manner.
Critically discuss the various methods for storage large or Big Data data sets.
Construct an appropriate data warehouse for a given data set.
Demonstrate skills to suitably sanitise a given data set.
Explain the definitions and examples of the various types of analytical data.
|GWAITH CWRS||Identify and Sanitise Exercise||
Identification of Facts, Dimensions and modality of a data-set and recode/sanitise exercise.
|GWAITH CWRS||Big Data Analysis with Spark||
Analysis of a Big Data data-set using Apache Spark
|GWAITH CWRS||Construct and Document a Data Warehouse||
Build, document, and populate a data warehouse for an appropriate data-set.
Strategaeth addysgu a dysgu
Traditional lecture (2 hrs x 24 weeks).
Practical laboratories (S1 - 1 hrs x 12; S2 - 2 hrs x 12)
Private study including completing assignments.
- Llythrennedd - Medrusrwydd mewn darllen ac ysgrifennu drwy amrywiaeth o gyfryngau
- Rhifedd - Medrusrwydd wrth ddefnyddio rhifau ar lefelau priodol o gywirdeb
- Defnyddio cyfrifiaduron - Medrusrwydd wrth ddefnyddio ystod o feddalwedd cyfrifiadurol
- Hunanreolaeth - Gallu gweithio mewn ffordd effeithlon, prydlon a threfnus. Gallu edrych ar ganlyniadau tasgau a digwyddiadau, a barnu lefelau o ansawdd a phwysigrwydd
- Adalw gwybodaeth - Gallu mynd at wahanol ac amrywiol ffynonellau gwybodaeth
- Dadansoddi Beirniadol & Datrys Problem - Gallu dadelfennu a dadansoddi problemau neu sefyllfaoedd cymhleth. Gallu canfod atebion i broblemau drwy ddadansoddiadau ac archwilio posibiliadau
Sgiliau pwnc penodol
- Knowledge and understanding of facts, concepts, principles & theories
- Use of such knowledge in modelling and design
- Problem solving strategies
- Deploy theory in design, implementation and evaluation of systems
- Deploy tools effectively
- Development of general transferable skills
- Deploy systems to meet business goals
- Specify, deploy, verify and maintain information systems
- System Design
- Knowledge and/or understanding of appropriate scientific and engineering principles
- Knowledge and understanding of computational modelling
- Specify, deploy, verify and maintain computer-based systems
- Principles of appropriate supporting engineering and scientific disciplines
Rhestrau Darllen Bangor (Talis)http://readinglists.bangor.ac.uk/modules/ice-3601.html
- Star Schema The Complete Reference, Christopher Adamson, ISBN 9780071744324.
- Getting Started with Data Science: Making Sense of Data with Analytics: Making Sense of Data with Analytics, Murtaza Haider, ISBN 9780133991024.
- The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling (3rd Ed.), Ralph Kimball, ISBN 9781118530801.
Cyrsiau sy’n cynnwys y modiwl hwn
Gorfodol mewn cyrsiau:
- H116: BSc Applied Data Science (Degree Apprenticeship) year 3 (BSC/ADS)
- H113: BSc Data Science and Machine Learning year 3 (BSC/DSML)
- H114: BSc Data Science and Visualisation year 3 (BSC/DSV)