CSC1033 : Information Storage and Retrieval
- Offered for Year: 2019/20
- Module Leader(s): Dr Marie Devlin
- Lecturer: Dr Dan Nesbitt, Dr John Colquhoun
- Owning School: Computing
- Teaching Location: Newcastle City Campus
|Semester 1 Credit Value:||10|
|Semester 2 Credit Value:||10|
This module will provide students with an understanding of information storage and retrieval. This relates to all forms of data, including text and multimedia (image, video and audio) stored on and consumed from the web, amongst other sources. The module covers fundamental techniques and strategies of information storage and retrieval used in a variety of online applications such as web-search engines, document matching systems and business storage and analytics.
Outline Of Syllabus
• Retrieval, browsing, user information needs, and other core concerns.
• Notions of structured, unstructured and semi-structured data
• Relational databases, SQL
• Exemplar applications including business data collections and website design.
• Data models and query languages
• Metadata and semantics, faceted classifications, and other “linked data” issues
• Image and video features and classifications that enable access to other media types
• Data standards and governance
• Query expansion and its relationship with the Semantic Web
• Spiders/crawlers, stopwords and keywords, indexing and stemming
• Exemplar applications including publishing multimedia data archives, web-based search engines
• Information models, databases and data normalization for transactional systems (OLTP)
• Data de-normalization, data marts / data warehouses, star and snowflake schemas, and cubes as support for analytical systems (OLAP)
• Exemplar applications, including organisation-wide analytics, e-commerce
• The challenges presented by “Big Data”
• NoSQL and Cloud Computing for distributed and scalable treatment of “Big Data”.
• Overview of IoT and edge computing
• Exemplar applications such as real-time data processing, smart video stream surveillance, urban observatory data publishing.
|Guided Independent Study||Assessment preparation and completion||24||1:00||24:00||Lecture follow-up|
|Scheduled Learning And Teaching Activities||Lecture||24||2:00||48:00||Lectures|
|Guided Independent Study||Assessment preparation and completion||24||1:00||24:00||Skills Practice: Practical follow up and technique practice /tool use|
|Guided Independent Study||Assessment preparation and completion||44||0:30||22:00||Revision for exam|
|Guided Independent Study||Assessment preparation and completion||1||2:00||2:00||examination|
|Scheduled Learning And Teaching Activities||Practical||24||1:00||24:00||Computer classroom|
|Guided Independent Study||Independent study||56||1:00||56:00||Background reading|
Teaching Rationale And Relationship
Techniques and theory are presented in lectures. Classroom-based practical sessions provide experience of designing and building database applications.
This is a very practical subject, and it is important that the learning materials are supported by hands-on opportunities provided by practical classes, and on the related Programming Portfolio modules.
The format of resits will be determined by the Board of Examiners
Assessment Rationale And Relationship
In the written examination the questions will assess fundamental knowledge and understanding of theory and
application of database design and usage.
Practical exercises are set during lab practical classes.