INFORMATION RETRIEVAL - 2017/8

Module code: COM2034

Module provider

Computer Science

Module Leader

GILLAM L Dr (Computer Sci)

Number of Credits

15

ECT Credits

15

Framework

FHEQ Level 5

JACs code

I100

Module cap (Maximum number of students)

N/A

Module Availability

Semester 1

Overall student workload

Independent Study Hours: 120

Lecture Hours: 24

Laboratory Hours: 11

Assessment pattern

Assessment type Unit of assessment Weighting
Coursework COURSEWORK 20%
Examination EXAM 2 HOURS 80%

Alternative Assessment

N/A

Prerequisites / Co-requisites

N/A

Module overview

This module will provide students with an understanding of information retrieval. This relates to multimedia data (principally text, but also image, video and audio) stored for, presented on, and consumed from, the web amongst other sources. The module covers fundamental techniques and strategies of information retrieval used in a variety of online applications such as web-search engines, document matching systems, and business storage and analytics.

Module aims

Help students to gain an understanding of the current study of information retrieval

Provide practical understanding of how data are represented for storage, analysis and use in particular applications.

Learning outcomes

Attributes Developed
Explain theories behind search and assess the impacts on search performance inherent in variations in their construction KC
Elaborate a range of techniques for analysing, modelling, and retrieving text documents KCT
Contrast different kinds of applications, and their integration, in satisfying specific user information needs KCT
  Elaborate, contrast and evaluate information models that support efficient storage, retrieval and browsing, in a variety of applications. KC
Contrast the need for efficiency of data storage with the needs of batch access to large datasets KCT
Apply appropriate, standard, metadata sets and semantics to ensure effective data storage and curation. KP
 Identify the important features for storage, retrieval and browsing of non-textual data KCT

Attributes Developed

C - Cognitive/analytical

K - Subject knowledge

T - Transferable skills

P - Professional/Practical skills

Module content

·         Retrieval, browsing, user information needs, and other core concerns.

·         Notions of structured, unstructured and semi-structured data

·         A generic architecture for information retrieval

·         Spiders/crawlers, stopwords and keywords, indexing and stemming.

·         Boolean retrieval, ranked retrieval, and vector spaces

·         Query expansion and its relationship with the Semantic Web.

·         Assessing relevance - precision and recall

·         Metadata and semantics, faceted classifications, and other “linked data” issues.

·         Information models, databases and data normalization for transactional systems (OLTP)

·         Data de-normalization, data marts / data warehouses, star and snowflake schemas, and cubes as support for analytical systems (OLAP) as support to Business Intelligence.

·         The challenges presented by “Big Data”

·         NoSQL and Cloud Computing for distributed and scalable treatment of “Big Data”.

·         Image and video features and classifications that enable access to other media types

·         Exemplar applications, including web-based search engines, organisation-wide archives, business data collections, and media collections. 

Methods of Teaching / Learning

The learning and teaching strategy is designed to:

Develop an understanding for the principles and role of information retrieval and closely related applications

 

The learning and teaching methods include:

·         Lectures, including case studies•  

·         Occasional set reading•  

·         In-class discussions

·         In-class and out-of-class exercises•  

·         Lab sessions•  

·         Coursework.

Assessment Strategy

The assessment strategy is designed to provide students with the opportunity to demonstrate :



Explaining theories behind search and assess the impacts on search performance inherent in variations in their construction


Elaborating a range of techniques for analysing, modelling, and retrieving text documents


Contrasting different kinds of applications, and their integration, in satisfying specific user information needs


Elaborating, contrasting and evaluating information models that support efficient storage, retrieval and browsing, in a variety of applications.


Contrasting the need for efficiency of data storage with the needs of batch access to large datasets.


Applying appropriate, standard, metadata sets and semantics to ensure effective data storage and curation.


Identifying the important features for storage, retrieval and browsing of non-textual data



 

Thus, the summative assessment for this module consists of:



A coursework that will involve applying and evaluating various concepts and principles introduced in lectures and tested in lab sessions. Specific software and analytical approaches will be explored in these assessments. Submissions will be made through the VLE, with the deadline towards the end of the module. The coursework may assess against all relevant learning outcomes addressed suitably in advance of the deadline.


2-hour written unseen written examination comprising a mixture of short answer and discussion questions. The examination paper may assess against all learning outcomes.



 

Formative assessment and feedback

Students will be progressively completing structured lab workbooks where submission of each is necessary to progress to the next. On submission, informative solutions are also provided such that students will be able to gauge their progress as the module progresses.

Reading list

Reading list for INFORMATION RETRIEVAL : http://aspire.surrey.ac.uk/modules/com2034

Please note that the information detailed within this record is accurate at the time of publishing and may be subject to change. This record contains information for the most up to date version of the programme / module for the 2017/8 academic year.