Skip to main content
School of Electronic Engineering and Computer Science

Dr Thomas Roelleke

Thomas

Senior Lecturer

Email: t.roelleke@qmul.ac.uk
Telephone: +44 20 7882 7988
Room Number: Peter Landin, CS 423
Website: http://www.eecs.qmul.ac.uk/~thor
Office Hours: Tuesday 12:00-14:00

Teaching

Database Systems (Undergraduate)

This module is an introduction to databases and their language systems in theory and practice. The main topics covered by the module are: the principles and components of database management systems; the main modelling techniques used in the construction of database systems; implementation of databases using an object-relational database management system; the main relational database language; Object-Oriented database systems; future trends, in particular information retrieval, data warehouses and data mining.There are two timetabled lectures a week, and one-hour tutorial per week (though not every week). There will be timetabled laboratory sessions (two hours a week) for approximately five weeks.

Research

Research Interests:

My research focuses on three related areas:
1. information retrieval (IR) models and probability theory
2. integration of database (DB) and IR technologies
3. DB+IR+AI technology and advanced statistics for data science

IR models are related to probability theory and the sound derivation of IR models leads to new and general approaches to rank any object, to reason about complex knowledge sources, and to make decisions. Many results of my research over the past 10 years are summarised in the book "IR Models: Foundations and Relationships", Morgan Claypool Publishers, 2013. Currently, my main research interest is in generalisations of probability theory in order to obtain a "new" theory that joins probabilistic and information-theoretic reasoning (logic).

The integration of DB and IR (and AI) is an ongoing research challenge, though, in principle, DB and IR do the same: manage and retrieve data. I have developed probabilistic object-relational, logic-based knowledge representations that are useful for solving tasks in the domain of "semantic" (knowledge-rich) information management tasks. This led to POOL (a probabilistic object-oriented logic) based on AI knowledge representations, and the "Relational Bayes", a patented technology (VLDB Journal 2008).

Both, probability theory and DB+IR+AI technology produce methods and tools for solving complex data science tasks.

Based on the insights into probabilistic reasoning and IR models, and a seamless DB+IR+AI technology, we apply a probabilistic Datalog engine in data science scenarios (data analytics, complex search).

Publications

  • Ketola T, Roelleke T (2024). Document structure-driven investigative information retrieval. nameOfConference


    QMRO: qmroHref
  • Ketola T, Roelleke T (2023). Automatic and Analytical Field Weighting for Structured Document Retrieval. nameOfConference


    QMRO: qmroHref
  • Ketola T, Roelleke T (2022). Formal Constraints for Structured Document Retrieval. Proceedings of the 2022 ACM SIGIR International Conference on Theory of Information Retrieval


    QMRO: qmroHref
  • Roelleke T (2013). Information Retrieval Models, Foundations and Relationships. nameOfConference


    QMRO: qmroHref
  • Bahrani M, Roelleke T (publicationYear). Opinion-Aware Retrieval Models Based on Sentiment and Intensity of Lexical Features. nameOfConference


    QMRO: qmroHref
  • Bahrani M, Roelleke T (2021). ADOR: A New Medical Dataset for Sentiment-based IR. nameOfConference

    DOI: doi

  • Bahrani M, Roelleke T (2020). FDCM. Proceedings of the 29th ACM International Conference on Information & Knowledge Management


  • Ketola T, Roelleke T (2020). BM25-FIC: Information content-based field weighting for BM25F. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Roelleke T, Wang J, Robertson S (2018). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference


    QMRO: qmroHref
  • Lipani A, Roelleke T, Lupu M et al. (2018). A systematic approach to normalization in probabilistic models.. nameOfConference


  • Roelleke T, Wang J, Robertson S (2016). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference


    QMRO: qmroHref
  • Frommholz I, Roelleke T (2016). Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit. nameOfConference


  • Milajevs D, Sadrzadeh M, Roelleke T (2015). IR meets NLP. Proceedings of the 2015 International Conference on The Theory of Information Retrieval


  • Roelleke T, Kaltenbrunner A, Baeza-Yates R (2015). Harmony Assumptions in Information Retrieval and Social Networks. nameOfConference


  • Roelleke T (2013). IR Models. Proceedings of the 2013 Conference on the Theory of Information Retrieval


    QMRO: qmroHref
  • Martinez-Alvarez M, Bonzanini M, Roelleke T (2013). Mathematical Specification and Logic Modelling in the context of IR. Proceedings of the 2013 Conference on the Theory of Information Retrieval


    QMRO: qmroHref
  • Roelleke T, Bonzanini M, Martinez-Alvarez M (2013). On the modelling of ranking algorithms in probabilistic datalog. Proceedings of the 7th International Workshop on Ranking in Databases


    QMRO: qmroHref
  • Bonzanini M, Martinez-Alvarez M, Roelleke T (2013). Extractive summarisation via sentence removal. Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval


    QMRO: qmroHref
  • Roelleke T (2013). Information Retrieval Models. nameOfConference


    QMRO: qmroHref
  • Martinez-Alvarez M, Bellogin A, Roelleke T (2013). Document Difficulty Framework for Semi-automatic Text Classification. nameOfConference


    QMRO: qmroHref
  • Roelleke T, Azzam H, Bonzanini M et al. (2013). The D2Q2 framework: On the relationship and combination of language modelling and TF-IDF. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Bonzanini M, Martinez-Alvarez M, Roelleke T (2012). Investigating the use of extractive summarisation in sentiment classification. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Roelleke T (2012). IR models. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval


    QMRO: qmroHref
  • Bonzanini M, Martinez-Alvarez M, Roelleke T (2012). Opinion summarisation through sentence extraction. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval


    QMRO: qmroHref
  • Azzam H, Yayhaei, Roelleke et al. (2012). A Schema-driven Approach for Knowledge-oriented Retrieval and Query Formulation. KEYS 2012, The 3rd International Workshop on Keyword Search and Structured Data


    QMRO: qmroHref
  • Martinez-Alvarez M, Yahyaei S, Roelleke T (2012). Semi-automatic document classification. nameOfConference


    QMRO: qmroHref
  • Azzam H, Roelleke T, Yahyaei S (2011). Ranking-based processing of SQL queries. Proceedings of the 20th ACM international conference on Information and knowledge management


    QMRO: qmroHref
  • Blank D, Fuhr N, Henrich A et al. (2011). Teaching IR: Curricular Considerations. nameOfConference


    QMRO: qmroHref
  • Martinez-Alvarez M, Roelleke T (2011). A Descriptive Approach to Classification. nameOfConference


    QMRO: qmroHref
  • Azzam H, Roelleke T (2011). A Generic Data Model for Schema-Driven Design in Information Retrieval Applications. nameOfConference


    QMRO: qmroHref
  • Yahyaei S, Bonzanini M, Roelleke T (2011). Cross-Lingual Text Fragment Alignment Using Divergence from Randomness. nameOfConference


    QMRO: qmroHref
  • Azzam H, Klampanos IA, Roelleke T (2011). Large-Scale Logical Retrieval: Technology for Semantic Modelling of Patent Search. nameOfConference


    QMRO: qmroHref
  • Smeraldi F, Martinez-Alvarez M, Frommholz I et al. (2011). On the probabilistic logical modelling of quantum and geometrically–inspired IR. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Azzam H, Roelleke T (2010). An attribute-based model for semantic retrieval. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Azzam H, Roelleke T (2010). SQR. Proceedings of the third workshop on Exploiting semantic annotations in information retrieval


    QMRO: qmroHref
  • Gurrin C, He Y, Kazai G et al. (2010). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Klampanos IA, Wu HZ, Roelleke T et al. (2010). Logic-Based Retrieval: Technology for Content-Oriented and Analytical Querying of Patent Data. nameOfConference


    QMRO: qmroHref
  • Martinez-Alvarez M, Roelleke T (2010). Modelling Probabilistic Inference Networks and Classification in Probabilistic Datalog. nameOfConference


    QMRO: qmroHref
  • Gurrin C, He YL, Kazai G et al. (2010). Recent Developments in Information Retrieval. nameOfConference


    QMRO: qmroHref
  • Gurrin C, He Y, Kazai G et al. (2010). Recent developments in information retrieval. nameOfConference


    QMRO: qmroHref
  • Klampanos IA, Azzam H, Roelleke T (2009). A case for probabilistic logic for scalable patent retrieval. Proceedings of the 2nd international workshop on Patent information retrieval


    QMRO: qmroHref
  • Forst JF, Tombros A, Roelleke T (2009). Less Is More: Maximal Marginal Relevance as a Summarisation Feature. nameOfConference


    QMRO: qmroHref
  • Roelleke T, Wang J, Robertson S (2009). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference


    QMRO: qmroHref
  • Wu HZ, Roelleke T (2009). Semi-subsumed Events: A Probabilistic Semantics of the BM25 Term Frequency Quantification. nameOfConference


    QMRO: qmroHref
  • Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR integration. nameOfConference


    QMRO: qmroHref
  • Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR integration. nameOfConference


    QMRO: qmroHref
  • Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR Integration: Report on the Dagstuhl Seminar "Ranked XML Querying". nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR Integration: Report on the Dagstuhl Seminar "Ranked XML Querying". nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Roelleke T, Wu H, Wang J et al. (2008). Modelling retrieval models in a probabilistic relational algebra with a new operator: the relational Bayes. nameOfConference


    QMRO: qmroHref
  • ROELLEKE T, Wang J (2008). TF-IDF Uncovered: A Study of Theories and Probabilities. 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval


    QMRO: qmroHref
  • Forst JF, Roelleke T, Tombros A (2007). Modelling a summarisation logic in probabilistic datalog. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Forst JF, Tombros A, Rölleke T (2006). Solving the enterprise TREC task with probabilistic data models. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Fuhr N, Rölleke T (1998). HySpirit — A probabilistic inference engine for hypermedia retrieval in large databases. nameOfConference


    QMRO: qmroHref
  • ROELLEKE T, Wang J (2006). A Parallel Derivation of Probabilistic Retrieval Models. 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, US


    QMRO: qmroHref
  • Rölleke T, Fuhr N (1998). Querying for facts and content in hypermedia documents. nameOfConference


    QMRO: qmroHref
  • Rolleke T, Tsikrika T, Kazai G (2006). A general matrix framework for modelling Information Retrieval. nameOfConference


    QMRO: qmroHref
  • Wang J, Roelleke T (2006). Context-specific frequencies and discriminativeness for the retrieval of structured documents. nameOfConference


    QMRO: qmroHref
  • Amer-Yahia S, Case P, Rolleke T et al. (2005). Report on the DB/IR panel at SIGMOD 2005. nameOfConference


    QMRO: qmroHref
  • Roelleke T, Ashoori E, Wu H et al. (2005). The QMUL team with probabilistic SQL at enterprise track. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • ROELLEKE T, de Vries A (2005). Relevance Information: A Loss of Entropy but a Gain for IDF?. 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil


    QMRO: qmroHref
  • Szlavik Z, Rolleke T (2005). Building and experimenting with a heterogeneous collection. nameOfConference


    QMRO: qmroHref
  • Baeza-Yates R, Maarek YS, Roelleke T et al. (2004). Third edition of the "XML and information retrieval" workshop first workshop on integration of IR and DB (WIRD) jointly held at SIGIR'2004, Sheffield, UK, July 29th, 2004. nameOfConference


    QMRO: qmroHref
  • Lalmas M, Rolleke T (2004). Modelling vague content and structure querying in XML retrieval with a probabilistic object-relational framework. nameOfConference


    QMRO: qmroHref
  • ROELLEKE T (2003). A Frequency-based and a Poisson-based Definition of the Probability of Being Informative. 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada


    QMRO: qmroHref
  • Lalmas M, Rolleke T (2003). Four-valued knowledge augmentation for structured document retrieval. nameOfConference


    QMRO: qmroHref
  • Roelleke T (2003). A Frequency-based and a Poisson-based Definition of the Probability of Being Informative. nameOfConference


    QMRO: qmroHref
  • LALMAS M, Roelleke T, Ruthven I (2003). Abductive retrieval for multimedia information seeking. 10th International Conference on Human - Computer Interaction, HCI International, Crete, Greece, vol. 4

    DOI: doi

    QMRO: qmroHref
  • Lalmas M, Rölleke T, Fuhr N (2003). Intelligent Retrieval of Hypermedia Documents. nameOfConference


    QMRO: qmroHref
  • Kazai G, Lalmas M, Roelleke T (2002). Focussed Structured Document Retrieval. nameOfConference


    QMRO: qmroHref
  • Pearmain A, Lalmas M, Moutogianni E et al. (2002). Using MPEG-7 at the consumer terminal in broadcasting. nameOfConference


    QMRO: qmroHref
  • Lalmas M, Roelleke T (2002). Four-valued knowledge augmentation for representing structured documents. nameOfConference


    QMRO: qmroHref
  • Lalmas L, ROELLEKE T, Fuhr N (2002). Intelligent Hypermedia Retrieval. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Roelleke T, Lalmas M, Kazai G et al. (2002). The accessibility dimension for structured document retrieval. nameOfConference


    QMRO: qmroHref
  • Healey P, LALMAS M, Roelleke T et al. (2002). Using MPEG7 at the Consumer Terminal in Broadcasting. nameOfConference

    DOI: doi

    QMRO: qmroHref
  • Rölleke T, Lübeck R, Kazai G (2001). The HySpirit retrieval platform. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval


    QMRO: qmroHref
  • Kazai G, Lalmas M, Rolleke T (2001). A model for the representation and focussed retrieval of structured documents based on fuzzy aggregation. nameOfConference


    QMRO: qmroHref
  • Lalmas M, Rolleke T, Turra F et al. (2001). Concepts for a graphical user interface for hypermedia retrieval. nameOfConference


    QMRO: qmroHref
  • Pearmain A, Lalmas M, Moutogianni E et al. (2001). Using MPEG-7 at the consumer terminal in broadcasting. WIAMIS 2001 Workshop on Image Analysis for Multimedia Services


    QMRO: qmroHref
  • Fuhr N, Gövert N, Rölleke T (1998). DOLORES: A System for Logic-Based Retrieval Objects. nameOfConference


    QMRO: qmroHref
  • Fuhr N, Gövert N, Rölleke T (1998). DOLORES: a system for logic-based retrieval of multimedia objects. nameOfConference


    QMRO: qmroHref
  • Fuhr N, Rölleke T (1997). A probabilistic relational algebra for the integration of information retrieval and database systems. nameOfConference


    QMRO: qmroHref
  • Roelleke T, Fuhr N (1996). Retrieval of complex objects using a four-valued logic. nameOfConference

    DOI: doi

    QMRO: qmroHref
Back to top