Dr Thomas RoellekeSenior LecturerEmail: t.roelleke@qmul.ac.ukTelephone: +44 20 7882 7988Room Number: Peter Landin, CS 423Website: http://www.eecs.qmul.ac.uk/~thorOffice Hours: Tuesday 12:00-14:00TeachingResearchPublicationsTeachingDatabase Systems (Undergraduate)This module is an introduction to databases and their language systems in theory and practice. The main topics covered by the module are: the principles and components of database management systems; the main modelling techniques used in the construction of database systems; implementation of databases using an object-relational database management system; the main relational database language; Object-Oriented database systems; future trends, in particular information retrieval, data warehouses and data mining.There are two timetabled lectures a week, and one-hour tutorial per week (though not every week). There will be timetabled laboratory sessions (two hours a week) for approximately five weeks.ResearchResearch Interests:My research focuses on three related areas:1. information retrieval (IR) models and probability theory2. integration of database (DB) and IR technologies3. DB+IR+AI technology and advanced statistics for data scienceIR models are related to probability theory and the sound derivation of IR models leads to new and general approaches to rank any object, to reason about complex knowledge sources, and to make decisions. Many results of my research over the past 10 years are summarised in the book "IR Models: Foundations and Relationships", Morgan Claypool Publishers, 2013. Currently, my main research interest is in generalisations of probability theory in order to obtain a "new" theory that joins probabilistic and information-theoretic reasoning (logic).The integration of DB and IR (and AI) is an ongoing research challenge, though, in principle, DB and IR do the same: manage and retrieve data. I have developed probabilistic object-relational, logic-based knowledge representations that are useful for solving tasks in the domain of "semantic" (knowledge-rich) information management tasks. This led to POOL (a probabilistic object-oriented logic) based on AI knowledge representations, and the "Relational Bayes", a patented technology (VLDB Journal 2008).Both, probability theory and DB+IR+AI technology produce methods and tools for solving complex data science tasks.Based on the insights into probabilistic reasoning and IR models, and a seamless DB+IR+AI technology, we apply a probabilistic Datalog engine in data science scenarios (data analytics, complex search).Publications Ketola T, Roelleke T (2024). Document structure-driven investigative information retrieval. nameOfConference DOI: 10.1016/j.is.2023.102315 QMRO: https://qmro.qmul.ac.uk/xmlui/handle/123456789/95558 Ketola T, Roelleke T (2023). Automatic and Analytical Field Weighting for Structured Document Retrieval. nameOfConference DOI: 10.1007/978-3-031-28244-7_31 QMRO: qmroHref Ketola T, Roelleke T (2022). Formal Constraints for Structured Document Retrieval. Proceedings of the 2022 ACM SIGIR International Conference on Theory of Information Retrieval DOI: 10.1145/3539813.3545128 QMRO: qmroHref Roelleke T (2013). Information Retrieval Models, Foundations and Relationships. nameOfConference DOI: 10.1007/978-3-031-02328-6 QMRO: qmroHref Bahrani M, Roelleke T (publicationYear). Opinion-Aware Retrieval Models Based on Sentiment and Intensity of Lexical Features. nameOfConference DOI: 10.3233/faia210228 QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/83456 Bahrani M, Roelleke T (2021). ADOR: A New Medical Dataset for Sentiment-based IR. nameOfConference DOI: doi QMRO: https://qmro.qmul.ac.uk/xmlui/handle/123456789/83455 Bahrani M, Roelleke T (2020). FDCM. Proceedings of the 29th ACM International Conference on Information & Knowledge Management DOI: 10.1145/3340531.3412151 QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/69429 Ketola T, Roelleke T (2020). BM25-FIC: Information content-based field weighting for BM25F. nameOfConference DOI: doi QMRO: qmroHref Roelleke T, Wang J, Robertson S (2018). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference DOI: 10.1007/978-1-4614-8265-9_919 QMRO: qmroHref Lipani A, Roelleke T, Lupu M et al. (2018). A systematic approach to normalization in probabilistic models.. nameOfConference DOI: 10.1007/s10791-018-9334-1 QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/53567 Roelleke T, Wang J, Robertson S (2016). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference DOI: 10.1007/978-1-4899-7993-3_919-2 QMRO: qmroHref Frommholz I, Roelleke T (2016). Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit. nameOfConference DOI: 10.1007/s13222-015-0208-z QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/22300 Milajevs D, Sadrzadeh M, Roelleke T (2015). IR meets NLP. Proceedings of the 2015 International Conference on The Theory of Information Retrieval DOI: 10.1145/2808194.2809448 QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/32449 Roelleke T, Kaltenbrunner A, Baeza-Yates R (2015). Harmony Assumptions in Information Retrieval and Social Networks. nameOfConference DOI: 10.1093/comjnl/bxv031 QMRO: https://uat2-qmro.qmul.ac.uk/xmlui/handle/123456789/23398 Roelleke T (2013). IR Models. Proceedings of the 2013 Conference on the Theory of Information Retrieval DOI: 10.1145/2499178.2499203 QMRO: qmroHref Martinez-Alvarez M, Bonzanini M, Roelleke T (2013). Mathematical Specification and Logic Modelling in the context of IR. Proceedings of the 2013 Conference on the Theory of Information Retrieval DOI: 10.1145/2499178.2499197 QMRO: qmroHref Roelleke T, Bonzanini M, Martinez-Alvarez M (2013). On the modelling of ranking algorithms in probabilistic datalog. Proceedings of the 7th International Workshop on Ranking in Databases DOI: 10.1145/2524828.2524832 QMRO: qmroHref Bonzanini M, Martinez-Alvarez M, Roelleke T (2013). Extractive summarisation via sentence removal. Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval DOI: 10.1145/2484028.2484149 QMRO: qmroHref Roelleke T (2013). Information Retrieval Models. nameOfConference DOI: 10.2200/s00494ed1v01y201304icr027 QMRO: qmroHref Martinez-Alvarez M, Bellogin A, Roelleke T (2013). Document Difficulty Framework for Semi-automatic Text Classification. nameOfConference DOI: 10.1007/978-3-642-40131-2_10 QMRO: qmroHref Roelleke T, Azzam H, Bonzanini M et al. (2013). The D2Q2 framework: On the relationship and combination of language modelling and TF-IDF. nameOfConference DOI: doi QMRO: qmroHref Bonzanini M, Martinez-Alvarez M, Roelleke T (2012). Investigating the use of extractive summarisation in sentiment classification. nameOfConference DOI: doi QMRO: qmroHref Roelleke T (2012). IR models. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval DOI: 10.1145/2348283.2348535 QMRO: qmroHref Bonzanini M, Martinez-Alvarez M, Roelleke T (2012). Opinion summarisation through sentence extraction. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval DOI: 10.1145/2348283.2348499 QMRO: qmroHref Azzam H, Yayhaei, Roelleke et al. (2012). A Schema-driven Approach for Knowledge-oriented Retrieval and Query Formulation. KEYS 2012, The 3rd International Workshop on Keyword Search and Structured Data DOI: 10.1145/2254736.2254746 QMRO: qmroHref Martinez-Alvarez M, Yahyaei S, Roelleke T (2012). Semi-automatic document classification. nameOfConference DOI: 10.1007/978-3-642-28997-2_43 QMRO: qmroHref Azzam H, Roelleke T, Yahyaei S (2011). Ranking-based processing of SQL queries. Proceedings of the 20th ACM international conference on Information and knowledge management DOI: 10.1145/2063576.2063614 QMRO: qmroHref Blank D, Fuhr N, Henrich A et al. (2011). Teaching IR: Curricular Considerations. nameOfConference DOI: 10.1007/978-3-642-22511-6_3 QMRO: qmroHref Martinez-Alvarez M, Roelleke T (2011). A Descriptive Approach to Classification. nameOfConference DOI: 10.1007/978-3-642-23318-0_27 QMRO: qmroHref Azzam H, Roelleke T (2011). A Generic Data Model for Schema-Driven Design in Information Retrieval Applications. nameOfConference DOI: 10.1007/978-3-642-23318-0_31 QMRO: qmroHref Yahyaei S, Bonzanini M, Roelleke T (2011). Cross-Lingual Text Fragment Alignment Using Divergence from Randomness. nameOfConference DOI: 10.1007/978-3-642-24583-1_3 QMRO: qmroHref Azzam H, Klampanos IA, Roelleke T (2011). Large-Scale Logical Retrieval: Technology for Semantic Modelling of Patent Search. nameOfConference DOI: 10.1007/978-3-642-19231-9_9 QMRO: qmroHref Smeraldi F, Martinez-Alvarez M, Frommholz I et al. (2011). On the probabilistic logical modelling of quantum and geometrically–inspired IR. nameOfConference DOI: doi QMRO: qmroHref Azzam H, Roelleke T (2010). An attribute-based model for semantic retrieval. nameOfConference DOI: doi QMRO: qmroHref Azzam H, Roelleke T (2010). SQR. Proceedings of the third workshop on Exploiting semantic annotations in information retrieval DOI: 10.1145/1871962.1871976 QMRO: qmroHref Gurrin C, He Y, Kazai G et al. (2010). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface. nameOfConference DOI: doi QMRO: qmroHref Klampanos IA, Wu HZ, Roelleke T et al. (2010). Logic-Based Retrieval: Technology for Content-Oriented and Analytical Querying of Patent Data. nameOfConference DOI: 10.1007/978-3-642-13084-7_9 QMRO: qmroHref Martinez-Alvarez M, Roelleke T (2010). Modelling Probabilistic Inference Networks and Classification in Probabilistic Datalog. nameOfConference DOI: 10.1007/978-3-642-15951-0_27 QMRO: qmroHref Gurrin C, He YL, Kazai G et al. (2010). Recent Developments in Information Retrieval. nameOfConference DOI: 10.1007/978-3-642-12275-0_1 QMRO: qmroHref Gurrin C, He Y, Kazai G et al. (2010). Recent developments in information retrieval. nameOfConference DOI: 10.1007/978-3-642-12275-0_1 QMRO: qmroHref Klampanos IA, Azzam H, Roelleke T (2009). A case for probabilistic logic for scalable patent retrieval. Proceedings of the 2nd international workshop on Patent information retrieval DOI: 10.1145/1651343.1651345 QMRO: qmroHref Forst JF, Tombros A, Roelleke T (2009). Less Is More: Maximal Marginal Relevance as a Summarisation Feature. nameOfConference DOI: 10.1007/978-3-642-04417-5_37 QMRO: qmroHref Roelleke T, Wang J, Robertson S (2009). Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model. nameOfConference DOI: 10.1007/978-0-387-39940-9_919 QMRO: qmroHref Wu HZ, Roelleke T (2009). Semi-subsumed Events: A Probabilistic Semantics of the BM25 Term Frequency Quantification. nameOfConference DOI: 10.1007/978-3-642-04417-5_43 QMRO: qmroHref Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR integration. nameOfConference DOI: 10.1145/1480506.1480522 QMRO: qmroHref Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR integration. nameOfConference DOI: 10.1145/1462571.1462584 QMRO: qmroHref Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR Integration: Report on the Dagstuhl Seminar "Ranked XML Querying". nameOfConference DOI: doi QMRO: qmroHref Amer-Yahia S, Hiemstra D, Roelleke T et al. (2008). DB&IR Integration: Report on the Dagstuhl Seminar "Ranked XML Querying". nameOfConference DOI: doi QMRO: qmroHref Roelleke T, Wu H, Wang J et al. (2008). Modelling retrieval models in a probabilistic relational algebra with a new operator: the relational Bayes. nameOfConference DOI: 10.1007/s00778-007-0073-y QMRO: qmroHref ROELLEKE T, Wang J (2008). TF-IDF Uncovered: A Study of Theories and Probabilities. 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval DOI: 10.1145/1390334.1390409 QMRO: qmroHref Forst JF, Roelleke T, Tombros A (2007). Modelling a summarisation logic in probabilistic datalog. nameOfConference DOI: doi QMRO: qmroHref (2007). TOIS reviewers January 2006 through May 2007. nameOfConference DOI: 10.1145/1281485.1281486 QMRO: qmroHref Forst JF, Tombros A, Rölleke T (2006). Solving the enterprise TREC task with probabilistic data models. nameOfConference DOI: doi QMRO: qmroHref Fuhr N, Rölleke T (1998). HySpirit — A probabilistic inference engine for hypermedia retrieval in large databases. nameOfConference DOI: 10.1007/bfb0100975 QMRO: qmroHref ROELLEKE T, Wang J (2006). A Parallel Derivation of Probabilistic Retrieval Models. 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, US DOI: 10.1145/1148170.1148192 QMRO: qmroHref Rölleke T, Fuhr N (1998). Querying for facts and content in hypermedia documents. nameOfConference DOI: 10.1007/bfb0056013 QMRO: qmroHref Rolleke T, Tsikrika T, Kazai G (2006). A general matrix framework for modelling Information Retrieval. nameOfConference DOI: 10.1016/j.ipm.2004.11.006 QMRO: qmroHref Wang J, Roelleke T (2006). Context-specific frequencies and discriminativeness for the retrieval of structured documents. nameOfConference DOI: 10.1007/11735106_69 QMRO: qmroHref Amer-Yahia S, Case P, Rolleke T et al. (2005). Report on the DB/IR panel at SIGMOD 2005. nameOfConference DOI: 10.1145/1107499.1107514 QMRO: qmroHref Roelleke T, Ashoori E, Wu H et al. (2005). The QMUL team with probabilistic SQL at enterprise track. nameOfConference DOI: doi QMRO: qmroHref ROELLEKE T, de Vries A (2005). Relevance Information: A Loss of Entropy but a Gain for IDF?. 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil DOI: 10.1145/1076034.1076084 QMRO: qmroHref Szlavik Z, Rolleke T (2005). Building and experimenting with a heterogeneous collection. nameOfConference DOI: 10.1007/11424550_28 QMRO: qmroHref Baeza-Yates R, Maarek YS, Roelleke T et al. (2004). Third edition of the "XML and information retrieval" workshop first workshop on integration of IR and DB (WIRD) jointly held at SIGIR'2004, Sheffield, UK, July 29th, 2004. nameOfConference DOI: 10.1145/1041394.1041400 QMRO: qmroHref Lalmas M, Rolleke T (2004). Modelling vague content and structure querying in XML retrieval with a probabilistic object-relational framework. nameOfConference DOI: 10.1007/978-3-540-25957-2_34 QMRO: qmroHref ROELLEKE T (2003). A Frequency-based and a Poisson-based Definition of the Probability of Being Informative. 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada DOI: 10.1145/860435.860478 QMRO: qmroHref Lalmas M, Rolleke T (2003). Four-valued knowledge augmentation for structured document retrieval. nameOfConference DOI: 10.1142/S0218488503001953 QMRO: qmroHref Roelleke T (2003). A Frequency-based and a Poisson-based Definition of the Probability of Being Informative. nameOfConference DOI: 10.1145/860476.860478 QMRO: qmroHref LALMAS M, Roelleke T, Ruthven I (2003). Abductive retrieval for multimedia information seeking. 10th International Conference on Human - Computer Interaction, HCI International, Crete, Greece, vol. 4 DOI: doi QMRO: qmroHref Lalmas M, Rölleke T, Fuhr N (2003). Intelligent Retrieval of Hypermedia Documents. nameOfConference DOI: 10.1007/978-3-7908-1772-0_20 QMRO: qmroHref Kazai G, Lalmas M, Roelleke T (2002). Focussed Structured Document Retrieval. nameOfConference DOI: 10.1007/3-540-45735-6_21 QMRO: qmroHref Pearmain A, Lalmas M, Moutogianni E et al. (2002). Using MPEG-7 at the consumer terminal in broadcasting. nameOfConference DOI: 10.1155/S1110865702000756 QMRO: qmroHref Lalmas M, Roelleke T (2002). Four-valued knowledge augmentation for representing structured documents. nameOfConference DOI: 10.1007/3-540-48050-1_19 QMRO: qmroHref Lalmas L, ROELLEKE T, Fuhr N (2002). Intelligent Hypermedia Retrieval. nameOfConference DOI: doi QMRO: qmroHref Roelleke T, Lalmas M, Kazai G et al. (2002). The accessibility dimension for structured document retrieval. nameOfConference DOI: 10.1007/3-540-45886-7_19 QMRO: qmroHref Healey P, LALMAS M, Roelleke T et al. (2002). Using MPEG7 at the Consumer Terminal in Broadcasting. nameOfConference DOI: doi QMRO: qmroHref Rölleke T, Lübeck R, Kazai G (2001). The HySpirit retrieval platform. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval DOI: 10.1145/383952.384095 QMRO: qmroHref Kazai G, Lalmas M, Rolleke T (2001). A model for the representation and focussed retrieval of structured documents based on fuzzy aggregation. nameOfConference DOI: 10.1109/SPIRE.2001.989746 QMRO: qmroHref Lalmas M, Rolleke T, Turra F et al. (2001). Concepts for a graphical user interface for hypermedia retrieval. nameOfConference DOI: 10.1007/978-3-7908-1834-5_28 QMRO: qmroHref Pearmain A, Lalmas M, Moutogianni E et al. (2001). Using MPEG-7 at the consumer terminal in broadcasting. WIAMIS 2001 Workshop on Image Analysis for Multimedia Services DOI: 10.1155/s1110865702000756 QMRO: qmroHref Fuhr N, Gövert N, Rölleke T (1998). DOLORES: A System for Logic-Based Retrieval Objects. nameOfConference DOI: 10.1145/290941.291005 QMRO: qmroHref Fuhr N, Gövert N, Rölleke T (1998). DOLORES: a system for logic-based retrieval of multimedia objects. nameOfConference DOI: 10.1145/290941.291005 QMRO: qmroHref Fuhr N, Rölleke T (1997). A probabilistic relational algebra for the integration of information retrieval and database systems. nameOfConference DOI: 10.1145/239041.239045 QMRO: qmroHref Roelleke T, Fuhr N (1996). Retrieval of complex objects using a four-valued logic. nameOfConference DOI: doi QMRO: qmroHref