Research Areas
You can find the complete list of articles on my Google Scholar or DBLP profiles.
Experience in reviewing or organizing workshops, challenges, special issue, conferences, journals
Experience in supervision, open research topics
Research Topics & Publications
Large Language Models and Dialogue
I study the evaluation and grounding of LLMs in interactive and multimodal settings, with a focus on behavioral benchmarking, reasoning costs, and grounding in conversation.
- Hakimov, S., Bernard, R., Leiber, T., Osswald, K., Richert, K., Yang, R., Bernardi, R., and Schlangen, D. (2026). The Price of Thought: A Multilingual Analysis of Reasoning, Performance, and Cost of Negotiation in Large Language Models. EACL 2026
PDF - Yang, J., Feldhus, N., Mohtaj, S., Hennig, L., Wang, Q., Metheniti, E., Hakimov, S., Jakob, C., Solopova, V., Rieck, K., Schlangen, D., Möller, S., and Schmitt, V. (2026). Order in the Evaluation Court: A Critical Analysis of NLG Evaluation Trends.
PDF - Hakimov, S., Abdullayeva, Y., Koshti, K., Schmidt, A., Weiser, Y., Beyer, A., and Schlangen, D. (2025). Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models. COLING 2025
PDF - Chalamalasetti, K., Götze, J., Hakimov, S., Madureira, B., Sadler, P., and Schlangen, D. (2023). clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents. EMNLP 2023
PDF - Hakimov, S., and Schlangen, D. (2023). Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks. Findings of ACL 2023
PDF
Multimodal Understanding and News Analytics
My multimodal work focuses on image-text reasoning, sentiment and geolocation in news, and robust multimodal modeling for social media and news streams.
- Cheema, G.S., Hakimov, S., Müller-Budack, E., Otto, C., Bateman, J., and Ewerth, R. (2023). Understanding Image-Text Relations and News Values for Multimodal News Analysis. Frontiers in Artificial Intelligence
PDF - Tahmasebzadeh, G., Hakimov, S., Ewerth, R., and Müller-Budack, E. (2023). Multimodal Geolocation Estimation of News Photos. ECIR 2023
PDF - Tahmasebzadeh, G., Hakimov, S., Ewerth, R., and Müller-Budack, E. (2023). MM-Locate-News: Multimodal Focus Location Estimation in News. MMM 2023
PDF - Thakkar, G., Hakimov, S., and Tadic, M. (2024). M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets. LREC-COLING 2024
PDF - Cheema, G.S., Hakimov, S., Müller-Budack, E., and Ewerth, R. (2021). A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods. MMPT 2021
PDF
Misinformation and Harmful Content Detection
This line of work develops models and resources for detecting misinformation, claims, and harmful content in multimodal social media and news.
- Tahmasebi, S., Hakimov, S., Ewerth, R., and Müller-Budack, E. (2023). Improving Generalization for Multimodal Fake News Detection. ICMR 2023
PDF - Hakimov, S., Cheema, G.S., and Ewerth, R. (2022). TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes. SemEval 2022
PDF - Cheema, G.S., Hakimov, S., Müller-Budack, E., and Ewerth, R. (2021). On the Role of Images for Analyzing Claims in Social Media. WWW Workshop (CLEOPATRA 2021)
PDF - Cheema, G.S., Hakimov, S., and Ewerth, R. (2020). Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features. CLEF 2020
PDF
Semantic Parsing and Knowledge Graph QA
I work on semantic parsing, knowledge base question answering, and structured meaning representations for multilingual settings.
- Hakimov, S. (2019). Learning Multilingual Semantic Parsers for Question Answering over Linked Data. PhD Dissertation
PDF - Hakimov, S., Jebbara, S., and Cimiano, P. (2019). Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases. ICSC 2019
PDF - Hakimov, S., Jebbara, S., and Cimiano, P. (2017). AMUSE: Multilingual Semantic Parsing for Question Answering over Linked Data. ISWC 2017
PDF - Hakimov, S., Unger, C., Walter, S., and Cimiano, P. (2015). Applying Semantic Parsing to Question Answering over Linked Data: Addressing the Lexical Gap. NLDB 2015
PDF
Video Summarization
I build multimodal approaches for summarizing long-form educational and lecture videos, balancing informativeness and compactness.
- Ghauri, J.A., Hakimov, S., and Ewerth, R. (2021). Supervised Video Summarization via Multiple Feature Sets with Parallel Attention. ICME 2021
PDF - Ghauri, J.A., Hakimov, S., and Ewerth, R. (2020). Classification of Important Segments in Educational Videos using Multimodal Features. CIKM Workshops 2020
PDF
Event-Centric Analytics
This work focuses on event understanding and classification across multimodal and multilingual sources.
- Müller-Budack, E., Springstein, M., Hakimov, S., Mrutzek, K., and Ewerth, R. (2021). Ontology-driven Event Type Classification in Images. WACV 2021
PDF - Demidova, E., Hakimov, S., Winters, J., and Tadic, M. (2020). Proceedings of the 1st International Workshop on Cross-lingual Event-centric Open Analytics (CLEOPATRA 2020). ESWC 2020 Workshops
PDF
