Cross-modal Image Recommendation for News Articles by Multimodal Foundation Models-based Retrieval-Reranking

1. Centre for Research and Technology Hellas

Retrieving relevant images for a given news article is challenging and can be considered a special version of the cross-modal retrieval problem. This notebook paper presents our solution for the MediaEval NewsImages 2025 benchmarking task. We propose a retrieval-reranking solution based on multimodal foundation models such as VLMs and multimodal LLMs, and utilizing multiple levels of textual granularity. We report the official results of our submitted runs and additional experiments we conducted internally to evaluate our runs.

Files

mediaeval2025.pdf

Files (733.5 kB)

Name	Size	Download all
mediaeval2025.pdf md5:3f2b84f65db95764ebaffd790392d7ba	733.5 kB	Preview Download

Additional details

European Commission
AI4TRUST - AI-based-technologies for trustworthy solutions against disinformation 101070190

Views

Downloads

Show more details

	All versions	This version
Views	6	6
Downloads	0	0
Data volume	0 Bytes	0 Bytes

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

2025 Multimedia Evaluation Workshop (MediaEval'25), Dublin, Ireland, Oct. 2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 21, 2026
Modified: February 21, 2026

Cross-modal Image Recommendation for News Articles by Multimodal Foundation Models-based Retrieval-Reranking

Authors/Creators

Description

Files

mediaeval2025.pdf

Files (733.5 kB)

Additional details

Funding