FNDetect: Multimodal Fake News Detection & Conflict-Aware AI Benchmark
1. Motivation and Introduction
Social media has emerged as a key arena for shaping narratives during geopolitical crises. The discussions around the Nakba and the aftermath of the October 7th conflict in Gaza illustrate a complex mix of historical trauma, live conflict reporting, and highly polarized public opinion.
Analyzing what drives a post to “go viral” in this context is essential for understanding information spread, propaganda dynamics, and public sentiment. This shared task introduces a novel challenge: predicting the reach (virality) and engagement (interactions) of posts that are emotionally charged, historically significant, and often multimodal—combining text with graphic or symbolic imagery.
By concentrating on this specialized domain, the task aims to advance the capabilities of NLP and computer vision models in processing context-rich, sensitive, and polarizing content.
2. Data Description
We present a curated, multi-platform dataset specifically designed for this task.
- Data Sources: X (formerly Twitter) and Facebook.
- Dataset Size: 2,000 distinct samples
- Scope: Content published post-October 7th, 2023, filtered using keywords related to “Gaza,” “Nakba,” “Palestine,” “Israel,” and specific conflict-related terminology.
- Data Fields:
- P Creation: Creation date of the page from which the post was scraped.
- Author Name : Name of the page or account that published the post.
- Post URL: Direct link to the post on Facebook or Twitter/X.
- Post Text: Full original text content of the post.
- Img URL: URL of the image associated with the post.
- Lang: Language of the post (Arabic or English).
- Followers: Number of followers for the page at the time of scraping.
- Post Date: Publication date of the post on the platform.
- App: Platform from which the post was collected (Facebook or Twitter/X).
- Img Name: File name of the image.
- Text Target: Manual label indicating whether the post text is real or fake.
- Img Target: Manual label indicating whether the image is real or fake.
- Page Target : Contextual credibility label of the source page (reliable/unreliable).
Note on Ethics: All data will be anonymized to protect user privacy, given the sensitive nature of the topic. IDs and handles will be hashed.
3. Task Definitions
We propose three distinct tasks to evaluate model performance on different modalities and prediction objectives.
Task 1: Binary Multilingual Fake News Detection
The first subtask focuses on detecting misinformation in Arabic and English posts using textual information only. The dataset includes posts collected from Facebook and Twitter, containing fields such as author information, post URL, textual content. Each post is labeled as real or fake based on the Text Target field. The goal is to build multilingual classification models that can accurately identify misinformation across different languages and contexts.
- Input: Post Text (Arabic & English Posts) + Text Target.
- Objective: The goal is to build multilingual classification models that can accurately identify misinformation across different languages and contexts.
- Classes:
- Real: Correct Posts that dosen't contains anything missing.
- Fake: Posts contains anything wrong or fake.
- Evaluation Metric: Macro-F1 Score (to account for potential class imbalance).
Task 2: Arabic Cross-Modal Inconsistency Detection
The second subtask targets Arabic posts containing both text and images. It aims to identify inconsistencies between the textual and visual content. Based on the Text Target and Img Target annotations, new labels—consistent, inconsistent, and manipulated—are generated. This enables the detection of cases where textual and visual information either align or contradict each other, supporting the development of models capable of identifying complex multimodal misinformation.
- Input: Post Text (Arabic Posts) + Image + Text Target + Image Target
- Objective: The goal is to enables the detection of cases where textual and visual information either align or contradict each other, supporting the development of models capable of identifying complex multimodal misinformation.
- Classes:
- Real: Correct Posts that dosen't contains anything missing.
- Fake: Posts contains anything wrong or fake.
- Evaluation Metric: Macro-F1 Score (to account for potential class imbalance).
Task 3: Image Captioning for Conflict Awareness: Gaza–Israel War 2023–2025
Participants must build an image captioning model that generates accurate, descriptive, and unbiased captions for images related to the Gaza–Israel conflict. The images, collected from Facebook and Twitter between October 2023 and August 2025, cover events such as protests, destruction, aid efforts, and media scenes. The system should produce empathetic captions that factually describe each image’s content.
- Input: Image only + Description
- Objective: The goal is to produce empathetic captions that factually describe each image’s content.
- Evaluation Metric: BLEU (Bilingual Evaluation Understudy) & METEOR
4. Baseline Systems
To assist participants, the organizers will provide the following baselines:
- For Task 1: A multilingual text-only baseline using XLM-RoBERTa-base, where the post text (Arabic and English) is encoded by a pretrained transformer and the CLS representation is passed to a linear classification head with softmax for binary fake news detection.
- For Task 2: A multimodal late-fusion baseline combining MARBERT for Arabic text and ResNet-50 for images. Text and image embeddings are extracted independently, concatenated, and fed into a fully connected layer with softmax to detect cross-modal misinformation.
- For Task 3: A CNN–LSTM image captioning baseline, where ResNet-50 is used as an image encoder and an LSTM-based decoder generates descriptive and unbiased captions for conflict-related images.
5. Significance and Impact
This shared task contributes to the NLP and multimodal AI communities by:
- Multilingual Misinformation Detection: Advancing research on detecting fake news in both Arabic and English social media posts, addressing linguistic diversity, code-switching, and cross-cultural narratives in conflict-related content.
- Cross-Modal Inconsistency Analysis: Enabling the study of alignment and contradiction between textual and visual information in Arabic posts, supporting the detection of manipulated or misleading multimodal content.
- Conflict-Aware Image Understanding: Promoting the development of image captioning systems that generate accurate, descriptive, and unbiased captions, with an emphasis on empathy and factual reporting in sensitive conflict scenarios.
6. Tentative Schedule
- Jan 1: Call for Participation.
- Jan 10: Release of Training Data (3,500 samples).
- Feb 10: Evaluation Period Begins (Test set: 1,000 blinded samples)
- Feb 17: Evaluation Period Ends.
- Feb 21: Release of Results.
- Mar 1: Paper Submission Deadline.
7. Organizers
- Mohamed Ibrahim Ragab, ITCS School at Nile University
- Ensaf Mohamed Hussein, ITCS School at Nile University
- Walaa Medhat Asel, ITCS School at Nile University
Participation Guidelines
- For participation guidelines, please refer to Participation Guidelines.
- Comprehensive instructions for preparing and submitting your paper(s) are available at Paper Submission Guidelines.