Global Claims Dataset (doi:10.57979/I3YY8C)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Global Claims Dataset

Identification Number:

doi:10.57979/I3YY8C

Distributor:

POLEN DataHub

Date of Distribution:

2026-03-19

Version:

1

Bibliographic Citation:

José Reis; Íris Damião; Íris Damião; Angela Rijo; Ana Vranic; Joana Gonçalves-Sá, 2026, "Global Claims Dataset", https://doi.org/10.57979/I3YY8C, POLEN DataHub, V1

Study Description

Citation

Title:

Global Claims Dataset

Identification Number:

doi:10.57979/I3YY8C

Authoring Entity:

José Reis (LIP)

Íris Damião (LIP)

Íris Damião (LIP)

Angela Rijo (LIP)

Ana Vranic (LIP)

Joana Gonçalves-Sá (LIP)

Producer:

Laboratório de Instrumentação e Física Experimental de Partículas

Date of Production:

2026-03-19

Distributor:

POLEN DataHub

Access Authority:

Joana Gonçalves-Sá

Depositor:

Miranda Benta, Zacarias José

Date of Deposit:

2026-03-19

Holdings Information:

https://doi.org/10.57979/I3YY8C

Study Scope

Keywords:

Social Sciences, Ciências Sociais, Fake News

Abstract:

Collection of claims collected from different fact-checking websites, covering various languages and topics. Described in "Global Claims: A Multilingual Dataset of Fact-Checked Claims with Veracity, Topic, and Salience Annotations"

Notes:

factcheck_claims.json A JSON Lines dataset of fact-checked claims. Each entry includes the following fields: factcheck_url: URL of the fact-checking website factcheck_date: Date of the fact-check claim_reviewed : Text of the reviewed claim claim_language: Language of the claim items_reviewed: Source URL of the reviewed claim review_rating: Original rating assigned to the claim review_standardized: Standardized rating (`true`, `false`, `other`, `unknown`) topics: Dictionary of topic probabilities Mean Topic: Inferred main topic of the claim twitter_presence: `True` if the claim’s source URL appears in url_tweets.json, otherwise `False` url_tweets.json A JSON Lines file containing tweets that shared URLs found in claims. Each entry includes: url: URL shared in the tweet tweet_id: List of tweet IDs that shared the URL Reis, J., Damião, Í., Davidson, A., Rijo, A., Vranic, A., & Gonçalves-Sá, J. (2025). Global Claims Dataset [Data set]. Zenodo. https://doi.org/10.5281/zenodo.16942245

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/licenses/by/4.0">CC BY 4.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

Conference proceeding: 10.1145/3746275.3762201 (DOI)

Identification Number:

10.1145/3746275.3762201

Bibliographic Citation:

Conference proceeding: 10.1145/3746275.3762201 (DOI)

Citation

Title:

Computational notebook: 10.5281/zenodo.16942428 (DOI)

Identification Number:

10.5281/zenodo.16942428

Bibliographic Citation:

Computational notebook: 10.5281/zenodo.16942428 (DOI)

Other Study-Related Materials

Label:

factcheck_claims.json

Notes:

application/json

Other Study-Related Materials

Label:

url_tweets.json

Notes:

application/json