Others - D4Science Catalogue

Dataset

SWH Filenames

A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...

ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!

Dataset

Multi-Task Faces (MTF) dataset

The Multi-Task Faces (MTF) dataset consists of cropped human faces for classification tasks or other research purposes. Each image in the dataset is labelled according to four...

ZIP
The resource: 'MTF_dataset_20230701' is not accessible as guest user. You must login to access it!

Dataset

VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination ...

We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or...

The resource: 'Zenodo Dataset Link' is not accessible as guest user. You must login to access it!

Dataset

Cross-Lingual Dataset of Crisis-Related Social Media

If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive...

The resource: 'Cross-Lingual Dataset of ...' is not accessible as guest user. You must login to access it!

Dataset

Ukraine-related Disinformation Dataset

Ukraine-related disinformation dataset from "Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation" (accepted at SocInfo...

The resource: 'Zenodo Dataset Link' is not accessible as guest user. You must login to access it!

Dataset

Cherenkov Telescope Data for Ordinal Quantification

This labeled data set is targeted at ordinal quantification. It appears in our research paper "Ordinal Quantification Through Regularization", which we have published at...

The resource: 'Zenodo' is not accessible as guest user. You must login to access it!

Dataset

Learning to quantify: LeQua 2022 datasets

The aim of LeQua 2022 (the 1st edition of the CLEF “Learning to Quantify” lab) is to allow the comparative evaluation of methods for “learning to quantify” in textual...

The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!

7 items found

SWH Filenames

Multi-Task Faces (MTF) dataset

VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination ...

Cross-Lingual Dataset of Crisis-Related Social Media

Ukraine-related Disinformation Dataset

Cherenkov Telescope Data for Ordinal Quantification

Learning to quantify: LeQua 2022 datasets