-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. -
Frank Experiments
Dataset with experimental results for the "Frank" hybrid decision-making system, with simulated users. Features: - CA. Co-evolutionary Accuracy. Accuracy reached by the user...-
JSON
The resource: 'Frank Experiments Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Semantic Networks from news articles (Italian sample)
The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Visual Analytics for Perfomance Analysis: Dataset of preprocessed distributed...
The dataset includes execution traces from Train-ticket, an established open-source microservices system.The execution traces have been generated across various scenarios, and... -
Shopping retail synthetic dataset (GaussianCopula)
Synthetic shopping retail consumption data generated with GaussianCopula. The dataset provides monthly information on the spending of synthetic customers belonging to two...-
CSV
The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (German sample)
The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 31-mers
A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
-
ZIP
-
The subTHz regime, channel measurements with no line-of-sight conditions in t...
Dataset of channel measurements obtained in the frequency band 500-750 GHz in non line-of-sight conditions. The measurements have been conducted using a Keysight PNA Vector... -
FAIR-SWENG: dataset on gender fairness in software engineering academic lands...
The dataset contains academic performance metrics of Software Engineers worldwide. -
Semantic Networks from news articles (Dutch sample)
The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (English sample)
The Semantic Networks from news articles (English sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
The subTHz regime, channel measurements with no line-of-sight conditions in t...
Dataset of channel measurements obtained in the frequency band 170-260 GHz in non line-of-sight conditions. The measurements have been conducted using a Keysight PNA Vector... -
DeLag: Microservices execution traces
The dataset contains execution traces collected from the well-know open-source microservices system Train-ticket. The traces are generated over a variety of scenario,...-
parquet
The resource: 'Unnamed resource' is not accessible as guest user. You must login to access it!
-
parquet
-
Twitter dataset on coordinated behavior in 2019 UK General Election
This dataset contains ~11M tweets related to the 2019 United Kingdom General Election, published and collected between November 12, 2019, and December 12, 2019. In addition,... -
Private Telecom Traffic Distribution Dataset
The dataset contains aggregate - hourly, daily, weekly - cellular traffic demand data of individual base stations deployed in the metropolitan area of Milan and Trento... -
Twitter dataset on coordinated behavior in 2020 USA Presidential Election
This dataset contains ~140M tweets related to the 2020 United States Presidential Election, published and collected between October 2, 2020, and December 2, 2020. In addition,... -
Private Vehicular trip dataset extracted from black boxes embedded on vehicles
The data contains vehicular trips denoted by a trip identifier (unique, reset after each vehicle engine shutdown) and a set of latitude/longitude coordinates, representing... -
Twitter Conspiracy Dataset
This repository contains the Twitter dataset used to investigate the traits of 7,394 conspiracy users and 7,394 random users collected in 2022. Both the profile's info and the... -
Dataset on online cryptocurrency discussion on Twitter, Telegram, and Discord
This Dataset contains Twitter, Telegram and Discord data on online discussions on cryptocurrency. Starting from tweets mentioning cryptocurrencies, we leveraged and followed...