-
Measuring the Salad Bowl: Superdiversity on Twitter
Superdiversity refers to large cultural diversity in a population due to immigration. In this paper, we introduce a superdiversity index based on the changes in the emotional... -
Private EnviroStream
This repository contains datasets, queries and a generator for the EnviroStream, a benchmark for Stream Reasoning (SR) systems. SR focuses on applying inference to dynamic... -
Human and mouse gene regulatory networks
The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available... -
Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-E...
Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders. However, the lack of verifiable... -
Private Alternate Training for Multi-Task Neural Networks
In this repository, we publish the code used to implement the Alternate Training through the Epochs (ATE) procedure for training Multi-Task Neural Networks (MTNN) presented in... -
Private Highway driving simulation
The SUMO simulator is used to model scenarios with diferent road topologies and traffc intensities, randomizing the fow of vehicles, to ensure the generation of sufciently... -
Digital footprints of international migration on twitter
Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and... -
Private Dynamical Linear Upper Confidence Bound (DynLin-UCB)
The repository contains the code to run DynLin-UCB (Dynamical Linear Upper Confidence Bound). DynLin-UCB is an optimistic regret-minimization algorithm that can be used to... -
Online Learning of Order Flow and Market Impact (OLOFMI)
This library performs regime detection in the aggregated order flow time-series and market impact analysis. The required input file is in the format of the message file of the... -
Score-Driven Bayesian Online Change Point Detection (SD-BOCPD)
This code deals with Bayesian online detection in univariate time-series of changepoints, i.e. abrupt variations in the generative parameters of a data, and regimes, i.e.... -
Debiaser for Multiple Variables (DEMV)
DEMV is a Debiaser for Multiple Variables that aims to increase Fairness in any given dataset, both binary and categorical, with one or more sensitive variables, while keeping...-
ipynb
The resource: 'Tutorial Notebook' is not accessible as guest user. You must login to access it!
-
ipynb
-
Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...
"A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long... -
Gene Disease Association Data and Features
This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...-
RAR
The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Online polarization: enriching models with data, understanding data through m...
Development of online polarization dynamics models and application to social media discussion data -
Reddit Echo Chamber dataset
In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...-
ZIP
The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
-
ZIP
-
Twitter EURO2020: BLM debate in Italy
Twitter Dataset for "Will You Take the Knee? Italian Twitter Echo Chambers' Genesis During EURO 2020" The dataset is comprised of the following files:...-
JSON
The resource: 'Twitter EURO2020' is not accessible as guest user. You must login to access it!
-
JSON
-
Brexit dataset
This dataset comprises a set of online footprints extracted from Twitter using the available APIs. It is centered around the Brexit debate on Twitter from the 2nd until the...-
RAR
The resource: 'BrexitDataset' is not accessible as guest user. You must login to access it!
-
RAR
-
Multi-sensor dataset of environmental office room conditions
The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...-
RAR
The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
-
RAR
-
Multi-sensor dataset of environmental outdoor home conditions
The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in... -
The subTHz regime, first Results on channel measurement: 500-750 GHz
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 500-750 GHz (W-band). IF bandwidth has...