-
Private Twitter users retweet
The dataset was collected using the tweepy API (http://docs.tweepy.org), a Python library for accessing the Twitter API. We selected 14 Twitter accounts, and we obtained all... -
Italian Common Procurement Vocabulary (CPV)
This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...-
ZIP
The resource: '10007545' is not accessible as guest user. You must login to access it!
-
ZIP
-
EVALITA 2020 HT
This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...-
ZIP
The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Battery State of Health in smart grids Dataset
Smart Grids are the evolution of traditional electric grids and allow two-way flows of electricity and information between different actors. At the edge of this network,... -
EUR-Lex MOSTA
This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...-
ZIP
The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
-
ZIP
-
Wi-Fi Dataset of wireless channel samplings
The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...-
ZIP
The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
dolly-15k-it
This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...-
jsonl
The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
-
jsonl
-
Private Environmental Monitoring of Fluorescence Response
We study a novel sequential decision-making setting, namely the dissimilarity bandits. At each round, the learner pulls an arm that provides a stochastic d-dimensional... -
Integrating Direct Intracranial Stimulation with the Human Connectome
Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12... -
Private Superdiversity dataset
The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The... -
Private ltlf2asp
Linear Temporal Logic over Finite Traces (LTLf) is a popular logic to reason about finite sequences of events. In LTLf, the (bounded) satisfiability problem refers to whether... -
Private Cybersecurity NER BERT-base-cased model
This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that... -
Experimental results from the Empirical Investigation of the Completeness of ...
This is the raw data from the empirical investigation of the paper “Completeness of Datasets Documentation on ML/AI repositories: an Empirical Investigation”. This work aim of... -
Private Optimizing Empty Container Repositioning and Fleet Deployment via Configurabl...
We introduce a novel framework, Configurable SemiPOMDPs, to model this type of problems. Furthermore, we provide a two-stage learning algorithm, “Configure & Conquer”... -
Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...
This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022.... -
EnviroStream (Benchmark)
Stream Reasoning (SR) focuses on developing advanced approaches for applying inference to dynamic data streams; it has become increasingly relevant in various application... -
Private Origin and destination attachment from Twitter
The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in... -
EMAKG: Enhanced Microsoft Academic Knowledge Graph
The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and... -
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Iperf K8s-based Power and Resource consumption dataset
The data were collected in a Prometheus-like data format: each entry has a timestamp, a value and key-value labels containing additional information. Metrics were gathered...-
CSV
The resource: '5G_Power_and_Resource_consu ...' is not accessible as guest user. You must login to access it!
-
CSV