115 items found

Types: Dataset Groups: sobigdata-it

Filter Results
  • Access required...

    ×

    Dataset

    Private Twitter users retweet

    The dataset was collected using the tweepy API (http://docs.tweepy.org), a Python library for accessing the Twitter API. We selected 14 Twitter accounts, and we obtained all...
  • Dataset

    Italian Common Procurement Vocabulary (CPV)

    This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...
    • ZIP
      The resource: '10007545' is not accessible as guest user. You must login to access it!
  • Dataset

    EVALITA 2020 HT

    This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...
    • ZIP
      The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Battery State of Health in smart grids Dataset

    Smart Grids are the evolution of traditional electric grids and allow two-way flows of electricity and information between different actors. At the edge of this network,...
  • Dataset

    Multi-aspect Integrated Migration Indicators (MIMI) dataset

    The Multi-aspect Integrated Migration Indicators (MIMI) dataset is a new dataset to be exploited in migration studies as a concrete example of this new approach. It includes...
    • HTML
      The resource: 'Multi-aspect Integrated ...' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific article.' is not accessible as guest user. You must login to access it!
  • Dataset

    EUR-Lex MOSTA

    This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...
    • ZIP
      The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi Dataset of wireless channel samplings

    The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...
    • ZIP
      The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    dolly-15k-it

    This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...
    • jsonl
      The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
  • Dataset

    Integrating Direct Intracranial Stimulation with the Human Connectome

    Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12...
    • The resource: 'Integrating direct ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Superdiversity dataset

    The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The...
  • Dataset

    Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...

    This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022....
    • The resource: 'Supporting data for ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Origin and destination attachment from Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in...
  • Dataset

    EMAKG: Enhanced Microsoft Academic Knowledge Graph

    The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and...
    • The resource: 'Link to dataset.' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Dataset

    Iperf K8s-based Power and Resource consumption dataset

    The data were collected in a Prometheus-like data format: each entry has a timestamp, a value and key-value labels containing additional information. Metrics were gathered...
    • CSV
      The resource: '5G_Power_and_Resource_consu ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Stroke and sepsi

    The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
    • The resource: 'Stroke and sepsi' is not accessible as guest user. You must login to access it!
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Vegetation of a basin of the Po river Dataset

    We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which...
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Word-in-Context task for Italian

    The general goal of the WiC-ITA task is to establish whether a word w occurring in two different sentences, s_1 and s_2, has the same meaning or not. In particular, our task...