31 items found

Types: Dataset Formats: ZIP Groups: sobigdata-eu

Filter Results
  • Dataset

    Italian Common Procurement Vocabulary (CPV)

    This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...
    • ZIP
      The resource: '10007545' is not accessible as guest user. You must login to access it!
  • Dataset

    EVALITA 2020 HT

    This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...
    • ZIP
      The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
  • Dataset

    EUR-Lex MOSTA

    This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...
    • ZIP
      The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi Dataset of wireless channel samplings

    The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...
    • ZIP
      The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Dataset

    Reddit Echo Chamber dataset

    In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...
    • ZIP
      The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
  • Dataset

    Fire smoke detection dataset

    Dataset of fire, non fire, and smoke images
    • ZIP
      The resource: 'Ilenia Ficili' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 12-mers

    A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
  • Dataset

    Spotify track dataset (small)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • ZIP
      The resource: 'std_small' is not accessible as guest user. You must login to access it!
  • Dataset

    Shopping retail synthetic dataset (CopulaGAN)

    Synthetic shopping retail consumption data generated with CopulaGAN. The dataset provides monthly information on the spending of synthetic customers belonging to two classes...
    • ZIP
      The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Compounds with Activity against the Dopamine D2 Receptor

    Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...
    • ZIP
      The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Physical activity, quality of sleep, and quality of life in Italy: the long t...

    From March 2020 to May 2021, several lockdown periods caused by COVID-19 pandemic have limited, with varying degrees of severity, the people’s usual activities and mobility in...
    • ZIP
      The resource: 'dataset and code' is not accessible as guest user. You must login to access it!
  • Dataset

    360° video footage from a bottom-up street-view survey in Pristina, Kosovo

    This dataset contains 360° video footage from a bottom-up street-view survey in Pristina, Kosovo divided into major neighborhoods.
    • ZIP
      The resource: '360° video footage from a ...' is not accessible as guest user. You must login to access it!
    • HTML
      The resource: 'Data collection strategy ...' is not accessible as guest user. You must login to access it!
  • Dataset

    UNI Fake Giveaway Dataset

    Dataset and study related to a scam originated on Twitter that lured users into sending their Uniswap (UNI) tokens to a fake giveaway.
    • ZIP
      The resource: 'UNI scam dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    GPS Tracks - Milan, Italy - Simulated

    This datataset contains simulated tracks of private cars in Milan. The dataset is generated from a real dataset of people in order to respect some general statistics and...
    • ZIP
      The resource: 'Milano Simulated Data' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2012 Emilia earthquake

    This dataset contains 3,170 Italian tweets about the earthquakes that stroke the Emilia Romagna regional district in Italy on 20 May 2012 starting from 4 a.m. local time...
    • ZIP
      The resource: 'EAQ-EML.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2016 Amatrice earthquake

    This dataset contais Italian tweets related to the earthquake of 2016 in the Centre of Italy (https://it.wikipedia.org/wiki/Terremoto_del_Centro_Italia_del_2016_e_d...). is...
    • ZIP
      The resource: 'EAQ-AMA.zip' is not accessible as guest user. You must login to access it!