139 items found

Licenses: Academic Free License 3.0 Organisations: SoBigData Catalogue

Filter Results
  • Dataset

    Emergency Tweets 2014 Genoa flood

    This dataset contains Italian tweets collected during and in the aftermath of the floods that occurred near the city of Genoa between 9 and 11 October 2014...
    • ZIP
      The resource: 'FLO-GEN.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Testing NBA dataset

    Just for platform reviewing
    • CSV
      The resource: 'nbaStats.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Global Peace Index data

    A dataset of the Global Peace Index (GPI), which ranks 163 independent states and territories according to their level of peacefulness. The GPI covers 99.7 per cent of the...
  • Method

    Modelling Scientific Migration

    This method is an adaptation of the general migration models to understand scientific migration. Under development.
  • Dataset

    CDR data - Tuscany

    The dataset contains mobile phone records collected in Tuscany between September 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding...
  • Dataset


    The yeast dataset is a collection of yeast microarray expressions and phylogenetic profiles which can be used to learn the yeast gene functional categories. One row of this...
    • arff
      The resource: 'Yeast Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2013 Sardinia flood

    This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...
    • ZIP
      The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
  • Method

    A New Topological Approach for the Prediction of Protein-Protein Interactions

    We propose, Maximum-Proteins-Similarity(Topological)": MPS(T). MPS(T) is a topological three-length path method that scores the potential interaction between proteins by...
  • Dataset

    GPS Tracks - Calabria, Italy 2012

    The dataset consists of GPS tracks of private vehicles collected in Calabria region (Italy). It counts about 28 mln of trajectories of about 115.000 users. Data are in the...
    • ZIP
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Congress Network

    Network built on top of US congress voting data and made available on the website GovTrack.us. Nodes identifies congressman and edges represent the semantic "have supported the...
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • Method

    Prediction of next career moves from scientific profiles

    This is a two-stage predictive model for the mobility of scientists. First, data mining is used to predict which researcher will move in the next year on the basis of their...
  • Dataset

    Facebook Wallpost

    Online interactions between users via the wall feature in the New Orleans regional network.
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • Experiment

    A comparison of approaches for type-2 diabetes treatment

    This experiment compares the performance of some GNN-based approaches for predicting the therapy recommended to type-2 diabetes patients
    • ZIP
      The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Flickr and Wikipedia Tourism Trajectories

    The dataset contains a knowledge base built with data coming from Flickr and Wikipedia. It covers three Italian cities which are important from a sightseeing point of view and...
    • ZIP
      The resource: 'TripBuilder' is not accessible as guest user. You must login to access it!
  • Experiment

    Prognostic stratification of patients with differentiated thyroid cancer

    Proper risk stratification of patients with differentiated thyroid cancer (DTC) is essential to avoid both unnecessary diagnostic procedures in low-risk patients and clinical...
  • Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    CDR Data - Rome

    The dataset contains mobile phone records collected in Rome between November 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding...
  • Method

    Nowcasting migration stocks and flows

    This method nowcasts migration stocks and flows by using Twitter data. Under development.