228 items found

Organisations: SoBigData Catalogue

Filter Results
  • Dataset

    GPS Tracks - Calabria, Italy 2012

    The dataset consists of GPS tracks of private vehicles collected in Calabria region (Italy). It counts about 28 mln of trajectories of about 115.000 users. Data are in the...
  • Dataset

    Soccer Team Performance

    The dataset contains the performance features (passes, shots, goals, tackles, ecc) of soccer teams during the games of six major European leagues in three seasons. The dataset...
  • Dataset

    Formal network of Estonian companies and board members

    This dataset consists of managed and continuously updated data about Estonian companies and board members since 1994. Technical documentation of data structures and the REST API...
    • ZIP
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Congress Network

    Network built on top of US congress voting data and made available on the website GovTrack.us. Nodes identifies congressman and edges represent the semantic "have supported the...
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • Method

    Prediction of next career moves from scientific profiles

    This is a two-stage predictive model for the mobility of scientists. First, data mining is used to predict which researcher will move in the next year on the basis of their...
  • Dataset

    Facebook Wallpost

    Online interactions between users via the wall feature in the New Orleans regional network.
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • Experiment

    A comparison of approaches for type-2 diabetes treatment

    This experiment compares the performance of some GNN-based approaches for predicting the therapy recommended to type-2 diabetes patients
  • Dataset

    ISTAT Census zone Tuscany

    This dataset contains the geometry of about 20.000 census sectors and limited demographic information of Tuscany region (Italy).
    • ZIP
      The resource: 'Istat Dataset ' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Flickr and Wikipedia Tourism Trajectories

    The dataset contains a knowledge base built with data coming from Flickr and Wikipedia. It covers three Italian cities which are important from a sightseeing point of view and...
    • ZIP
      The resource: 'TripBuilder' is not accessible as guest user. You must login to access it!
  • Experiment

    Prognostic stratification of patients with differentiated thyroid cancer

    Proper risk stratification of patients with differentiated thyroid cancer (DTC) is essential to avoid both unnecessary diagnostic procedures in low-risk patients and clinical...
  • Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    CDR Data - Rome

    The dataset contains mobile phone records collected in Rome between November 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding...
  • Dataset

    NYSE transactions

    This dataset contains financial data on the price of the top 250 most liquid assets of New York Stock Exchange (NYSE) from 2006 to 2014. The dataset contains transactions,...
  • Dataset

    bond yield_equity log-returns_CDS spreads

    Financial data used to construct a bipartite network of systemically important banks and sovereign bonds.
  • Dataset

    FED data

    March 2001- September 2013 quarterly data of US banks' holdings. The number of financial institutions present in the data is pretty stable during quarters, starting from...
  • Method

    Nowcasting migration stocks and flows

    This method nowcasts migration stocks and flows by using Twitter data. Under development.
  • Dataset

    BioTAGME: A comprehensive platform for biological knowledge network analysis

    This Network was built through BioTAGME, a system that combines TAGME, an entity-annotation framework based on Wikipedia corpus with a network-based inference methodology (i.e.,...
  • Dataset

    European Banks Asset Class exposures

    This is a curated dataset, where the Original data are taken from European Banking Authority (EBA), who collects banks' data to perform stress-test systemic risk analysis....
    • HTML
      The resource: 'data-link' is not accessible as guest user. You must login to access it!