102 items found

Organisations: SoBigData Catalogue

Filter Results
  • Method

    Prediction of next career moves from scientific profiles

    This is a two-stage predictive model for the mobility of scientists. First, data mining is used to predict which researcher will move in the next year on the basis of their...
  • Dataset

    ISTAT Census zone Tuscany

    This dataset contains the geometry of about 20.000 census sectors and limited demographic information of Tuscany region (Italy).
    • ZIP
      The resource: 'Istat Dataset ' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Flickr and Wikipedia Tourism Trajectories

    The dataset contains a knowledge base built with data coming from Flickr and Wikipedia. It covers three Italian cities which are important from a sightseeing point of view and...
    • ZIP
      The resource: 'TripBuilder' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    CDR Data - Rome

    The dataset contains mobile phone records collected in Rome between November 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding...
  • Method

    Nowcasting migration stocks and flows

    This method nowcasts migration stocks and flows by using Twitter data. Under development.
  • Dataset

    Emergency Tweets 2013 Milan blackout

    This dataset is related to a power outage (i.e., a blackout) that occurred in the city of Milan, in northern Italy, in the night between 14 and 15 May 2013. Despite not...
    • CSV
      The resource: 'PWO-MIL_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Call Data Record Tuscan cities 2014

    The dataset contains mobile phone records collected in the provinces of Pisa, Lucca, Livorno and Firenze in March 2014. It counts about 50 mln of Call Data Records (CDR) of...
  • Dataset

    City-to-city migration

    Census data recording the migration of people between metropolitan areas in the US
  • Dataset

    Emergency Tweets 2009 L'Aquila earthquake

    This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...
    • ZIP
      The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    GPS Origin Destination Matrix in Tuscany

    This dataset is the origin and destination matrix among the municipalities of Tuscany extracted starting from GPS tracks of private vehicles collected from 2014-02-10 to...
    • CSV
      The resource: ' GPS Origin Destination Matrix' is not accessible as guest user. You must login to access it!
  • Dataset

    Call Data Record District of Pisa 2013 October

    The dataset contains mobile phone records collected in the provinces of Pisa, Lucca, Livorno and Firenze in October 2013. It contains about 60 mln of Call Data Records (CDR),...
  • Dataset

    Official administrative information of Tuscany

    The data contains the spatial partitioning of Tuscany and some statistical information published by the Italian Statistical Bureau.
    • LOD
      The resource: 'Linked Open Data' is not accessible as guest user. You must login to access it!
  • Dataset

    Open data from NervousNet

    This dataset contains anonymized proximity information sent by 154 mobile phones (both Android and iPhone) via phone apps. These information are sent by bluetooth beacons every...
    • ZIP
      The resource: 'open data from NervousNet' is not accessible as guest user. You must login to access it!
  • Dataset

    Car sharing dataset

    The dataset comprises pickup and drop-off times and locations of vehicles in 10 European cities for one of the major free-floating car sharing operator. For nine of these...
  • Dataset

    GeoLife - GPS trajectories dataset

    This (link to a) GPS trajectory dataset was collected in (Microsoft Research Asia) Geolife project by 182 users in a period of over three years (from April 2007 to August 2012)....
    • ZIP
      The resource: 'GeoLife Download page' is not accessible as guest user. You must login to access it!
  • Dataset

    GPS Tracks - Tuscany 2011

    This dataset contains GPS trajectories of private vehicles crossing the region of Tuscany in Italy. It is composed of about 11 mln of trips of 150.000 users collected in May...
  • Dataset

    Twitter dataset about two premier UK music festivals

    The dataset contains twitter posts about two premier UK music festivals: Creamfields 2016 (on August 25th-28th) and VFestival 2016 (on August 20th-21st).
    • Github
      The resource: 'Twitter dataset about two ...' is not accessible as guest user. You must login to access it!
  • Method

    Twitter preprocessor

    Tokeniser, lemmatiser, extraction of negation. Under development.