228 items found

Organisations: SoBigData Catalogue

Filter Results
  • Dataset

    Brexit Tweets Linked Domains

    In this spreadsheet we share domains linked in the UK EU membership referendum tweet collection. Counts for links by leave voters and remain voters are given, enabling sites...
    • ODS
      The resource: 'Brexit Tweets Linked ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Brexit Twitter User Vote Intent

    A list of users for which vote intent in the UK EU membership referendum has been established.
  • Dataset

    Medical Dataset

    The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis...
    • ZIP
      The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Call Data Record Pisa 2012

    The dataset contains mobile phone records collected in the province of Pisa in February 2012. It contains about 8 mln of Call Data Records (CDR) of about 230.000 phone users,...
  • Dataset

    Retail market dataset

    The dataset contains purchases of Unicoop Tirreno customers, description and information of the shops (both small shops and supermarkets) and the customers.
  • Dataset

    A public data set of spatio-temporal match events in soccer competitions

    Soccer analytics is attracting increasing interest in academia and industry, thanks to the availability of sensing technologies that provide high-fidelity data streams for every...
    • JSON
      The resource: 'Soccer match event dataset' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'code to reproduce plots in ...' is not accessible as guest user. You must login to access it!
  • Method

    Economic Integration Model

    This model allows to understand the integration process of immigrants starting from retail data. Under development.
  • Dataset

    Emergency Tweets 2014 Genoa flood

    This dataset contains Italian tweets collected during and in the aftermath of the floods that occurred near the city of Genoa between 9 and 11 October 2014...
    • ZIP
      The resource: 'FLO-GEN.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Testing NBA dataset

    Just for platform reviewing
    • CSV
      The resource: 'nbaStats.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Global Peace Index data

    A dataset of the Global Peace Index (GPI), which ranks 163 independent states and territories according to their level of peacefulness. The GPI covers 99.7 per cent of the...
  • Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
  • Method

    Modelling Scientific Migration

    This method is an adaptation of the general migration models to understand scientific migration. Under development.
  • Dataset

    CDR data - Tuscany

    The dataset contains mobile phone records collected in Tuscany between September 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding...
  • Dataset

    DE webarchive

    The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.
    • HTML
      The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Aalto-Twitter

    The dataset consists of about 418 million of tweets from June 25, 2015 to September 19, 2015. Tweets are about trending hashtags gathered though the public Twitter api.
  • Dataset

    Yeast

    The yeast dataset is a collection of yeast microarray expressions and phylogenetic profiles which can be used to learn the yeast gene functional categories. One row of this...
    • arff
      The resource: 'Yeast Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    UK General Election Vote Intent

    A list of Twitter users for whom party political allegiance/vote intent has been established.
  • Dataset

    Emergency Tweets 2013 Sardinia flood

    This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...
    • ZIP
      The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
  • Method

    A New Topological Approach for the Prediction of Protein-Protein Interactions

    We propose, Maximum-Proteins-Similarity(Topological)": MPS(T). MPS(T) is a topological three-length path method that scores the potential interaction between proteins by...