136 items found

Groups: sobigdata-it

Filter Results
  • Dataset

    DNA 31-mers

    A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
  • Dataset

    The subTHz regime, channel measurements with no line-of-sight conditions in t...

    Dataset of channel measurements obtained in the frequency band 500-750 GHz in non line-of-sight conditions. The measurements have been conducted using a Keysight PNA Vector...
    • s2p
      The resource: '14Rot45deg_500750' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '15Rot90deg_500750' is not accessible as guest user. You must login to access it!
  • Dataset

    FAIR-SWENG: dataset on gender fairness in software engineering academic lands...

    The dataset contains academic performance metrics of Software Engineers worldwide.
  • Dataset

    Semantic Networks from news articles (Dutch sample)

    The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (English sample)

    The Semantic Networks from news articles (English sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    The subTHz regime, channel measurements with no line-of-sight conditions in t...

    Dataset of channel measurements obtained in the frequency band 170-260 GHz in non line-of-sight conditions. The measurements have been conducted using a Keysight PNA Vector...
    • s2p
      The resource: '14Rot45deg_G' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '15Rot90deg_G' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter dataset on coordinated behavior in 2019 UK General Election

    This dataset contains ~11M tweets related to the 2019 United Kingdom General Election, published and collected between November 12, 2019, and December 12, 2019. In addition,...
    • The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Telecom Traffic Distribution Dataset

    The dataset contains aggregate - hourly, daily, weekly - cellular traffic demand data of individual base stations deployed in the metropolitan area of Milan and Trento...
  • Dataset

    Twitter dataset on coordinated behavior in 2020 USA Presidential Election

    This dataset contains ~140M tweets related to the 2020 United States Presidential Election, published and collected between October 2, 2020, and December 2, 2020. In addition,...
    • The resource: 'dataset' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Vehicular trip dataset extracted from black boxes embedded on vehicles

    The data contains vehicular trips denoted by a trip identifier (unique, reset after each vehicle engine shutdown) and a set of latitude/longitude coordinates, representing...
  • Dataset

    Twitter Conspiracy Dataset

    This repository contains the Twitter dataset used to investigate the traits of 7,394 conspiracy users and 7,394 random users collected in 2022. Both the profile's info and the...
    • The resource: 'Twitter Conspiracy Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Dataset on online cryptocurrency discussion on Twitter, Telegram, and Discord

    This Dataset contains Twitter, Telegram and Discord data on online discussions on cryptocurrency. Starting from tweets mentioning cryptocurrencies, we leveraged and followed...
    • The resource: 'dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    HANSEN: Spoken Text Authorship Analysis

    HANSEN encom- passes meticulous curation of existing speech datasets accompanied by transcripts, along- side the creation of novel AI-generated spo- ken text datasets....
    • The resource: 'Datasets' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter Newcomers Dataset

    Twitter accounts detected right after registration and monitored for 21 days
    • ZIP
      The resource: 'New Accounts Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Italian Tourism Dataset

    A set of users' comments crawled and scraped from two main touristic websites (Booking.com and Tripadvisor.com) related to main touristic point of interests in Italy and, in...
    • HTML
      The resource: 'tourism-dataset' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private 64-tiles tessellation of Chicago

    Squared tessellation of the city center of Chicago, Illinois, into 64 tiles. Tessellation only of the central part of Chicago, namely the neighborhoods 'LOOP', 'NEAR SOUTH...