44 items found

Types: Dataset Groups: Others

Filter Results
  • Dataset

    Ego Networks of Words in Twitter

    This set of dataframes were used in our last paper : Ollivier K, Boldrini C, Passarella A, Conti M (2022) Structural invariants and semantic fingerprints in the “ego network”...
  • Dataset

    Dataset for Evaluating Abstractive Summaries of Crisis-Related Social Media

    The dataset created for evaluation of summaries generated from social media posted during five natural disasters. The dataset contains: ground truth reports created by human...
    • The resource: 'Dataset for Evaluating ...' is not accessible as guest user. You must login to access it!
  • Dataset

    CoPhIR

    The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:...
    • The resource: 'cophir.isti.cnr.it' is not accessible as guest user. You must login to access it!
  • Dataset

    WIRE dataset

    This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their...
    • HTML
      The resource: 'WikipediaRelatedness' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'WIRE dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination ...

    We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or...
    • The resource: 'Zenodo Dataset Link' is not accessible as guest user. You must login to access it!
  • Dataset

    Amazon Network

    Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently...
    • HTML
      The resource: 'Amazon Network ' is not accessible as guest user. You must login to access it!
  • Dataset

    Amazon reviews

    This (link to the) dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014. This dataset includes reviews...
    • HTML
      The resource: 'Julian McAuley's repository.' is not accessible as guest user. You must login to access it!
  • Dataset

    Facebook EuroSys 2009

    This dataset contains Social and interaction graphs representing two large-scale Facebook regional networks. Social graphs describe Facebook friendships between users...
    • The resource: 'The Facebook EuroSys'09 ...' is not accessible as guest user. You must login to access it!
  • Dataset

    MAMe dataset

    The MAMe dataset ia an image classification dataset with remarkable high resolution and variable shape properties. The goal of MAMe is to provide a tool for studying the...
    • The resource: 'MAMe Dataset page' is not accessible as guest user. You must login to access it!
  • Dataset

    Cross-Lingual Dataset of Crisis-Related Social Media

    If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive...
    • The resource: 'Cross-Lingual Dataset of ...' is not accessible as guest user. You must login to access it!
  • Dataset

    DBLP Network

    The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. This dataset is a co-authorship network constructed upon the DBLP...
    • HTML
      The resource: 'DBLP Network' is not accessible as guest user. You must login to access it!
  • Dataset

    The Italian Music Dataset

    The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...
    • JSON
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Ukraine-related Disinformation Dataset

    Ukraine-related disinformation dataset from "Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation" (accepted at SocInfo...
    • The resource: 'Zenodo Dataset Link' is not accessible as guest user. You must login to access it!
  • Dataset

    GERDAQ Dataset

    This is a benchmark dataset of annotated search-engine queries. Mentions of entities in search-engine queries are tagged with the entity they refer to. Wikipedia is used as...
    • XML
      The resource: 'GERDAQ dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Facebook - New Orleans regional network

    This dataset contains information about 90,269 users and 3,646,662 friendship links between those users. These users belong to the New Orleans Facebook regional network. The...
    • HTML
      The resource: 'New Orleans Facebook dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    German Academic Web

    The dataset contains regular crawls of the websites for German academic institutions.
  • Dataset

    MSN Search query log

    The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)...
  • Dataset

    A dataset of gamers on Twitter

    This gaming-related dataset consists of 8932 users (labeled as gamers) engaging in game-related conversations. We have collected (June 2018) their timeline (the most recent 3200...
    • The resource: 'Gamers dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Product Reviews for Ordinal Quantification

    This data set comprises a labeled training set, validation samples, and testing samples for ordinal quantification. It appears in our research paper "Ordinal Quantification...
    • The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!