245 items found

Groups: sobigdata-eu

Filter Results
  • Dataset

    Lexical networks from Swedish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Swedish news articles extracted from the dataset described...
    • jsonl
      The resource: 'swedish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Spotify track dataset (small)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • ZIP
      The resource: 'std_small' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (French sample)

    The Semantic Networks from news articles (French sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Shopping retail synthetic dataset (CopulaGAN)

    Synthetic shopping retail consumption data generated with CopulaGAN. The dataset provides monthly information on the spending of synthetic customers belonging to two classes...
    • ZIP
      The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Spanish sample)

    The Semantic Networks from news articles (Spanish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Spanish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Identified CNVs from whole exome sequencing data of BRCA1/2 negative breast c...

    This dataset offers a comprehensive analysis of Copy Number Variations (CNVs) identified in Whole Exome Sequencing (WES) data from patients with breast cancer who tested...
  • Dataset

    The subTHz regime, channel measurements with no line-of-sight conditions in t...

    Dataset of channel measurements obtained in the frequency band 75-110 GHz in non line-of-sight conditions. The measurements have been conducted using a Keysight PNA Vector...
    • s2p
      The resource: '70-110 GHz 45 degrees' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '70-110 GHz 90 degrees' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...

    Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...
    • CSV
      The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Romanian sample)

    The Semantic Networks from news articles (Romanian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Romanian_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi channel frequency response database for contactless human activity reco...

    This database collects the channel frequency response (CFR) vectors captured through the Nexmon CSI extraction tool from an Asus RT-AC86U IEEE 802.11ac Wi-Fi router working with...
    • The resource: 'Wi-Fi channel frequency ...' is not accessible as guest user. You must login to access it!
  • Method

    Reducing radicalizism in social networks by feeds prioritization - Rebalancin...

    Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++
  • Dataset

    Lexical networks from Polish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Polish news articles extracted from the dataset described...
    • jsonl
      The resource: 'polish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Shopping retail synthetic dataset (CTGAN)

    Synthetic shopping retail consumption data generated with CTGAN. The dataset provides monthly information on the spending of synthetic customers belonging to two classes (i.e.,...
    • CSV
      The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Finnish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Finnish news articles extracted from the dataset...
    • jsonl
      The resource: 'finnish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Compounds with Activity against the Dopamine D2 Receptor

    Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...
    • ZIP
      The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Dataset for Causal Analysis

    The dataset is a synthetic version of the well-known German Credit dataset (https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data). It includes variables such as...
    • CSV
      The resource: 'synthetic german data' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Lithuanian news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Lithuanian news articles extracted from the dataset...
    • jsonl
      The resource: 'lithuanian_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    FANCY Dataset

    (NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,...
    • The resource: 'FANCY Dataset' is not accessible as guest user. You must login to access it!