-
Experimental results from the Empirical Investigation of the Completeness of ...
This is the raw data from the empirical investigation of the paper “Completeness of Datasets Documentation on ML/AI repositories: an Empirical Investigation”. This work aim of... -
Gene Disease Association Data and Features
This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...-
RAR
The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Reddit Echo Chamber dataset
In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...-
ZIP
The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
-
ZIP
-
Twitter EURO2020: BLM debate in Italy
Twitter Dataset for "Will You Take the Knee? Italian Twitter Echo Chambers' Genesis During EURO 2020" The dataset is comprised of the following files:...-
JSON
The resource: 'Twitter EURO2020' is not accessible as guest user. You must login to access it!
-
JSON
-
Papers on Gender Bias in Academic Promotions
This dataset contains the result of a systematic mapping study conducted to analyse how the issue of gender bias in academic promotions has been addressed by the literature....-
CSV
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 12-mers
A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
-
ZIP
-
Compounds with Activity against the Dopamine D2 Receptor
Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...-
ZIP
The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
-
ZIP
-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
DNA 31-mers
A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
-
ZIP