-
Lexical networks from Polish news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Polish news articles extracted from the dataset described...-
jsonl
The resource: 'polish_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Shopping retail synthetic dataset (CTGAN)
Synthetic shopping retail consumption data generated with CTGAN. The dataset provides monthly information on the spending of synthetic customers belonging to two classes (i.e.,...-
CSV
The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Lexical networks from Finnish news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Finnish news articles extracted from the dataset...-
jsonl
The resource: 'finnish_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Compounds with Activity against the Dopamine D2 Receptor
Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...-
ZIP
The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
-
ZIP
-
The subTHz regime, first Results on channel measurement: : 170-260 GHz (nearl...
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands) 170-260 GHz (nearly G-band). IF... -
The subTHz regime, first Results on channel measurement: 75-110 GHz (W-band)
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 75-110 GHz (W-band). IF bandwidth has... -
GiveMeSomeCreditSC
The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...-
ZIP
The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
-
ZIP
-
Santorini Tweets July-August 2021
This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...-
ZIP
The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
-
ZIP
-
Synthetic Dataset for Causal Analysis
The dataset is a synthetic version of the well-known German Credit dataset (https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data). It includes variables such as...-
CSV
The resource: 'synthetic german data' is not accessible as guest user. You must login to access it!
-
CSV
-
Lexical networks from Lithuanian news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Lithuanian news articles extracted from the dataset...-
jsonl
The resource: 'lithuanian_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
FANCY Dataset
(NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,... -
Semantic Networks from news articles (Danish sample)
The Semantic Networks from news articles (Danish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Danish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. -
Frank Experiments
Dataset with experimental results for the "Frank" hybrid decision-making system, with simulated users. Features: - CA. Co-evolutionary Accuracy. Accuracy reached by the user...-
JSON
The resource: 'Frank Experiments Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Semantic Networks from news articles (Italian sample)
The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Visual Analytics for Perfomance Analysis: Dataset of preprocessed distributed...
The dataset includes execution traces from Train-ticket, an established open-source microservices system.The execution traces have been generated across various scenarios, and... -
Shopping retail synthetic dataset (GaussianCopula)
Synthetic shopping retail consumption data generated with GaussianCopula. The dataset provides monthly information on the spending of synthetic customers belonging to two...-
CSV
The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (German sample)
The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 31-mers
A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
-
ZIP