-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Semantic Networks from news articles (Italian sample)
The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (German sample)
The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 31-mers
A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
-
ZIP
-
Semantic Networks from news articles (Dutch sample)
The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (English sample)
The Semantic Networks from news articles (English sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV