-
UK election abuse data
The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...-
XLS
The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
-
XLS
-
Articles and comments of major Estonian newspapers
The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
DE webarchive
The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.-
HTML
The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
-
ClueWeb09
The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on... -
Multi-flow Composition in video streaming channels
This experiment is part the project "Streams of conspiratorial folklore" that investigates online media as a stream of performances, rather than as archives of documents. Our... -
Twitter Monitor
The Twitter Monitor is an interactive Web application designed to access the Twitter stream by exploiting the public Twitter Streaming APIs. The application can manage...-
HTML
The resource: 'Twitter Monitor URL' is not accessible as guest user. You must login to access it!
-
The resource: 'Twitter Monitor method' is not accessible as guest user. You must login to access it!
-
HTML
-
Studying the streaming of the Capitol Raid
This experiment is part the project "Streams of conspiratorial folklore" that investigates online media as a stream of performances, rather than as archives of documents. Our... -
Rhythm management in video streaming channels
This experiment is part the project "Streams of conspiratorial folklore" that investigates online media as a stream of performances, rather than as archives of documents. Our... -
SMAPH Query Entity Linker
The SMAPH system links queries to the entities it mentions, disambiguating mentions if needed. Entities are Wikipedia pages. This problem is known as "entity recognition and...-
HTML
The resource: 'SMAPH documentation' is not accessible as guest user. You must login to access it!
-
HTML
-
-
PDF
The resource: 'Research Article' is not accessible as guest user. You must login to access it!
-
PDF
-
-
PDF
The resource: 'Misinformation Detection ...' is not accessible as guest user. You must login to access it!
-
PDF
-
-
PDF
The resource: 'DEAP-FAKED: Knowledge ...' is not accessible as guest user. You must login to access it!
-
PDF