approved
Web Archive Collection Extractor

This method extracts event-centric collections of Web Archives through a focused crawling method. The key of this method is to adapt focused Web crawling to previously collected Web archives and to select documents by iteratively following links from relevant documents.

Tags
Data and Resources
To access the resources you must log in
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Availability On-Line
Basic rights Making available to the public
Basic rights Communication
Basic rights Modification
Basic rights Distribution
Basic rights Copying
Basic rights Download
CreationDate 2016-12-15
Creator Gossen, Gerhard, [email protected], orcid.org/0000-0001-8492-1103
Dependencies on Other SW Hadoop
Field/Scope of use Any use
Group Societal Debates and Misinformation
Owner Gossen, Gerhard, [email protected], orcid.org/0000-0001-8492-1103
ProgrammingLanguage Java
RelatedPaper Gerhard Gossen, Elena Demidova, and Thomas Risse. 2016. Analyzing web archives through topic and event focused sub-collections. In Proceedings of the 8th ACM Conference on Web Science (WebSci '16). DOI: 10.1145/2908131.2908175
Sublicense rights No
Territory of use World Wide
Thematic Cluster Web Analytics [WA]
UsageMode Download
system:type Method
Management Info
Field Value
Author Gossen Gerhard
Maintainer Gossen Gerhard
Version 1
Last Updated 16 September 2023, 10:12 (CEST)
Created 6 September 2018, 14:39 (CEST)