approved
DE webarchive

The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.

Tags
Data and Resources
To access the resources you must log in
  • Internet Archive Wayback MachineHTML

    The original dataset is accessible through the Internet Archive's Wayback...

    The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
Personal Data Attributes

Description: Personal Data related Information

Field Value
ChildrenData No
Personal Data Yes
Personal data was manifestly made public by the data subject Yes
Sensitive Data No
Additional Info
Field Value
Accessibility Trans National Access
Accessibility Mode API Access
Attribution requirements See https://archive.org/about/terms.php
Availability On-Line
Basic rights Other rights
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
Creation Date 1994-12-02 - 2013-09-30
Creator Internet Archive, San Francisco
DataProtectionDirective Unknown
DiskSize 60000000
Distribution requirements No re-distribution allowed
Field/Scope of use Research only
Format application/warc,application/arc
Group Societal Debates and Misinformation
IP/Copyrights Content in the dataset may fall under copyright law
Manifestation Type Replica
Processing Degree Primary
Restrictions on use Access according to the Internet Archive's Terms of Use (https://archive.org/about/terms.php). No replicas may be provided. Content in the dataset may fall under copyright and data protection law.
Semantic Coverage germany
SoBigData Node SoBigData EU
Sublicense rights No
Territory of use World Wide
Thematic Cluster Web Analytics [WA]
TimeCoverage 1994-12-02 /2013-09-30
system:type Dataset
Management Info
Field Value
Author Gerhard Gossen
Maintainer Gerhard Gossen
Version 1
Last Updated 28 October 2023, 10:21 (CEST)
Created 6 September 2018, 14:39 (CEST)