approved
Broad Twitter Corpus

The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to provide a representative example of named entities in social media. Its annotations have high agreement and quality, and it has about 12000 entity annotations, of types Person, Location and Organization.

Tags
Data and Resources
To access the resources you must log in
  • Broad Twitter CorpusJSON

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets,...

    The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
Personal Data Attributes

Description: Personal Data related Information

Field Value
ChildrenData No
Personal Data No
Personal data was manifestly made public by the data subject Yes
Additional Info
Field Value
Accessibility Both
Accessibility Mode Download
Accessibility Mode OnLine Access
Availability On-Line
Basic rights Making available to the public
Basic rights Communication
Basic rights Modification
Basic rights Distribution
Basic rights Copying
Basic rights Download
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
Creation Date 2016-10-01
Creator Derczynski, Leon
DataProtectionDirective Data Protection Act 1998
External Identifier https://gate.ac.uk/wiki/broad-twitter-corpus.html
Field/Scope of use Any use
Format JSON
Group Societal Debates and Misinformation
Language eng, English
Manifestation Type Virtual
PersonalSensitiveData Select PersonalSensitiveData
Processing Degree Secondary
RelatedPaper L. Derczynski, K. Bontcheva, I. Roberts. Broad Twitter Corpus: A Diverse Named Entity Recognition Resource. Proceedings of COLING, 2016
Restrictions on use Credit must be given and the license linked.
SoBigData Node SoBigData EU
Sublicense rights No
Territory of use World Wide
Thematic Cluster Text and Social Media Mining [TSMM]
TimeCoverage 2009-01-01 /2014-12-31
system:type Dataset
Management Info
Field Value
Author Gorrell Genevieve
Maintainer Leon Derczynski, Kalina Bontcheva, Ian Roberts
Version 1
Last Updated 28 October 2023, 10:18 (CEST)
Created 6 September 2018, 14:38 (CEST)