-
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
-
Twitter social bots
Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,... -
Broad Twitter Corpus
The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...-
JSON
The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
-
JSON
-
Gene-specific regularization for COPD partial-correlation estimation
We introduce a gene-specific regularization factor when computing the Partial Correlation score to make the indeterminate regression feasible. We decided to slightly modify... -
Twitter fake followers
Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the... -
Twitter dataset about two premier UK music festivals
The dataset contains twitter posts about two premier UK music festivals: Creamfields 2016 (on August 25th-28th) and VFestival 2016 (on August 20th-21st).-
Github
The resource: 'Twitter dataset about two ...' is not accessible as guest user. You must login to access it!
-
Github
-
Measurement Expression Annotator
Annotates numbers and measurement expressions in text. This method recognises many types of measurements including length, temperature, time and speed, and calculates their...-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
Python library for direct and indirect discrimination prevention in data mining
This python library implements the discrimination discovery and prevention method proposed in the paper: “A methodology for direct and indirect discrimination prevention in...-
GitHub
The resource: 'Link to library' is not accessible as guest user. You must login to access it!
-
GitHub
-
SWAT
SWAT is a entity-salience system which identifies on-the-fly the semantic focus of a document, expressed by its Salient Wikipedia Entities. The core of this technology is... -
Twitter Opinion Mining English
This tool recognises opinionated sentences in English tweets and it classifies them as positive or negative. It also indicates emotion type, author and target of the opinion,...-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
Summa Text Summarization (Es)
The SUMMA Text Summarization (ES) uses the SUMMA toolkit developed by Horacio Saggion to provide a generic Spanish document summarizer.-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
GATE Cloud COVID-19 Misinformation Categoriser
A machine learning classifier trained to categorise claims about COVID-19 into 10 categories proposed by the Reuters Institute for the Study of Journalism - Public authority...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
DecarboNet Environmental Annotator
The DecarboNet environmental annotation service identifies named entities, environmental terms, linguistic features and sentiment in social media texts.-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
WAT
WAT is an entity linker, namely a tool that identifies meaningful substrings (called "spots") in an unstructured English text and link each of them to the unambiguous entity...-
HTML
The resource: 'Link to the Application' is not accessible as guest user. You must login to access it!
-
HTML
-
Part Of Speech Tagger For Tweets
This service tags tweets with part-of-speech information, e.g. nouns and verbs.-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
TriplEx - Explaining with Triples
TRIPLEX is an explainability package for Transformer-based models fine-tuned on Natural Language Inference, Semantic Text Similarity, or Text Classification tasks. TRIPLEX... -
GATE Cloud Rumour Veracity Classifier
User generated content such as tweets often make claims that are unsubstantiated and possibly untrue. This service attempts to classify whether a text is discussing a rumour...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
GSP - Geo-Semantic-Parsing
GSP receives a text document as input and returns an enriched document, where all mentions of places/locations are associated to the corresponding geographic coordinates. To... -
German Named Entity Recognizer For Tweets
This method analyses German tweets for names of persons, locations and organizations. It also performs normalization of abbreviations and commonTwitter slang.-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine
-
GATE Cloud Brexit Tweet Analysis
A pipeline designed to detect political topics, hashtags, URLs, user mention, and hashtag-based voting intentions, expressed in tweets about the UK referendum on membership of...-
method-engine
The resource: 'Run method' is not accessible as guest user. You must login to access it!
-
method-engine