133 items found

Tags: Text mining

Filter Results
  • TrainingMaterial

    Introduction to Data Curation

    This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...
    • PDF
      The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • Dataset

    Broad Twitter Corpus

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...
    • JSON
      The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
  • Method

    Gene-specific regularization for COPD partial-correlation estimation

    We introduce a gene-specific regularization factor when computing the Partial Correlation score to make the indeterminate regression feasible. We decided to slightly modify...
  • Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
  • Dataset

    Twitter dataset about two premier UK music festivals

    The dataset contains twitter posts about two premier UK music festivals: Creamfields 2016 (on August 25th-28th) and VFestival 2016 (on August 20th-21st).
    • Github
      The resource: 'Twitter dataset about two ...' is not accessible as guest user. You must login to access it!
  • Method

    Measurement Expression Annotator

    Annotates numbers and measurement expressions in text. This method recognises many types of measurements including length, temperature, time and speed, and calculates their...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    Python library for direct and indirect discrimination prevention in data mining

    This python library implements the discrimination discovery and prevention method proposed in the paper: “A methodology for direct and indirect discrimination prevention in...
    • GitHub
      The resource: 'Link to library' is not accessible as guest user. You must login to access it!
  • Application

    SWAT

    SWAT is a entity-salience system which identifies on-the-fly the semantic focus of a document, expressed by its Salient Wikipedia Entities. The core of this technology is...
  • Method

    Twitter Opinion Mining English

    This tool recognises opinionated sentences in English tweets and it classifies them as positive or negative. It also indicates emotion type, author and target of the opinion,...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    Summa Text Summarization (Es)

    The SUMMA Text Summarization (ES) uses the SUMMA toolkit developed by Horacio Saggion to provide a generic Spanish document summarizer.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    GATE Cloud COVID-19 Misinformation Categoriser

    A machine learning classifier trained to categorise claims about COVID-19 into 10 categories proposed by the Reuters Institute for the Study of Journalism - Public authority...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
  • Method

    DecarboNet Environmental Annotator

    The DecarboNet environmental annotation service identifies named entities, environmental terms, linguistic features and sentiment in social media texts.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Application

    WAT

    WAT is an entity linker, namely a tool that identifies meaningful substrings (called "spots") in an unstructured English text and link each of them to the unambiguous entity...
    • HTML
      The resource: 'Link to the Application' is not accessible as guest user. You must login to access it!
  • Method

    Part Of Speech Tagger For Tweets

    This service tags tweets with part-of-speech information, e.g. nouns and verbs.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    TriplEx - Explaining with Triples

    TRIPLEX is an explainability package for Transformer-based models fine-tuned on Natural Language Inference, Semantic Text Similarity, or Text Classification tasks. TRIPLEX...
  • Method

    GATE Cloud Rumour Veracity Classifier

    User generated content such as tweets often make claims that are unsubstantiated and possibly untrue. This service attempts to classify whether a text is discussing a rumour...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
  • Method

    GSP - Geo-Semantic-Parsing

    GSP receives a text document as input and returns an enriched document, where all mentions of places/locations are associated to the corresponding geographic coordinates. To...
  • Method

    German Named Entity Recognizer For Tweets

    This method analyses German tweets for names of persons, locations and organizations. It also performs normalization of abbreviations and commonTwitter slang.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    GATE Cloud Brexit Tweet Analysis

    A pipeline designed to detect political topics, hashtags, URLs, user mention, and hashtag-based voting intentions, expressed in tweets about the UK referendum on membership of...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!