30 items found

Tags: Text mining

Filter Results
  • Method

    ArchiveSpark

    ArchiveSpark is an Apache Spark framework for easy data access, processing, extraction as well as derivation for Web archives and archival collections. It has a simple and...
    • The resource: 'ArchiveSpark on GitHub' is not accessible as guest user. You must login to access it!
  • Dataset

    Product Reviews for Ordinal Quantification

    This data set comprises a labeled training set, validation samples, and testing samples for ordinal quantification. It appears in our research paper "Ordinal Quantification...
    • The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!
  • Dataset

    Cherenkov Telescope Data for Ordinal Quantification

    This labeled data set is targeted at ordinal quantification. It appears in our research paper "Ordinal Quantification Through Regularization", which we have published at...
    • The resource: 'Zenodo' is not accessible as guest user. You must login to access it!
  • Dataset

    Learning to quantify: LeQua 2022 datasets

    The aim of LeQua 2022 (the 1st edition of the CLEF “Learning to Quantify” lab) is to allow the comparative evaluation of methods for “learning to quantify” in textual...
    • The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!
  • Method

    Ariadne Dutch Archaeology Named Entity Recognizer

    Identifies terms and phrases in Dutch for analysing archaeological text. The method delivers named entities of archaeological context, physical object, material, time...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to the library' is not accessible as guest user. You must login to access it!
  • Method

    Ariadne English Archaeology Named Entity Recognizer

    Identifies terms and phrases in English for analysing archaeological text. The method delivers named entities of archaeological context, physical object, material, time...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikinews dataset

    This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...
    • JSON
      The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Ecology of the digital world of Wikipedia

    Wikipedia, a paradigmatic example of online knowledge space is organized in a collaborative, bottom-up way with voluntary contributions, yet it maintains a level of reliability...