228 items found

Organisations: SoBigData Catalogue Groups: sobigdata-eu

Filter Results
  • Dataset

    Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...

    This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022....
    • The resource: 'Supporting data for ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Application

    Private Generative Datalog

    Generative Datalog is an extension of Datalog that incorporates constructs for referencing parameterized probability distributions. This augmentation transforms the evaluation...
  • Application

    NetMe

    The huge amount of biological literature, which daily increases, represents a strategic resource to automatically extract and gain knowledge concerning relations among...
    • The resource: 'NetME English Tutorial' is not accessible as guest user. You must login to access it!
  • Experiment

    EnviroStream (Benchmark)

    Stream Reasoning (SR) focuses on developing advanced approaches for applying inference to dynamic data streams; it has become increasingly relevant in various application...
    • The resource: 'EnviroStream Benchmark' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Origin and destination attachment from Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in...
  • Dataset

    EMAKG: Enhanced Microsoft Academic Knowledge Graph

    The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and...
    • The resource: 'Link to dataset.' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Bark Beetle Outbreak Czech Republic

    Repository containing satellite dataset created for bark beetle outbreak detection in satellite (Sentinel-1 and Sentinel-2) images. The dataset refer to scenes observed in...
    • The resource: 'Czech Republic' is not accessible as guest user. You must login to access it!
  • Dataset

    PaintNet: Unstructured Multi-Path Learning from 3D Point Clouds for Robotic S...

    We introduce the PaintNet dataset to accelerate research on supervised learning for multi-path prediction conditioned on free-shape 3D objects. PaintNet includes more than 800...
    • The resource: 'PaintNet: Unstructured ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Dataset

    Iperf K8s-based Power and Resource consumption dataset

    The data were collected in a Prometheus-like data format: each entry has a timestamp, a value and key-value labels containing additional information. Metrics were gathered...
    • CSV
      The resource: '5G_Power_and_Resource_consu ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Air Traffic Data International Mobility Indicators for the UK

    The Air Traffic Data International Mobility Indicators for the UK results from the investigation on air passenger data. Starting from air passenger traffic volumes from each...
  • Dataset

    Stroke and sepsi

    The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
    • The resource: 'Stroke and sepsi' is not accessible as guest user. You must login to access it!
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Vegetation of a basin of the Po river Dataset

    We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which...
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Word-in-Context task for Italian

    The general goal of the WiC-ITA task is to establish whether a word w occurring in two different sentences, s_1 and s_2, has the same meaning or not. In particular, our task...
  • Access required...

    ×

    Dataset

    Private EnviroStream

    This repository contains datasets, queries and a generator for the EnviroStream, a benchmark for Stream Reasoning (SR) systems. SR focuses on applying inference to dynamic...
  • Dataset

    Human and mouse gene regulatory networks

    The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available...
    • The resource: 'Human and mouse gene ...' is not accessible as guest user. You must login to access it!