-
Private Cybersecurity NER BERT-base-cased model
This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that... -
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Cybersecurity NER SecureBERT model
This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
text/x-python
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Cybersecurity NER RoBERTa-base model
This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
py
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Gene Disease Association Data and Features
This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...-
RAR
The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Multi-sensor dataset of environmental office room conditions
The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...-
RAR
The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
-
RAR
-
Multi-sensor dataset of environmental outdoor home conditions
The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in... -
UWB RADAR dataset of human activity detection in smart office
The UWB RADAR dataset consists of time series data acquired from UWB RADAR deployed in a smart office room located in ICAR-CNR, for monitoring human activity detection. Raw...-
RAR
The resource: 'IoT_UWB_RADAR_dataset_for_s ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Multi-sensor dataset of environmental conditions in smart office
The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in a smart office located in the ICAR CNR IoT...-
RAR
The resource: 'Laboratorio IoT' is not accessible as guest user. You must login to access it!
-
RAR
-
User preference-interest dataset
The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have... -
Weather and Pollution in Smart Cities
A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...-
ZIP
The resource: 'Weather and Pollution in ...' is not accessible as guest user. You must login to access it!
-
ZIP
-
Multi-sensor dataset of environmental indoor home conditions
The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in indoor of a smart domestic room located in...-
RAR
The resource: 'IoT_dataset_indoor_smart_home' is not accessible as guest user. You must login to access it!
-
RAR
-
Private Smart Cities Weather and Pollution conditions
A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based... -
Private Italian Thesaurus for Tourism domain
An Italian thesaurus in the domain of the Tourism, counting 2,684 concepts, organized according to semantic relationships (equivalence, hierarchical and associative). The... -
Compounds with Activity against the Dopamine D2 Receptor
Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...-
ZIP
The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
-
ZIP
-
Santorini Tweets July-August 2021
This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...-
ZIP
The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
-
ZIP
-
FANCY Dataset
(NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,... -
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Italian Tourism Dataset
A set of users' comments crawled and scraped from two main touristic websites (Booking.com and Tripadvisor.com) related to main touristic point of interests in Italy and, in...-
HTML
The resource: 'tourism-dataset' is not accessible as guest user. You must login to access it!
-
HTML
-
-
ZIP
The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
-
ZIP