approved
Cybersecurity NER RoBERTa-base model

This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be fed to the model. The method's output is a list of predictions, i.e. the labels assigned to each token of each of the sentences fed to the model. INPUT= sentences: string[] OUTPUT= predictions: list(list(dict(string, string))) The transformers model RoBERTa was pretrained in a self-supervised manner using a sizable corpus of English data. This indicates that it was pretrained using only the raw texts and an algorithmic procedure to create inputs and labels from those texts. No human labeling was done at all, which allows it to use a large amount of publicly available data.

Tags
Data and Resources
To access the resources you must log in
  • configJSON

    The resource: 'config' is not accessible as guest user. You must login to access it!
  • mergesTXT

    The resource: 'merges' is not accessible as guest user. You must login to access it!
  • modelBIN

    The resource: 'model' is not accessible as guest user. You must login to access it!
  • model_argsJSON

    The resource: 'model_args' is not accessible as guest user. You must login to access it!
  • schedulerZIP

    The resource: 'scheduler' is not accessible as guest user. You must login to access it!
  • special_tokens_mapJSON

    The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
  • tokenizer_configJSON

    The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
  • training_argsZIP

    The resource: 'training_args' is not accessible as guest user. You must login to access it!
  • tokenizerJSON

    The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
  • vocabJSON

    The resource: 'vocab' is not accessible as guest user. You must login to access it!
  • optimizerZIP

    The resource: 'optimizer' is not accessible as guest user. You must login to access it!
  • inferencepy

    The resource: 'inference' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Availability On-Line
Basic rights Download
CreationDate 2024-06-20 11:00
Creator Russo, Giuseppe Felice, [email protected], orcid.org/0009-0001-2090-9647
Field/Scope of use Non-commercial research only
Group Others
License term 2024-06-21 11:45/2027-06-21 11:45
Owner Russo, Giuseppe Felice, [email protected], orcid.org/0009-0001-2090-9647
ProgrammingLanguage Python
Semantic Coverage Cybersecurity
SoBigData Node SoBigData EU
SoBigData Node SoBigData IT
Sublicense rights No
Territory of use World Wide
Thematic Cluster Web Analytics [WA]
input sentences: string[]
output predictions: list(list(dict(string, string)))
system:type Method
Management Info
Field Value
Author Russo Giuseppe Felice
Maintainer Russo Giuseppe Felice
Version 1
Last Updated 23 November 2024, 16:02 (CET)
Created 23 November 2024, 16:02 (CET)