approved
HANSEN: Spoken Text Authorship Analysis

HANSEN encom- passes meticulous curation of existing speech datasets accompanied by transcripts, along- side the creation of novel AI-generated spo- ken text datasets. Together, it comprises 17 human datasets, and AI-generated spoken texts created using 3 prominent LLMs: Chat- GPT, PaLM2, and Vicuna13B.

Tags
Data and Resources
To access the resources you must log in
  • Datasets

    The resource: 'Datasets' is not accessible as guest user. You must login to access it!
Personal Data Attributes

Description: Personal Data related Information

Field Value
Anonymised No
General Data Yes
Personal Data No
Sensitive Data No
Additional Info
Field Value
Accessibility Both
Accessibility Mode OnLine Access
Basic rights Download
Basic rights Copying
Basic rights Distribution
Basic rights Communication
Basic rights Making available to the public
Creation Date 2023-10-23 17:55
Creator Lee, Dongwon, [email protected], orcid.org/0000-0001-8371-7629
Field/Scope of use Research only
Group Social Impact of AI and explainable ML
Language eng, English
Manifestation Type Virtual
SoBigData Node SoBigData EU
SoBigData Node SoBigData IT
Thematic Cluster Social Data [SD]
system:type Dataset
Management Info
Field Value
Author SETZU MATTIA
Maintainer SETZU MATTIA
Version 1
Last Updated 25 November 2023, 13:24 (CET)
Created 25 November 2023, 13:23 (CET)