approved
Human and mouse gene regulatory networks

The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available at Gene Expression Omnibus (GEO) (www.ncbi.nlm.nih.gov/geo/). Overall, 161 and 174 raw samples were considered for mouse and human organisms, respectively (see "heterogeneous" folder). All the samples were processed according to the workflow adopted for the DREAM5 challenge (Marbach et al., 2012, DOI: 10.1038/nmeth.2016), that led to a dataset of 5404 mouse genes and to a dataset of 15345 human genes. Each gene was also associated with six features (one for each organ), by averaging the expression levels measured within the same organ (see "homogeneous" folder). Finally, the dataset of the interactions among genes was built by considering all the possible pairs of genes (excluding the self-links), each associated with the concatenation of the feature vectors of the genes involved in the interaction. We extracted 235706 validated human gene interactions and 14613 validated mouse gene interactions from BioGRID (available at https://thebiogrid.org). As regards the unlabeled examples of interactions, we randomly selected, without replacement, a balanced number (i.e. equals to the number of labeled examples) of interactions involving at least one gene that appears in the set of labeled interactions.

Tags
Data and Resources
To access the resources you must log in
Personal Data Attributes

Description: Personal Data related Information

Field Value
ChildrenData No
Personal Data No
Personal data was manifestly made public by the data subject No
Additional Info
Field Value
Accessibility Both
Accessibility Mode Download
Associate Project FAIR
Availability On-Line
Basic rights Download
Basic rights Copying
Basic rights Distribution
Basic rights Communication
Creation Date 2017-01-01
Creator Mignone, Paolo, [email protected], orcid.org/0000-0002-8641-7880
Dataset Citation Cite the following works where the dataset was created and effectively used: - DOI: 10.1093/bioinformatics/btz781 - DOI: 10.1016/j.bdr.2024.100456
Dataset Re-Use Safeguards none
Field/Scope of use Any use
Group Health Studies
Group Others
License term 2024-07-04 /3024-07-04
Manifestation Type Virtual
Processing Degree Primary
Retention Period 3024-07-04 /4024-07-04
SoBigData Node SoBigData EU
SoBigData Node SoBigData IT
Sublicense rights No
Territory of use World Wide
Thematic Cluster Other
system:type Dataset
Management Info
Field Value
Author Mignone Paolo
Maintainer Mignone Paolo
Version 1
Last Updated 23 November 2024, 16:01 (CET)
Created 23 November 2024, 16:01 (CET)