PAG stands for Pathway, Annotated-list and Gene-signature. A PAG geneset represents a set of genes that share a common feature, whether they belong within the same pathway, share common biological processes, are co-differentially expressed under a particular condition of an experiment or clinical sample, or share a common downstream target or function. Each P-type PAG refers to a connected set of molecules (genes/proteins/metabolites), among which some detail of curated mechanism of actions, e.g. protein interactions, reactions, or gene regulations, are available (Biological Pathway, http://www.genome.gov/27530687). Each A-type PAG refers to a curated list of genes/proteins identified from a specific biological context, e.g. a shared GO category or a shared protein family. Each G-type PAG refers to a list of genes/proteins derived from any given high throughput Omics experiment, e.g. functional genomics, under a shared biological condition.
Pathway and Annotated-list Gene-signature Electronic Repository for Coronavirus-related disease (PAGER-CoV) is an online systems biology tool for constructing and visualizing gene and PAG networks from multiple PAG genesets.
PAGER-CoV contains domain-expert curated PAGs curated from published high-impact SARS-CoV-2 studies as well as curated PAGs relating to infectious disease inherited from PAGER 2.0. PAGER-CoV was designed specifically for COVID-19 drug repositioning and translational research studies, as well as to help COVID-19 researchers quickly identify relevant information based on a queried geneset. PAGs from PAGER-CoV can be used with GSEA software for parallel analysis.
PAGER-CoV currently contains a total of 3263 PAGs from 27 data sources, 281 of these PAGs being manually curated by a domain expert, and is being updated weekly. We welcome you to add your COVID-19-related PAGs to PAGER-CoV ( link ).
For full breakdown on PAG sources, refer to the table below:
|Category||Source or Journal||Description||Count|
|PubChem||PubChem||A compiled list of PubChem data related to COVID-19 (Coronavirus Disease 2019) and SARS-CoV-2 (Severe acute respiratory syndrome coronavirus 2)||1549||1549|
|PAGER||PAGER-MSigDB||Immune signatures from GSEA (imported from PAGER 2.0).||6139||10015|
|PAGER-GOA_AGG||Gene Ontology Annotation database (is_a relation aggregation).||1888|
|PAGER-GAD||Gene signatures from Genetic Association Database.||70|
|PAGER-GOA||Gene Ontology Annotation database.||1030|
|PAGER-Pfam||A Large collection of protein families represented by multiple sequence alignments and hidden Markov models (HMMs).||82|
|PAGER-GeneSigDB||A curated database of gene expression signatures.||390|
|PAGER-Protein Lounge||Interactive web-based biological database.||30|
|PAGER-Spike||Signaling Pathways Integrated Knowledge Engine, regulation from highly curated human pathway information.||3|
|PAGER-PheWAS||Phenome-wide association studies database.||57|
|PAGER-PharmGKB||A database containing information relating to human genetic variation on drug response.||4|
|PAGER-Reactome||An interactive pathway database.||25|
|PAGER-BioCarta||Online maps of metabolic and signaling pathways.||105|
|PAGER-GWAS Catalog||GWAS Catalog A Catalog of Published Genome-Wide Association Studies provided by NHGRI||79|
|PAGER-KEGG||A database containing high-level functions and utilities of the biological system.||38|
|PAGER-NGS Catalog||A Database of Next Generation Sequencing Studies in Humans.||1|
|PAGER-DSigDB||A drug signatures database for gene set analysis||49|
|PAGER-GTEx||The Genotype-Tissue Expression (GTEx) project||2|
|PAGER-NCI-Nature Curated||Gathered from published research literature and reviewed by expert scientists before publication.||13|
|PAGER-WikiPathway||A database of biological pathways maintained by and for the scientific community.||10|
|Curation||Am J Respir Crit Care Med||Clinical journals related to pulmonary, critical care, and sleep-related fields.||2||271|
|Microbiology and Molecular Biology Reviews||Research journals related to immunology, molecular, and cell biology.||1|
|Zenodo||general-purpose open-access repository developed under the European OpenAIRE program and operated by CERN.||1|
|Mouse Genome Informatics Database||international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data for researching human health.||5|
|Nature Cell Discovery||Cell molecular biology journal.||4|
|GenBank/UniProt||Databases used to manually curate gene to protein mapping and protein annotations.||33|
|Cell||A research journal relating to cell biology||5|
|The Annual Review of Cell and Developmental Biology||A research journal relating to developments in the field of cell and developmental biology.||1|
|Nature||A leading multidisciplinary science journal.||111|
|Drugbank||Unique bioinformatics and cheminformatics resource that combines detailed drug data with comprehensive drug target information.||96|
|Nature Medicine||A monthly journal publishing original peer-reviewed research in all areas of medicine.||1|
|Cell Host and Microbe||A research journal that focuses broadly on the study of microbes, with an emphasis on the interface between the microbe and its host.||1|
|bioRxiv||COVID-19 SARS-CoV-2 preprints from medRxiv and bioRxiv.||10|
PAGER-CoV has several unique features available:
For examples of how to use PAGER, please see Use Cases.
Users can perform a basic search query for PAGs in PAGER-CoV by entering a keyword or gene symbol into the search bar on the home page:
Users can enter a list of genes obtained from their experiment or other data source to order to search for related PAGs in PAGER-CoV:
The RP-score (standing for rank prioritization-score) is used to rank genes within the PAG, allowing for output of a PAGER-ranked gene list.
We use RP-score to calculate and assign the gene weight in every PAG:
where p and q are the indexes of proteins from the selected module, k is a constant (k=2 in this study). The term conf(p, q) is the interaction confidence score assigned by HAPPI-2, where conf(p, q) is between 0 and 1. N(p, q) holds the value of 1 if protein p interacts with q or the value of 0 if protein p does not interact with q. We then rank the genes by the RP-score to pull out potential important genes in the top rank list.
The nCoCo score takes into account PAG size bias when determining quality score of PAG results. Users can view PAGs similar to their queried gene list by nCoCo score by clicking on header arrows:
The nCoCo score represents a PAG quality metric and is improved from the initial CoCo score (a Cohesion Coefficient derived from the measure of statistically significant coverage of gene-gene functional correlations in gene pairs or gene trios) in our PAGER publication .
Yes! Users may refer to this link for detailed instructions.
Yes! Users may refer to this link for detailed instructions.
The PAGER-CoV API allows developers to analyze gene lists, get enrichment results, and download files of enrichment results. We provide a demo using two R scripts in performing the PAGER-CoV analysis using API.
Zongliang Yue, Qi Zheng, Michael T. Neylon, Minjae Yoo, Jimin Shin, Zhiying Zhao, Aik Choon Tan, and Jake Y. Chen* (2017) PAGER 2.0: an update to the pathway, annotated-list and gene-signature electronic repository for Human Network Biology. Nucleic Acids Research, gkx1040, https://doi.org/10.1093/nar/gkx1040
Zongliang Yue, Madhura Kshirsagar, Thanh Nguyen, Chayaporn Suphavilai, Michael T. Neylon, Liugen Zhu, Timothy Ratliff, and Jake Y. Chen* (2015) PAGER: Constructing PAGs and new PAG-PAG Relationships for Network Biology. ISMB/ECCB Special Issue, Bioinformatics, Vol.31, No.12, pp.i250-i257. doi: 10.1093/bioinformatics/btv265
Hui Huang, Xiaogang Wu, Madhankumar Sonachalam, Sammed N Mandape, Ragini Pandey, Karl F MacDorman, Ping Wan*, and Jake Y. Chen* (2012) PAGED: a pathway and gene-set enrichment database to enable molecular phenotype discoveries. BMC Bioinformatics, Vol.13, Suppl.15, s2
Sudhir R. Chowbina, Xiaogang Wu, Fan Zhang, Peter M. Li, Ragini Pandey, Harini N. Kasamsetty, and Jake Y. Chen* (2009) HPD: An Online Integrated Human Pathway Database Enabling Systems Biology Studies. BMC Bioinformatics, Vol.10, Suppl.11, S5.
Zongliang Yue, Michael T. Neylon, Thanh Nguyen, Timothy Ratliff, and Jake Y. Chen* (2017) "Super gene set" causal relationship discovery from functional genomics data, IEEE Transactions on Computational Biology and Bioinformatics, DOI: 10.1109/TCBB.2018.2858755
Jake Y. Chen*, Zongliang Yue, Michael T. Neylon, Thanh Nguyen, Nafisa Bulsara, Itika Arora, and Timothy Ratliff (2016) Towards constructing "Super Gene Sets" regulatory networks. Proceeding of the 2016 international Conference on Bioinformatics, & Biomedicine, Shenzhen, China, doi:10.1109/BIBM.2016.7822534
Xiaogang Wu, Hui Huang, Tao Wei, Ragini Pandey, Christoph Reinhard, Shuyu D. Li and Jake Y. Chen* (2012) Network Expansion and Pathway Enrichment Analysis towards Biologically Significant Findings from Microarrays. Journal of Integrative Bioinformatics, Vol.9, No 2, 213.
Lijie Sun, Hong Mei Li, Manfredo J. Seufferheld, Kent R. Walters, Venu M. Margam, Amber Jannasch, Naomi Diaz, Catherine P. Riley, Weilin Sun, Yueh Feng Li, William M. Muir, Jun Xie, Jing Wu, Fan Zhang, Jake Y. Chen, Eric L. Barker, Jiri Adamec, and Barry R. Pittendrigh (2011) Systems-scale analysis reveals pathways involved in cellular response to methamphetamine. PLOS ONE, 6(4):e18215.
Sudhir Chowbina, Youping Deng*, Junmei Ai, Xiaogang Wu, Xin Guan, Mitchell S Wilbanks, Barbara Lynn Escalon, Sharon A Meyer, Edward J Perkins, and Jake Y Chen* (2012) A new approach to construct pathway connected networks and its application in dose responsive gene expression profiles of rat liver regulated by 2,4DNT. BMC Genomics. 2010; 11(Suppl 3): S4.