TY - GEN
T1 - PEGR
T2 - 2020 Conference on Practice and Experience in Advanced Research Computing: Catch the Wave, PEARC 2020
AU - Shao, Danying
AU - Kellogg, Gretta
AU - Mahony, Shaun
AU - Lai, William
AU - Pugh, B. Franklin
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/7/26
Y1 - 2020/7/26
N2 - There has been a rapid development in genome sequencing, including high-throughput next generation sequencing (NGS) technologies, automation in biological experiments, new bioinformatics tools and utilization of high-performance computing and cloud computing. ChIP-based NGS technologies, e.g. ChIP-seq and ChIP-exo, are widely used to detect the binding sites of DNA-interacting proteins in the genome and help us to have a deeper mechanistic understanding of genomic regulation. As sequencing data is generated at an unprecedented pace from the ChIP-based NGS pipelines, there is an urgent need for a metadata management system. To meet this need, we developed the Platform for Eukaryotic Genomic Regulation (PEGR), a web service platform that logs metadata for samples and sequencing experiments, manages the data processing workflows, and provides reporting and visualization. PEGR links together people, samples, protocols, DNA sequencers and bioinformatics computation. With the help of PEGR, scientists can have a more integrated understanding of the sequencing data and better understand the scientific mechanisms of genomic regulation. In this paper, we present the architecture and the major functionalities of PEGR. We also share our experience in developing this application and discuss the future directions.
AB - There has been a rapid development in genome sequencing, including high-throughput next generation sequencing (NGS) technologies, automation in biological experiments, new bioinformatics tools and utilization of high-performance computing and cloud computing. ChIP-based NGS technologies, e.g. ChIP-seq and ChIP-exo, are widely used to detect the binding sites of DNA-interacting proteins in the genome and help us to have a deeper mechanistic understanding of genomic regulation. As sequencing data is generated at an unprecedented pace from the ChIP-based NGS pipelines, there is an urgent need for a metadata management system. To meet this need, we developed the Platform for Eukaryotic Genomic Regulation (PEGR), a web service platform that logs metadata for samples and sequencing experiments, manages the data processing workflows, and provides reporting and visualization. PEGR links together people, samples, protocols, DNA sequencers and bioinformatics computation. With the help of PEGR, scientists can have a more integrated understanding of the sequencing data and better understand the scientific mechanisms of genomic regulation. In this paper, we present the architecture and the major functionalities of PEGR. We also share our experience in developing this application and discuss the future directions.
UR - http://www.scopus.com/inward/record.url?scp=85089274040&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089274040&partnerID=8YFLogxK
U2 - 10.1145/3311790.3396621
DO - 10.1145/3311790.3396621
M3 - Conference contribution
AN - SCOPUS:85089274040
T3 - ACM International Conference Proceeding Series
SP - 285
EP - 292
BT - PEARC 2020 - Practice and Experience in Advanced Research Computing 2020
PB - Association for Computing Machinery
Y2 - 27 July 2020 through 31 July 2020
ER -