Skip to main content

Software Introduction

This section contains installation guides to setup primary software packages and development infrastructure for the Pugh/Lai labs.

Every graduate student (WET AND DRY) in the lab will need to perform bioinformatic analysis at some point. Referencing these guides sooner rather than later can SAVE YOU AND THE DRY-BENCH MEMBERS OF THE LAB A LOT OF TIME.

Infrastructure

The Pugh Lab, Lai Lab, and Epigenomics Core (EGC) are supported by an integrated infrastructure of software and tools that facilitates our research.


PughInfrastructure

  • PEGR - Manage metadata associated with all sequencing runs for looking up samples and downloading processed datafiles (e.g. BAM formatted alignments).
  • Galaxy - Create data analysis & visualization pipelines for large-scale data processing. The workflow results from these intitial analyses are displayed on PEGR.
  • ScriptManger - For day-to-day bioinformatics. Generalized toolbox of scripts we use to perform both our standard ChIP-exo analyses and more customized analyses for most of our papers
  • GenoPipe - Quality check tool we use to confirm the genotypes of our samples which ultimately contributes to the reproducibility of our research.

Our Resources

ProjectSoftware stackComputing resources used
GalaxyExternally developed by The Galaxy ProjectGalaxy is run on Cornell CAC servers and also leverages ACCESS (fromerly XSEDE) resources for compute requirements.
Platform for Epigenetic and Genomic Research (PEGR)MySql, Grails, GroovyPEGR is also run on Cornell CAC servers.
Yeast Epigenome Project (YEP)MERN software stack*
Protein Capture Reagent Program Validation (PCRP)MERN software stack*

*MERN Stack is made up of MongoDB, Express, React, NodeJS