GenePattern
GenePattern is a freely available computational biology open-source software package originally created and developed at the Broad Institute for the analysis of genomic data. Designed to enable researchers to develop, capture, and reproduce genomic analysis methodologies, GenePattern was first released in 2004. GenePattern is currently developed at the University of California, San Diego.
Developer(s) | Broad Institute, University of California, San Diego |
---|---|
Stable release | 3.9.11 rc4 b216
/ May 2019 |
Operating system | |
Type | genomic analysis |
License | BSD |
Website | www |
Functionality
GenePattern is a powerful scientific workflow system that provides access to hundreds of genomic analysis tools. Use these analysis tools as building blocks to design sophisticated analysis pipelines that capture the methods, parameters, and data used to produce analysis results. Pipelines can be used to create, edit and share reproducible in silico results.
Project Objectives
- Accessibility: Run over 200 regularly updated analysis and visualization tools (that support data preprocessing, gene expression analysis, proteomics, Single nucleotide polymorphism (SNP) analysis, flow cytometry, and next-generation sequencing) and create analytic workflows without any programming through a point and click user interface.
- Reproducibility: Automated history and provenance tracking with versioning so that any user can share, repeat and understand a complete computational analysis
- Extensibility: Computational users can import their methods and code for sharing using tools that support easy creation and integration
- Multiple interfaces: Web browser, application, and programmatic interfaces make analysis modules and pipelines available to a broad range of users; public hosted server
Features
- A regularly updated repository of hundreds of computational analysis modules that support data preprocessing, gene expression analysis, proteomics, single nucleotide polymorphism (SNP) analysis, flow cytometry, and short-read sequencing.
- A programmatic interface that makes analysis modules available to computational biologists and developers from Python, Java, MATLAB, and R.
- The GenePattern Notebook Environment: Built on the Jupyter Notebook environment, GenePattern Notebook allows researchers to run GenePattern analyses within notebooks that interleave text, graphics, and executable code, creating a single "research narrative."
- GParc: Repository and community for GenePattern users to share and discuss their own GenePattern modules
Availability
GenePattern is available:
- As a free public web application,[1] hosted on Amazon Web Services. Users can create accounts, perform analyses, and create pipelines on the server.
- As open-source software that can be downloaded and installed locally.[2]
- Public web servers hosted by other organizations.[3]
Notes
- "GenePattern". cloud.genepattern.org. Retrieved 2012-05-07.
- "GenePattern: Download GenePattern". Broadinstitute.org. Archived from the original on 2012-05-09. Retrieved 2012-05-07.
- "Use GenePattern". genepattern.org. 2006-10-07. Archived from the original on 2012-06-07. Retrieved 2012-05-07.
References
- The GenePattern Notebook Environment Reich M, Tabor T, Liefeld T, Thorvaldsdóttir H, Hill B, Tamayo P, Mesirov JP. Cell Syst. 2017 Aug 23;5(2):149-151.e1. doi: 10.1016/j.cels.2017.07.003. Epub 2017 Aug 16. PMID 28822753; PMC 5572818.
- Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace Qu K, Garamszegi S, Wu F, Thorvaldsdottir H, Liefeld T, Ocana M, Borges-Rivera D, Pochet N, Robinson JT, Demchak B, Hull T, Ben-Artzi G, Blankenberg D, Barber GP, Lee BT, Kuhn RM, Nekrutenko A, Segal E, Ideker T, Reich M, Regev A, Chang HY, Mesirov JP. Nat Methods. 2016 Mar;13(3):245-247. doi: 10.1038/nmeth.3732. Epub 2016 Jan 18. PMID 26780094; PMC 4767623.
- Using GenePattern for Gene Expression Analysis Kuehn, H., Liberzon, A., Reich, M. and Mesirov, J. P. Current Protocols in Bioinformatics. 2008. 22:7.12:7.12.1–7.12.39.
- GenePattern 2.0 Michael Reich, Ted Liefeld, Joshua Gould, Jim Lerner, Pablo Tamayo & Jill P. Mesirov. Nature Genetics - 38, 500 - 501 (2006)
External links
Related software: