To install this package, start R and enter:

## try http:// if https:// URLs are not supported

In most cases, you don't need to download the package archive at all.



Surrogate Variable Analysis

Bioconductor version: Release (3.2)

The sva package contains functions for removing batch effects and other unwanted variation in high-throughput experiment. Specifically, the sva package contains functions for the identifying and building surrogate variables for high-dimensional data sets. Surrogate variables are covariates constructed directly from high-dimensional data (like gene expression/RNA sequencing/methylation/brain imaging data) that can be used in subsequent analyses to adjust for unknown, unmodeled, or latent sources of noise. The sva package can be used to remove artifacts in three ways: (1) identifying and estimating surrogate variables for unknown sources of variation in high-throughput experiments (Leek and Storey 2007 PLoS Genetics,2008 PNAS), (2) directly removing known batch effects using ComBat (Johnson et al. 2007 Biostatistics) and (3) removing batch effects with known control probes (Leek 2014 biorXiv). Removing batch effects and using surrogate variables in differential expression analysis have been shown to reduce dependence, stabilize error rate estimates, and improve reproducibility, see (Leek and Storey 2007 PLoS Genetics, 2008 PNAS or Leek et al. 2011 Nat. Reviews Genetics).

Author: Jeffrey T. Leek <jtleek at>, W. Evan Johnson <wej at>, Hilary S. Parker <hiparker at>, Elana J. Fertig <ejfertig at>, Andrew E. Jaffe <ajaffe at>, John D. Storey <jstorey at>

Maintainer: Jeffrey T. Leek <jtleek at>, John D. Storey <jstorey at>, W. Evan Johnson <wej at>

Citation (from within R, enter citation("sva")):


To install this package, start R and enter:

## try http:// if https:// URLs are not supported


To view documentation for the version of this package installed in your system, start R and enter:



PDF sva tutorial
PDF   Reference Manual


biocViews BatchEffect, Microarray, MultipleComparison, Normalization, Preprocessing, RNASeq, Sequencing, Software, StatisticalMethod
Version 3.18.0
In Bioconductor since BioC 2.9 (R-2.14) (4 years)
License Artistic-2.0
Depends R (>= 2.8), mgcv, genefilter
Suggests limma, pamr, bladderbatch, BiocStyle, zebrafishRNASeq, testthat
Depends On Me SCAN.UPC
Imports Me ballgown, ChAMP, charm, DeSousa2013, edge, ENmix, MEAL, PAA, trigger
Suggests Me curatedBladderData, curatedCRCData, curatedOvarianData, RnBeads, SomaticSignatures
Build Report  

Package Archives

Follow Installation instructions to use this package in your R session.

Package Source sva_3.18.0.tar.gz
Windows Binary (32- & 64-bit)
Mac OS X 10.6 (Snow Leopard) sva_3.18.0.tgz
Mac OS X 10.9 (Mavericks) sva_3.18.0.tgz
Subversion source (username/password: readonly)
Git source
Package Short Url
Package Downloads Report Download Stats

Documentation »


R / CRAN packages and documentation

Support »

Please read the posting guide. Post questions about Bioconductor to one of the following locations:

Fred Hutchinson Cancer Research Center