Using Bioconductor To Analyse Microarray Data

From Bridges Lab Protocols
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.


Software Requirements

  • R, get from [CRAN]
  • Bioconductor, get from [Bioconductor]
  • Bioconductor packages. Install as needed:
    • Biobase
    • GEOquery - [1]
    • Limma
source("http://www.bioconductor.org/biocLite.R")
biocLite("PACKAGE")

Obtaining GEO Datasets

  • Open a R terminal
  • Load Biobase and GEOquery packages
libary(Biobase)
library(GEOquery)
  • Can load:
    • datasets - GDS
    • measurements - GSM
    • platforms - GPL
    • series - GSE
gds <- getGEO("GDS162")  #load GDS162 dataset
Meta(gds)  #show extracted meta data
table(gds)[1:10,]  #show first ten rows of dataset
eset <- GDS2eSet(gds, do.log=TRUE)  #convert to expression set, by default obtains annotation (GPL) data with log2 transformation
pData(eset)  #phenotype data
sampleNames(eset)  #sample names (GSM)

Microarray Analysis

  • set up design matrix. Use a different integer for each treatment group. The following example is for a contrast between the first seven groups and the last eight groups. For details on other design matrices see chapter 8 of [limma User Guide]
library(limma)  #load limma package
pData(eset)  #to see phenotype annotation data
design <- model.matrix(~(c(1,1,1,1,1,1,1,0,0,0,0,0,0,0,0)),eset)  #for four replicates of each treatment group,
colnames(design) <- c("resistant","sensitive")  # give names to the treatment groups
design  #check the design matrix
fit <- lmFit(eset,design)
fit.eb <- eBayes(fit)