Using Bioconductor To Analyse Microarray Data

From Bridges Lab Protocols
Revision as of 00:22, 27 July 2009 by Davebrid (talk | contribs)
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.


Software Requirements

  • R, get from [CRAN]
  • Bioconductor, get from [Bioconductor]
  • Bioconductor packages. Install as needed:
    • Biobase
    • GEOquery - [1]
source("http://www.bioconductor.org/biocLite.R")
biocLite("PACKAGE")

Obtaining GEO Datasets

  • Open a R terminal
  • Load Biobase and GEOquery packages
libary(Biobase)
library(GEOquery)
  • Can load:
    • datasets - GDS
    • measurements - GSM
    • platforms - GPL
    • series - GSE
gds <- getGEO("GDS162")  #load GDS162 dataset
Meta(gds)  #show extracted meta data
table(gds)[1:10,]  #show first ten rows of dataset
eset <- GDS2eSet(gds)  #convert to expression set, by default obtains annotation (GPL) data and no log transformation.
pData(eset)  #phenotype data
sampleNames(eset)  #sample names (GSM)

Microarray Analysis

  • set up design matrix. Use a different integer for each treatment group.
pData(eset)  #to see phenotype annotation data
design <- model.matrix(~0+factor(c(1,1,1,1,2,2,2,2)),eset)  #for four replicates of each treatment group
colnames(design) <- c("resistant","sensitive")  # give names to the treatment groups