Introduction (lecture)
Paper discussion
From raw data to lists of differentially expressed genes (Step by step analysis of a microarray data set using the R language)
Lab discussion
Biological interpretation of microarray data (Gene ontology analysis using the R language + online research of candidate genes)
A microarray is a solid support (such as a membrane or glass microscope slide) on which DNA of known sequence is deposited in a grid-like array.
The amount of RNA hybridised on each grid location can be measured and is a proxy for the gene expression level
It is possible to represent different samples on one microarray using different fluorescent molecules (fluorophores)
Expression ratio: $log({\color{red}{Cy5} \over \color{green}{Cy3}})$
Reminder: $log_2(x)$ is the unique real number $y$ such that: $2^y = x$.
For example: $log_2(8) = 3$ because $2^3 = 8$
$\color{red}{Cy5}/\color{green}{Cy3}$ | $log_2({\color{red}{Cy5}/\color{green}{Cy3}})$ |
---|---|
4 | 2 |
2 | 1 |
1 | 0 |
0.5 | -1 |
0.25 | -2 |
Null hypothesis ($H_0$): gene $x$ is not differentially expressed between two treatments
Mean:
$\overline{x}={{1 \over M} \sum_{i=1}^{M}{x_i}}$; for gene $x$ in M replicates
Variance:
$S_{x}^2 = {{1 \over {M-1}} \sum_{i=1}^{M}({x_i^2 - \overline{x}^2})}$
T-statistic:
$T_x={{ \overline{x_{C_1}} - \overline{x_{C_2}} } \over \sqrt[2]{ {S_{x_{C_1}}^2 \over M} + {S_{x_{C_2}}^2 \over N} } }$
T-test is used only to compare two samples. To compare more than two samples, ANOVA (ANalysis Of VAriance) is used.
Null hypothesis ($H_0$): gene A is not differentially expressed between two treatments
The Bonferroni correction is generally considered to be too conservative and False Discovery Rate (FDR) should be used.
P-value | Observed number of genes | Expected number of false positives | FDR |
---|---|---|---|
$10^{-2}$ | 184 | 64 | 35 |
$10^{-3}$ | 35 | 6 | 18 |
$10^{-4}$ | 15 | 0.6 | 4 |
Minimum Information About a Microarray Experiment that is needed to enable the interpretation of the results of the experiment unambiguously and potentially to reproduce the experiment
What type of research?
|
Why N. attenuata?
|
The chip: cDNA array with 15,264 potato genes from TIGR (The Institute for Genomic Research) | |||||
The veggies | The vegetarian | ||||
Solanum nigrum Black nightshade |
Nicotiana attenuata Coyote tobacco |
Manduca sexta |
- Question:
- Do tobacco and black nightshade plants respond differently to caterpillar attack?