Principal Component Analysis (PCA) is an effective tool to group data by components that contribute to the greatest variance in the dataset. In other words, PCA can group your data based on variance, which can be used to analyze gene expression data. Microarrays are at the center of a revolution in biotechnology, allowing researchers to simultaneously monitor the expression of tens of thousands of genes.

Gene expression data analysis methods will develop similarly as sequence analysis methods have developed over the past decades. The amounts of gene expression data will continue growing and the data will become more systematic.

Differential Gene Expression: sequencingbased technologies (count data) 2 x 2 contingency table Statistical tests Chisquare test Fishers exact test Poisson regression. SAGExplore is a web server designed for exploiting the major benefits of the SAGE technique, which consist in assisting the processes of gene discovery and annotation. A microarray is a multiplex labonachip. It is a twodimensional array on a solid substrate.

Always log transform your gene expression data. Gene expression levels are heavily skewed in linear scale: half of the datapoint (the lower expressed genes) are between 0 and 1 (with 1 meaning no change), and the other half (the higher expressed genes) between 1 and positive infinity. Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product. For a specific cell at a specific time, only a subset of the genes coded in the genome are expressed. Transcriptional control is critical in gene expression regulation. Digital Gene Expression (DGE) is a costeffective, sequencebased and quantitative approach for simple transcript quantification. By sequencing one read per molecule of RNA, this technique can be used to efficiently count transcripts while obviating the need for transcriptlength normalization. Gene Expression Omnibus (GEO) is a public functional genomics data repository supporting MIAMEcompliant data submissions. Array and sequencebased data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.

