Affymetrix Facility Documentation
 
Affy Facility Docs Home
Running Chips
Pricing
Sample Submission
SBEAMS
Download Data
Annotate Samples
GetExpression
Get Affy Intensity
Analize Data
Analysis Pipeline
MEV
Affy Help Pages
Affy Core Help
SBEAMS Help Pages
Analysis Help Pages
Tutorials and Presentations
Other Resources
Affymetrix Home Page

Analyze Data

After analysis through the pipeline the data can be loaded into the GetExpression SBEAMS table which allows a user to query the data receive a results set and then combine the data with other types of data for analysis in Cytoscape


Follow to find a previous analysis run.

1)Log into Sbeams
2)Select the Microarray Link in the main navigation bar
3)Select the Project of interest at the top of the page
4)Select the Data Pipeline Button on the main navigation bar
5)If the project has Affy data a link to "Affy Analysis Pipeline" will be displayed at the bottom of the page, Click the link
6)Click on the "Analysis Results" tab to see any previous runs
7)Click on the "Show files" link to view the run of interest
8)Click the link Under the heading Add Results to Get Expression --> Add Data Link

Image could not be found
If the names of the conditions about to be upload are unique there should be a simple looking form, similar to the one below. The condition names can be edited if you wish
Image could not be found
If the Condition Name already exists in the database a warning will indicate so. There is two options at this point, change the name and click the button "Check condition Name" to see if it is unique or delete the data in the database by using the same condition name. Be careful you don't stomp someone else's data if you see a warning.
Image could not be found
The data from the SAM analysis is parsed and Added to certain database fields. See below for the mapping. In addition an attempt is made to produce a Canonical Name that will be useful for merging an Affy Expression results sets with other types of data. See below for more details.
SAM Column NameSBEAM Column NameData Description
Probe_set_id reporter_name & gene_name Affy Probe set id
Gene_Symbol common_name Gene Symbol, usually from NCBI
Gene_Title full_name Full gene Name
Unigene external_identifier Unigene ID
LocusLink second_name LocusLink ID
Public_ID canonical_name Canonical Name ***See below how it was derived from the Affy data
FDR false_discovery_rate False Discovery Rate
Log_10_Ratio log10_ratio Log 10 expression ratio
mu_X mu_X Mean linear expression value for the non-reference sample
mu_Y mu_Y Mean linear expression value for the Reference sample
D_stat NOT USED SAM statistic. Based on the ratio of change in gene expression to standard deviation in the data for a gene
The Canonical name is chosen in a very simple manor. Initially during analysis Affymetrix provides a file with a bunch of annotation for each probe_set. One of the columns called 'Representative Public ID' holds a DNA accession number usually from GenBank. Initially this was used as the canonical name but it was difficult it to match it up with other data sets. So additional columns of annotation were pulled from the Affy annotation file which including 'RefSeq Protein ID' and 'Locus Link ID'. During the Upload process a better canonical name is chosen by seeing if a RefSeq Protein ID is present, if so it becomes the new canonical name. If not then the Unigene name is used and finally the original ID from Affymetrix is used the so called ' Representative Public ID'. In the event a column of annotation holds more then one accession number (which happens quite frequently) the first accession number is chosen

1)Canonical Names in GetExpression Could come from a combination of places

Potential Names Used for the Canonical Name
RefSeq Protein ID
LocusLink ID
Affy Representative Public ID
Go Back to the Pipeline Overview
Go Back