|
For each annotated gene, NimbleScan searches for peaks that appear in a specified promoter region around the transcription start site (TSS). The region searched is design-specific; for most mammalian designs, the search region spans from 5kb upstream to 1kb downstream of the TSS.
You can view the promoter reports using spreadsheet software, such as Microsoft Excel:
- Report_All_Peaks – Lists all peaks with an FDR ≤ 0.2 and maps them to promoter regions. Each row in the report lists a peak-transcript pair. For each transcript, if more than one peak lies within the promoter region, there will be multiple rows for that transcript.
- Report_Nearest_Peaks – Lists all peaks with an FDR ≤ 0.2 and maps them to promoter regions. Each row in the report lists a peak-transcript pair. For each transcript, if more than one peak lies within the promoter region, only the peak nearest to the TSS is reported.
To effectively analyze peak data, you should sort the data in promoter reports according to FDR, peak score, gene name, chromosome, distance to TSS, etc. To sort data in Microsoft Excel, highlight row 1 and choose Data -> Filter -> Auto Filter. You can then sort individual columns by ascending/descending values, top 10 values, or individual values.
The table below identifies the fields on the promoter reports (.xls):
| Field |
Description |
| PEAK_ID |
An ID for each peak. |
| CHROMOSOME |
Chromosome associated with the peak. |
| PEAK_START |
First base of the peak on the chromosome. |
| PEAK_END |
Last base of the peak on the chromosome. |
| PEAK_SCORE |
The log2-ratio of the fourth highest probe in the peak. |
| PEAK_FDR |
FDR value of the peak. |
| FEATURE_TRACK |
The annotation track against which peaks were mapped; it is the transcription start site for promoter reports. |
| FEATURE_STRAND |
Strand of the transcript. |
| FEATURE_START |
First base of the feature on the chromosome. |
| |
Note: For the transcription start site, feature size is 1; therefore, start and end positions are the same. |
| FEATURE_END |
Last base of the feature on the chromosome. |
| |
Note: For the transcription start site, feature size is 1; therefore, start and end positions are the same. |
| FEATURE_TO_PEAK_DISTANCE |
Center-to-center distance of peak to feature. |
| Name |
Gene symbol of the transcript. |
| accession |
GenBank accession number of the transcript. |
| description |
Full gene name of the transcript. |
| ncbi_gene_id |
NCBI Entrez GeneID of the transcript. |
| synonyms |
Other alias symbol(s) of the transcript. |
| Parent |
The internal identification number of the transcript from which this transcription start site is generated. |
|