|
For each annotated gene, NimbleScan searches for peaks that appear in a specified promoter region around the transcription start site (TSS). The region searched is design-specific; for most mammalian designs, the search region spans from 5kb upstream to 1kb downstream of the TSS.
You can view the summary reports using spreadsheet software, such as Microsoft Excel:
- Report All Peaks – Lists all peaks and maps them to promoter regions. Each row in the report lists a peak-transcript pair. For each transcript, if more than one peak lies within the promoter region, there will be multiple rows for that transcript.
- Report Nearest Peak – Lists all peaks and maps them to promoter regions. Each row in the report lists a peak-transcript pair. For each transcript, if more than one peak lies within the promoter region, only the peak nearest to the TSS is reported.
To effectively analyze peak data, you should sort the data in summary reports according to peak score, gene name, chromosome, distance to TSS, etc. To sort data in Microsoft Excel, highlight row 1 and select Data -> Filter -> Auto Filter. You can then sort individual columns by ascending/descending values, top 10 values, or individual values.
The table below identifies the fields on the summary reports (.xls):
| Field |
Description |
| PEAK_ID |
An ID for each peak. |
| CHROMOSOME |
Chromosome associated with the peak. |
| PEAK_START |
First base of the peak on the chromosome. |
| PEAK_END |
Last base of the peak on the chromosome. |
| PEAK_SCORE |
The peak score, which is the average -log10 pvalues from probes within that peak. |
| FEATURE_TRACK |
The annotation track against which peaks were mapped; it is the transcription start site for summary reports. |
| FEATURE_STRAND |
Strand of the transcript. |
| FEATURE_START |
First base of the feature on the chromosome. |
| |
Note: For the transcription start site, feature size is 1; therefore, start and end positions are the same. |
| FEATURE_END |
Last base of the feature on the chromosome. |
| |
Note: For the transcription start site, feature size is 1; therefore, start and end positions are the same. |
| FEATURE_TO_PEAK_DISTANCE |
Center-to-center distance of peak to feature. |
| Name |
Gene symbol of the transcript. |
| Accession |
GenBank accession number of the transcript. |
| description |
Full gene name of the transcript. |
| ncbi_gene_id |
NCBI Entrez GeneID of the transcript. |
| synonyms |
Other alias symbol(s) of the transcript. |
| Parent |
The internal identification number of the transcript from which this transcription start site is generated. |
| PEAK_ATTR |
Attribute field from the peak GFF file. |
|