MRC Clinical Sciences Centre
Bioinformatics Core Team
If you run high throughput sequencing at the Clinical Science Centre you will be delivered a path to your data in Fastq format.Â
The Bioinformatics Core will process optical data to base calls presented as Fastq.
A member of the Core team will have assessed that your data meets minimal sequencing standards.
Fastq maybe stored gzipped.
Many programs wont mind.
You will have to  unpack to view sequences manually.
Â
•A small number of molecules in each cluster run ahead (prephasing) or fall behind(phasing) of the current cycle.
•Corrected for during base calling
•Clusters filtered based on signal purity
•Only one cycle is allowed to have chastity value of < 0.6 for the first 25 cycles.
Lazer Cross Talk
•Two lasers are used to excite the dye attached to each nucleotide.
•The spectra from the four dyes overlap, such that the images are no independent.
•This cross-talk is resolved with a cross-talk matrix.
•Qscores = estimated probability of an incorrect base call
•Qscore = 10log10(e)
•Represented as ASCII characters to save space
For more information on Phred scores come here
Cross-Coverage Score =(Coverage0 – CoverageN)/Coverage0
Frag_CC = Cross-coverage score at fragment length.