Sunday, December 13, 2009

Bioinformatics:Open Reading Frame (ORF)

ADVERTISEMENTS

Bioinformatics:Open Reading Frame (ORF)

The Open Reading Frame or (ORF) is a sequence of DNA located between the start-code sequence (initiation codon) and the stop-code sequence (termination codon).

The ORF finder softwares or algorithms are used to locate a gene in a given sequence by locating the initiation codon and the termination codon.

The initiation and termination codon can occur by chance so they could falsify our results, but in general the sequence found between them is not long enough, so to make sure its an ORF, we have to make sure that the sequence between the initiation and termination codon is long so it can represent a GENE.

The DNA sequence can be read in SIX different reading frames, 3 for each strand (because every codon have 3 bases).

In eucaryotic DNA we may find overlapping sequences withing a gene, these overlapping sequences are called INTRONS and they do not code for proteines.

Example: if we have a mRNA sequence:

1 st reading frame: AGUAAGAUGGCGAAUCUU
2 nd reading frame: - GUAAGAUGGCGAAUCUU
3 rd reading frame: - - UAAGAUGGCGAAUCUU

We can see that the first reading frame contains an initiation codon (AUG), the 2nd doesn't contain anything, the 3rd contains a stop codon (UAA).

So if we are about to choose a correct reading frame we would choose the first one.

There are many softwares dedicated for ORF detection, GeneMark is one of the best, it is a family of gene prediction programs developed at
Georgia Institute of Technology, Atlanta, Georgia, USA. You can access it from HERE.

Any questions you're welcome.

0 comments:

Post a Comment