REP

Uncategorized — Titus Barik on August 30, 2004 at 4:09 pm

Wrote a batch processing front-end to REP that uses the annotated P. falciparum strain 3D7 as its test data source from PlasmoDB and processes the results. The program is written using Python and the Biopython library. The Python Regular Expression tutorial and the Python course in Bioinformatics provided essential information along the way. To ensure correctness, the results were compared against a PlasmoDB gene query for the Pfam domain.

In Biopython, processing FASTA file formats are almost trivial:

it = Bio.Fasta.Iterator(handle,
     Bio.Fasta.SequenceParser())

seq = it.next()
while seq:
    print seq.description
    print seq.seq
    seq = it.next()

The comp.lang.python newsgroup is also very helpful. And to close, here are some useful resources available online for Python and Biopython:

0 Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a comment

Powered by WP Hashcash

titus@barik.net | The Weblog of Titus Barik