Protein Matching

From Purdue Genomics Database Facility

Jump to: navigation, search

To access the parts add the filename after the following url: http://sapporo.genomics.purdue.edu/~thechief/

For example, the url for blah.c is: http://sapporo.genomics.purdue.edu/~thechief/blah.c

blah.c - C program that lets things be fed to the calc class compare.pl - Perl script that reads input and communicates with process that actually does the computing of sequences calc.h - Header for the class that does the computing calc.cpp - Implementation of the class that does the computing

gribskov code

here is some revised code for doing the windowed matching. Not very well commented I'm afraid, and parameters have to be changed in the header file. The most time consuming part of this code is writing the output. compare.c therefore writes an unformatted file that must be reformatted with format.c. The score calculation has NOT been carefully validated. urrently an NxN comparison with N=43,000 takes about 20 min. Time depends largely on the distance cutoff.

research Groups