PFP is designed to predict GO annotations for a query protein sequence beyond what can be found by searching conventional databases.The PFP algorithm has been shown to increase coverage of sequence-based function annotation more than fivefold by extending a PSI-BLAST search to extract and score GO terms individually and include information from distantly related sequences. It applies a novel data mining tool, the Function Association Matrix (FAM), to score significantly associating pairs of annotations.


Enter your protein sequence here:


Databases to search:

All organisms

Mus musculus [House mouse]
Rattus norvegicus [Brown Rat]
Arabidopsis thaliana [Thale cress]
Escherichia coli
Caenorhabditis elegans
Drosophila melanogaster [Fruit fly]
Danio rerio [zebra fish]
Homo sapiens [human]
Bos taurus [Cow]
Cannabis sativa
Hepatitis C virus
Tetraodon nigroviridis
Gallus gallus [chicken]
Xenopus laevis [African clawed frog]
Paramecium tetraurelia
Saccharomyces cerevisiae [baker's yeast]
Schizosaccharomyces pombe [fission yeast]
Aedes aegypti [yellow fever mosquito]

Evidence codes to use:

All evidence codes

IDA (inferred from direct assay)
IEA (inferred from electronic annotation)
IEP (inferred from expression pattern)
IPI (inferred from physical interaction)
ISS (inferred from sequence or structural similarity)
NAS (non-traceable author statement)

Databases to search by organisms categories:

Bacteria and Archaea (Coming Soon)
Eukaryotes (Coming Soon)
Plants (Coming Soon)
Virus (Coming Soon)
Mamals (Coming Soon)



Enter your email address here to receive results:




PFP is operated and maintained by the Kihara Lab at Purdue University, West Lafayette, IN, USA.

last updated December 21, 2007