Databases of protein domains and functional sites have become vital resources for the prediction of protein functions. During the last decade, several signature-recognition methods have evolved to address different sequence analysis problems, resulting in rather different and, for the most part, independent databases. Diagnostically, these resources have different areas of optimum application owing to the different strengths and weaknesses of their underlying analysis methods. Thus, for best results, search strategies should ideally combine all of them. InterPro ([1]) is a collaborative project aimed at providing an integrated layer on top of the most commonly used signature databases by creating a unique, non-redundant characterisation of a given protein family, domain or functional site. The InterPro database integrates PROSITE, PRINTS, Pfam, ProDom, SMART, TIGRFAMs, PIR superfamily, SUPERFAMILY Gene3D and PANTHER databases. The InterPro project home page is available at http://www.ebi.ac.uk/interpro.
References
The InterPro Consortium (*R.Apweiler, T.K.Attwood, A.Bairoch, A.Bateman, E.Birney, M.Biswas, P.Bucher, L.Cerutti, F.Corpet, M.D.R.Croning, R.Durbin, L.Falquet, W.Fleischmann, J.Gouzy, H.Hermjakob, N.Hulo, I.Jonassen, D.Kahn, A.Kanapin, Y.Karavidopoulou, R.Lopez, B.Marx, N.J.Mulder, T.M.Oinn, M.Pagni, F.Servant, C.J.A.Sigrist, E.M.Zdobnov). "The InterPro database, an integrated documentation resource for protein families, domains and functional sites." Nucleic Acids Research, 2001, 29(1): 37-40.
Bucher P., Karplus K., Moeri N., and Hofmann K. "A Flexible Motif Search Technique Based on Generalized Profiles." Comput Chem, 1996, 20(1): 3-23.
Scordis P., Flower D.R., and Attwood T.K. "Fingerprintscan: Intelligent Searching of the Prints Motif Database>" Bioinformatics, 1999, 15(10): 799-806.
Eddy S.R. "Profile Hidden Markov Models." Bioinformatics, 1998, 14(9): p. 755-63.
Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., and Lipman D.J. "Gapped Blast and Psi-Blast: A New Generation of Protein Database Search Programs." Nucleic Acids Res, 1997, 25(17): p. 3389-402.