References

  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool.

    J Mol Biol 1990, 215:403-410. PubMed Abstract | Publisher Full Text OpenURL

  2. NCBI BLASt [http://www.ncbi.nlm.nih.gov/BLAST/]
  3. WU-BLAST [http://blast.wustl.edu/]
  4. Baxevanis AD, Ouellette BFF, (eds):

    Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins. John Wiley; 1998. OpenURL

  5. Durbin R, Eddy S, Krogh A, Mitchison G:

    Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge: Cambridge University Press; 1998. OpenURL

  6. Higgins D, Taylor W, (eds):

    Bioinformatics: Sequence, Structure and Databanks. New York: Oxford University Press; 2000. OpenURL

  7. Kanehisa M:

    Post-Genome Informatics. New York: Oxford University Press; 2000. OpenURL

  8. Gibas L, Jambeck P:

    Developing Bioinformatics Computer Skills. Sebastopol, California: O'Reilly and Associates; 2001. OpenURL

  9. Wake DB: Comparative terminology.

    Science 1994, 265:268-269. OpenURL

  10. Wake DB: Homoplasy, homology and the problem of 'sameness' in biology.

    Novartis Found Symp 1999, 222:24-33. PubMed Abstract OpenURL

  11. Reeck GR, de Haen C, Teller DC, Doolittle RF, Fitch WM, Dickerson RE, Chambon P, McLachlan AD, Margoliash E, Jukes TH, et al.: "Homology" in proteins and nucleic acids: a terminology muddle and a way out of it.

    Cell 1987, 50:667. PubMed Abstract | Publisher Full Text OpenURL

  12. Pearson WR, Lipman DJ: Improved tools for biological sequence comparison.

    Proc Natl Acad Sci USA 1988, 85:2444-2448. PubMed Abstract OpenURL

  13. Altschul SF, Boguski MS, Gish W, Wootton JC: Issues in searching molecular sequence databases.

    Nat Genet 1994, 6:119-129. PubMed Abstract OpenURL

  14. Pearson WR: Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms.

    Genomics 1991, 11:635-650. PubMed Abstract OpenURL

  15. Koski LB, Golding GB: The closest BLAST hit is often not the nearest neighbor.

    J Mol Evol 2001, 52:540-542. PubMed Abstract | Publisher Full Text OpenURL

  16. Henikoff S, Henikoff JG: Amino acid substitution matrices from protein blocks.

    Proc Natl Acad Sci USA 1992, 89:10915-10919. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Dayhoff MO, Schwartz RM, Orcutt BC: A model of evolutionary change in proteins.

    In: Atlas of Protein Sequence and Structure, vol. 5. Edited by Dayhoff MO. Washington DC: National Biomedical Research Foundation; 1978, 345-352. OpenURL

  18. States DJ, Gish W, Altschul SF: Improved sensitivity of nucleic acid database searches using application-specific scoring matrices.

    Methods: A Companion to Methods in Enzymology 1991, 3:66-70. OpenURL

  19. Henikoff S, Henikoff JG: Protein family classification based on searching a database of blocks.

    Genomics 1994, 19:97-107. PubMed Abstract | Publisher Full Text OpenURL

  20. Henikoff S, Henikoff JG: Automated assembly of protein blocks for database searching.

    Nucleic Acids Res 1991, 19:6565-6572. PubMed Abstract OpenURL

  21. NCBI FTP directory - BLAST matrices [ftp://ncbi.nlm.nih.gov/blast/matrices]
  22. Needleman SB, Wunsch CD: A general method applicable to the search for similarities in the amino acid sequence of two proteins.

    J Mol Biol 1970, 48:443-453. PubMed Abstract OpenURL

  23. Smith TF, Waterman MS: Identification of common molecular subsequences.

    J Mol Biol 1981, 147:195-197. PubMed Abstract OpenURL

  24. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Rapp BA, Wheeler DL: GenBank.

    Nucleic Acids Res 2000, 28:15-18. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. GenBank [http://www.ncbi.nlm.nih.gov/Genbank/]
  26. Bairoch A, Apweiler R: The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

    Nucleic Acids Res 2000, 28:45-48. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. SWISS-PROT [http://www.expasy.ch/sprot/]
  28. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

    Nucleic Acids Res 1997, 25:3389-3402. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  29. Karlin S, Altschul SF: Applications and statistics for multiple high-scoring segments in molecular sequences.

    Proc Natl Acad Sci USA 1993, 90:5873-5877. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  30. Lamperti ED, Kittelberger JM, Smith TF, Villa-Komaroff L: Corruption of genomic databases with anomalous sequence.

    Nucleic Acids Res 1992, 20:2741-2747. PubMed Abstract OpenURL

  31. Kristensen T, Lopez R, Prydz H: An estimate of the sequencing error frequency in the DNA sequence databases.

    DNA Seq 1992, 2:343-346. PubMed Abstract OpenURL

  32. Lopez R, Kristensen T, Prydz H: Database contamination.

    Nature 1992, 355:211. PubMed Abstract | Publisher Full Text OpenURL

  33. States DJ, Botstein D: Molecular sequence accuracy and the analysis of protein coding regions.

    Proc Natl Acad Sci USA 1991, 88:5518-5522. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  34. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al.: Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.

    Science 1995, 269:496-512. PubMed Abstract OpenURL

  35. Ichikawa T, Suzuki Y, Czaja I, Schommer C, Lessnick A, Schell J, Walden R: Identification and role of adenylyl cyclase in auxin signalling in higher plants.

    Nature 1997, 390:698-701. PubMed Abstract | Publisher Full Text OpenURL

  36. Ichikawa T, Suzuki Y, Czaja I, Schommer C, Lessnick A, Schell J, Walden R: Identification and role of adenylyl cyclase in auxin signalling in higher plants.

    Nature 1998, 396:390. PubMed Abstract | Publisher Full Text OpenURL

  37. Karlin S, Altschul SF: Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

    Proc Natl Acad Sci USA 1990, 87:2264-2268. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  38. Full list of the BLAST Advanced options [http://www.ncbi.nlm.nih.gov/BLAST/full_options.html]