<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-8-r176</ui>
   <ji>GBJ</ji>
   <fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Distribution patterns of small-molecule ligands in the protein universe and implications for origin of life and drug discovery</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Ji</snm>
					<fnm>Hong-Fang</fnm>
					<insr iid="I1"/>
					<email>jhf@sdut.edu.cn</email>
				</au>
				<au id="A2">
					<snm>Kong</snm>
					<fnm>De-Xin</fnm>
					<insr iid="I1"/>
					<email>dxkong@163.com</email>
				</au>
				<au id="A3">
					<snm>Shen</snm>
					<fnm>Liang</fnm>
					<insr iid="I1"/>
					<email>shen@sdut.edu.cn</email>
				</au>
				<au id="A4">
					<snm>Chen</snm>
					<fnm>Ling-Ling</fnm>
					<insr iid="I1"/>
					<email>llchen@sdut.edu.cn</email>
				</au>
				<au id="A5">
					<snm>Ma</snm>
					<fnm>Bin-Guang</fnm>
					<insr iid="I1"/>
					<email>mbgmbg@163.com</email>
				</au>
				<au id="A6" ca="yes">
					<snm>Zhang</snm>
					<fnm>Hong-Yu</fnm>
					<insr iid="I1"/>
					<email>zhanghy@sdut.edu.cn</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Shandong Provincial Research Center for Bioinformatic Engineering and Technique, Center for Advanced Study, Shandong University of Technology, Zibo 255049, PR China</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2007</pubdate>
			<volume>8</volume>
			<issue>8</issue>
			<fpage>R176</fpage>
			<url>http://genomebiology.com/2007/8/8/R176</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">17727706</pubid>
					<pubid idtype="doi">10.1186/gb-2007-8-8-r176</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>4</day>
					<month>2</month>
					<year>2007</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>22</day>
					<month>8</month>
					<year>2007</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>29</day>
					<month>8</month>
					<year>2007</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>29</day>
					<month>08</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Ji et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Protein-ligand interactions</p>
		</shorttitle>
		<shortabs>
			<p>Ligand-protein mapping was found to follow a power law and the preferential attachment principle, leading to the identification of the molecules, mostly nucleotide-containing compounds, that are likely to have evolved earliest.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Extant life depends greatly on the binding of small molecules (such as ligands) with macromolecules (such as proteins), and one ligand can bind multiple proteins. However, little is known about the global patterns of ligand-protein mapping.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>By examining 2,186 well-defined small-molecule ligands and thousands of protein domains derived from a database of druggable binding sites, we show that a few ligands bind tens of protein domains or folds, whereas most ligands bind only one, which indicates that ligand-protein mapping follows a power law. Through assigning the protein-binding orders (early or late) for bio-ligands, we demonstrate that the preferential attachment principle still holds for the power-law relation between ligands and proteins. We also found that polar molecular surface area, H-bond acceptor counts, H-bond donor counts and partition coefficient are potential factors to discriminate ligands from ordinary molecules and to differentiate super ligands (shared by three or more folds) from others.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>These findings have significant implications for evolution and drug discovery. First, the chronology of ligand-protein binding can be inferred by the power-law feature of ligand-protein mapping. Some nucleotide-containing ligands, such as ATP, ADP, GDP, NAD, FAD, dihydro-nicotinamide-adenine-dinucleotide phosphate (NDP), nicotinamide-adenine-dinucleotide phosphate (NAP), flavin mononucleotide (FMN) and AMP, are found to be the earliest cofactors bound to proteins, agreeing with the current understanding of evolutionary history. Second, the finding that about 30% of ligands are shared by two or more domains will help with drug discovery, such as in finding new functions from old drugs, developing promiscuous drugs and depending more on natural products.</p>
				</sec>
			</sec>
		</abs>
	</fm>
   <meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010001">Biochemistry and structural biology</classification>
		</classifications>
	</meta>
   <bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Life is essentially a molecular network, not only in the individual sense but also at the ecosystem level <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>. The network depends greatly on the binding of small molecules (for example, ligands and cofactors) with macromolecules (for example, proteins). Small-molecule ligands not only participate in many basic enzymatic reactions (as coenzymes or substrates) to build metabolic networks, but also act as extra- and intra-cellular signals to help construct regulation networks <abbrgrp>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
				</abbrgrp>. The great potential of small-molecule ligands to make links between different proteins means that one ligand can bind to diverse targets <abbrgrp>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
					<abbr bid="B13">13</abbr>
				</abbrgrp>. In fact, some ligands are extremely powerful in contacting proteins, which are termed hubs of biochemical networks <abbrgrp>
					<abbr bid="B14">14</abbr>
					<abbr bid="B15">15</abbr>
					<abbr bid="B16">16</abbr>
					<abbr bid="B17">17</abbr>
				</abbrgrp>. However, little is known about the global patterns of ligand-protein mapping, which stimulated our interest to do a comprehensive analysis and explore the biological and chemical bases underlying the mapping patterns. Since ligand-protein binding is one of the most basic biochemical processes, the present study has significant implications for tracing the important events in the origin of life and as well as for understanding the new paradigms in drug discovery.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Distribution patterns of ligands in the protein universe</p>
				</st>
				<p>Although considerable efforts have been devoted to constructing ligand databases <abbrgrp>
						<abbr bid="B18">18</abbr>
						<abbr bid="B19">19</abbr>
						<abbr bid="B20">20</abbr>
						<abbr bid="B21">21</abbr>
						<abbr bid="B22">22</abbr>
						<abbr bid="B23">23</abbr>
						<abbr bid="B24">24</abbr>
						<abbr bid="B25">25</abbr>
						<abbr bid="B26">26</abbr>
					</abbrgrp>, it is still a great challenge to select clearly defined ligands from them. Thanks to the endeavor of Rognan and co-workers, a well-defined ligand database, the Annotated Database of Druggable Binding Sites from the PDB (sc-PDB), was released recently <abbrgrp>
						<abbr bid="B27">27</abbr>
					</abbrgrp>. For this database, the ligands were collected according to the following criteria: only host proteins with high-resolution (&lt;2.5 &#197;) crystal structures were considered; water molecule, metal ions and other 'unwanted molecules' (for example, solvents, detergents and covalently bound ligands) were removed; only small-molecular-weight ligands (ranging from 70 to 800 Da for heavy atoms) were selected; and only ligands with a limited solvent-exposed surface (that is, less than 50% of their surface exposed to the solvent) were picked. In addition, the corresponding binding sites were also extracted and were defined by all of the protein residues with at least one atom within 6.5 &#197; of any ligand atom. Taken together, the clear definition for the ligands in sc-PDB guarantees the repeatability of the present analysis, which gives sc-PDB an advantage over other ligand databases.</p>
				<p>Through searching sc-PDB, 2,186 small-molecule ligands were selected, which are bound by 5,740 domains (the domains were counted at a non-redundant level and constituted domain space; Additional data file 1). According to SCOP 1.69 <abbrgrp>
						<abbr bid="B28">28</abbr>
						<abbr bid="B29">29</abbr>
					</abbrgrp>, these domains were classified into 591 folds. As one fold may cover multiple domains and bind more than one ligand, the fold occurrences amounted to 3,224, which constituted the fold universe.</p>
				<p>As shown in Additional data file 1, ligands do not distribute evenly in the domain space. A few ligands cover 100+ domains, 681 ligands (31.2%) are shared by 2 or more domains and 1,505 (68.8%) bind only one. Moreover, ligands also populate unevenly in the protein architecture universe. For instance, 1,833 ligands (83.9%) are bound by only one fold, 185 (8.5%) by two, while 24 ligands (1.1%) are bound by 10+ folds (Additional data file 1). The most common ligand, ATP (adenosine-5'-triphosphate), is shared by 35 folds. As illustrated in Figure <figr fid="F1">1</figr>, the number of ligands (<it>N</it>) decays with increasing number (<it>L</it>) of domains and folds that bind the ligand and follows the power law <it>N </it>= <it>aL</it>
					<sup>-<it>b </it>
					</sup>(<it>P </it>&lt; 0.0001). It is interesting to note that most of the widely shared ligands (such as those shared by 15+ folds; Additional data file 1) are hubs of metabolic networks <abbrgrp>
						<abbr bid="B14">14</abbr>
						<abbr bid="B15">15</abbr>
						<abbr bid="B16">16</abbr>
					</abbrgrp> and are vital to metabolism (especially energy metabolism).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Power-law behaviors of ligand-protein binding</p>
					</caption>
					<text>
						<p>Power-law behaviors of ligand-protein binding. The number of ligands (<it>N</it>) decays with an increase in the number (<it>L</it>) of <b>(a) </b>domains and <b>(b) </b>folds that bind the ligand and follows the equation <it>N </it>= <it>aL</it>
							<sup>-<it>b</it>
							</sup>. The figure illustrates that a few ligands cover tens of protein domains or folds, while most ligands bind only one domain or fold.</p>
					</text>
					<graphic file="gb-2007-8-8-r176-1"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Biological basis underlying the power-law behaviors of ligand-protein binding</p>
				</st>
				<p>Although power law is a central concept in network sciences and has been implicated in most biological networks <abbrgrp>
						<abbr bid="B14">14</abbr>
						<abbr bid="B15">15</abbr>
						<abbr bid="B16">16</abbr>
					</abbrgrp>, it is a challenge to elucidate the mechanisms underlying the rule. The most popular theoretical models resort to preferential attachment principle, which attributes the different connections of nodes to their different emerging orders, that is to say, the more connected nodes originated earlier than the less connected nodes <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>. Although the preferential attachment principle has been justified for protein networks <abbrgrp>
						<abbr bid="B31">31</abbr>
						<abbr bid="B32">32</abbr>
						<abbr bid="B33">33</abbr>
					</abbrgrp>, it remains unclear whether it can be applied to protein-ligand binding.</p>
				<p>As a large part of the sc-PDB-derived ligands are synthetic, to explore the applicability of the preferential attachment principle to protein-ligand binding, we extracted bio-ligands from the ligand dataset. To do this, the MetaCyc database (9.5; a metabolic-pathway database that contains 5,253 metabolites) <abbrgrp>
						<abbr bid="B34">34</abbr>
					</abbrgrp> was employed to filter the non-metabolic ligands. As a result, 128 bio-ligands were obtained, which bind to 1,662 domains (counted at a non-redundant level). According to SCOP 1.69 <abbrgrp>
						<abbr bid="B28">28</abbr>
						<abbr bid="B29">29</abbr>
					</abbrgrp>, these domains were classified into 207 folds. As one fold may cover multiple domains and bind more than one ligand, the fold occurrences amounted to 574. Although these ligands are only metabolism-relevant, they also follow power-law distribution in the protein universe (Additional data file 2).</p>
				<p>As the quantity of bio-ligands is limited, to guarantee statistical significance, the 128 bio-ligands were classified into only two categories: first, 70 early ligands, which are owned by both prokaryotic (<it>Escherichia coli</it>) and eukaryotic (yeast or higher) species; and second, 54 late ligands, which are owned only by eukaryotic (yeast or higher) species (4 ligands failed in age assignment) (Additional data file 3). It is interesting to note that early ligands cover 7.1 folds on average, in contrast to late ligands, which cover only 1.2 folds on average, and that all (100%) super ligands (shared by 3+ folds) originated early, while most (64.8%) ordinary ligands (bind to 3 or less folds) appeared late. All of these findings strongly suggest that the preferential attachment principle still holds for ligand-protein binding to a large extent.</p>
			</sec>
			<sec>
				<st>
					<p>Chemical basis underlying the power-law behaviors of ligand-protein binding</p>
				</st>
				<p>It has been widely accepted that protein folds are among the most conserved elements of life <abbrgrp>
						<abbr bid="B35">35</abbr>
						<abbr bid="B36">36</abbr>
						<abbr bid="B37">37</abbr>
					</abbrgrp>. However, the present analysis indicates that 353 ligands (16.1%) are shared by 2 or more folds and 104 ligands (4.8%) can cover 3+ folds, which suggests that ligand binding is not constrained by the global architecture of proteins. This finding is consistent with a recent concept that the local structures around an active site are more basic than folds to describe a protein's biological space (binding site for potential ligands) <abbrgrp>
						<abbr bid="B38">38</abbr>
					</abbrgrp>. This phenomenon can be elucidated, at least in part, in terms of the structure-function relationships of proteins. First, binding sites and ligands are quite flexible and plastic <abbrgrp>
						<abbr bid="B39">39</abbr>
						<abbr bid="B40">40</abbr>
						<abbr bid="B41">41</abbr>
					</abbrgrp>, and therefore, binding-site selection is, to certain extent, ligand dependent <abbrgrp>
						<abbr bid="B42">42</abbr>
						<abbr bid="B43">43</abbr>
						<abbr bid="B44">44</abbr>
					</abbrgrp>. Second, ligand binding is governed by a few conserved residues and, thus, is a local rather than a global property of proteins <abbrgrp>
						<abbr bid="B10">10</abbr>
						<abbr bid="B11">11</abbr>
					</abbrgrp>. However, the structural factors underlying the strong protein-binding ability of the super ligands still remain unknown. In addition, it is also of interest to explore the structural features discriminating ligands from ordinary molecules. Therefore, the chemical space consisting of ligands and ordinary molecules was charted to reveal the relationship between the ligand distribution patterns in the protein universe and in the chemical space.</p>
				<p>The chemical space is composed of 2,176 ligands derived from sc-PDB (due to the lack of atomic parameters, 10 of the 2,186 ligands failed to go through the descriptor calculations) and 2,184 small molecules randomly selected from ACD-SC (Available Chemicals Directory-Screening Compounds, Version 2005.1, Molecular Design Ltd. Information Systems Inc., San Leardo, CA, USA; which collects chemicals that are commercially available and is broadly regarded as a source of ordinary molecules <abbrgrp>
						<abbr bid="B45">45</abbr>
					</abbrgrp>). Seventy descriptors characterizing the structural features of these molecules were calculated, of which 13 were calculated by Sybyl (Tripos Inc., St Louis, Missouri, USA <abbrgrp>
						<abbr bid="B46">46</abbr>
					</abbrgrp>), 49 by Cerius2 (Version 4.10L, Accelrys Inc., San Diego, CA, USA <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp>) and 8 by an in-house program written in Perl (Table <tblr tid="T1">1</tblr>).</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Descriptors of chemical space consisting of sc-PDB-derived ligands and ACD-SC-derived ordinary molecules and corresponding loadings (Varimax normalized) for the first two factors*</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Descriptors</p>
							</c>
							<c ca="left">
								<p>Characterization</p>
							</c>
							<c cspan="2" ca="center">
								<p>Factor loadings</p>
							</c>
							<c ca="left">
								<p>Software</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AREA</p>
							</c>
							<c ca="left">
								<p>Total molecular surface area</p>
							</c>
							<c ca="center">
								<p>
									<b>0.974</b>
								</p>
							</c>
							<c ca="center">
								<p>0.103</p>
							</c>
							<c ca="left">
								<p>Sybyl</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PSA</p>
							</c>
							<c ca="left">
								<p>Polar molecular surface area</p>
							</c>
							<c ca="center">
								<p>0.255</p>
							</c>
							<c ca="center">
								<p>
									<b>0.892</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PV</p>
							</c>
							<c ca="left">
								<p>Polar molecular volume</p>
							</c>
							<c ca="center">
								<p>0.501</p>
							</c>
							<c ca="center">
								<p>0.741</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>VOL</p>
							</c>
							<c ca="left">
								<p>Total molecular volume</p>
							</c>
							<c ca="center">
								<p>
									<b>0.991</b>
								</p>
							</c>
							<c ca="center">
								<p>0.062</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>MOLWEIGHT</p>
							</c>
							<c ca="left">
								<p>Molecular weight</p>
							</c>
							<c ca="center">
								<p>
									<b>0.958</b>
								</p>
							</c>
							<c ca="center">
								<p>0.206</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Acceptor</p>
							</c>
							<c ca="left">
								<p>H-bond acceptor counts</p>
							</c>
							<c ca="center">
								<p>0.464</p>
							</c>
							<c ca="center">
								<p>
									<b>0.799</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Donor</p>
							</c>
							<c ca="left">
								<p>H-bond donor counts</p>
							</c>
							<c ca="center">
								<p>0.376</p>
							</c>
							<c ca="center">
								<p>
									<b>0.817</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BondCount</p>
							</c>
							<c ca="left">
								<p>Total bond counts</p>
							</c>
							<c ca="center">
								<p>
									<b>0.972</b>
								</p>
							</c>
							<c ca="center">
								<p>0.060</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Chiral</p>
							</c>
							<c ca="left">
								<p>Counts of chiral center</p>
							</c>
							<c ca="center">
								<p>0.367</p>
							</c>
							<c ca="center">
								<p>0.617</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Hydrophobe</p>
							</c>
							<c ca="left">
								<p>Hydrophobic fragment counts</p>
							</c>
							<c ca="center">
								<p>0.767</p>
							</c>
							<c ca="center">
								<p>-0.417</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RingCount</p>
							</c>
							<c ca="left">
								<p>Ring counts</p>
							</c>
							<c ca="center">
								<p>0.686</p>
							</c>
							<c ca="center">
								<p>-0.069</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RotBonds</p>
							</c>
							<c ca="left">
								<p>Number of rotatable bonds</p>
							</c>
							<c ca="center">
								<p>0.630</p>
							</c>
							<c ca="center">
								<p>0.428</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>HeavyAtoms</p>
							</c>
							<c ca="left">
								<p>Number of non-H atoms</p>
							</c>
							<c ca="center">
								<p>
									<b>0.978</b>
								</p>
							</c>
							<c ca="center">
								<p>0.149</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Carbons</p>
							</c>
							<c ca="left">
								<p>Number of carbons atoms</p>
							</c>
							<c ca="center">
								<p>
									<b>0.943</b>
								</p>
							</c>
							<c ca="center">
								<p>-0.228</p>
							</c>
							<c ca="left">
								<p>Perl</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Oxygens</p>
							</c>
							<c ca="left">
								<p>Number of oxygen atoms</p>
							</c>
							<c ca="center">
								<p>0.425</p>
							</c>
							<c ca="center">
								<p>0.793</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Nitrogens</p>
							</c>
							<c ca="left">
								<p>Number of nitrogen atoms</p>
							</c>
							<c ca="center">
								<p>0.475</p>
							</c>
							<c ca="center">
								<p>0.324</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Sulfurs</p>
							</c>
							<c ca="left">
								<p>Number of sulfur atoms</p>
							</c>
							<c ca="center">
								<p>0.141</p>
							</c>
							<c ca="center">
								<p>-0.009</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Phosphorus</p>
							</c>
							<c ca="left">
								<p>Number of phosphorus atoms</p>
							</c>
							<c ca="center">
								<p>0.162</p>
							</c>
							<c ca="center">
								<p>0.617</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Halides</p>
							</c>
							<c ca="left">
								<p>Number of halide atoms</p>
							</c>
							<c ca="center">
								<p>0.076</p>
							</c>
							<c ca="center">
								<p>-0.170</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DoubleBonds</p>
							</c>
							<c ca="left">
								<p>Number of double bonds</p>
							</c>
							<c ca="center">
								<p>0.527</p>
							</c>
							<c ca="center">
								<p>0.378</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TripleBonds</p>
							</c>
							<c ca="left">
								<p>Number of triple bonds</p>
							</c>
							<c ca="center">
								<p>-0.009</p>
							</c>
							<c ca="center">
								<p>-0.109</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RadOfGyration</p>
							</c>
							<c ca="left">
								<p>Radius of gyration</p>
							</c>
							<c ca="center">
								<p>0.888</p>
							</c>
							<c ca="center">
								<p>0.004</p>
							</c>
							<c ca="left">
								<p>Cerius 2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowXY</p>
							</c>
							<c ca="left">
								<p>Surface area projections</p>
							</c>
							<c ca="center">
								<p>
									<b>0.967</b>
								</p>
							</c>
							<c ca="center">
								<p>0.076</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowXZ</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.951</b>
								</p>
							</c>
							<c ca="center">
								<p>0.053</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowYZ</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.877</p>
							</c>
							<c ca="center">
								<p>0.093</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowXYfrac</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-0.610</p>
							</c>
							<c ca="center">
								<p>-0.027</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowXZfrac</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-0.421</p>
							</c>
							<c ca="center">
								<p>-0.002</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowYZfrac</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-0.289</p>
							</c>
							<c ca="center">
								<p>0.039</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Shadownu</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.268</p>
							</c>
							<c ca="center">
								<p>-0.117</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowXlength</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.849</p>
							</c>
							<c ca="center">
								<p>-0.008</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowYlength</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.798</p>
							</c>
							<c ca="center">
								<p>0.075</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ShadowZlength</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.756</p>
							</c>
							<c ca="center">
								<p>0.059</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Density</p>
							</c>
							<c ca="left">
								<p>Density</p>
							</c>
							<c ca="center">
								<p>-0.089</p>
							</c>
							<c ca="center">
								<p>0.354</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PMImag</p>
							</c>
							<c ca="left">
								<p>Principal moment of inertia</p>
							</c>
							<c ca="center">
								<p>0.819</p>
							</c>
							<c ca="center">
								<p>0.134</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AlogP</p>
							</c>
							<c ca="left">
								<p>Log of the partition coefficient using Ghose and Crippen's method.</p>
							</c>
							<c ca="center">
								<p>0.425</p>
							</c>
							<c ca="center">
								<p>-0.727</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AlogP98</p>
							</c>
							<c ca="left">
								<p>Log of the partition coefficient, atom-type value, using latest parameters.</p>
							</c>
							<c ca="center">
								<p>0.365</p>
							</c>
							<c ca="center">
								<p>
									<b>-0.852</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Fh2o</p>
							</c>
							<c ca="left">
								<p>Desolvation free energy for water</p>
							</c>
							<c ca="center">
								<p>-0.479</p>
							</c>
							<c ca="center">
								<p>-0.762</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Foct</p>
							</c>
							<c ca="left">
								<p>Desolvation free energy for octanol</p>
							</c>
							<c ca="center">
								<p>-0.578</p>
							</c>
							<c ca="center">
								<p>-0.617</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>LogP</p>
							</c>
							<c ca="left">
								<p>Log of the partition coefficient.</p>
							</c>
							<c ca="center">
								<p>-0.022</p>
							</c>
							<c ca="center">
								<p>
									<b>-0.892</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>MR</p>
							</c>
							<c ca="left">
								<p>Molar refractivity using Hopfinger's method.</p>
							</c>
							<c ca="center">
								<p>0.835</p>
							</c>
							<c ca="center">
								<p>-0.110</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>MolRef</p>
							</c>
							<c ca="left">
								<p>Molar refractivity using linear additive method based on AlogP atom types</p>
							</c>
							<c ca="center">
								<p>
									<b>0.986</b>
								</p>
							</c>
							<c ca="center">
								<p>-0.033</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>JX</p>
							</c>
							<c ca="left">
								<p>Balaban indices</p>
							</c>
							<c ca="center">
								<p>-0.567</p>
							</c>
							<c ca="center">
								<p>0.027</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa1</p>
							</c>
							<c ca="left">
								<p>Kappa topological indices</p>
							</c>
							<c ca="center">
								<p>
									<b>0.969</b>
								</p>
							</c>
							<c ca="center">
								<p>0.189</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.926</b>
								</p>
							</c>
							<c ca="center">
								<p>0.026</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa3</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.691</p>
							</c>
							<c ca="center">
								<p>0.033</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa1AM</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.958</b>
								</p>
							</c>
							<c ca="center">
								<p>0.220</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa2AM</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.901</b>
								</p>
							</c>
							<c ca="center">
								<p>0.050</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kappa3AM</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.630</p>
							</c>
							<c ca="center">
								<p>0.046</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PHI</p>
							</c>
							<c ca="left">
								<p>Molecular flexibility index</p>
							</c>
							<c ca="center">
								<p>0.800</p>
							</c>
							<c ca="center">
								<p>0.078</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC0</p>
							</c>
							<c ca="left">
								<p>Subgraph topological counts</p>
							</c>
							<c ca="center">
								<p>
									<b>0.980</b>
								</p>
							</c>
							<c ca="center">
								<p>0.147</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC1</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.973</b>
								</p>
							</c>
							<c ca="center">
								<p>0.125</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.943</b>
								</p>
							</c>
							<c ca="center">
								<p>0.186</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC3P</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.904</b>
								</p>
							</c>
							<c ca="center">
								<p>0.141</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC3C</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.749</p>
							</c>
							<c ca="center">
								<p>0.389</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SC3CH</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.016</p>
							</c>
							<c ca="center">
								<p>-0.086</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI0</p>
							</c>
							<c ca="left">
								<p>Kier and Hall Chi connectivity indices</p>
							</c>
							<c ca="center">
								<p>
									<b>0.974</b>
								</p>
							</c>
							<c ca="center">
								<p>0.190</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI1</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.983</b>
								</p>
							</c>
							<c ca="center">
								<p>0.115</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.958</b>
								</p>
							</c>
							<c ca="center">
								<p>0.210</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI3P</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.939</b>
								</p>
							</c>
							<c ca="center">
								<p>0.136</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI3C</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.655</p>
							</c>
							<c ca="center">
								<p>0.484</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHI3CH</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.015</p>
							</c>
							<c ca="center">
								<p>-0.087</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV0</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.990</b>
								</p>
							</c>
							<c ca="center">
								<p>0.076</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV1</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.971</b>
								</p>
							</c>
							<c ca="center">
								<p>0.120</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV2</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0.913</b>
								</p>
							</c>
							<c ca="center">
								<p>0.137</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV3P</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.838</p>
							</c>
							<c ca="center">
								<p>0.096</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV3C</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.476</p>
							</c>
							<c ca="center">
								<p>0.148</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CHIV3CH</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>0.016</p>
							</c>
							<c ca="center">
								<p>-0.088</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Wiener</p>
							</c>
							<c ca="left">
								<p>Wiener topological index</p>
							</c>
							<c ca="center">
								<p>0.854</p>
							</c>
							<c ca="center">
								<p>0.186</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>logZ</p>
							</c>
							<c ca="left">
								<p>Logarithm of Hosoya topological index</p>
							</c>
							<c ca="center">
								<p>-0.220</p>
							</c>
							<c ca="center">
								<p>-0.131</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Zagreb</p>
							</c>
							<c ca="left">
								<p>Zagreb topological index</p>
							</c>
							<c ca="center">
								<p>
									<b>0.958</b>
								</p>
							</c>
							<c ca="center">
								<p>0.162</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>*The first factor explains 52.8% of the variance and the second explains 12.7%. Factors with high loadings (>0.9 for first factors and >0.8 for second factors) are shown in bold.</p>
					</tblfn>
				</tbl>
				<p>We used factor analysis to visualize the diversity of the molecules. Factor analysis is widely used to study the patterns of relationship among many dependent variables, with the goal of discovering something about the nature of the independent variables (called factors) that affect them <abbrgrp>
						<abbr bid="B48">48</abbr>
						<abbr bid="B49">49</abbr>
					</abbrgrp>. In the present analysis, two factors, which can explain 65.5% of the variance, were extracted by principal component analysis and rotated by the Varimax method <abbrgrp>
						<abbr bid="B50">50</abbr>
					</abbrgrp> to chart the two-dimensional chemical space of small molecules. The factor loadings (Varimax normalized) are listed in Table <tblr tid="T1">1</tblr>.</p>
				<p>From the factor loadings, we see that the first factor, explaining 52.8% of the variance, contains high loadings (>0.9; shown in bold in Table <tblr tid="T1">1</tblr>) from constitutional properties (such as total molecular surface area, total molecular volume, molecular weight, total bond counts, number of non-hydrogen atoms and number of carbons atoms) and topological properties (such as Kappa topological indices, subgraph topological counts, Kier and Hall Chi connectivity indices and Zagreb topological Index). In comparison, the second factor, explaining 12.7% of the variance, contains important contributions (with loadings of higher than 0.8; shown in bold in Table <tblr tid="T1">1</tblr>) from electronic properties, such as polar molecular surface area, H-bond acceptor counts (whose loading is 0.799), H-bond donor counts and partition coefficient (measured by AlogP98 and LogP).</p>
				<p>In the chemical space formed by the two factors (Figure <figr fid="F2">2</figr>), one can find some differences between the distribution patterns of ligands and ordinary molecules. That is, ligands (in red) occupy the relatively upper part of the space, while ordinary molecules (in blue) hold the relatively lower part, which implies that it is the second factor that discriminates ligands from ordinary molecules. As a consequence, it can be deduced that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are likely responsible for the differences between ligands and ordinary molecules, which agrees well with the current understanding of the chemical basis of ligand-protein binding that electrostatic interactions (including H-bond) and hydrophobic interactions make major contributions to the binding. More interestingly, as shown in Figure <figr fid="F3">3</figr>, super ligands (in blue and red) do not distribute randomly in the chemical space, but concentrate in the relatively upper part of the space, which suggests that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are also key factors discriminating super ligands from others.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Chemical space consisting of ligands (derived from sc-PDB) and ordinary molecules (randomly selected from ACD-SC), defined by the first two factors derived from 70 descriptors</p>
					</caption>
					<text>
						<p>Chemical space consisting of ligands (derived from sc-PDB) and ordinary molecules (randomly selected from ACD-SC), defined by the first two factors derived from 70 descriptors. The figure illustrates that ligands (in red) occupy the relatively upper part of the space, while ordinary molecules (in blue) occupy the relatively lower part, which means that it is the second factor that discriminates ligands from ordinary molecules. From the loadings of the second factor, it can be deduced that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are likely responsible for the differences between ligands and ordinary molecules, which is supported by the different average values of the four kinds of parameters for ligands and ordinary molecules (Table 2).</p>
					</text>
					<graphic file="gb-2007-8-8-r176-2"/>
				</fig>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Chemical space consisting of sc-PDB-derived ligands, defined by the first two factors derived from 70 descriptors</p>
					</caption>
					<text>
						<p>Chemical space consisting of sc-PDB-derived ligands, defined by the first two factors derived from 70 descriptors. The figure illustrates that super ligands (shared by 3+ folds; in blue), especially those that are shared by 10+ folds (in red), concentrate in the relatively upper part of the space (the area of the circle is directly proportional to the number of folds that bind the ligand), which suggests that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are responsible for the strong protein-binding potential of the super ligands, which is supported by the different average values of the four kinds of parameters for ligands with different protein-binding potentials (Table 2).</p>
					</text>
					<graphic file="gb-2007-8-8-r176-3"/>
				</fig>
				<p>To shed more light on the above findings, the average values of descriptors characterizing polar molecular surface area, H-bond donors, H-bond acceptors and partition coefficient were calculated for ordinary molecules, ligands and super ligands. From Table <tblr tid="T2">2</tblr>, it can be seen that there indeed exist correlations between protein-binding ability and the four kinds of parameters. The protein-binding potential of ligands is positively correlated with polar molecular surface area, H-bond donor and acceptor counts, and negatively correlated with partition coefficient (measured by AlogP98 and LogP).</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Average values of descriptors characterizing polar molecular surface area, H-bond donors, H-bond acceptors, partition coefficient and rotatable bonds for ordinary molecules, ligands and ligands with different protein-binding potentials</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Descriptor*</p>
							</c>
							<c ca="left">
								<p>Small molecules<sup>&#8224;</sup>
								</p>
							</c>
							<c ca="center">
								<p>Average values</p>
							</c>
							<c ca="center">
								<p>Standard error</p>
							</c>
							<c ca="center">
								<p>Number of molecules</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PSA</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>111.81</p>
							</c>
							<c ca="center">
								<p>1.79</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>230.59</p>
							</c>
							<c ca="center">
								<p>2.79</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>225.71</p>
							</c>
							<c ca="center">
								<p>2.80</p>
							</c>
							<c ca="center">
								<p>2,072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>304.28</p>
							</c>
							<c ca="center">
								<p>15.67</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>406.83</p>
							</c>
							<c ca="center">
								<p>33.10</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Donor</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>1.51</p>
							</c>
							<c ca="center">
								<p>0.04</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>3.97</p>
							</c>
							<c ca="center">
								<p>0.07</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>3.87</p>
							</c>
							<c ca="center">
								<p>0.07</p>
							</c>
							<c ca="center">
								<p>2,072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>5.24</p>
							</c>
							<c ca="center">
								<p>0.43</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>8.21</p>
							</c>
							<c ca="center">
								<p>0.90</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Acceptor</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>3.35</p>
							</c>
							<c ca="center">
								<p>0.05</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>5.87</p>
							</c>
							<c ca="center">
								<p>0.09</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>5.74</p>
							</c>
							<c ca="center">
								<p>0.09</p>
							</c>
							<c ca="center">
								<p>2,072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>7.69</p>
							</c>
							<c ca="center">
								<p>0.53</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>11.00</p>
							</c>
							<c ca="center">
								<p>1.18</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AlogP98</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>2.87</p>
							</c>
							<c ca="center">
								<p>0.05</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>0.81</p>
							</c>
							<c ca="center">
								<p>0.06</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>0.92</p>
							</c>
							<c ca="center">
								<p>0.06</p>
							</c>
							<c ca="center">
								<p>2,072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>-1.33</p>
							</c>
							<c ca="center">
								<p>0.25</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>-1.80</p>
							</c>
							<c ca="center">
								<p>0.38</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>LogP</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>0.77</p>
							</c>
							<c ca="center">
								<p>0.08</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>-2.27</p>
							</c>
							<c ca="center">
								<p>0.10</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>-2.10</p>
							</c>
							<c ca="center">
								<p>0.10</p>
							</c>
							<c ca="center">
								<p>2,072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>-5.06</p>
							</c>
							<c ca="center">
								<p>0.50</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>-8.11</p>
							</c>
							<c ca="center">
								<p>0.96</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RotBond</p>
							</c>
							<c ca="left">
								<p>Molecules</p>
							</c>
							<c ca="center">
								<p>4.88</p>
							</c>
							<c ca="center">
								<p>0.09</p>
							</c>
							<c ca="center">
								<p>2,184</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>7.49</p>
							</c>
							<c ca="center">
								<p>0.11</p>
							</c>
							<c ca="center">
								<p>2,176</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8804; 3)</p>
							</c>
							<c ca="center">
								<p>7.43</p>
							</c>
							<c ca="center">
								<p>0.11</p>
							</c>
							<c ca="center">
								<p>2072</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (4-9)</p>
							</c>
							<c ca="center">
								<p>8.00</p>
							</c>
							<c ca="center">
								<p>0.50</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Ligands (&#8805; 10)</p>
							</c>
							<c ca="center">
								<p>11.33</p>
							</c>
							<c ca="center">
								<p>1.19</p>
							</c>
							<c ca="center">
								<p>24</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>* PSA, polar molecular surface area; Donor, H-bond donor counts; Acceptor, H-bond acceptor counts; AlogP98, log of the partition coefficient, atom-type value, using latest parameters; LogP, log of the partition coefficient; RotBond, number of rotatable bonds. <sup>&#8224;</sup>Molecules, ACD-SC-derived ordinary molecules; Ligands, sc-PDB-derived ligands; Ligands (&#8804; 3), ligands covering &#8804; 3 folds; Ligands (4-9), ligands covering 4-9 folds; Ligands (&#8805; 10), ligands covering &#8805; 10 folds.</p>
					</tblfn>
				</tbl>
				<p>Recently, through examining the conformational diversity of some very common ligands (that is, ATP, NAD and FAD) bound to proteins, Stockwell and Thornton <abbrgrp>
						<abbr bid="B41">41</abbr>
					</abbrgrp> suggested that molecular flexibility is important for ligands to bind diverse proteins. This opinion is partially supported by the present analysis. Although the contribution from the number of rotatable bonds (RotBonds) to the second factor is not very strong (the loading is 0.428; Table <tblr tid="T1">1</tblr>), there is a correlation between the protein-binding ability of ligands and index RotBonds. As listed in Table <tblr tid="T2">2</tblr>, the average RotBonds for ligands is significantly higher than that for ordinary molecules (independent samples <it>t</it>-test shows that <it>P </it>&lt; 0.0001), and it is clear that the more folds the ligands cover, the higher the average RotBonds are for the ligands.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>Since ligand-protein binding is one of the most basic biochemical processes, the present findings have broad biological and medical implications.</p>
			<sec>
				<st>
					<p>Implications for tracing the chronology of ligand binding to proteins</p>
				</st>
				<p>The most challenging issue in life sciences may be elucidating how organisms originated from inorganic scratches (gases, water and clays), during which one of the most important missions is to establish the chronology of the important biological events. Thanks to the continuing efforts of chemists and biologists, the chronologies of the evolution of amino acids and proteins have been established in principle <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B51">51</abbr>
						<abbr bid="B52">52</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
						<abbr bid="B55">55</abbr>
					</abbrgrp>. However, as many proteins bind ligands that are essential for their functions and the ligands are likely to have originated independently of proteins <abbrgrp>
						<abbr bid="B56">56</abbr>
						<abbr bid="B57">57</abbr>
						<abbr bid="B58">58</abbr>
						<abbr bid="B59">59</abbr>
					</abbrgrp>, the binding of ligands with primordial proteins would also be a critical step in the origin of life. Thus, it is intriguing to explore the chronology of ligand-protein binding and answer the following questions: which ligand was first recognized by a protein and what kind of architecture did the host protein have. Nevertheless, since there is no fossil of the last universal common ancestor, let alone the more ancestral organisms, it is a great challenge to trace the protein-binding history of early ligands.</p>
				<p>As stated above, through determining the protein-binding ages of ligands, a rough temporal order (early or late) for ligand-protein binding can be inferred (as shown in Additional data file 3). However, considering the fact that fold distribution pattern in the sequence universe helps greatly to reveal the chronology of the evolution of protein architecture <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
					</abbrgrp>, we speculate that the power-law distribution of ligands in the protein universe may implicate a more explicit temporal order for ligand-protein binding. In fact, the preferential attachment principle underlying the power-law behavior of ligand-protein mapping suggests that the more widely a ligand is shared, the earlier it bound to proteins. As protein architecture is more conserved than sequence <abbrgrp>
						<abbr bid="B35">35</abbr>
						<abbr bid="B36">36</abbr>
						<abbr bid="B37">37</abbr>
					</abbrgrp>, the fold-based inference is believed to be more robust than the domain-based one. Therefore, the nine bio-ligands that are most popular in the fold universe (covering 15+ folds; Table <tblr tid="T3">3</tblr>) are considered to have bound their host proteins relatively earlier than others and to follow the order (from early to late): ATP, ADP (adenosine-5'-diphosphate), GDP (guanosine-5'-diphosphate), NAD (nicotinamide-adenine-dinucleotide), FAD (flavin-adenine dinucleotide), NDP (dihydro-nicotinamide-adenine-dinucleotide phosphate), NAP (nicotinamide-adenine-dinucleotide phosphate), FMN (flavin mononucleotide) and AMP (adenosine monophosphate).</p>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>The most prevalent bio-ligands in the fold universe (shared by 15+ folds) and the most common folds used by host proteins of each ligand</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>Ligands</p>
							</c>
							<c ca="center">
								<p>Number of folds</p>
							</c>
							<c ca="left">
								<p>Most common folds</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Adenosine-5'-triphosphate (ATP)</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
							<c ca="left">
								<p>P-loop containing nucleoside triphosphate hydrolases (c.37)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Adenosine-5'-diphosphate (ADP)</p>
							</c>
							<c ca="center">
								<p>31</p>
							</c>
							<c ca="left">
								<p>P-loop containing nucleoside triphosphate hydrolases (c.37)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Guanosine-5'-diphosphate (GDP)</p>
							</c>
							<c ca="center">
								<p>29</p>
							</c>
							<c ca="left">
								<p>P-loop containing nucleoside triphosphate hydrolases (c.37)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Nicotinamide-adenine-dinucleotide (NAD)</p>
							</c>
							<c ca="center">
								<p>27</p>
							</c>
							<c ca="left">
								<p>NAD(P)-binding Rossmann-fold domains (c.2)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Flavin-adenine dinucleotide (FAD)</p>
							</c>
							<c ca="center">
								<p>21</p>
							</c>
							<c ca="left">
								<p>FAD/NAD(P)-binding domain (c.3)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Dihydro-nicotinamide-adenine-dinucleotide phosphate (NDP)</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="left">
								<p>NAD(P)-binding Rossmann-fold domains (c.2)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Nicotinamide-adenine-dinucleotide phosphate (NAP)</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="left">
								<p>NAD(P)-binding Rossmann-fold domains (c.2)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Flavin mononucleotide (FMN)</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="left">
								<p>
									<a href="http://scop.berkeley.edu/data/scop.b.d.cg.html">Flavodoxin-like</a> (c.23)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Adenosine monophosphate (AMP)</p>
							</c>
							<c ca="center">
								<p>15</p>
							</c>
							<c ca="left">
								<p>Adenine nucleotide alpha hydrolase-like (c.26)</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>A close inspection of ATP's host proteins reveals that although ATP covers 35 folds and 97 domains, most domains belong to a small group of folds, indicating that power law is still effective (Additional data file 4). According to the preferential attachment principle of fold usage <abbrgrp>
						<abbr bid="B37">37</abbr>
					</abbrgrp>, it is reasonable to infer that the most prevalent fold, P-loop hydrolase (c.37), was employed by ATP's first host (Table <tblr tid="T3">3</tblr>). Interestingly, c.37 is the most ancient fold predicted by a phylogenomic analysis of protein architectures <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
					</abbrgrp>. Similar analyses allowed us to deduce the most ancestral host proteins of the other eight early ligands (Additional data file 4, Table <tblr tid="T3">3</tblr>). It is interesting to note that the predicted earliest hosts for the nine bio-ligands appeared in roughly the same order as the protein structures deduced by a phylogenomic analysis (that is, c.37 is the earliest, followed by c.2, c.23, c.3 and c.26, all of which belong to the &#945;/&#946; class) <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
					</abbrgrp>. Although no consensus has been reached on the exact temporal order of protein architectures, &#945;/&#946; is generally considered to be the most ancient protein class <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
						<abbr bid="B60">60</abbr>
						<abbr bid="B61">61</abbr>
						<abbr bid="B62">62</abbr>
					</abbrgrp>. In addition, based on an extensive analysis of sequences and structures of numerous proteins, Trifonov and co-workers <abbrgrp>
						<abbr bid="B63">63</abbr>
						<abbr bid="B64">64</abbr>
						<abbr bid="B65">65</abbr>
					</abbrgrp> also inferred that some P-loop ATP-binding domains represent the most ancient proteins. Recently, through a phylogenomic analysis on protein architectures of modern metabolic networks, Caetano-Anoll&#233;s and co-workers <abbrgrp>
						<abbr bid="B66">66</abbr>
					</abbrgrp> indicated that enzymes with the P-loop hydrolase fold engaged in nucleotide (especially purine) metabolism may be the most primitive members of metabolic systems. Through examining the structures and functions of these members, we found that most (approximately 80%) of them need ATP to work normally. Therefore, the present speculations on the chronology of ligand-protein binding are self-consistent and are in line with the up-to-date knowledge on protein evolutionary history.</p>
				<p>To get a deeper insight into the evolutionary features of ligands, the building block usage of 128 bio-ligands was analyzed. As shown in Additional data file 5, nucleic acid bases are the most frequently used building blocks, followed by carbohydrates and amino acids, which is in accordance with Nobeli <it>et al</it>.'s <abbrgrp>
						<abbr bid="B67">67</abbr>
					</abbrgrp> finding that nucleic acid bases are the most common fragments of metabolites. More interestingly, many early bio-ligands (45.0%) contain nucleic acid bases; in particular, the nine earliest bio-ligands all contain one or more bases. In contrast, carbohydrates or amino acids are contained by only a small proportion of early bio-ligands (25.0% and 7.5%, respectively). This provides further evidence to support the notion that early ligands are vestiges of the RNA world <abbrgrp>
						<abbr bid="B56">56</abbr>
					</abbrgrp>.</p>
				<p>As mentioned above, the presently revealed chronology of early ligands' host proteins is roughly in line with the previously deduced evolutionary history of protein architectures <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B53">53</abbr>
						<abbr bid="B54">54</abbr>
					</abbrgrp>. Thus, it is interesting to ask: is the accordance between both events fortuitous? Our answer is maybe not. Considering the prevalent ligand-induced protein folding <abbrgrp>
						<abbr bid="B68">68</abbr>
						<abbr bid="B69">69</abbr>
						<abbr bid="B70">70</abbr>
						<abbr bid="B71">71</abbr>
						<abbr bid="B72">72</abbr>
					</abbrgrp>, we conjecture that early ligands might have facilitated protein formation as catalysts (to assemble amino acids or peptide segments), as molecular chaperons (to help protein folding) and/or as selectors (because of the important functions of the early ligands), which naturally resulted in the accordance between both events. This conjecture implicates that the origin of primitive proteins benefited from ligand binding, which is reasonable in terms of the thermodynamics of ligand binding and protein folding.</p>
				<p>It has been found that some early ligands, such as ADP and GDP, can bind proteins related to the very old P-loop hydrolase fold (for example, preprotein translocase SecA (1M74), ADP-ribosylation factor-like protein 3 (1FZQ) and GTP-binding protein (1A4R)) with an affinity (free energy) of 10-15 kcal/mol <abbrgrp>
						<abbr bid="B73">73</abbr>
					</abbrgrp>, which is just in the range of the free energy loss (10-20 kcal/mol) during protein folding <abbrgrp>
						<abbr bid="B74">74</abbr>
						<abbr bid="B75">75</abbr>
					</abbrgrp>. Thus, the free energy release during ligand binding may meet the free energy demand during protein folding. It is tempting to examine the conjecture of ligand-induced formation and/or folding of primordial proteins through experimentation. To do that, <it>in vitro </it>selection may be an appropriate methodology <abbrgrp>
						<abbr bid="B76">76</abbr>
					</abbrgrp>. It is interesting to note that <it>in vitro </it>selection of proteins (consisting of 80 residues) targeted to bind ATP has been performed <abbrgrp>
						<abbr bid="B77">77</abbr>
					</abbrgrp>. The randomly generated proteins indeed belong to the &#945;/&#946; class, but are not related to P-loop hydrolases fold <abbrgrp>
						<abbr bid="B78">78</abbr>
					</abbrgrp>. However, considering the fact that the shortest protein sequence for the P-loop hydrolase fold contains 94 residues (according to the Protein Databank), we suggest that to explore whether the formation of the most ancient proteins was induced by ATP, one should adopt longer protein sequences in the <it>in vitro </it>selection experiments and use small amino acids as building blocks, because in the primordial world only these amino acids were available <abbrgrp>
						<abbr bid="B51">51</abbr>
						<abbr bid="B55">55</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Implications for understanding the new paradigms in drug discovery</p>
				</st>
				<p>Nowadays, the pharmaceutical industry is facing an unprecedented challenge. Global research funding has doubled since 1991, whereas the number of approved new drugs has fallen by 50% <abbrgrp>
						<abbr bid="B79">79</abbr>
						<abbr bid="B80">80</abbr>
					</abbrgrp>. To meet the more-investment-less-outcome challenge, some novel drug discovery strategies have appeared in recent years, which include finding new functions from old drugs, developing promiscuous drugs rather than selective agents and depending more on natural products than on combinatorial libraries of synthetic compounds to derive drug leads. Since the essence of drug action is the binding between drugs and target biomolecules (most of which are proteins), the ligand-protein binding features revealed in the present study have important implications for understanding these new drug discovery strategies.</p>
				<p>As indicated above, approximately 30% of ligands are bound by two or more domains (this number gets ~15%, if counted on fold level), which suggests that if a ligand can bind to a protein, it has great potential to bind to others. Considering the fact that the US Food and Drug Administration (FDA) has approved approximately 2,000 drugs (chemical entities) and there exist only 2,000-3,000 druggable genes and 600-1,500 drug targets <abbrgrp>
						<abbr bid="B81">81</abbr>
						<abbr bid="B82">82</abbr>
					</abbrgrp>, it is truly possible to find new functions from these old 'safe' drugs, which supports an increasingly shared notion in drug development that the most fruitful basis for the discovery of a new drug is to start with an old drug <abbrgrp>
						<abbr bid="B83">83</abbr>
						<abbr bid="B84">84</abbr>
						<abbr bid="B85">85</abbr>
					</abbrgrp>.</p>
				<p>Since most human diseases, such as cancer, diabetes, heart disease, arthritis and neurodegenerative diseases, involve multiple pathogenetic factors, the more-investment-less-outcome predicament is attributed in part to the limitations of the current one-drug-one-target paradigm in drug discovery <abbrgrp>
						<abbr bid="B79">79</abbr>
						<abbr bid="B86">86</abbr>
					</abbrgrp>. Therefore, more and more efforts are devoted to finding new therapeutics aimed at multiple targets <abbrgrp>
						<abbr bid="B86">86</abbr>
					</abbrgrp>, which is becoming a new paradigm in drug discovery. To hit the multiple targets implicated in complex diseases, two strategies are conceivable. One is called the multicomponent therapeutic strategy, which incorporates two or more active ingredients in one drug <abbrgrp>
						<abbr bid="B86">86</abbr>
						<abbr bid="B87">87</abbr>
						<abbr bid="B88">88</abbr>
						<abbr bid="B89">89</abbr>
					</abbrgrp>, as was applied in some traditional medicines (in China and many other countries) and in recently developed drug cocktails. The other is to hit the multiple targets with a single component, which is termed the one-ligand-multiple-targets strategy or promiscuous drug strategy <abbrgrp>
						<abbr bid="B89">89</abbr>
						<abbr bid="B90">90</abbr>
						<abbr bid="B91">91</abbr>
						<abbr bid="B92">92</abbr>
						<abbr bid="B93">93</abbr>
						<abbr bid="B94">94</abbr>
						<abbr bid="B95">95</abbr>
						<abbr bid="B96">96</abbr>
						<abbr bid="B97">97</abbr>
						<abbr bid="B98">98</abbr>
						<abbr bid="B99">99</abbr>
					</abbrgrp>. Compared with the former strategy, the latter might take advantage of lower risks of drug-drug interactions and more predictable pharmacokinetic behaviors <abbrgrp>
						<abbr bid="B91">91</abbr>
						<abbr bid="B92">92</abbr>
					</abbrgrp> and thus has been paid more and more attention. The feasibility of the one-ligand-multiple-targets strategy is supported by the present findings, because a certain proportion of ligands do indeed bind to two or more domains (even folds). In addition, the presently revealed structural features of super ligands are of significance for selecting and/or designing multipotent agents. Of course, the new strategy should be treated with wariness, because of the potential side effects of the promiscuous ligands.</p>
				<p>Another feature of the recent drug discovery paradigm shift is that more attention has been given to natural-product repositories rather than combinatorial libraries of synthetic compounds for finding novel drug leads <abbrgrp>
						<abbr bid="B100">100</abbr>
						<abbr bid="B101">101</abbr>
					</abbrgrp>. Due to their biosynthetic origin, natural products are natively bound to proteins (synthases). In light of the present findings, one can conclude that natural products have more potential than synthetic compounds to bind proteins, including those of human, which helps to understand the natural product-based drug discovery strategy. In addition, it can be inferred that it is rather easy to build a protein-ligand network on the basis of naturally occurring small-molecule ligands, which definitely benefits the birth of networked life and facilitates the formation of links within different species.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Data selection/collection</p>
				</st>
				<p>Until June 2006, 2,721 ligands had been recorded in sc-PDB. As our interest was focused on non-peptide ligands, 433 peptides were eliminated. After removing 102 repeated ligands (which have the same structures to others but were given different release names), 2,186 small-molecule ligands remained (Additional data file 1), which bind to 5,740 non-redundant domains (to remove the redundancy of domains, only one domain was chosen from each species). Domain is defined as an independently folded unit within a protein, often joined by a flexible segment of the polypeptide chain <abbrgrp>
						<abbr bid="B102">102</abbr>
					</abbrgrp>. For a small proportion of ligands that are shared by two domains, both domains were counted. According to SCOP 1.69 <abbrgrp>
						<abbr bid="B28">28</abbr>
						<abbr bid="B29">29</abbr>
					</abbrgrp>, these domains were classified into 591 folds. As one fold may cover multiple domains and hold more than one ligand, the fold occurrences amounted to 3,224.</p>
				<p>Since sc-PDB is a subset of the PDB, one may be concerned about the robustness of the conclusions derived when using it. However, considering the facts that the present inferences were made mainly on the level of protein fold and that folds are much more conserved than domains, and thus fold increase is much slower than that of domains in the PDB <abbrgrp>
						<abbr bid="B103">103</abbr>
					</abbrgrp>, it is believed that the present conclusions are solid. In fact, even if the latest data of the sc-PDB (containing 396 new ligands and 827 new domains, which were kindly provided by Dr Rognan and have not been uploaded on the website) are considered, all of the present conclusions still hold.</p>
			</sec>
			<sec>
				<st>
					<p>Descriptor calculation</p>
				</st>
				<p>Seventy descriptors characterizing the structural features of 2,186 ligands selected from the sc-PDB and 2,184 small molecules randomly selected from ACD-SC were calculated by Sybyl (13 descriptors) <abbrgrp>
						<abbr bid="B46">46</abbr>
					</abbrgrp>, Cerius 2 (49 descriptors) <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp> and an in-house program written in Perl (8 descriptors). Then, the calculated data were linked together with Perl for further analysis. Because of the lack of atomic parameters for ten ligands (that is, 2,3,4,5,6-pentafluorobenzyl alcohol, 2-amino-4-oxo-4,7-dihydro-3h-pyrrolo [2,3-d] pyrimidine-5-carbonitrile, 3,5,3',5'-tetraiodo-l-thyronine, 6,7-dinitroquinoxaline-2,3-dione, 9-hydroxy aristolochic acid, 3,5,7-trihydroxy-2-(4-hydroxyphenyl)-4h-chromen-4-one, 5-hydroxy-2-(4-hydroxyphenyl)-1-benzofuran-7-carbonitrile, 3,3',5,5'-tetraiodothyroacetic acid, 3,5,7,3',4'-pentahydroxyflavone and radicicol), some descriptors could not be calculated for these molecules. Hence, only 2,176 ligands went through the calculation. However, as each of the ten ligands covers only one fold, their absence has no impact on the conclusion of the present study.</p>
			</sec>
			<sec>
				<st>
					<p>Factor analysis</p>
				</st>
				<p>SPSS 13.0 (SPSS Inc., Chicago, IL, USA) was employed to do the factor analysis. The factors were extracted by means of principal component analysis <abbrgrp>
						<abbr bid="B48">48</abbr>
						<abbr bid="B49">49</abbr>
					</abbrgrp> and the parameter settings were as follows: a correlation matrix was used; and two factors were extracted to visualize the two-dimensional chemical space of ligands and ordinary molecules. In order to simplify the interpretation of the extracted factors, factor rotation was performed, during which the most popular orthogonal rotation method, Varimax, developed by Kaiser <abbrgrp>
						<abbr bid="B50">50</abbr>
					</abbrgrp>, was employed. For other variables, default parameters were adopted.</p>
			</sec>
			<sec>
				<st>
					<p>Age assignment for bio-ligands</p>
				</st>
				<p>An early bio-ligand is defined as that owned by both prokaryotic (<it>E. coli</it>) and eukaryotic (yeast or higher) species, while a late bio-ligand is defined as that owned only by eukaryotic (yeast or higher) species. As there is no direct information on ligand ownership, we used the information of their host proteins to deduce their ages. That is, a ligand is early, provided that at least one of its host proteins is owned by both <it>E. coli </it>and yeast (or higher species); and a ligand is late if none of its host proteins is owned by <it>E. coli </it>but at least one is owned by yeast or higher species. During the age-assigning process, not only the host proteins recorded in sc-PDB were checked, but also the corresponding homologous proteins retrieved from Swiss-Prot <abbrgrp>
						<abbr bid="B104">104</abbr>
					</abbrgrp> were considered.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Abbreviations</p>
			</st>
			<p>ACD-SC, Available Chemicals Directory-Screening Compounds; RotBond, rotatable bond; sc-PDB, Annotated Database of Druggable Binding Sites from the PDB.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>H.-Y.Z. designed the study. H.-F.J., D.-X.K. and L.S. collected the data and performed the calculation. All authors analyzed the data. H.-Y.Z., H.-F.J. and L.-L.C. wrote the paper.</p>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> lists ligands and the numbers of domains and folds that bind them. Additional data file <supplr sid="S2">2</supplr> illustrates the power-law behaviors of metabolism-relevant ligands. Additional data file <supplr sid="S3">3</supplr> provides building blocks and ownerships of metabolism-relevant ligands. Additional data file <supplr sid="S4">4</supplr> illustrates the power-law behaviors of folds for proteins binding ATP, ADP and NAD. Additional data file <supplr sid="S5">5</supplr> illustrates the building block usage of bio-ligands.</p>
			<suppl id="S1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>Ligands and the numbers of domains and folds that bind them</p>
				</caption>
				<text>
					<p>Ligands and the numbers of domains and folds that bind them.</p>
				</text>
				<file name="gb-2007-8-8-r176-S1.doc">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional data file 2</p>
				</title>
				<caption>
					<p>Power-law behaviors of metabolism-relevant ligands</p>
				</caption>
				<text>
					<p>Power-law behaviors of metabolism-relevant ligands.</p>
				</text>
				<file name="gb-2007-8-8-r176-S2.doc">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional data file 3</p>
				</title>
				<caption>
					<p>Building blocks and ownerships of metabolism-relevant ligands</p>
				</caption>
				<text>
					<p>Building blocks and ownerships of metabolism-relevant ligands.</p>
				</text>
				<file name="gb-2007-8-8-r176-S3.doc">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S4">
				<title>
					<p>Additional data file 4</p>
				</title>
				<caption>
					<p>Power-law behaviors of folds for proteins binding ATP, ADP and NAD</p>
				</caption>
				<text>
					<p>Power-law behaviors of folds for proteins binding ATP, ADP and NAD.</p>
				</text>
				<file name="gb-2007-8-8-r176-S4.doc">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S5">
				<title>
					<p>Additional data file 5</p>
				</title>
				<caption>
					<p>Building block usage of bio-ligands</p>
				</caption>
				<text>
					<p>Building block usage of bio-ligands.</p>
				</text>
				<file name="gb-2007-8-8-r176-S5.doc">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
   <bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank Prof. Antonio Lazcano and Dr Xin-Min Li for fruitful discussions and Dr Didier Rognan for generously providing the sdf file of sc-PDB and the latest data. This work was partially supported by National Basic Research Program of China (2003CB114400) and National Natural Science Foundation of China (30570383 and 30600119).</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Network biology: understanding the cell's functional organization.</p>
				</title>
				<aug>
					<au>
						<snm>Barab&#225;si</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Oltvai</snm>
						<fnm>ZN</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>101</fpage>
				<lpage>113</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1272</pubid>
						<pubid idtype="pmpid" link="fulltext">14735121</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Social insect networks.</p>
				</title>
				<aug>
					<au>
						<snm>Fewell</snm>
						<fnm>JH</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>301</volume>
				<fpage>1867</fpage>
				<lpage>1870</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1088945</pubid>
						<pubid idtype="pmpid" link="fulltext">14512616</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Allelochemics: chemical interactions between species.</p>
				</title>
				<aug>
					<au>
						<snm>Whittaker</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Feeny</snm>
						<fnm>PP</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1971</pubdate>
				<volume>171</volume>
				<fpage>757</fpage>
				<lpage>770</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.171.3973.757</pubid>
						<pubid idtype="pmpid" link="fulltext">5541160</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Natural products and plant disease resistance.</p>
				</title>
				<aug>
					<au>
						<snm>Dixon</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>411</volume>
				<fpage>843</fpage>
				<lpage>847</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35081178</pubid>
						<pubid idtype="pmpid" link="fulltext">11459067</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Bacterial small-molecule signaling pathways.</p>
				</title>
				<aug>
					<au>
						<snm>Camilli</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bassler</snm>
						<fnm>BL</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2006</pubdate>
				<volume>311</volume>
				<fpage>1113</fpage>
				<lpage>1116</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1121357</pubid>
						<pubid idtype="pmpid" link="fulltext">16497924</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Volatile signaling in plant-plant interactions: 'talking trees' in the genomics era.</p>
				</title>
				<aug>
					<au>
						<snm>Baldwin</snm>
						<fnm>IT</fnm>
					</au>
					<au>
						<snm>Halitschke</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Paschold</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>von Dahl</snm>
						<fnm>CC</fnm>
					</au>
					<au>
						<snm>Preston</snm>
						<fnm>CA</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2006</pubdate>
				<volume>311</volume>
				<fpage>812</fpage>
				<lpage>815</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1118446</pubid>
						<pubid idtype="pmpid" link="fulltext">16469918</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Communication in bacteria: an ecological and evolutionary perspective.</p>
				</title>
				<aug>
					<au>
						<snm>Keller</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Surette</snm>
						<fnm>MG</fnm>
					</au>
				</aug>
				<source>Nat Rev Microbiol</source>
				<pubdate>2006</pubdate>
				<volume>4</volume>
				<fpage>249</fpage>
				<lpage>258</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrmicro1383</pubid>
						<pubid idtype="pmpid" link="fulltext">16501584</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Bacterially speaking.</p>
				</title>
				<aug>
					<au>
						<snm>Bassler</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Losick</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2006</pubdate>
				<volume>125</volume>
				<fpage>237</fpage>
				<lpage>246</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.cell.2006.04.001</pubid>
						<pubid idtype="pmpid" link="fulltext">16630813</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Rheostat control of gene expression by metabolites.</p>
				</title>
				<aug>
					<au>
						<snm>Ladurner</snm>
						<fnm>AG</fnm>
					</au>
				</aug>
				<source>Mol Cell</source>
				<pubdate>2006</pubdate>
				<volume>24</volume>
				<fpage>1</fpage>
				<lpage>11</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.molcel.2006.09.002</pubid>
						<pubid idtype="pmpid" link="fulltext">17018288</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Classification of proteins based on the properties of the ligand-binding site: The case of adenine-binding proteins</p>
				</title>
				<aug>
					<au>
						<snm>Cappello</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Tramontano</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Koch</snm>
						<fnm>U</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>2002</pubdate>
				<volume>47</volume>
				<fpage>106</fpage>
				<lpage>115</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/prot.10070</pubid>
						<pubid idtype="pmpid" link="fulltext">11933058</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>When fold is not important: A common structural framework for adenine and AMP binding in 12 unrelated protein families.</p>
				</title>
				<aug>
					<au>
						<snm>Denessiouk</snm>
						<fnm>KA</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>2000</pubdate>
				<volume>38</volume>
				<fpage>310</fpage>
				<lpage>326</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/(SICI)1097-0134(20000215)38:3&lt;310::AID-PROT7>3.0.CO;2-T</pubid>
						<pubid idtype="pmpid" link="fulltext">10713991</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Emergence of diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Anantharaman</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Aravind</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Koonin</snm>
						<fnm>EV</fnm>
					</au>
				</aug>
				<source>Curr Opin Chem Biol</source>
				<pubdate>2003</pubdate>
				<volume>7</volume>
				<fpage>12</fpage>
				<lpage>20</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1367-5931(02)00018-2</pubid>
						<pubid idtype="pmpid" link="fulltext">12547421</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Supersites within superfolds. Binding site similarity in the absence of homology.</p>
				</title>
				<aug>
					<au>
						<snm>Russell</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>Sasieni</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Sternberg</snm>
						<fnm>MJE</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1998</pubdate>
				<volume>282</volume>
				<fpage>903</fpage>
				<lpage>918</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1998.2043</pubid>
						<pubid idtype="pmpid" link="fulltext">9743635</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>The large-scale organization of metabolic networks.</p>
				</title>
				<aug>
					<au>
						<snm>Jeong</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Tombor</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Albert</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Oltvai</snm>
						<fnm>ZN</fnm>
					</au>
					<au>
						<snm>Barab&#225;si</snm>
						<fnm>AL</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2000</pubdate>
				<volume>407</volume>
				<fpage>651</fpage>
				<lpage>654</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35036627</pubid>
						<pubid idtype="pmpid" link="fulltext">11034217</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>The small world inside large metabolic networks.</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Fell</snm>
						<fnm>DA</fnm>
					</au>
				</aug>
				<source>Proc R Soc Lond Ser B</source>
				<pubdate>2001</pubdate>
				<volume>268</volume>
				<fpage>1803</fpage>
				<lpage>1810</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1098/rspb.2001.1711</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Reconstruction of metabolic networks from genome data and analysis of their global structure for various organisms.</p>
				</title>
				<aug>
					<au>
						<snm>Ma</snm>
						<fnm>HW</fnm>
					</au>
					<au>
						<snm>Zeng</snm>
						<fnm>AP</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<fpage>270</fpage>
				<lpage>277</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/19.2.270</pubid>
						<pubid idtype="pmpid" link="fulltext">12538249</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>The metabolic world of <it>Escherichia coli </it>is not small.</p>
				</title>
				<aug>
					<au>
						<snm>Arita</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<fpage>1543</fpage>
				<lpage>1547</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">341771</pubid>
						<pubid idtype="pmpid" link="fulltext">14757824</pubid>
						<pubid idtype="doi">10.1073/pnas.0306458101</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>SuperLigands-a database of ligand structures derived from the Protein Data Bank.</p>
				</title>
				<aug>
					<au>
						<snm>Michalsky</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Dunkel</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Goede</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Preissner</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>122</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1173082</pubid>
						<pubid idtype="pmpid" link="fulltext">15943884</pubid>
						<pubid idtype="doi">10.1186/1471-2105-6-122</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Databases for protein-ligand complexes.</p>
				</title>
				<aug>
					<au>
						<snm>Hendlich</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Acta Crystallogr</source>
				<pubdate>1998</pubdate>
				<volume>D54</volume>
				<fpage>1178</fpage>
				<lpage>1182</lpage>
			</bibl>
			<bibl id="B20">
				<title>
					<p>LIGAND: chemical database for enzyme reactions.</p>
				</title>
				<aug>
					<au>
						<snm>Goto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nishioka</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kanehisa</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>1998</pubdate>
				<volume>14</volume>
				<fpage>591</fpage>
				<lpage>599</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/14.7.591</pubid>
						<pubid idtype="pmpid" link="fulltext">9730924</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>PDB-Ligand: a ligand database based on PDB for the automated and customized classification of ligand-binding structures.</p>
				</title>
				<aug>
					<au>
						<snm>Shin</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Cho</snm>
						<fnm>DH</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<fpage>D238</fpage>
				<lpage>D241</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540013</pubid>
						<pubid idtype="pmpid" link="fulltext">15608186</pubid>
						<pubid idtype="doi">10.1093/nar/gki059</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>A searchable database for comparing protein-ligand binding sites for the analysis of structure-function relationships.</p>
				</title>
				<aug>
					<au>
						<snm>Gold</snm>
						<fnm>ND</fnm>
					</au>
					<au>
						<snm>Jackson</snm>
						<fnm>RM</fnm>
					</au>
				</aug>
				<source>J Chem Inf Model</source>
				<pubdate>2006</pubdate>
				<volume>46</volume>
				<fpage>736</fpage>
				<lpage>742</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1021/ci050359c</pubid>
						<pubid idtype="pmpid" link="fulltext">16563004</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>A complete small molecule dataset from the protein data bank.</p>
				</title>
				<aug>
					<au>
						<snm>Feldmana</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Snydera</snm>
						<fnm>KA</fnm>
					</au>
					<au>
						<snm>Ticolla</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Pintiliea</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Hogue</snm>
						<fnm>CWV</fnm>
					</au>
				</aug>
				<source>FEBS Lett</source>
				<pubdate>2006</pubdate>
				<volume>580</volume>
				<fpage>1649</fpage>
				<lpage>1653</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.febslet.2006.02.003</pubid>
						<pubid idtype="pmpid" link="fulltext">16494871</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>SitesBase: a database for structure-based protein-ligand binding site comparisons.</p>
				</title>
				<aug>
					<au>
						<snm>Gold</snm>
						<fnm>ND</fnm>
					</au>
					<au>
						<snm>Jackson</snm>
						<fnm>RM</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2006</pubdate>
				<volume>34</volume>
				<fpage>D231</fpage>
				<lpage>D234</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1347425</pubid>
						<pubid idtype="pmpid" link="fulltext">16381853</pubid>
						<pubid idtype="doi">10.1093/nar/gkj062</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB.</p>
				</title>
				<aug>
					<au>
						<snm>Block</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Sotriffer</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Dramburg</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Klebe</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2006</pubdate>
				<volume>34</volume>
				<fpage>D522</fpage>
				<lpage>D526</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1347402</pubid>
						<pubid idtype="pmpid" link="fulltext">16381925</pubid>
						<pubid idtype="doi">10.1093/nar/gkj039</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>MSDsite: A database search and retrieval system for the analysis and viewing of bound ligands and active sites.</p>
				</title>
				<aug>
					<au>
						<snm>Golovin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Dimitropoulos</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Oldfeld</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Rachedi</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Henrick</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>2005</pubdate>
				<volume>58</volume>
				<fpage>190</fpage>
				<lpage>199</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/prot.20288</pubid>
						<pubid idtype="pmpid" link="fulltext">15468317</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>sc-PDB: an annotated database of druggable binding sites from the Protein Data Bank.</p>
				</title>
				<aug>
					<au>
						<snm>Kellenberger</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Muller</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Schalon</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Bret</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Foata</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Rognan</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J Chem Inf Model</source>
				<pubdate>2006</pubdate>
				<volume>46</volume>
				<fpage>717</fpage>
				<lpage>727</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1021/ci050372x</pubid>
						<pubid idtype="pmpid" link="fulltext">16563002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>SCOP: a structural classification of proteins database for the investigation of sequences and structures.</p>
				</title>
				<aug>
					<au>
						<snm>Murzin</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Brenner</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Hubbard</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Chothia</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1995</pubdate>
				<volume>247</volume>
				<fpage>536</fpage>
				<lpage>540</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1995.0159</pubid>
						<pubid idtype="pmpid" link="fulltext">7723011</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>SCOP database in 2004: refinements integrate structure and sequence family data.</p>
				</title>
				<aug>
					<au>
						<snm>Andreeva</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Howorth</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Brenner</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Hubbard</snm>
						<fnm>TJP</fnm>
					</au>
					<au>
						<snm>Chothia</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Murzin</snm>
						<fnm>AG</fnm>
					</au>
				</aug>
				<source>Nucleic Acid Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<fpage>D226</fpage>
				<lpage>D229</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308773</pubid>
						<pubid idtype="pmpid" link="fulltext">14681400</pubid>
						<pubid idtype="doi">10.1093/nar/gkh039</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Emergence of scaling in random networks.</p>
				</title>
				<aug>
					<au>
						<snm>Barab&#225;si</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Albert</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1999</pubdate>
				<volume>286</volume>
				<fpage>509</fpage>
				<lpage>512</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.286.5439.509</pubid>
						<pubid idtype="pmpid" link="fulltext">10521342</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Preferential attachment in the protein network evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Eisenberg</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Levanon</snm>
						<fnm>EY</fnm>
					</au>
				</aug>
				<source>Phys Rev Lett</source>
				<pubdate>2003</pubdate>
				<volume>91</volume>
				<fpage>138701</fpage>
				<lpage>138704</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1103/PhysRevLett.91.138701</pubid>
						<pubid idtype="pmpid" link="fulltext">14525344</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>What properties characterize the hub proteins of the protein-protein interaction network of <it>Saccharomyces cerevisiae</it>?</p>
				</title>
				<aug>
					<au>
						<snm>Ekman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Light</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bj&#246;rklund</snm>
						<fnm>&#197;K</fnm>
					</au>
					<au>
						<snm>Elofsson</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<fpage>R45</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1779539</pubid>
						<pubid idtype="pmpid" link="fulltext">16780599</pubid>
						<pubid idtype="doi">10.1186/gb-2006-7-6-r45</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Protein function, connectivity, and duplicability in yeast.</p>
				</title>
				<aug>
					<au>
						<snm>Prachumwat</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>WH</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2006</pubdate>
				<volume>23</volume>
				<fpage>30</fpage>
				<lpage>39</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msi249</pubid>
						<pubid idtype="pmpid" link="fulltext">16120800</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>MetaCyc: a multiorganism database of metabolic pathways and enzymes.</p>
				</title>
				<aug>
					<au>
						<snm>Caspi</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Foerster</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Fulcher</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Hopkinson</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ingraham</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kaipa</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Krummenacker</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Paley</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Pick</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Rhee</snm>
						<fnm>SY</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2006</pubdate>
				<volume>34</volume>
				<fpage>D511</fpage>
				<lpage>D514</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1347490</pubid>
						<pubid idtype="pmpid" link="fulltext">16381923</pubid>
						<pubid idtype="doi">10.1093/nar/gkj128</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model.</p>
				</title>
				<aug>
					<au>
						<snm>Qian</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Luscombe</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Gerstein</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2001</pubdate>
				<volume>313</volume>
				<fpage>673</fpage>
				<lpage>681</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.2001.5079</pubid>
						<pubid idtype="pmpid" link="fulltext">11697896</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>The structure of the protein universe and genome evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Koonin</snm>
						<fnm>EV</fnm>
					</au>
					<au>
						<snm>Wolf</snm>
						<fnm>YI</fnm>
					</au>
					<au>
						<snm>Karev</snm>
						<fnm>GP</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<fpage>218</fpage>
				<lpage>223</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01256</pubid>
						<pubid idtype="pmpid" link="fulltext">12432406</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>An evolutionarily structured universe of protein architecture.</p>
				</title>
				<aug>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>1563</fpage>
				<lpage>1571</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403752</pubid>
						<pubid idtype="pmpid" link="fulltext">12840035</pubid>
						<pubid idtype="doi">10.1101/gr.1161903</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Identification of protein fold topology shared between different folds inhibited by natural products.</p>
				</title>
				<aug>
					<au>
						<snm>McArdle</snm>
						<fnm>BM</fnm>
					</au>
					<au>
						<snm>Quinn</snm>
						<fnm>RJ</fnm>
					</au>
				</aug>
				<source>ChemBioChem</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<fpage>788</fpage>
				<lpage>798</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/cbic.200700035</pubid>
						<pubid idtype="pmpid" link="fulltext">17429823</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Plasticity of enzyme active sites.</p>
				</title>
				<aug>
					<au>
						<snm>Todd</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Orengo</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>2002</pubdate>
				<volume>27</volume>
				<fpage>419</fpage>
				<lpage>426</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0968-0004(02)02158-8</pubid>
						<pubid idtype="pmpid" link="fulltext">12151227</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Ligand selectivity and competition between enzymes <it>in silico</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Macchiarulo</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Nobeli</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2004</pubdate>
				<volume>22</volume>
				<fpage>1039</fpage>
				<lpage>1045</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt999</pubid>
						<pubid idtype="pmpid" link="fulltext">15286657</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Conformational diversity of ligands bound to proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Stockwell</snm>
						<fnm>GR</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2006</pubdate>
				<volume>356</volume>
				<fpage>928</fpage>
				<lpage>944</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.jmb.2005.12.012</pubid>
						<pubid idtype="pmpid" link="fulltext">16405908</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Molecular recognition in the post-reductionist era.</p>
				</title>
				<aug>
					<au>
						<snm>Van Regenmortel</snm>
						<fnm>MHV</fnm>
					</au>
				</aug>
				<source>J Mol Recognit</source>
				<pubdate>1999</pubdate>
				<volume>12</volume>
				<fpage>1</fpage>
				<lpage>2</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/(SICI)1099-1352(199901/02)12:1&lt;1::AID-JMR449>3.0.CO;2-P</pubid>
						<pubid idtype="pmpid" link="fulltext">10398391</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Folding funnels and binding mechanisms.</p>
				</title>
				<aug>
					<au>
						<snm>Ma</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kumar</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tsai</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Nussinov</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Protein Eng</source>
				<pubdate>1999</pubdate>
				<volume>12</volume>
				<fpage>713</fpage>
				<lpage>720</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/protein/12.9.713</pubid>
						<pubid idtype="pmpid" link="fulltext">10506280</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Multiple diverse ligands binding at a single protein site: a matter of pre-existing populations.</p>
				</title>
				<aug>
					<au>
						<snm>Ma</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Shatsky</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Wolfson</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Nussinov</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Protein Sci</source>
				<pubdate>2002</pubdate>
				<volume>11</volume>
				<fpage>184</fpage>
				<lpage>197</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1110/ps.21302</pubid>
						<pubid idtype="pmpid" link="fulltext">11790828</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Available Chemicals Directory-Screening Compounds</p>
				</title>
				<url>http://www.akosgmbh.eu/acd-sc.htm</url>
			</bibl>
			<bibl id="B46">
				<title>
					<p>SYBYL 7.0</p>
				</title>
				<url>http://www.tripos.com/index.php?family=modules,SimplePage,,,&amp;page=comp_informatics</url>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Cerius2</p>
				</title>
				<url>http://www.accelrys.com/products/cerius2/</url>
			</bibl>
			<bibl id="B48">
				<aug>
					<au>
						<snm>Kim</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Mueller</snm>
						<fnm>CW</fnm>
					</au>
				</aug>
				<source>Factor Analysis: Statistical Methods and Practical Issues</source>
				<publisher>Thousand Oaks, CA: Sage Publications</publisher>
				<pubdate>1978</pubdate>
			</bibl>
			<bibl id="B49">
				<aug>
					<au>
						<snm>Reyment</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Joreskog</snm>
						<fnm>KG</fnm>
					</au>
				</aug>
				<source>Applied Factor Analysis in the Natural Sciences</source>
				<publisher>Cambridge: Cambridge University Press</publisher>
				<pubdate>1993</pubdate>
			</bibl>
			<bibl id="B50">
				<title>
					<p>The varimax criterion for analytic rotation in factor analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Kaiser</snm>
						<fnm>HF</fnm>
					</au>
				</aug>
				<source>Psychometrika</source>
				<pubdate>1958</pubdate>
				<volume>23</volume>
				<fpage>187</fpage>
				<lpage>200</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1007/BF02289233</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>
						<it>Primordia vita </it>deconvolution from modern sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
					<au>
						<snm>Gabdank</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Barash</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Sobolevsky</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>Orig Life Evol Biosph</source>
				<pubdate>2006</pubdate>
				<volume>36</volume>
				<fpage>559</fpage>
				<lpage>565</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s11084-006-9042-5</pubid>
						<pubid idtype="pmpid" link="fulltext">17120122</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Coevolutionary theory of the genetic code at age thirty.</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>JT-F</fnm>
					</au>
				</aug>
				<source>BioEssays</source>
				<pubdate>2005</pubdate>
				<volume>27</volume>
				<fpage>416</fpage>
				<lpage>425</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/bies.20208</pubid>
						<pubid idtype="pmpid" link="fulltext">15770677</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Universal sharing patterns in proteomes and evolution of protein fold architecture and life.</p>
				</title>
				<aug>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>2005</pubdate>
				<volume>60</volume>
				<fpage>484</fpage>
				<lpage>498</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00239-004-0221-6</pubid>
						<pubid idtype="pmpid" link="fulltext">15883883</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Phylogenomic reconstruction of the protein world based on a genomic census of protein fold architecture.</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Boca</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Kalelkar</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Mittenthal</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Complexity</source>
				<pubdate>2006</pubdate>
				<volume>12</volume>
				<fpage>27</fpage>
				<lpage>40</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1002/cplx.20141</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>Exploring the evolution of standard amino-acid alphabet: when genomics meets thermodynamics.</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>H-Y</fnm>
					</au>
				</aug>
				<source>Biochem Biophys Res Commun</source>
				<pubdate>2007</pubdate>
				<volume>359</volume>
				<fpage>403</fpage>
				<lpage>405</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.bbrc.2007.05.115</pubid>
						<pubid idtype="pmpid" link="fulltext">17543275</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>Coenzymes as fossils of an earlier metabolic state.</p>
				</title>
				<aug>
					<au>
						<snm>White</snm>
						<fnm>HB</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1976</pubdate>
				<volume>7</volume>
				<fpage>101</fpage>
				<lpage>104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/BF01732468</pubid>
						<pubid idtype="pmpid">1263263</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>Prebiotic syntheses of vitamin coenzymes: I. Cysteamine and 2-mercaptoethanesulfonic acid (coenzyme M).</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Schlesinger</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1993</pubdate>
				<volume>36</volume>
				<fpage>302</fpage>
				<lpage>307</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11536534</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B58">
				<title>
					<p>Prebiotic syntheses of vitamin coenzymes: II. Pantoic acid, pantothenic acid, and the composition of coenzyme A.</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Schlesinger</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1993</pubdate>
				<volume>36</volume>
				<fpage>308</fpage>
				<lpage>314</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11536535</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B59">
				<title>
					<p>RNA-catalyzed CoA, NAD, and FAD synthesis from phosphopantetheine, NMN, and FMN.</p>
				</title>
				<aug>
					<au>
						<snm>Huang</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Bugg</snm>
						<fnm>CW</fnm>
					</au>
					<au>
						<snm>Yarus</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Biochemistry</source>
				<pubdate>2000</pubdate>
				<volume>39</volume>
				<fpage>15548</fpage>
				<lpage>15555</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1021/bi002061f</pubid>
						<pubid idtype="pmpid" link="fulltext">11112541</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B60">
				<title>
					<p>How old is your fold?</p>
				</title>
				<aug>
					<au>
						<snm>Winstanley</snm>
						<fnm>HF</fnm>
					</au>
					<au>
						<snm>Abeln</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Deane</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>Suppl 1</issue>
				<fpage>i449</fpage>
				<lpage>i458</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti1008</pubid>
						<pubid idtype="pmpid" link="fulltext">15961490</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B61">
				<title>
					<p>Fold usage on genomes and protein fold evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Abeln</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Deane</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>2005</pubdate>
				<volume>60</volume>
				<fpage>690</fpage>
				<lpage>700</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/prot.20506</pubid>
						<pubid idtype="pmpid" link="fulltext">16001400</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B62">
				<title>
					<p>Protein architecture chronology deduced from structures of amino acid synthases.</p>
				</title>
				<aug>
					<au>
						<snm>Ji</snm>
						<fnm>H-F</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>H-Y</fnm>
					</au>
				</aug>
				<source>J Biomol Struct Dyn</source>
				<pubdate>2007</pubdate>
				<volume>24</volume>
				<fpage>321</fpage>
				<lpage>323</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">17206848</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B63">
				<title>
					<p>Protein sequences yield a proteomic code.</p>
				</title>
				<aug>
					<au>
						<snm>Berezovsky</snm>
						<fnm>IN</fnm>
					</au>
					<au>
						<snm>Kirzhner</snm>
						<fnm>VM</fnm>
					</au>
					<au>
						<snm>Kirzhner</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Rosenfeld</snm>
						<fnm>VR</fnm>
					</au>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
				</aug>
				<source>J Biomol Struct Dyn</source>
				<pubdate>2003</pubdate>
				<volume>21</volume>
				<fpage>317</fpage>
				<lpage>325</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14616028</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B64">
				<title>
					<p>Spelling protein structure.</p>
				</title>
				<aug>
					<au>
						<snm>Berezovsky</snm>
						<fnm>IN</fnm>
					</au>
					<au>
						<snm>Kirzhner</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kirzhner</snm>
						<fnm>VM</fnm>
					</au>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
				</aug>
				<source>J Biomol Struct Dyn</source>
				<pubdate>2003</pubdate>
				<volume>21</volume>
				<fpage>327</fpage>
				<lpage>339</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14616029</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B65">
				<title>
					<p>Early molecular evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
				</aug>
				<source>Israel J Ecol Evol</source>
				<pubdate>2006</pubdate>
				<fpage>375</fpage>
				<lpage>387</lpage>
			</bibl>
			<bibl id="B66">
				<title>
					<p>The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture.</p>
				</title>
				<aug>
					<au>
						<snm>Caetano-Anoll&#233;s</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Mittenthal</snm>
						<fnm>JE</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2007</pubdate>
				<volume>104</volume>
				<fpage>9358</fpage>
				<lpage>9363</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.0701214104</pubid>
						<pubid idtype="pmpid" link="fulltext">17517598</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B67">
				<title>
					<p>A structure-based anatomy of the <it>E. coli </it>metabolome.</p>
				</title>
				<aug>
					<au>
						<snm>Nobeli</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Ponstingl</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Krissinel</snm>
						<fnm>EB</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2003</pubdate>
				<volume>334</volume>
				<fpage>697</fpage>
				<lpage>719</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.jmb.2003.10.008</pubid>
						<pubid idtype="pmpid" link="fulltext">14636597</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B68">
				<title>
					<p>Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm.</p>
				</title>
				<aug>
					<au>
						<snm>Wright</snm>
						<fnm>PE</fnm>
					</au>
					<au>
						<snm>Dyson</snm>
						<fnm>HJ</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1999</pubdate>
				<volume>293</volume>
				<fpage>321</fpage>
				<lpage>331</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1999.3110</pubid>
						<pubid idtype="pmpid" link="fulltext">10550212</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B69">
				<title>
					<p>Coupling of folding and binding for unstructured proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Dyson</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Wright</snm>
						<fnm>PE</fnm>
					</au>
				</aug>
				<source>Curr Opin Struct Biol</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>54</fpage>
				<lpage>60</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-440X(02)00289-0</pubid>
						<pubid idtype="pmpid" link="fulltext">11839490</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B70">
				<title>
					<p>Natively unfolded proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Fink</snm>
						<fnm>AL</fnm>
					</au>
				</aug>
				<source>Curr Opin Struct Biol</source>
				<pubdate>2005</pubdate>
				<volume>15</volume>
				<fpage>35</fpage>
				<lpage>41</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.sbi.2005.01.002</pubid>
						<pubid idtype="pmpid" link="fulltext">15718131</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B71">
				<title>
					<p>Intrinsically unstructured proteins and their functions.</p>
				</title>
				<aug>
					<au>
						<snm>Dyson</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Wright</snm>
						<fnm>PE</fnm>
					</au>
				</aug>
				<source>Nat Rev Mol Cell Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>197</fpage>
				<lpage>208</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrm1589</pubid>
						<pubid idtype="pmpid" link="fulltext">15738986</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B72">
				<title>
					<p>Cloning, overexpression and characterization of micro-myoglobin, a minimal heme-binding fragment.</p>
				</title>
				<aug>
					<au>
						<snm>Grandori</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Schwarzinger</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>M&#252;ller</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Eur J Biochem</source>
				<pubdate>2000</pubdate>
				<volume>267</volume>
				<fpage>1168</fpage>
				<lpage>1172</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1432-1327.2000.01114.x</pubid>
						<pubid idtype="pmpid" link="fulltext">10672027</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B73">
				<title>
					<p>The PDBbind database: methodologies and updates.</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>RX</fnm>
					</au>
					<au>
						<snm>Fang</snm>
						<fnm>XL</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>YP</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>CY</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>SM</fnm>
					</au>
				</aug>
				<source>J Med Chem</source>
				<pubdate>2005</pubdate>
				<volume>48</volume>
				<fpage>4111</fpage>
				<lpage>4119</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1021/jm048957q</pubid>
						<pubid idtype="pmpid" link="fulltext">15943484</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B74">
				<title>
					<p>Protein folding: a perspective from theory and experiment.</p>
				</title>
				<aug>
					<au>
						<snm>Dobson</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>&#352;ali</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Karplus</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Angew Chem Int Ed</source>
				<pubdate>1998</pubdate>
				<volume>37</volume>
				<fpage>868</fpage>
				<lpage>893</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1002/(SICI)1521-3773(19980420)37:7&lt;868::AID-ANIE868>3.0.CO;2-H</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B75">
				<aug>
					<au>
						<snm>Fersht</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding</source>
				<publisher>New York: Freeman</publisher>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B76">
				<title>
					<p>
						<it>In vitro </it>selection of functional nucleic acids.</p>
				</title>
				<aug>
					<au>
						<snm>Wilson</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Szostak</snm>
						<fnm>JW</fnm>
					</au>
				</aug>
				<source>Ann Rev Biochem</source>
				<pubdate>1999</pubdate>
				<volume>68</volume>
				<fpage>611</fpage>
				<lpage>648</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.biochem.68.1.611</pubid>
						<pubid idtype="pmpid" link="fulltext">10872462</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B77">
				<title>
			