| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
GlaxoSmithKline, Discovery Research, Research Triangle Park, North Carolina 27709
Address all correspondence and requests for reprints to: Timothy M. Willson, GlaxoSmithKline, Discovery Research, NTH-M.1421.1A, Research Triangle Park, North Carolina 27709-3398. E-mail: tmw20653{at}gsk.com.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The nuclear receptors are transcription factors that regulate expression of specific target genes (1). The ability of cell-permeable hormones to regulate the activity of these transcription factors provides a mechanism for small organic molecules to directly regulate pathways of gene expression within cells. So there has been great interest in the identification of nuclear receptors and the characterization of the biochemical pathways that they regulate. Pursuit of this knowledge has invariably encompassed a marriage of chemistry and biology from the early days of endocrine hormone isolation to the current era of genomics (2). The purpose of this review is to illustrate how chemical, structural, and functional genomics have expanded our understanding of nuclear receptors and led to the identification of new pharmaceutical drug targets.
| THE NUCLEAR RECEPTOR GENE FAMILY |
|---|
|
|
|---|
|
cDNA was isolated around the same time through three independent strategies. In one case, antibodies were used to isolate human ER
from expression libraries (7). Alternatively, partial sequencing of purified human ER protein allowed design of ER-specific oligonucleotides to screen
-phage cDNA libraries (8), and in the third case CHO cells transformed with human cDNA were screened using radiolabeled E2 (9). These studies emphasize how early advances of nuclear receptor molecular biology relied on the reagents and expertise of the past biochemical era.
Comparison of the newly emerging nuclear receptor sequences gave rise to the exciting observation that significant regional homology existed between the steroid hormone receptors as well as the cellular receptors for thyroid hormone (10, 11) and RA (12, 13). Two hallmark domains were identified; a central DNA-binding domain (DBD) and a C-terminal ligand-binding domain (LBD) (Fig. 1
). Drosophila nuclear receptors that were identified through genetic screens (14) also shared these conserved domains. These observations implied that the human and Drosophila nuclear receptors were members of a gene family (now termed the nuclear receptor superfamily) that arose from a common primordial gene. Importantly, this knowledge allowed the identification of additional members of the nuclear receptor superfamily using sequence data alone, without the requirement for biochemical characterization (e.g. hormone binding).
Additional nuclear receptor genes were rapidly identified using a variety of genetic, biochemical, and molecular biology techniques. Initially, oligonucleotides representing conserved nuclear receptor motifs (such as the DBD) were employed as molecular probes to perform low-stringency hybridizations to cDNA libraries. A large number of mammalian (15) and Drosophila nuclear receptors (16, 17) were identified in this manner. Another powerful approach employed the use of degenerate PCR primers representing the highly conserved P box and Gly-Met junction within the DBD (18, 19). Through application of these cloning methods, the number of orphan nuclear receptors quickly surpassed the number of classical nuclear hormone receptors (20, 21, 22) (Tables 1
and 2
).
|
|
Recently, complete genome sequences have become available for three organisms. In contrast to EST databases, genome sequence allows identification of all gene family members without being limited by temporal, spatial, or quantitative aspects of mRNA expression. So, the mining of genome sequence databases has the potential to dramatically expand the nuclear receptor set for a given organism by identifying genes expressed at low levels or restricted to specific tissues. When the Caenorhabditis elegans genome sequence was reported, more than 200 orphan nuclear receptors were quickly identified (25). Furthermore, analysis of the C. elegans genome revealed multiple new subfamilies of nuclear receptors with divergent P box motifs in their DBDs that would not have been cloned using degenerate PCR primers. This dramatic increase in the number of known C. elegans nuclear receptors led to speculation that the human nuclear receptor set would be dramatically expanded through genome sequencing (26). Surprisingly, however, when the Drosophila genome was reported, only 21 nuclear receptor sequences were found (27, 28). With this puzzling backdrop, the nuclear receptor community was keen to learn whether the total set of human receptors would reflect the diversity seen in C. elegans or parallel the limited number found in Drosophila.
In 2000, the complete human genome sequence became available from both the International Human Genome Sequencing Consortium and Celera Genomics. Large numbers of novel ion channel, protease, and seven-transmembrane receptor genes were identified through analysis of the sequence. In contrast, analysis of the complete human genome identified only three new candidate nuclear receptor sequences (27, 28, 29). The sequences were most closely related to the hepatocyte nuclear factor 4 (HNF4), the farnesoid X receptor (FXR), and the chicken ovalbumin upstream promoter transcription factor, but unfortunately all three contained multiple stop codons within their coding regions. The three sequences are likely to be nonfunctional pseudogenes (27), even though the FXR-related gene is expressed in a limited number of tissues (27, 29). Therefore, the complete human nuclear receptor set comprises only the 48 genes that were identified using methods that predated the completion of the human genome project.
The total number of nuclear receptor genes reflects only one level of complexity within an organism. The diversity of nuclear receptors at the protein level (the proteome) is also a product of differential transcriptional and posttranscriptional processing events. Undoubtedly, additional characterization of the nuclear receptor superfamily will encompass functional variants of each receptor that are present at the protein level. Because the complement of nuclear receptor genes provides the foundation to tackle these proteomic questions, the tools of the genomic era will provide the springboard for a new wave of biochemical research.
| CHEMICAL GENOMICS |
|---|
|
|
|---|
Three approaches have proved successful in the hunt for orphan receptor ligands. The first approach employed focused screening of carefully selected sets of candidate ligands (20). Many investigators have assembled collections of test ligands by reasoning that orphan receptors are likely to bind to small lipophilic molecules. By this approach the RXR, PPAR, liver X receptor (LXR), and FXR were shown to bind to naturally occurring retinoids (31), fatty acids (32), oxysterols (33), and bile acids (34, 35), respectively. Because retinoids have traditionally been employed for chemotherapy (36), identification of RXR as a retinoid receptor led to the use of RXR ligands in treatment of various cancers. For the PPARs, their role as receptors for dietary and endogenous fatty acids accounts for the diverse therapeutic potential of synthetic PPAR ligands. Fatty acids have been shown to affect insulin sensitivity, lipid metabolism, and skin development (37). Likewise, PPAR ligands have shown clinical utility in the treatment of type 2 diabetes, dyslipidemia, and psoriasis (38). In the case of LXR, the discovery that it functions as a nuclear oxysterol receptor was pivotal in uncovering its role in the regulation of cholesterol metabolism. Synthetic LXR ligands increase the efflux of cholesterol from macrophages and raise high-density lipoprotein cholesterol, which suggests that they will be powerful drugs for combating atherosclerosis (39). Finally, the identification of endogenous bile acids as FXR ligands led to the study of the role of this receptor in bile acid and lipid metabolism. Synthetic FXR ligands suppress bile acid biosynthesis, lower serum triglycerides, and may have utility in the treatment of diseases of abnormal bile acid metabolism (40).
The second successful approach for identifying orphan receptor ligands has employed random screening of known drug molecules. Synthetic drugs have been shown to bind and activate PXR, the constitutive androstane receptor (CAR), and the estrogen related receptor (ERR) (2, 39). Remarkably, the biological activity attributed to each of these orphan receptors was previously considered to be an unexplained side effect of the drug in question. For example, a series of structurally unrelated drugs share the common property of inducing cytochrome P450 (CYP)3A levels in the liver and binding to PXR (41), implicating the orphan receptor as the unifying factor responsible for transcriptional regulation of this class of CYP. The connection is further supported by the species specificity of PXR ligands. The human-specific PXR agonist rifampicin (Fig. 2
) induces CYP3A in humans but not rodents; conversely the rodent-specific PXR agonist pregnenolone 16
-carbonitrile induces CYP3A in rodent but not human hepatocytes (42). A similar line of reasoning led to the association of CAR with the regulation of the CYP2B class of CYP. The barbiturate phenobarbital, which is known to induce expression of CYP2B, was shown to cause translocation of CAR from the cytoplasm to the nucleus of hepatocytes (43). Although phenobarbital activates CAR by an indirect mechanism, TCPOBOP (Fig. 2
), a structurally unrelated CYP2B inducer, has been shown to bind and activate the receptor. Thus, CAR activators share a common property of CYP2B induction, thereby linking this orphan receptor with transcriptional regulation of this class of cytochrome P450 (44). In the case of ERR
, diethylstilbestrol and 4-hydroxytamoxifen (Fig. 2
) were recently reported to bind to the receptor and repress its constitutive transcriptional activity (45). Both of these drug molecules have a long history of use in womens health, through their opposing effects on the activity of the classical ER
. Although the full implication of these findings is still unknown, it has been proposed that ERR ligands may have utility in the treatment of breast cancer (46).
|
| STRUCTURAL GENOMICS |
|---|
|
|
|---|
. In the following 6 yr, the structures of an additional 16 nuclear receptor LBDs have been disclosed. Many of these structures have been obtained as complexes with agonist and antagonist ligands, as well as with fragments of coactivator and corepressor proteins (49, 50). With LBD structures solved for more than a third of the mammalian nuclear receptors, it is now possible to conduct an analysis of the impact of structural genomics on our understanding of this gene family.
The nuclear receptor LBDs have a common fold composed of a three-layer sandwich of
-helices. This fold became apparent from comparison of the first two retinoid receptor structures and has been seen in all subsequent LBD structures (Fig. 3
). The domain has four functional regions (49). The top half of the LBD (Fig. 3
) forms the structural core of the protein and is relatively invariant between receptors. In contrast, the ligand-binding site, in the bottom half of the LBD (Fig. 3
), shows differences between receptors commensurate with their recognition of unique hormones and ligands. In particular, the number of ß-strands in this region coupled with the presence or absence of helix 2 contributes to the variability in the ligand-binding site. The third functional region is a hydrophobic cleft on the surface of the LBD that serves as the docking site for coactivator and corepressor proteins (50). This cleft is bounded by a charge clamp (51) formed between a Lys at the C-terminal end of helix 3 and a Glu on the activation function 2 (AF2) helix. The precise positioning of the AF2 helix is controlled by the binding of ligands to the LBD, which allows small molecules to regulate the transcriptional activity of the receptor. The fourth key region of the LBD comprises a dimerization interface along the outside of helix 10. Homodimers show C2 symmetry around this interface, whereas RXR heterodimers form an asymmetric interface with additional contributions from amino acids in helix 9 (52).
|
The PPAR ligand-binding site stands in stark contrast with the structure of HNF4, a member of the nuclear receptor gene family that was originally cloned in a screen for liver-enriched transcription factors (55). The crystal structure of the HNF4
LBD (Fig. 3B
) reveals a smaller ligand-binding site of only 625Å3, which is occupied entirely by a fatty acid that copurifies with the protein (Wisely, B., and S. P. Williams, personal communication). The fatty acid is bound into this site with its acidic head group buried within the core of the protein, in the opposite orientation to that seen in PPAR (56). In HNF4, the fatty acid does not contact the AF2 helix and there is no obvious way for it to modulate the activity of the receptor. Notably, HNF4 is a constitutively active transcription factor in cells. One explanation for this phenomenon is that the fatty acid is not a true ligand, but plays a permanent role in maintaining the structural fold of the LBD in an active conformation. Heterozygous mutations in HNF4
that decrease receptor activity cause the genetic disorder of maturity onset diabetes of the young, in which patients show defects in glucose-stimulated insulin secretion. Therefore, there is intense pharmaceutical interest in modulating the activity of HNF4 as a potential approach to treating type 2 diabetes. Unfortunately, the collected structural and biochemical data suggest that HNF4 may be a constitutively active transcription factor that is not regulated by conventional small molecule ligands (56, 57). Instead, HNF4 may function as a coactivator-docking site (58) or may be regulated by posttranslational modification of the protein (59, 60), which will require new approaches to regulate its activity with synthetic drug molecules.
A third contrast in LBD structure is shown by PXR (Fig. 3C
), a nuclear receptor that functions as a promiscuous xenobiotic sensor to regulate the expression of CYP3A in the liver and intestine. Although the PXR ligand-binding site, at 1150Å3, is not the largest, it has features that are ideally suited to its role as a xenosensor (61). The receptor has five strands of ß-sheet instead of the usual two or three strands, and a large insert of 45 amino acids that replaces helix 2. These elements lead to a reshaping of the protein such that helix 6 is unwound, and part of the 45-amino acid insert is used as the floor of a spherical ligand-binding site that is lined with predominantly hydrophobic amino acids. In addition, a flexible loop that replaces helix 6 may expand and contract to accommodate ligands of various sizes, including the macrocyclic antibiotic rifampicinprobably the largest known nuclear receptor ligand (Fig. 2
). The cocrystal structure of PXR with the cholesterol-lowering drug SR12813 (Fig. 2
) revealed another surprise (61). The drug was bound in multiple orientations, in which each unique orientation employed a different set of interactions with the protein. Apparently, the spherical pocket does not require ligands to satisfy a single shape or arrangement of hydrogen bonding interactions. These molecular features may give PXR the ability to recognize a wide range of structurally unrelated xenobiotics.
The crystal structures of the remaining orphan nuclear receptor LBDs have the potential to uncover important clues about their physiological function. The recent structure of RORß (62), which has a large ligand-binding site and shows partial occupancy by stearic acid, suggests that the RORs may be another family of fatty acid-sensing receptors akin to the PPARs. In contrast, receptors that crystallize with tightly bound lipids, as is seen with HNF4 (56), or lack a ligand-binding pocket, as is predicted for RevErbA (63), may be examples of constitutively active transcription factors. Interestingly, many of the remaining orphan nuclear receptors show constitutive activation or repression of transcription in cells. The act of systematically solving the LBD structures of the remaining orphan nuclear receptors will undoubtedly provide important insight into which ones are likely to be good targets for small molecule pharmaceutical drugs and, conversely, which ones may be less tractable using traditional chemical technologies.
| FUNCTIONAL GENOMICS |
|---|
|
|
|---|
Reverse endocrinology exploits the availability of potent and selective small-molecule ligands to identify nuclear receptor target genes by differential gene expression (DGE) (2). This approach is especially pertinent for nuclear receptors because they are transcription factors that directly regulate gene expression. In cases where clues about the biology of the receptor are available, the differential effects of ligands on candidate genes can be examined on an individual basis. However, the advent of high-throughput DGE technologies [e.g. Affymetrix (Santa Clara, CA) microarrays and Curagen Corp. (New Haven, CT) Gene Calling] has enabled comprehensive, unbiased analysis of gene expression (64). Global DGE studies in the liver using the selective FXR ligand GW4064 (Fig. 2
) have uncovered an elaborate nuclear receptor-regulatory cascade for the maintenance of bile acid homeostasis (65). Among the mRNA whose expression was induced by the FXR ligand was the short heterodimer partner (SHP), an atypical nuclear receptor that lacks a DBD. SHP, a constitutive repressor of gene expression, mediates the effects of FXR in the suppression of bile acid biosynthesis. Based on these data it has been proposed that cholestasis, a serious and life-threatening disease that is characterized by accumulation of bile acids in the liver, could be treated with potent FXR agonists (40). Other recently reported global DGE studies include the identification of PPAR
target genes that led to a model for PPAR
action in insulin sensitization (66), and differentiation of CAR-dependent and CAR-independent effects of phenobarbital in the liver (67). As DGE technologies become widely accessible, the use of reverse endocrinology in functional analysis of orphan nuclear receptors is likely to remain popular. Future improvements in efficiency will depend less on the speed of DGE data generation and more on the development of improved bioinformatic tools to discern key pathways of gene regulation within complex data sets.
Traditional molecular genetics continues to play an important role in nuclear receptor functional analysis. Importantly, genetic analysis of receptor function does not require the availability of a ligand. Key insights into the physiological function of several orphan nuclear receptors without known ligands have been derived from analysis of knockout mice generated using homologous recombination (68). For example, nerve growth factor immediate early gene B (NGFI-B) ß knockout mice fail to develop midbrain dopaminergic neurons (69). The knockout phenotype led to the suggestion that NGFI-Bß ligands might be useful for the treatment of disorders of dopamine signaling such as Parkinsons disease. One general drawback of conventional nuclear receptor gene knockouts is that about half of them result in embryonic lethality. To circumvent this problem, new genetic strategies such as the conditional site-specific recombination using the Cre/lox system are being used to introduce temporal or tissue-specific somatic mutations of genes (70, 71). Recently, an inducible Cre/lox system was used to knock out steroidogenic factor 1 (SF1) in the anterior pituitary of the mouse, limiting the severe phenotype of the conventional SF1 knockout. The phenotype of the conditional knockout mice established an essential role of SF1 in pituitary production of the gonadotropes LH and FSH (72).
Nuclear receptor knockouts can now be accomplished in a broader range of cells using two new genetic techniques. Antisense oligonucleotides (73) and small RNA interference technologies (74) introduce sequence-specific, posttranscriptional gene silencing, allowing the suppression of nuclear receptor expression in cell culture. The effect of HNF4
suppression on gene expression in human hepatocytes has recently been evaluated using antisense RNA (75). In these experiments, an adenoviral vector was used to efficiently deliver an HNF4
antisense RNA expression construct to hepatocytes in culture, resulting in nearly undetectable levels of HNF4
protein. Specific ablation of HNF4
led to dysregulation of multiple drug-metabolizing CYPs.
In the postgenome era, human genetic studies will be increasingly used as a source of new information on orphan nuclear receptor function. One of the fruits of the human genome project is a wealth of information on human DNA sequence variation. Although the overall difference between any two human genomes is less than 0.1%, there are still several million single nucleotide polymorphisms (SNPs) within each individual (76, 77). The availability of a dense set of SNPs increases the probability of finding nuclear receptor variants that can be associated with human diseases. Recently, SNP analysis has led to association of NGFI-Bß with schizophrenia and manic depressive disorder (78). Another study based on a SNP analysis of patients with maturity onset diabetes of the young associated mutations in SHP with mild obesity (79). This field is burgeoning with new information and is one of the most exciting avenues for future exploration into orphan nuclear receptor function in humans.
| CONCLUSIONS AND FUTURE DIRECTIONS |
|---|
|
|
|---|
To date, almost all of the natural ligands that have been identified are agonists that increase receptor activity. These compounds have proven to be powerful tools in analysis of receptor function by reverse endocrinology (2). However, the development of antagonist ligands would provide an additional capacity to perform chemical knockouts of receptor function in mature animals or in animal models of disease. Increased understanding of the molecular mechanism of receptor activation combined with the availability of cocrystal structures with coactivators and corepressors may soon permit the rational design of potent antagonist ligands. This combination of chemical and structural genomics will be an important undertaking for future medicinal chemists.
Perhaps the most controversial concept that has emerged from structural analysis of the orphan nuclear receptors is the possibility that some of the remaining family members may not be small molecule receptors. Several of the orphans that may fall into this category are highly validated as pharmaceutical drug targets because of their genetic association with human diseases (e.g. HNF4
with diabetes, SHP with obesity, and NGFI-Bß with schizophrenia) (22). If it transpires that the activity of these transcription factors cannot be modulated by conventional small-molecule ligands, researchers may need to consider alternative approaches by which to develop therapeutic agents. Early indications are that short peptides can be developed that disrupt the action coactivator proteins through competition with the binding cleft on the surface of the receptor (80, 81). These peptides are able to block receptor function when introduced into cells. Thus, it may be possible to directly modulate cofactor binding to these transcription factors if issues of cell penetration can be overcome through the design and synthesis of peptidomimetic molecules (82).
Less than two decades after the cloning of the GR cDNA, we now know that there are a remarkably small number of human nuclear receptors. Each of these transcription factors appears to play an important role in mammalian physiology, and many of them have emerged as important drug targets. The physiological functions of at least half of the mammalian nuclear receptors are now known, and increasingly rapid progress is being made to uncover the biology of the remaining members of the gene family. At the current pace, it now seems possible that we will complete the functional analysis of the orphan nuclear receptors within the next decade. This achievement will undoubtedly lead to new opportunities for the development of human therapeutic agents.
The half-time score: all tied with everything to play for!
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Received for publication January 22, 2002. Accepted for publication March 1, 2002.
| REFERENCES |
|---|
|
|
|---|
. Nature 383:728731[CrossRef][Medline]
. Nature 375:377382[CrossRef][Medline]
. Nature 415:813816[Medline]
. Nature 395:137143[CrossRef][Medline]
. J Mol Biol 302:831851[CrossRef][Medline]
1: modulation by the F domain. Mol Cell Biol 22:16261638
activation has coordinate effects on gene expression in multiple insulin-sensitive tissues. Endocrinology 142:12691277
-helix. J Am Chem Soc 123:53825383[CrossRef][Medline]
This article has been cited by other articles:
![]() |
S. C. Wu and Y. Zhang Minireview: Role of Protein Methylation and Demethylation in Nuclear Hormone Signaling Mol. Endocrinol., September 1, 2009; 23(9): 1323 - 1334. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Conzen Minireview: Nuclear Receptors and Breast Cancer Mol. Endocrinol., October 1, 2008; 22(10): 2215 - 2228. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Howarth, S. H. W. Law, B. Barnes, J. M. Hall, D. E. Hinton, L. Moore, J. M. Maglich, J. T. Moore, and S. W. Kullman Paralogous Vitamin D Receptors in Teleosts: Transition of Nuclear Receptor Function Endocrinology, May 1, 2008; 149(5): 2411 - 2422. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. D. Moore, S. Kato, W. Xie, D. J. Mangelsdorf, D. R. Schmidt, R. Xiao, and S. A. Kliewer International Union of Pharmacology. LXII. The NR1H and NR1I Receptors: Constitutive Androstane Receptor, Pregnene X Receptor, Farnesoid X Receptor {alpha}, Farnesoid X Receptor beta, Liver X Receptor {alpha}, Liver X Receptor beta, and Vitamin D Receptor Pharmacol. Rev., December 1, 2006; 58(4): 742 - 759. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lempiainen, F. Molnar, M. Macias Gonzalez, M. Perakyla, and C. Carlberg Antagonist- and Inverse Agonist-Driven Interactions of the Vitamin D Receptor and the Constitutive Androstane Receptor with Corepressor Protein Mol. Endocrinol., September 1, 2005; 19(9): 2258 - 2272. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Pardee, J. Reinking, and H. Krause Nuclear Hormone Receptors, Metabolism, and Aging: What Goes Around Comes Around Sci. Aging Knowl. Environ., November 24, 2004; 2004(47): re8 - re8. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Pawar and D. B. Jump Unsaturated Fatty Acid Regulation of Peroxisome Proliferator-activated Receptor {alpha} Activity in Rat Primary Hepatoctes J. Biol. Chem., September 19, 2003; 278(38): 35931 - 35939. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Maglich, J. A. Caravella, M. H. Lambert, T. M. Willson, J. T. Moore, and L. Ramamurthy The first completed genome sequence from a teleost fish (Fugu rubripes) adds significant diversity to the nuclear receptor superfamily Nucleic Acids Res., July 15, 2003; 31(14): 4051 - 4058. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Y. Liu, A. K. Death, and D. J. Handelsman Androgens and Cardiovascular Disease Endocr. Rev., June 1, 2003; 24(3): 313 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, X. Huang, and C. D. Sigmund Identification of a Nuclear Orphan Receptor (Ear2) as a Negative Regulator of Renin Gene Transcription Circ. Res., May 16, 2003; 92(9): 1033 - 1040. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Endocrinology | Endocrine Reviews | J. Clin. End. & Metab. |
| Molecular Endocrinology | Recent Prog. Horm. Res. | All Endocrine Journals |