Page 91 - Hematology_ Basic Principles and Practice ( PDFDrive )
P. 91

62    Part I  Molecular and Cellular Basis of Hematology

                                                                        12
           Deciphering this basic protein building block is key for under-  transduction.   Unstructured  segments  are  well  suited  for  protein
        standing  the  structure  and  evolution  of  proteins.  Kinetically,  the   interactions controlled by posttranslational modifications. For example,
        domain structure of a protein may simplify the folding process into   sites  of  tyrosine  phosphorylation  are  typically  unstructured  and
                      3
        a step-wise course.  Thus a long amino acid sequence may fold into   therefore accessible for modification, but after phosphorylation they
        multiple domains rapidly and correctly. For many proteins, individ-  become  ordered  upon  phosphorylation-dependent  binding  to  a
        ual domains fold in a cotranslational manner; from the N-terminal   partner protein.
        region, a growing nascent polypeptide chain immediately begins to   Most  proteins  are  composed  of  multiple  domains,  which  may
        fold domain-by-domain during translation from the ribosome in a   confer multiple functions, couple a targeting function to a catalytic
                         4
        very  efficient  manner.   Genetically,  it  was  long  suspected  that  the   function, or provide for allosteric regulation. In the following sections
        exon structure of genes was correlated with the domain structure of   we highlight the structure of a few proteins and domains and that
               5
        proteins.  Subsequent multigenome analysis did find a strong corre-  are of central and recurring importance in hematology in order to
        lation between domain organization and exon–intron arrangement   illustrate the relationship between domain architecture and function.
        in genomic DNA. The exon–domain correlation facilitates extensive   We discuss representative examples from the extracellular space (the
                                     6
        exon shuffling events during evolution,  although it is not necessar-  immunoglobulin  domain),  from  intracellular  signaling  (protein
        ily  always  one-exon/one-domain.  This  mechanism  ensures  that  a   kinase  domain),  and  from  the  cell  membrane  (G  protein–coupled
        stable and functionally efficient domain can be repeatedly used as a   receptors and the vitamin K receptor).
        module  assembled  into  many  proteins  with  shared  functions.  A
        well-known early example is the nucleotide-binding domain identi-
        fied  in  various  dehydrogenases;  its  robust  alternate  β-strand–α-  The Immunoglobulin Domain and Variations
        helix–β-strand  fold  provides  a  common  structural  unit  for  these
        enzymes. 7                                            As implied by its name, the immunoglobulin (Ig) domain was first
                                                                                13
           Recent computational approaches demonstrate that almost all the   recognized in antibodies.  A detailed discussion on antibody biology
        growing number of known sequences come from new combinations   can be found in Chapter 24. The human genome project has identi-
        of  various  domains,  and  more  than  70%  of  all  sequences  can  be   fied the Ig superfamily (IgSF) as the largest superfamily in human
                                                          8
        partially modeled from known structures with homologous domains.    genome,  due  to  its  extensive  usage  in  a  more  recently  developed
                                                                                     9
                                                  9
        This has been reflected in the human genome sequence.  Impressive   immune system in vertebrates.  In fact, the Ig domain is an evolu-
        progress has already been made in computational protein prediction   tionarily ancient structural unit that can be found in Caenorhabditis
                                                                   14
        and design, principally based on the known structural elements. 10  elegans.  Although Ig-like domains also exist in a few intracellular
           Importantly, not all protein sequences fold into a compact domain.   proteins, they are found predominantly in the extracellular space and
        Depending  on  computational  methods  used,  35%  to  50%  of  the   are the most abundant structural unit found in cell surface receptors,
        human  proteome  is  estimated  to  lack  a  folded  three-dimensional   serving key recognition functions in both the immune and nervous
        structure. Nevertheless, these intrinsically disordered proteins (or more   systems.  Along  with  a  handful  of  other  modular  domains  such  as
        often, intrinsically disordered regions within proteins) can perform   fibronectin  type  III  domains  and  epidermal  growth  factor  (EGF)
        critically  important  biologic  functions  that  complement  those  of   domains, they form modular structures of most receptor molecules
                       11
        structured proteins.  Intrinsically disordered protein segments typi-  on the cell surface. 15
        cally  contain  relatively  few  bulky  hydrophobic  amino  acids  such     An Ig domain is composed of roughly 100 residues, folding into
        as  tryptophan,  phenylalanine,  leucine,  and  isoleucine,  which  are   two β-sheets packing face-to-face, forming a β-barrel. This distinc-
        required to form the hydrophobic core of a domain. Many disordered   tively folded structure is commonly known as the immunoglobulin
        regions  mediate  protein–protein  interactions  and  may  undergo  a   fold (Fig. 6.3A). An intact IgG antibody consists of two heavy chains
        disorder-to-order transition upon binding to an interacting partner,   and two light chains. Each heavy chain contains four Ig domains,
        a  process  dubbed  folding-upon-binding.  They  function  in  crucial   one “variable” domain, and three “constant” domains; whereas each
        areas such as transcriptional regulation, translation, and cellular signal   light chain contains two Ig domains, one constant and one variable.


                                CDR3


                          CDR1


                                                                                     C    F    G
                        CDR2      C     F                      C
                             C’           B     A                   F    G
                        C”           E                  D                                       A
                               D                            E   B    A                     B
                                              G                                      E


                                                  A’
                                                                                                 A’


                      A                                B                         C
                        Fig. 6.3  Ig DOMAIN ARCHITECTURE. (A) V-set Ig domain (Protein Data Bank [PDB] entry 3IDG).
                        (B) C-set Ig domain (3IDG). (C) I-set Ig domain, which can be described as a truncated V-set domain (PDB
                        entry 2V5M). Disulfide bonds are highlighted in orange.
   86   87   88   89   90   91   92   93   94   95   96