Review of TFIIIA

By Michael Bumbulis, Ph.D

1996 (Introduction from Ph.D. Thesis)

Recognition of specific DNA sequences is a crucial step associated with many cellular processes and among eucaryotes, the zinc finger is a ubiquitous DNA-binding motif involved in this step (Klug and Rhodes, 1987). Since this motif was first i dentified in Xenopus laevis transcription factor IIIA (TFIIIA) in 1985 (Miller et al., 1985), literally hundreds of proteins, many of them transcription factors, have been shown to contain zinc fingers. In fact, all eucaryotes tested to date cont ain one or more sequences that hybridize to zinc finger-derived probes (Chowdhury et al., 1987; Schuh et al, 1986) and over 100 human DNA sequences encoding putative zinc finger protein genes have been isolated (Hoovers, et al., 1992). Among the better c haracterized zinc finger proteins are yeast transcription factor ADR1 (Bemis and Denis, 1988), human transcription factor Sp1 (Gidoni et al., 1984), mouse transcription factor Zif268 (Christy et al., 1988) and the gene product from human gene ZFY, the testis-determining factor (Kochoyan at al., 1991). The zinc finger motif has also been implicated in Drosophila development, being encoded by the segmentation genes Kruppel (Rosenberg et al., 1986) and Hunchback (Tautz et al., 1987) and by genes thought to be involved in tumorigenesis, including GLI (Kinzler et al., 1988) and a candidate for the Wilm's tumor gene (Gessler et al., 1990). A survey of zinc finger proteins shows a significant variation in the number of zinc finger motifs per protein, ranging from 2 (ADR1), 3 (Sp1, Zif268), 9 (TFIIIA), 13 (ZFY) and up to 37 (Xfin, a Xenopus RNA-binding protein).

 

Isolation and characterization of TFIIIA

TFIIIA is considered the archetypal zinc finger protein. Not only was it the first protein discovered to contain the zinc finger motif, but it was also the first eucaryotic transcription factor to be purified. Because of the great abundance of TFI IIA in Xenopus oocytes, where it exists in a complex with 5S RNA (the cytoplasmic 7S storage particle), it was purified to homogeneity from ovarian extracts (Engelke et al., 1980; Pelham and Brown, 1980). TFIIIA was purified as a 40-kDa protein that was specifically required for accurate transcription of 5S RNA genes by RNA polymerase III (Engelke et al., 1980). It binds specifically to the internal control region (ICR) of the 5S RNA gene, a region approximately 50 base pairs long that begins about 45 base pairs downstream of the transcription initiation site (Engelke et al., 1980; Sakonju and Brown, 1981). TFIIIA alone binds reversibly to the ICR and once bound, it promotes the subsequent formation of a complex involving other transcription factors, TFIIIC and TFIIIB (Bogenhagen et al., 1982; Lasser et al., 1983). TFIIIC does not specifically bind to the ICR itself, but binds to the TFIIIA-5S RNA gene complex and stabilizes it. TFIIIB then binds to complete the preinitiation complex and RNA polymer ase is then recruited to initiate transcription (Bieker et al., 1985). The transcription complex remains bound to the DNA through multiple passages of the RNA polymerase (Bogenhagen et al, 1982).

Because TFIIIA protected a 45-50 base pair region from DNase I digestion, yet was only 38-40-KDa in size, it was initially argued that more than one protein bound to the ICR (Engelke et al., 1980). However, a lack of dyad structure in the ICR coup led with low protein-to-gene molar ratios necessary for binding suggested that one or two protein molecules were sufficient for binding along the entire length of the ICR (Sakonju and Brown, 1982). Footprint titrations, transcription assays, and mixing e xperiments determined that TFIIIA binds to DNA with a binding stoichiometry of 1:1(Smith et al., 1984), suggesting that it has a very elongated structure in association with its binding site.

When the amino acid composition of TFIIIA was determined (Picard and Wegnez, 1979), the surprisingly frequent occurrence of cysteines led Hanas et al. (1983) to suspect that metal ions were involved in TFIIIA function (sulfhydryl groups are capable of metal coordination). They discovered that zinc was necessary for the transcription of 5S RNA genes by RNA polymerase III. It was also determined that preparations of TFIIIA treated with chelating agents could not bind to the 5S RNA gene, yet binding could be restored by the addition of zinc ions to the TFIIIA preparations (Hanas et al., 1983). The stability of 7S particles was also improved by excluding chelating agents (Miller et al. 1985). Atomic absorption spectroscopy analyses showed that prep arations of 7S particles contained 7-11 atoms of zinc per mole of particle (Klug and Rhodes, 1987). While is was clear that zinc was essential for TFIIIA function, the actual role of zinc in DNA-binding was not known.

It was not until the amino acid sequence of TFIIIA was deduced from a cDNA clone that the basis of TFIIIA's zinc-dependence was determined (Miller et al., 1985). Analysis of the amino acid sequence of TFIIIA revealed nine tandem similar units (Figu re 1), each containing about 30 amino acids, including two cysteines and two histidines located at invariant positions within each unit. These four residues were proposed to be zinc-ligands which interact with zinc to form a tetrahedal complex necessary for the proper tertiary structure of the minidomain, termed a "zinc finger." This interpretation was supported by proteolysis experiments, where trypsin digests of TFIIIA yielded fragments that differed in size by about 3 kDa (Miller et al, 1985). A ser ies of individual zinc coordinating domains, each containing about 30 amino acids, would then account for both the zinc content of the protein and a periodic arrangement of 3-kDa fragments.

Further support for the importance of the zinc ligands was forthcoming. An EXAFS (extended X-ray absorption fine structure) study confirmed that approximately nine zinc atoms are contained within a 7S particle and the coordination sphere of the zi nc ions contains two cysteines and two histidines (Diakun, et al., 1986). A peptide, containing finger two of TFIIIA, was also purified and shown to fold properly in the presence of zinc (Frankel et al, 1987). Site-directed mutagenesis was used to repla ce the first conserved histidine with asparagine in each of the nine fingers (Del Rio et al., 1993). The mutant proteins

 

 

 

 

 

 

 

 

 

Figure 1. Amino acid sequence of Xenopus laevis TFIIIA. Sequence is aligned to show repeated units 1-9. Zinc ligands and conserved hydrophobic residues are highlighted. a and b symbols correspond to predicted position of alpha helix and beta sheet, respectively, based upon crystallographic data. Sequence in parentheses correspond to C-terminal domain.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

demonstrated localized regions of disruption associated with the region of mutation, a decreased affinity for the 5S RNA gene, and when compared wild type TFIIIA, allowed for localized regions of increased susceptibility to DNase I digestion in DNA-TFI IIA complexes. Zinc has also been determined to be essential for the proper folding and binding activity of transcription factor Sp1 (Kuwahura and Coleman, 1990).

Several other features of the amino acid sequence of TFIIIA are functionally relevant. The consensus sequence of the nine repeats is

-Cys-X2-5-Cys-X12-His-X3-4-His-. Within each repe at, there are three hydrophobic residues found at invariant positions (Figure 2) that interact with each other to stabilize the minidomain. The looping region (X12) between the second Cy s and the first His is enriched with amino acids known to interact with the phosphate backbone of DNA in gene repressor protein/DNA binding studies (Figure 2). In addition, it is also rich in arginine residues which often form hydrogen bonds with specifi c base pairs in previously analyzed DNA-protein complexes (Seeman et al., 1976). Thus, it is likely that some of these amino acids interact with the DNA and impart binding specificity. Approximately 80% of the protein's mass corresponds to its zinc fing er DNA-binding domain (Vrana et al., 1988).

Each repeat is joined by a linker region of 2-6 amino acids. The linker regions between fingers 1-2 and 2-3 contain a pentapeptide sequence, TGEKP/N, that is also found in Kruppel, Zif268, and many other zinc

 

 

 

 

 

 

 

Figure 2. Analysis of average distribution of amino acids along the length of each repeated unit. The height of the histogram corresponds to the number of times that class of amino acid occurs within a particular position along the repeated unit. Cl assification of amino acids are as follows: a) DNA-binding - Lys, Arg, His, Asn, Gln, Thr; b) Basic - Arg, Lys, His; c) Acidic - Asp, Glu; d) Hydrophobic - Leu, Ile, Val, Phe, Tyr, Trp. The four zinc-ligands (cysteines and histidines) are represented in each set of histograms. (Figure taken from Miller et al., 1985).

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

finger proteins (Schuh, et al., 1986). There is no obvious pattern among the linkers except that fingers 1,3,5,7,and 9 have more basic residues in

their C-terminal halves than fingers 2,4,6, and 8 (Churchill et al., 1990). Compared to other fingers, the linkers going into and out of the sixth

finger are unusually short. This suggests that finger six probably interacts with the DNA in a manner that is different from the other fingers.

Following the nine zinc finger domains is a C-terminal region that contains about 70 amino acids. The first half of this region is enriched

with basic residues (Figure 1). Miller et al. (1985) proposed that this region corresponds to the 10-kDa domain that is revealed at one end of TFIIIA by mild proteolysis (Smith et al., 1984). This domain can be removed by proteolysis without affectin g the affinity of DNA-binding or the pattern of protection seen with DNase I footprinting (Smith et al., 1984; Vrana et al., 1988). However, the C-terminal domain is required for efficient RNA synthesis. The 30-kDa fragment has only about 20% of the act ivity of the full-length 40-kDa fragment (Smith et al., 1984). This C-terminal region probably interacts with TFIIIC in the formation of the preinitiation complex (Lasser et al., 1983).

 

Structure of zinc finger domain

Knowledge of recurring substructures in crystallographically characterized metalloproteins was used to predict the three-dimensional structure of the zinc finger motif (Berg, 1988). It was observed that the zinc finger consensus sequence could be divided into a Cys-Cys loop (Hyd - X - Cys - X2-4 - Cys - X3 - Hyd) and a His-His loop (X - Leu - X2 - His - X3-4 - His). The sequence that characterizes the Cys-Cys loop was seen in two other metalloproteins, aspartate transcarbamylase and rubredoxin. Crystallographic studies of these proteins showed that the cysteines coordinated metal ions and the three-dimensional structure consisted of two antiparallel beta strands. Similarly, the His-His loop was seen in thermolysin and hemerythr in, and crystallographic studies of these proteins showed that each histidine coordinated a metal ion and was part of a helical structure. Berg (1988) then proposed that the zinc finger motif could be understood by attaching these two substructures, resu lting in a domain centered around a tetrahedal metal center and consisting of an antiparallel beta sheet followed by an alpha helix. Such modeling also had the appealing feature of allowing the three conserved hydrophobic residues to be packed together.

Two-dimensional NMR studies of peptides from ADR1 containing either one or two zinc fingers (Parraga et al., 1988), finger 31 from Xfin (Lee et al., 1989), finger 2 from yeast transcription factor SWI5 (Neuhaus et al., 1992) and the C-terminal fing er of human enhancer binding protein (Omichinski et al., 1990) essentially confirmed this model (Figure 3). Specifically, the N-terminal region of the finger forms a two-stranded beta sheet, containing both metal-binding cysteine residues, that terminat es, bends, and gives rise to an alpha helix containing the metal-binding histidine residues. The two cysteine residues and two histidine residues coordinate the zinc ion and the conserved hydrophobic

 

 

 

 

 

 

 

 

 

 

Figure 3. Schematic illustration of the tertiary structure of a zinc finger domain. Interactions between zinc-ligands and conserved hydrophobic residues are shown. (Figure taken from Krizek et al., 1991).

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

residues do indeed form a core, which further stabilizes the tertiary structure of the finger. The linker residues connect the adjacent fingers. While all zinc fingers characterized share this common structure, finger one from SWI5 has an additional beta strand N-terminal to the formal start of the finger motif (Neuhaus et al., 1992). This third strand plays a role in stabilizing the structure of this finger.

 

TFIIIA/5S RNA Gene Interaction

5S RNA Gene Structure

TFIIIA binds specifically to an intragenic region, called the internal control region (ICR), of the 5S RNA gene and protects nucleotide positions 45-96 (where 1 denotes the transcription start site) from DNase I digestion (Pelham and Brown, 198 0; Engelke et al., 1980). A series of deletion mutants initially defined residues 50-83 to be required for transcription initiation (Bogenhagen et al., 1980). Methylation interference experiments showed that the main DNA contact points with TFIIIA occur at the extreme 3' end of the ICR, where six critical G residues on the noncoding strand were identified between positions 80-90 (Sakonju and Brown, 1982). These data suggest TFIIIA interacts with almost every nucleotide on the noncoding strand from regi on 80-90. A series of point mutations were introduced into the 5S RNA gene to determine their effect on transcription initiation (Pieler et al., 1985a; Pieler et al., 1985b). As a result, the ICR was defined to include residues 45-97 and discovered to b e tripartite in structure. The three regions are defined as box A (54-60), the intermediate element (67-72) and box C (80-97). Box A is a common, conserved class III promoter domain, while the intermediate element and box C are 5S-gene-specific. The sp acer regions between these elements are quite tolerant of sequence alterations (Pieler et al., 1985b) and are variably sensitive to insertions and deletions (Pieler et al., 1985b; Pieler et al., 1987).

A series of 5S RNA gene point mutants were also analyzed in terms of their effect on TFIIIA binding affinity (You et al., 1991; Veldhoen et al., 1994). In agreement with the data derived from methylation interference experiments (Sakonju and Brown , 1982), extensive protein-DNA contacts were detected in the C box. Substitution of GC base pairs at positions 81, 85, 86, 89, and 91 resulted in three to fivefold reductions in TFIIIA binding affinity (Veldhoen et al., 1994). Other bases within the C bo x (residues 80-91) were also important in binding. Within the intermediate element, a significant reduction in TFIIIA affinity was seen when positions 70 and 71 were mutated, but no reduction in affinity was seen when positions 67-69 were mutated. (Veldh oen et al., 1994). Substitutions of sequences 57-62 in box A resulted in a three to fourfold reduction in binding (You et al., 1991).

The 5S RNA gene also has the ability to bind zinc in a sequence-specific fashion (Martinez-Balbas, et al., 1995). The strongest zinc-binding sites correspond to GGG trinucleotide repeats, although the strength of binding depends on flanking sequen ces. It is interesting to note that residues determined to be important for TFIIIA binding (79-82 and 84-86 in the C box and 70-71 in the intermediate element) have a strong affinity for zinc. These data raise the intriguing possibility that the structu ral zinc ions contained within TFIIIA may also interact with the DNA.

 

Modeling TFIIIA binding to the ICR

TFIIIA binds to the ICR so that its carboxyl terminus is towards the 5' end (the region containing the A box) and its amino terminus is towards the 3' end (the region containing the C box) (Miller et al., 1985; Vrana et al., 1988). The protein is extended so that the nine zinc fingers interact with 50 nucleotide pairs (Smith et al., 1984). Initially it was predicted that each finger would therefore interact with one half turn or five base pairs of DNA (Miller et al., 1985). This idea was sup ported by sequence analysis of the ICR, where one or more G residues are found approximately every five base pairs (Rhodes and Klug, 1986). Probing the accessibility of the DNA in the TFIIIA-ICR complex to dimethysulfate and micrococcal nuclease further supported this hypothesis, as nine possible patches of protection were identified approximately five base pairs apart (Fairall et al., 1986).

Two models of TFIIIA-ICR interaction were initially proposed (Fairall et al., 1986). In model I, TFIIIA snakes around the DNA with the fingers following the helical path of the major groove (Figure 4a) (Smith et al.,

 

 

 

 

 

Figure 4. Various models representing the proposed interactions between TFIIIA and the 5S RNA gene. a) Two models originally proposed by Fairall et al. (1986). Model I shows the the fingers of TFIIIA wrapping around the DNA so that they follow the helical path of the major groove. Model II shows TFIIIA positioned on one side of the helix.

b) Modified version of model II, where every other linker residue (shown as dotted line) crosses the minor groove (Figure taken from Churchill et al., 1990). c) Representative of non-uniform model, where fingers 1-3 and 7-9 wrap around the DNA much like Zif268 (Pavletich and Pabo, 1991). In this particular model, finger 5 interacts with the major groove and fingers 4 and 6 span the minor groove (Figure taken from Clemens et al., 1992). d) Sites of DNase I footprint alterations associated with par ticular broken finger mutants (Figure taken from Del Rio et al., 1993).

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

1984; Berg, 1988). In this model, each finger would make structurally equivalent contacts with the major groove. In model II, TFIIIA lies on one face of the helix with successive fingers interacting with alternate sides of the DNA (Figure 4a) (Fairal l et al., 1986). In this model, only every other finger would make structurally equivalent contacts every 10 base pairs. Model II suggests the amino and carboxyl terminus of each finger

would exit from the same end of the finger, while model I suggests the amino and carboxyl termini would exit from opposite ends of the finger. The predicted three-dimensional structure of the zinc finger domain (Berg, 1988), that was later confirmed b y 2-D NMR studies, clearly supported model I. On the other hand, methylation and micrococcal nuclease protection studies showed alternating strong and weak sites of protection every five nucleotides (Fairall et al., 1986). DNase I footprinting also show ed that primarily one face of the DNA is protected (Rhodes, 1985). These data clearly supported model II. Hydroxyl radical footprinting further supported model II, as regions of protection every 10 base pairs were seen, along with asymmetric protection of the two strands (Churchill et al., 1990). Churchill et al. (1990) then proposed a model to reconcile the experimental data with the known structure of the zinc finger domain. In this model, a two finger periodicity was proposed where each finger sits on the same side of the helix, but after every second finger, the linker region would span the minor groove (Figure 4b). All of these models assume essentially equivalent interactions between each finger (or pairs of fingers) and the DNA, yet the tripar tite structure of the ICR, determined by mutagenic studies of the 5S RNA gene, did not support this assumption.

Several studies demonstrated a non-uniform interaction between the nine zinc fingers of TFIIIA and the ICR. By considering the length of the linkers connecting finger six to its adjacent fingers, Berg (1990) proposed that finger six is itself a li nker connecting two sets of fingers (1-5 and 7-9) that wrap along the major groove. Truncated versions of TFIIIA were used in hydroxyl-radical footprint experiments and further showed non-uniform interaction between TFIIIA and the ICR (Vrana, et al., 198 8; Hayes and Clemens, 1992). These experiments revealed two continuous regions of protection at both ends of the ICR, corresponding to roughly one turn of the helix. However, the middle 20 base pairs showed a more complex pattern of protection, where a short stretch of DNA is protected, but is flanked by two unprotected regions. These three clustered regions of protection were in good agreement with the tripartite structure of the ICR determined by Pieler et al. (1985a,b). Various further models were t hen proposed. While they differ slightly in their positioning of fingers 4-6, they all agree that fingers 1-3 and 7-9 wrap around the DNA, while fingers 4-6 span only one side, crossing the minor groove twice (Figure 4c). These non-uniform models were further supported by a reconsideration of the DNase I footprinting experiments (Fairall and Rhodes, 1992), missing nucleoside experiments (Hayes and Tullius, 1992) and further refined hydroxyl-radical footprinting experiments using truncated versions of T FIIIA (Clemens et al., 1992; Hayes and Clemens, 1992). To account for the 22 or so base pairs that interact with fingers 4-6, it has been noted that TFIIIA bends the DNA upon binding (Schroth et al., 1989, Schroth et al., 1991). Most of the experimental evidence used to propose this model of non-uniform interactions employed truncated versions of TFIIIA. Therefore, the above interpretations depend upon the assumption that truncated forms of TFIIIA interact with the 5S RNA gene essentially as they woul d in the context of the full-length protein.

Site-directed mutagenesis was used to generate mutant versions of TFIIIA containing structural disruptions of individual zinc fingers (Del Rio et al., 1993). DNase I footprinting analyses of these 'broken finger' mutants suggest that TFIIIA's mod e of interaction with the ICR may be even more complex than that which is proposed in the non-uniform model (Figure 4d). For example, while the pattern of footprint alterations associated with zinc-ligand mutations in finger 2 or 5 correspond to the pos itioning of these fingers in the non-uniform models (Clemens et al., 1992; Hayes and Clemens, 1992), the same does not hold true for fingers 6,7,8, and 9. The sites of loss-of-protection associated with these finger mutants were positioned further toward s the 5' end of the ICR than would be predicted by the non-uniform models. In fact, disruption of finger 8 was associated with two separate patches of loss-of-protection suggesting that finger 8 interacts with two sites on the ICR well separated along th e linear dimension of the DNA. These data do, however, show that TFIIIA does interact with the helix along the length of the ICR as a series of sequence-specific, localized contacts.

Cross-linking experiments have shown that of the 20 T residues that span a 61-bp fragment containing the ICR, >90% of the cross-linking occurs in two positions corresponding to residues 84 and 88 on the noncoding strand and coding strands, respe ctively (Lee et al., 1991). Amino acid analysis of the peptides cross-linked to the photoactive, noncoding strand indicates that finger two interacts with position 84 on the noncoding strand. This is in good agreement with the footprinting results obtai ned with broken finger mutants (Del Rio et al., 1993) and truncation mutants (Vrana et al., 1988).

 

TFIIIA binding

Although zinc finger proteins are thought to be essentially modular, TFIIIA interacts with its binding site in a manner apparently different than the sum of its parts. An individual finger from TFIIIA does not give a clearly defined DNase I footpr int (Frankel et al., 1987) and is probably not sufficient for sequence-specific binding (Parraga et al., 1988). And while polypeptides containing fingers 1-3 demonstrate sequence-specific binding, this is not seen with polypeptides containing fingers 1-2 or 2-4 (Liao et al., 1991). Deletion mutants and broken finger mutants demonstrate that the binding energy contributed by each finger is not equal (Clemens et al., 1994; Del Rio et al., 1993). In fact, TFIIIA binding is not merely the additive effect of the nine separate fingers interacting with the DNA (Clemens et al., 1994; Del Rio et al., 1993).

Several studies employing truncation mutants have suggested that most of the binding energy of TFIIIA is conferred by the N-terminal fingers (Clemens et al., 1994; Liao et al., 1992; Vrana et al., 1988). Gel mobility shift assays that compare the dissociation constants of a deletion mutant containing only fingers 1-3 with full-length protein show that the three-finger mutant binds with a DGo that is approximately 90% of the DGo for full-length TFIIIA binding (Liao et al., 1992). These studies are suppor ted by observations where removal of fingers from the amino-terminus abolish DNA-binding, assessed by footprinting assays (Fiser-Little et al., 1988), while removal of carboxyl-terminus fingers has little effect on DNA-binding (Vrana et al., 1988). Howev er, all of these studies assume that the fingers contained in a truncated polypeptide behave essentially as they would in the context of the full-length protein, and this assumption is probably not valid. Thermodynamic analyses of broken finger mutants c ontaining single- and double-finger disruptions demonstrate an unfavorable interaction between fingers in DNA-binding (Del Rio et al., 1993; Kehres et al., submitted for publication). In fact, a truncation mutant containing fingers 4-9 binds to the ICR w ith 83% of the binding energy seen with wild-type TFIIIA (Kehres et al., submitted for publication). If polypeptides containing fingers 1-3 and 4-9 contribute 90% and 83% of the binding energy, respectively, clearly this suggests that simultaneous bindin g of all nine fingers, in the context of full-length protein, involves energetically unfavorable interactions (either with the DNA, among fingers, or both).

Crystal Structures of TFIIIA-Like Proteins

A TFIIIA-DNA co-crystal structure has yet to be described; given the conserved aspects of the zinc finger motif, however, descriptions of TFIIIA-like protein-DNA crystal structures are likely to be relevant. To date, the crystallographic analyses of three such proteins have been published: Zif268 (Pavletich and Pabo, 1991), tramtrak (Fairall et. al; 1993), and GLI (Pavletich and Pabo, 1993).

Zif268 is a mouse transcription factor that contains three zinc fingers sharing the C2H2-type consensus sequence o f TFIIIA. A 73 amino acid polypeptide (corresponding to residues 349 to 421 of the native protein) that contains the zinc fingers was crystallized in the presence of a 11-bp DNA fragment containing the 9-bp sequence recognized by Zif268 (Pavletich and P abo, 1991). The structure of this complex was solved at 2.1 A. In agreement with the NMR studies, the zinc fingers contain a C-terminal antiparallel beta-ribbon followed by an N-terminal alpha helix, held together by zinc ion coordination with the conse rved cysteines and histidines and by a set of hydrophobic interactions. The three fingers are arranged in a semicircular structure that fit into the major groove of the DNA and the most N-terminal finger binds to the 3' end of the of the DNA so that the overall arrangement of the peptide is "antiparallel" with the DNA (Figure 5).

Each finger makes primary contacts with a 3-bp subsite of the DNA through the N-terminal portion of the alpha helix (in every case, the peptide interacts with guanines on the same strand of DNA). While the

 

 

 

 

 

 

 

 

 

 

 

Figure 5. Crystal structure of the Zif268-DNA complex (From Pavletich and Pabo, 1991).

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

helix fits snugly into the major groove, it is tipped at a steeper angle than the angle of the major groove. The beta-sheet is oriented away from the major groove on the back of the alpha helix. The first strand of the beta sheet does not make any co ntacts with the DNA while the second strand contacts the sugar phosphate backbone along one strand of the DNA.

As a whole, the residues involved in making base contacts are found immediately preceding the start of the alpha helix (-1) and at the second(+2), third (+3), and sixth (+6) position of the alpha helix. Fingers 1 and 3 have identical residues at t hese positions (Arg, Asp, Glu, Arg) and interact with the same 3-bp subsite sequence (GCG). In both of these fingers, the arginines found at positions -1 and +6 make primary interactions with the GCG subsite (Figure 6a). The aspartic acid at position +2 (which is conserved in all three fingers) does not make direct contact with the DNA, but instead stabilizes the side chain of arginine at position -1 by forming hydrogen bond-salt bridges with the guanidinium group already hydrogen-bonded to the first G of each subsite. The second G in the subsite (at the 5' end) is hydrogen bonded to the arginine at position +6.

Finger 2 interacts with the DNA differently than fingers 1 and 3 (Figure 6b). It recognizes a TGG subsite and contains Arg, Asp, His, and Thr at positions -1, +2, +3, and +6 respectively. Like fingers 1 and 3, the arginine at position -1 hydrogen bonds with the first G of the subsite with stabilizing interactions from the aspartic acid at position +2. However, instead of glutamic acid, finger 2 contains histidine (not a zinc-ligand) at position +3. This histidine forms a hydrogen bond with the second G in

 

 

 

 

 

 

 

 

 

Figure 6. Schematic representations of amino acid-nucleotide contacts seen in the fingers of ZIF268, tramtrak, and GLI. Numbers represent position of amino acids with respect to the start of the alpha helix.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

the subsite. The threonine at position +6 does not interact with the DNA.

In addition to the base contacts made by Zif268, several contacts are made with the sugar-phosphate backbone of the DNA. In each finger, the first histidine that contributes to the coordination of the zinc ion also hydrogen bonds to a phosphate on the primary strand of the DNA

(creating a zinc-histidine-phosphate contact). Thus, in addition to coordinating zinc, this particular residue also contributes to the overall binding energy of the peptide. A conserved arginine on the second beta-sheet (two residues downstream of th e second conserved cysteine) also contacts phosphate on the DNA. There are also four other phosphate contacts involving Arg3 (which precedes the first finger), Arg87 (two residues downstream of the second conserved histidine in finger 3), Ser45 (at position -2 in finger 2), and Ser75 (at position +1 in finger 3). The Ser75 contact is the only one that involves the secondary strand of the DNA.

Other residues in the alpha helix make important contributions to protein function, yet do not involve contact with the DNA. In the alpha helix of the first finger, the threonine-CO at position +5 and the arginine-CO at position +6 are too far awa y from their normal hydrogen bond partners, and as a result, a kink is introduced into the helix giving rise to a 310 helical arrangement. This kink bends the second half of the alpha hel ix (which resumes at position +8) toward the zinc binding site, putting the second histidine in a better position to coordinate the zinc ion. The same kink is also seen at the same positions in fingers 2 and 3.

Contacts between fingers were also observed. An arginine at the ninth position of the alpha helix of finger 1 forms a hydrogen bond with the backbone carbonyl of a serine at the -2 position of finger 2. The same interaction is seen in fingers 2 a nd 3, where an arginine at position 9 of the alpha helix of finger 2 interacts with the carbonyl of an alanine at position -2 of finger 3.

Several generalized features of the Zif268 complex are evident: 1. Each zinc finger constitutes an independent, modular unit which recognizes a distinct 3 bp contact site; 2. Most contacts are made with a single strand of the DNA; 3. The contacts i nvolve basic or polar amino acids that are found at specific positions in the alpha helix, namely, positions -1, and either +3 or +6. These features, in turn, have been suggested as part of a generalized recognition code between TFIIIA-like proteins and their binding sites (Berg, 1992; Jacobs, 1992). However, the analyses of two other crystalized zinc finger-DNA complexes indicate it is unlikely there is any simple, general code describing zinc finger-DNA interactions.

Tramtrack (TTK) is a transcriptional regulator of the Drosophila development gene fushi-tarazu. A 66-residue portion of this protein's DNA-binding domain, containing two zinc finger motifs, was crystallized in the presence of an 18-b p oligonucleotide and the complex was solved to 2.8 A (Fairall et. al; 1993). As with Zif268, the zinc fingers function as independent modular units that recognize a specific 3-bp sequence, but unlike Zif268, the binding sites overlap slightly. The fing ers fit into the major groove and make contact through amino acids in the N-terminal region of the alpha helix.

The structure of finger 2, and its interaction with the DNA, is essentially the same as that seen in Zif268 (Figure 6d). An arginine at position -1 (relative to the start of the alpha helix) hydrogen bonds with the second G of the 5'AGG3' subsite. An asparagine at position +3 makes contact with the 5' A. Like Zif268, there is an aspartic acid at position +2, but instead of merely stabilizing the arginine interaction, this residue actually hydrogen bonds with the C on the opposite strand of DNA t hat is base-paired to the most 3' G in the subsite (thus, finger 2 of TTK contacts three bases compared to two contacted by Zif268). Phosphate contacts are also similar to those seen in Zif268 and involve the first conserved histidine and a lysine residu e on the second beta-sheet (two residues downstream of the second conserved cysteine). The other phosphate contact involves a lysine in the alpha helix at position +1.

While finger 2 is very similar to the fingers of Zif268, finger 1 showed several unanticipated features. The residues N-terminal to the conventional finger motif fold to form a third strand to the beta-sheet. This feature has been previously seen in the first finger of protein SWI5 (Neuhaus et al., 1992). While these residues do not interact with the DNA, they are required for DNA-binding. The zinc-histidine-phosphate contact observed in all three fingers of Zif268 and finger 2 of TTK was not s een in finger 1. This was surprising as this interaction was expected to be widely conserved among zinc finger-DNA complexes since it involved the invariant histidine. In finger 1 of TTK, this interaction is substituted by a phosphate contact with tyros ine at position -3 (relative to the start of the alpha helix). Unlike the three fingers of Zif268 and finger 2 of TTK, there are many other phosphate contacts with residues in the second beta strand and turn region between this second beta strand and the alpha helix (arginine at position -5, threonine at position

-2, and histidine at position-1, all relative to the start of the alpha helix). The numerous contacts with the phosphate backbone are probably due to a significant bend in the DNA.

The recognition code employed by Zif268 and finger 2 of TTK does not strictly apply to finger 1. At position -1, instead of a base contact, there is a backbone contact. As with all fingers noted thus far, a 3-bp subsite (5'GAT3') is contacted. A n arginine at position +6 hydrogen bonds to the 5' G, an asparagine at position +3 hydrogen bonds to the middle A, and a serine at position +2 hydrogen bonds to the 3' T (Figure 6c).

The third protein that has been crystallized and complexed to DNA is the product of the human GLI oncogene, a DNA-binding protein that is amplified in human glioblastomas. Five zinc fingers of GLI were bound to a 21-bp DNA fragment obtained by in vitro selection from genomic DNA (Pavletich and Pabo, 1993). Since it is not known whether this DNA fragment contains the biologically relevant binding site, caution is required when interpreting these results. The complex was solved to 2.6 A.

There are several features of GLI structure and its interaction with DNA that are quite different from Zif268 and tramtrak. The first zinc finger does not contact the DNA, but instead makes extensive protein-protein contacts with finger 2. In ad dition, fingers 1,2 and 3 together make only one base contact (a tyrosine at position +2 in finger 2). Given that this complex may not contain the biologically relevant binding site, the anomalous features of the first three fingers may not be all that m eaningful.

Fingers 4 and 5 do make extensive base contacts in a 9-bp region, more so than those seen in either Zif268 or TTK. As with Zif268 and TTK, the contacts predominate in the N-terminal portion of the alpha helices of fingers 4 and 5. Finger 4 has fo ur residues that make contact with 5 bases (Figure 6e). An alanine at the position that corresponds to the first position (+1) of the alpha helix makes a van der Waals contact with the methyl group of T (the most 5' base in the recognition sequence). A serine at the second position (+2) hydrogen bonds with a T that is immediately 3' to the first T that interacts with Ala. The same serine also makes van der Waals contact with a G that is immediately 3' to the second T. This is the first example of a si ngle amino acid residue contacting more than one nucleotide. An aspartic acid at the third position (+3) hydrogen bonds to a C that is on the other strand of DNA. Finally, a lysine that corresponds to the sixth position in the alpha helix hydrogen bonds to a G immediately 3' to the third G in the recognition sequence (it is also base-paired to the C recognized by the aspartic acid).

Finger 5 involves five residues that make contacts with five bases (Figure 6f). The first residue, aspartic acid, is found at the position immediately preceding the alpha helix (position -1). It accepts hydrogen bonds from two adjacent cytosines . The next residue is a serine that corresponds to the second position (+2) of the alpha helix. It hydrogen bonds with a G that is base-paired with the most 5' C bound to aspartic acid. A serine in the third position (+3) hydrogen bonds to an A that is 5' to C. An arginine at the fifth position (+5) in the alpha helix hydrogen bonds to the same G bonded to the serine in the second position. Finally, a lysine at position +6 hydrogen bonds with a G that is 5' of the A (bond to the serine in the third p osition).

While the residues that make contact in fingers 4 and 5 show a correlation with the base contact positions in Zif268, many of these contacts could not have been predicted from a knowledge of Zif268 contacts. Furthermore, while Zif268 recognizes ba ses on only one strand of the DNA, GLI recognizes bases from both strands, where the bases recognized by finger 4 of GLI are found on the opposite strand from the bases recognized by finger 5. Figure 7 summarizes the base contact positions in Zif268, TTK , and GLI demonstrating that previous attempts to assign base contact residues to positions -1, +3, and +6 (Berg, 1992; Jacobs, 1992) are too simplistic.

In addition to base contacts, several residues in fingers 4 and 5 make contact with the phosphodiester oxygens. In finger 4, such contacts are made with residues which correspond to the following positions (relative to the start of the alpha helix ): positions +4, +7, +11, and +16. In finger 5,

 

 

 

 

 

 

 

Figure 7. Summary of all base contact positions seen in Zif268, tramtrak, and fingers 4 and 5 from GLI. a symbols represent alpha helical region.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

residues at positions -3, -2, and +5 make contact with phosphate groups.

 

Inter-species comparisons of TFIIIA

TFIIIA has been most extensively analyzed in Xenopus laevis. However, it has also been sequenced and partially characterized in four other species of frogs (Xenopus borealis, Rana pipiens, Rana catesbeiana, and Bufo americanus), human, and Saccharomyces cerevisiae. While all proteins are composed of an N-terminal region that

contains nine C2H2 zinc finger motifs, followed by a positively-charged C-terminal region, sequence analysis of the TF IIIA genes has revealed a surprising degree of divergence.

Among frogs, biochemical assays and sequence analysis indicate divergence between genera is greater than that which has occurred within genera. Immunoblot assays revealed that TFIIIA from both R. catesbeiana and R. pipiens reacted with anti-X.laevis TFIIIA antisera, but to a considerably lesser extent than TFIIIA from X. laevis and X. borealis (Gaskins et al., 1989). DNase I- protection analyses demonstrated that the Rana TFIIIAs imparted less protect ion of the 5' half of the Xenopus ICR than either Xenopus TFIIIA. In vitro transcription assays showed that both Rana TFIIIAs were less efficient at promoting transcription of a Xenopus 5S RNA gene than was either of the Xenopus TFIIIAs (Gaskins et al., 1989). The cDNAs for TFIIIA from Xenopus borealis and Rana catesbeiana were cloned and sequenced (Gaskins and Hanas, 1990). X. borealis TFIIIA is composed of 339 amino acids and shar es 84% sequence homology with X. laevis., while R. catesbeiana TFIIIA is composed of 335 amino acids and exhibits 63% sequence homology with X. laevis. The C-terminal halves of both proteins contain more non-conservative changes (wit h respect to the X. laevis sequence) than the N-terminal halves, with the most conserved region consisting of the first two zinc fingers and the most divergent region contained in the C-terminal domain. The Rana TFIIIA contains an insertio n of 4 amino acids between the 6th and 7th finger not seen in either Xenopus TFIIIA.

The cDNA clones of TFIIIA were later obtained from both Bufo americanus and Rana pipiens (Gaskins et al., 1992). Like X. borealis and R. catesbeiana, each gene contained nine tandem repeats corresponding to the C2H2 zinc finger motif seen in Xenopus laevis. In both cases, the zinc finger domains were found in the N-terminal thre e-fourths of the protein and both contained a non-finger C-terminal domain. TFIIIA from B. americanus contains 339 amino acids with 136 changes relative to X. laevis TFIIIA, resulting in a homology of 61%. TFIIIA from R. pipiens con tains 335 amino acids with 131 changes relative to X. laevis TFIIIA, resulting in a homology of 62%. R. pipiens TFIIIA also has a 13 amino acid deletion in the C-terminal domain, while B. americanus has single amino acid deletions in the N-terminal region and the seventh zinc finger, along with a three amino acid deletion in the C-terminal domain. Among X.laevis, B. americanus, and R. pipiens, finger one shows the highest overall conservation and the C-terminal domain has a highly conserved overall positive charge. In addition, there is also significant sequence conservation between a region comprising the last 5 amino acids of finger nine and the first 5-6 amino acids of the C-terminal domain. Footprinting st udies also showed that B. americanus TFIIIA protects the same region of the X. borealis somatic-type 5S gene ICR from DNase I digestion as does X. laevis TFIIIA (Gaskins et al., 1992). The only difference between the two is the loss of a hypersensitive site (position +43) seen with X. laevis TFIIIA-binding but not seen with B. americanus TFIIIA-binding. R. pipiens TFIIIA gives strong protection from nucleotide +96 to +78, but weaker protection from nucleotide +7 8 to +43 (corresponding to weaker protection imparted by the C-terminal fingers).

A 42-kDa human TFIIIA was purified from HeLa cell extracts (Moorefield and Roeder, 1994). The DNase I protection pattern of purified human TFIIIA on a human 5S RNA gene (from +44 to +97) is very similar to the pattern seen with Xenopus TFII IA and its binding site suggesting that their DNA-protein interactions are quite similar. Furthermore, both human and Xenopus 5S genes compete efficiently for human TFIIIA binding, in contrast to the inefficient binding of the yeast 5S gene with human TFIIIA. Immunological cross-reactivity between human and Xenopus TFIIIA was seen. Polyclonal antisera raised against Xenopus TFIIIA reacted with human TFIIIA. A monoclonal antibody that reacts with a 13-kDa CNBr fragment containing the N-t erminal portion and first two fingers of Xenopus TFIIIA also reacted with human TFIIIA, although to a lesser extent. However, another monoclonal antibody that reacts with a 20-kDa CNBr fragment of Xenopus TFIIIA that contains fingers 4-8 di d not react with human TFIIIA. These data suggest that the first two fingers of Xenopus and human TFIIIA are more conserved than fingers 4-8.

A clone isolated from a human brain cDNA library (Drew et al., 1995) possesses a deduced amino acid sequence with 57% homology to X. laevis TFIIIA and 59% homology to R. pipiens TFIIIA. The gene product encodes 363 amino acids that contain nine C2H2-type zinc finger motifs which, like Xenopus TFIIIA, include a linker region preceding the second and third fingers containing the TGEK/RP amino acid element. These data strongly suggest this is the human TFIIIA gene, but the gene product of this clone has not yet been expressed and characterized. The greatest homology between human and frog sequen ce is found in fingers 8 and 9.

Yeast TFIIIA is a 50-kDa protein containing nine C2H2 zinc fingers. Sequence analysis of the cloned yeast gene (Archambault et al., 1992) revealed the greatest extent of divergence yet seen among TFIIIAs. Homology between yeast and Xenopus TFIIIA is restricted to those amino acids that characterize the zinc finger motif (the four zinc-ligands and three conserved hydrophobic residues). The remaining amino acids demonstrate significant sequence divergence, where homology ranges from 20-40% (allowi ng for insertions and deletions). Fingers 6 and 7 show the least homology and the N-terminal and C-terminal regions differ in size (the N-terminal region of yeast TFIIIA has 36 more amino acids than Xenopus TFIIIA and the C-terminal region is 28 amino ac ids shorter). While comparisons between the TFIIIAs of X. borealis, X. laevis, and R. catesbeiana revealed exact conservation of Cys-Cys, Cys-His, and His-His spacing in homologous fingers (Gaskins and Hanas, 1990), many differences in the spacing of Cys and His residues are seen when yeast TFIIIA is compared with Xenopus TFIIIA. Yeast TFIIIA also contains a unique 81-amino acid domain that separates fingers 8 and 9 that is enriched with basic residues.

The sequence divergence observed between Xenopus and yeast TFIIIA is consistent with biochemical data showing species-specific activity. While Xenopus TFIIIA protects 5S RNA gene positions +96 to +43 from DNase I digestion, yeast TFI IIA protects a smaller region of the yeast 5S RNA gene ranging from +96 to +63 (Braun et al., 1989). Antibodies raised against Xenopus TFIIIA do not react with yeast TFIIIA and cDNA probes encoding Xenopus TFIIIA do not hybridize to yeast D NA (Wang and Weil, 1989). Xenopus TFIIIA also bound 100-1000-fold less efficiently to the yeast 5S RNA gene than did yeast TFIIIA (Wang and Weil, 1989).

While no other TFIIIA genes have been identified, C. elegans extracts that transcribe C. elegans 5S RNA genes were isolated (Honda et al., 1986). Species-specific activity was again seen, as these extracts could not transcribe X enopus 5S RNA genes and Xenopus extracts that transcribe Xenopus 5S RNA genes could not transcribe C. elegans 5S RNA genes.

TFIIIA is a rapidly evolving zinc finger protein. While human and frog TFIIIA show only 57-59% homology, significantly less divergence is seen in another zinc finger protein, erythroid transcription factor (Eryf 1) (Trainor et al., 1990). Mouse a nd human Eryf 1 demonstrate only 2 conservative substitutions among the 108 amino acids that comprise the two C2C2 zinc f ingers of these protein. Human and chicken Eryf 1 have just three conservative and five non-conservative substitutions. The rapid rate of TFIIIA evolution is surprising given that the ICR sequence remains largely conserved among species. For example, w hile there are 131 amino acid differences between R. pipiens and X. laevis TFIIIA (resulting in 62% homology), there are only 7 nucleotide substitutions in the 5S RNA gene from nucleotides +40 to +100. TFIIIA's rapid rate of evolution sugg ests it might serve as a good molecular clock for determining phylogenetic relationships between species (Nei, 1987).

A sequence analysis of 221 C2H2 zinc finger proteins containing 1340 zinc fingers shows that fingers in the same p rotein tend to be more similar to each other than fingers from other proteins (Jacobs, 1992). This suggests that the primary mode of zinc finger evolution involves internal duplication. This view is supported by the gene structure of Xenopus TFII IA, where introns exist between each of the domains corresponding to the first seven fingers (Tso et al., 1986).

Sequence comparison of the 1340 zinc fingers found the tip region and alpha helix to be the most conserved, the linker region had an intermediate level of conservation, and the beta sheet region was the least conserved. Surprisingly, the most conserv ed residue (that is not a zinc-ligand or conserved hydrophobic residue) is a lysine located two positions downstream of the second cysteine that coordinates zinc. Arginines in this position were seen to make contact with the phosphate backbone of DNA in all three fingers of Zif268 (Pavletich and Pabo, 1991). The next most-conserved residue is a glycine that immediately follows the second cysteine that serves as a zinc-ligand. The crystal structure of Zif268 shows the angle associated with the amino aci d in this position is easily adopted only by glycine. The third most conserved residue is a serine at alpha helical position +2. In Zif268, this is the position where transition from the beta sheet to the alpha helix has been completed and the regular a lpha helix begins. It has been proposed that serine at this position could cap the N-terminus of the helix by hydrogen bonding with residues at position -1 (Desjarlais and Berg, 1992). Within the alpha helical region, the positions that correspond to th e base contact sites seen in Zif268, namely, positions -1, +3, and +6, are strongly conserved.

While this sequence analysis correlates nicely with Zif268 structure, several considerations are relevant when extrapolating to TFIIIA sequence. First, only 49 of the 221 proteins analyzed contained 10 or more zinc fingers. Secondly, one of the u nderlying assumptions behind this analysis was that all or most zinc fingers interact with their ligands in the same manner. This is unlikely to be the case with TFIIIA given footprint analyses of truncation and broken finger mutants (Del Rio et al., 199 3, Clemens et al., 1994). Thirdly, the three most conserved residues (that are not zinc-ligands or conserved hydrophobic residues) mentioned above are not well-represented in TFIIIA. While a Lys or Arg is found in the position two residues downstream of the second zinc-coordinating cysteine in 6/9 fingers, the conserved Gly that immediately follows this cysteine is seen in only 3/9 fingers, and the conserved serine in alpha helical position +2 is found in only one of the fingers.

 

Mutational Analysis of Zif268 and other TFIIIA-like proteins

Although the crystal structure of Zif268 bound to its binding site has been solved, mutational analyses have proved fruitful in addressing the following questions: 1) How adaptable is the zinc finger motif when presented with variant DNA sites?, 2) What is the relative importance of amino acids that make contact with the DNA?, 3) Are there other amino acids essential for binding that were not detected by the static analysis of crystallography?, and 4) Are other residues outside the finger motif imp ortant in binding?

Mutagenic analyses have been largely consistent with the crystal data and have helped to uncover the relative importance of amino acids that bind to the DNA. Krox-20 is a serum-inducible transcription factor that is almost identical to Zif268 (Cha vrier et al., 1988). The three zinc fingers of Krox-20 bind to the target sequence:5' GCG GGG GCG 3'. This sequence is very similar to the target sequence of Sp1: 5' GGG GCG GGG 3'. Like Krox-20/Zif268, Sp1 is a DNA-binding protein containing three zi nc fingers (Kadonaga, et al., 1987). When the amino acid residues that form the alpha helices of both proteins are compared, a pattern emerges that reflects the similarities in binding sites: fingers 1 and 3 of Krox-20 and finger 2 of Sp1 bind to GCG and contain Glu at position +3 (relative to the start of the alpha helix) and Arg at position +6. Fingers 1 and 3 of Sp1 and finger 2 of Krox-20 bind to GGG and contain His at position +3. Site directed mutagenesis was used to convert His and Thr (position s +3, +6) in finger 2 of Krox-20 to Glu and Arg respectively (making it more like fingers 1 and 3) (Nardelli et al., 1991). The resulting protein was no longer able to bind to the Krox-20 consensus binding site, but instead formed abundant complexes with oligonucleotides carrying the sequence 5' GCG GCG GCG3' (this sequence was not recognized by wild-type Krox-20) (Nardelli et al., 1991). These data clearly suggest that the Glu and Arg residues in these positions recognize a GCG target site. To delimit the contribution of each amino acid, point mutants containing either the His-to-Glu or Thr-to-Arg substitutions were made in the second finger (Nardelli et al., 1991). The protein containing the His-to-Glu substitution at position +3 did not bind effici ently to either target sequence. The protein containing the Thr-to-Arg substitution at position +6 was able to bind efficiently to the wild-type target site. It also bound to the mutant sequence, 5' GCG GCG GCG3', but with a slightly reduced affinity co mpared to the double-substitution mutant. These results indicate that amino acids in positions +3 and +6 of fingers 1 and 3 combine to discriminate a single base pair, with position +6 being the strongest determinant of specificity. Gel retardation ana lyses of Krox-20 have also determined that as long as the 3' nucleotide of the binding site (the one that interacts with the Arg at position +6) is guanine, the second finger possesses a rather broad range of binding-specificity (Nardelli et al., 1991). Similar levels of binding were seen with different DNA-binding sites as long as they conformed to the 3-bp consensus sequence, 5'NG/AG3' (Nardelli et al., 1992).

While the ability to alter the sequence of finger two so that it interacts with the DNA like fingers 1 and 3 suggests a significant amount of modularity (where each finger acts as an independent unit that interacts with a specific nucleotide subsit e), other data indicate there are limits to the modularity. A Zif268-like protein was made containing amino acids QDR-RER-RHR in positions -1, +3, and +6 of fingers 1,2, and 3 respectively (Desjarlais and Berg, 1993). It bound with high affinity to the sequence 5'GGG GCG GCT3'. Another protein was made with the order of the first two fingers being switched: RER-QDR-RHR. In support of the modular hypothesis , it bound with high affinity to the permuted sequence 5'GGG GCT GCG3'. However, perfect modula rity was not seen, as the QDR-RER-RHR protein discriminated between its preferred site and the permuted site by a factor of 180, while the protein RER-QDR-RHR discriminated between its preferred site and the permeated site only by a factor of 7. Desjarla is and Berg (1993) explain this in terms of end effects, where the specificity due to any given residue depends upon whether it occurs within the central finger or at one of the ends. That is, fingers at the end would be exposed to less constraint since they would not be sandwiched by two sets of protein-DNA interactions. Since most of the fingers of TFIIIA (7/9) are not at the ends, deviance from perfect modularity would not be surprising in the case of TFIIIA. In fact, end effects are likely to be mo re significant with Zif268 (a three finger protein) than with TFIIIA (a nine finger protein), thus any degree of versatility and/or modularity seen in Zif268 (or other three finger proteins) may not be validly extrapolated to TFIIIA (and other larger zinc finger proteins).

Numerous point mutations have also been randomly introduced into Zif268 to determine other residues important for function. When a lexA-NGFI-A-Gal4 chimeric transcriptional activator protein was expressed in yeast, it was found to inhibit growth ( Wilson et al., 1992). Mutations in NGFI-A (also called Zif268) released cells from this growth inhibition and 90 point mutants were identified (Wilson et al., 1992). Crystal analysis of Zif268 made it possible to ascribe functions to 57 of the 83 residu es among the finger domains (Pavletich and Pabo, 1991) and of the 90 point mutants isolated by Wilson et al., 88 involved these residues. These data suggest that the 26 residues not seen to be important in the crystal analysis indeed contribute little to the informational content of the protein. The two residues not shown by cystallography to be important, yet detected by mutational analysis, were two lysines that were both part of the TGQ/EKP conserved pentapeptide element prior to the second and third fingers. In both cases, the Lys was changed to Asn.

All three fingers were shown to be important for binding, as a mutation in any single zinc finger could eliminate binding. Furthermore, each finger was about equally represented with 26, 36, and 28 mutants isolated affecting fingers 1,2, and 3 res pectively. Mutations affecting all four zinc-ligands and all three conserved hydrophobic residues were also identified.

Several mutations were discovered that further indicate the zinc finger motifs are not perfectly modular. Mutants were isolated that lacked the entire second finger. The resulting two-finger protein could not bind to the sequence 5'GCG GCG3' even though the modular hypothesis would predict such binding. However, there may not have been enough high-affinity contacts to detect binding with this two-finger protein. Furthermore, the two linker regions were not equally sensitive to mutations. Even t hough the amino acid and nucleotide sequences are similar in the regions associated with the linkers between fingers 1,2 and 2,3, the first linker was only mutated three times compared to 17 isolated mutations affecting the second linker.

Phage display (Barbas III, 1991; Barbas III, 1992) is a powerful tool for sorting zinc finger libraries that bind to different DNA sequences. The technique essentially involves the creation of a fusion protein containing the zinc finger domain and phage-coat protein that is expressed on the surface of phage. The zinc finger sequence is randomized, and expressing phages are passed over beads with specified binding sites. Those fingers that are more likely to bind to beads complexed with DNA, and the beads enriched with protein are eluted. A pool of phage expressing zinc finger motifs that bind to specified DNA is then created and the nucleotide sequence of the zinc-finger/coat fusion proteins (in a particular pool) is used to determine the amino acid sequences that bind.

Several residues (-1, +1, +2, +3, +5 and +6 relative to the start of the alpha helix) of the second finger of Zif268 were randomized (excluding Phe, Trp, Tyr, and Cys) and expressed on the surface of fd phage (Choo and Klug, 1994). A phage displa y library was made where the modified Zif268 proteins were selected by using a modified Zif268 operator sequence in which the middle DNA triplet was altered to the sequence of interest. Sequence analysis of the library showed that families of proteins re cognized specific sequences. For example, the clones selected with the triplet GAT all had Asn at position +3 and Arg at position +6. Most of the results are consistent with the x-ray crystallography data indicating base contacts with alpha helix positi ons -1, +3, and +6. When guanine is at the 5' or 3' end of the triplet, most of the selected proteins contained Arg at position +6 or -1 respectively. When guanine is in the middle position of the triplet, His at position +3 predominates in the selected family of proteins. Adenine at the 3' end of the triplet selects Gln at position -1, while it selects Asn at position +6 when it is at the 5' end. When it is in the middle position, it tends to select Asn at position +3. Cytosine at the 3' end of the triplet selects an acidic residue at position -1, together with Arg at position +1. When cytosine is in the middle position of the triplet, it tends to select Thr, Val, or Leu at position +3. Thymine in the middle position also selects Ser, Val, or Ala at position +3. Thymine at the 5' or 3' end selects a variety of polar amino acids.

The first finger of Zif268 was also subjected to random mutagenesis at its base contact positions (-1, +2, +3, and +6) and these mutant proteins were selected and sorted by phage display in the context of different operator sequences (Jamieson et al., 1994). It was determined that Arg at position +6 was strongly selected for when the binding site contained a 5' guanine. If the 5' position of the binding site contained thymine, Lys was strongly selected. These data are consistent with the nee d for a hydrogen bond donor that can make contact with guanine as seen in the crystal structure (Pavletich and Pabo, 1991). The crystal structure also showed Arg at position -1 forms a salt bridge with Asp at position +2, and this salt bridge is importan t in stabilizing the Arg so that it can make contact with the 3' position of the binding site. The mutagenesis analysis was consistent with the existence of an intramolecular interaction between these positions, as no mutant was found that contained resi dues of like charges at these two positions (most were of complementary charge). However, it was also clear that DNA sequences near the site of direct contact also influenced amino acid preference. When the sequence GCG was used to select mutagenized Zi f268/phage coat fusion proteins, a general consensus of Arg and Asp at positions -1 and +2, respectively, was seen. But changing the middle base so that the sequence used to select mutant proteins was GTG resulted in an entirely different consensus, wher e Glu and Arg appeared at positions -1 and +2, respectively.

Although the crystal structure of of Zif268 showed that Glu at position +3 in finger 1 did not make any contact with the DNA (Pavletich and Pabo, 1991), phage display analysis also shows a strong selection for Glu or Asp at this position when the WT sequence (GCG) was used (Jamieson et al., 1994) as the binding site..

A phage display analysis that involved mutations spanning the alpha helices of each of the three zinc fingers of Zif268 was consistent with earlier phage display results (Choo and Klug, 1994; Jamieson et al., 1994; however, several surprising featu res were uncovered (Wu et al., 1995). While proline residues have not been observed in TFIIIA-like zinc finger proteins beyond alpha helical position +2 (Jacobs, 1992), several clones (over 30%) containing proline residues within alpha helical region -2 to +6 survived the phage display selection. These data suggest that the DNA-binding function of some proteins can tolerate significant alterations in the secondary structure that has been typically proposed. Furthermore, there was no simple relationship between deviation from wild-type amino acid sequence and the binding affinity of some proteins. For example, a finger-one mutant had as much as a 13-fold increase in affinity for binding the wild-type sequence (GCG) although its amino acid sequence did not deviate greatly from wild-type (RSDELTR to KSADLKR) at positions -1 to +6. Yet another finger-three mutant had only a two-fold decrease in affinity for binding to the wild-type sequence (GCG) while its amino acid sequence deviated radically from wild -type (RSDERKR to SSGQWWR) at positions -1 to +6. The collective interpretation of all mutant sequences suggests that no simple coding relationship exists between protein and target DNA sequence.

While zinc finger phage display results demonstrate a high degree of versatility in protein-DNA interactions, several findings also suggest that such versatility is not without limit. For example, two studies were unable to produce zinc finger pha ge that bound to the sequence CCT (Choo and Klug, 1994; Rebar and Pabo, 1994). When the sequence of finger two was varied, specific binding was not seen with any nucleotide triplet that lacked a 5' and 3' guanine (Choo and Klug, 1994). Specific binding of finger one was also not seen when triplets ACG, CGC, ATA, and TAT were used to select zinc finger phage (Jamieson et al., 1994). While such sequence bias may be due to the limited extent of the Zif268 mutagenesis (where only select positions were rand omized), it does raise the possibility that zinc finger proteins may not be able to specifically bind certain spans of nucleotide sequence. This would not be surprising from an evolutionary perspective, as zinc finger proteins constitute only one class o f transcription factors.

In an attempt to decipher a 'zinc finger recognition code,' a data base analysis was used to guide the mutagenesis of finger two of Zif268 to alter its specificity (Nardelli et al., 1992). Gln was found to be the most common amino acid at position -1; thus, it was expected to be highly base-specific. Yet mutants that contained Gln at this position showed either little affinity for a variety of binding sites or little specificity among these binding sites. Molecular graphics, using the Zif268 coo rdinates, determined that when His was at position +3, Gln at position -1 was too far from the DNA to make stable hydrogen bonds. Re-examination of the data base showed a preference for small sidechains (with Asp and Asn being the most common) at positio n +3 when Gln was present at position -1. Such context dependence of Gln binding specificity at position -1, also seen in Sp1 (Desjarlais and Berg, 1992) and ADR1 (Thukral, 1991), again suggests that a generalized 'zinc finger recognition code' may be un obtainable.

A data base analysis of over 200 zinc finger proteins was used to guide the mutagenesis of the second finger of Sp1 (Desjarlais, and Berg, 1992). Specifically, the amino acid distribution for positions -1, +2, and +3 was determined. Gln was the m ost frequently seen residue at position -1, followed by Arg and Thr. Since Gln can make hydrogen bonds with adenine, analogous to the manner in which Arg hydrogen bonds with guanine (Seeman et al., 1976), it was hypothesized that changing Arg at position -1 (found in finger 2 of Sp1) to Gln would alter its binding specificity. However, further sequence analysis revealed amino acid correlations showing that the amino acids at positions +2 and +3 are strongly correlated with the identity of the amino acid at position -1. If position -1 is occupied by Gln, residues at positions +2 and +3 tend to be Ser and Asp, respectively. If Arg occupies position -1, the residue at position +2 tends to be Asp (the crystal structure of Zif268 demonstrated that Asp at +2 contributes to the binding activity by stabilizing the sidechain of Arg, a base contact residue at -1), while the residue at position +3 tends to be either His or Glu. These data were then used to guide the mutagenesis of the second finger of Sp1 to d etermine if sequence-specific binding for 5' GGG GCG GGG3' (the wild-type binding site) could be altered to permit recognition of 5'GGG GCA GGG3'. The importance of context dependence was clearly seen, as high affinity binding for the 5'GGG GCA GGG3' wa s seen only with a protein containing three amino acid substitutions: Arg to Gln at position -1, Asp to Ser at position +2, and Glu to Asp at position +3. All other single and double mutants failed to specifically bind the altered site.

Alanine-scanning site-directed mutagenesis was used to determine the residues of ADR1 (a yeast transcription factor) that are important for DNA-binding (Thukral et al., 1991). ADR1 possesses two zinc fingers of the TFIIIA type (C2H2). The results were largely consistent with the mode of binding seen in Sp1 and Zif268. When Arg at position -1 in both fingers was mu tated to alanine there was a significant reduction in affinity, although the loss of the Arg in finger two had a more severe effect. In addition , His at position +3 and Arg at position +6 in finger one were determined to be important for binding. Finge r two also has Arg at position +6, but it contributes less binding energy than the Arg at the same position in finger one. An Asp in position +2 of finger two is also important for binding and probably functions (like Zif268) in stabilizing the sidechain of Arg at position -1. A more extensive mutagenic analysis of ADR1 showed that positions -1, +3 and +6 in finger one would not tolerate several other amino acids in addition to alanine (Thukral et al., 1992).

Although the DNA-binding activity of both fingers involves an Arg at position -1 that interacts with guanine (Thukral et al., 1992) , the relative importance of both residues and their mode of interaction with the DNA appears to be different. Not only is this Arg more important for binding in finger two than in finger one, but in finger one, it can be replaced with Lys without any loss in affinity, while an Arg-to-Lys substitution at this position in finger two eliminates binding. Thus, even if t wo fingers contain the same amino acid-nitrogenous base interaction at the same position, one cannot assume their general mode of interaction is the same. In this case, the Arg in finger two probably hydrogen bonds with guanine in a similar manner to Zif 268 (since an Asp residue is at position +2), while phosphate contact with the Arg in finger one is suggested by the ability of Lys to functionally replace Arg.

It also appears that the residues in positions -1, +3, and +6 do not function independently. The first finger of wild-type ADR1 binds to the nucleotide sequence 5'GAG3' (Thukral, et al., 1992). When the Arg at position -1 in finger one is mutated to Gln, a change in specificity is seen where the mutant protein preferentially recognizes 5'GAA3'. Another mutant contains an Arg-to-Asn substitution at position +6. This mutant preferentially binds sequence 5'AAG3'. Based upon these results, one wou ld predict a double mutant, Arg-to-Gln at position -1 and Arg-to-Asn at position +6, would preferentially bind sequence 5'AAA3' if the positions functioned independently. However, the double mutant not only failed to bind to sequence 5'AAA3', it failed t o bind to all other sequences tested. Thus, concurrent contributions from several positions within the alpha helix appear to be essential for the proper orientation of the helix within the major groove so that sequence-specific binding can occur.

While conservation of sequence usually implies functional importance, it is interesting to note that several conserved residues of ADR1 were mutated with little or no effect. When leucine at position +4 in the first finger was mutated to alanine, no change in affinity was seen (Thukral, et al., 1991). Only a two-fold reduction in affinity was seen when the analogous leucine in finger 2 was mutated. This is surprising given that leucine is strongly conserved at this position among all TFIIIA-lik e proteins and is known to contribute to the hydrophobic pocket that stabilizes the tertiary structure of the zinc finger domain. Other positions containing amino acids conserved between the two fingers of ADR1 were changed without any effect on affinity . The linker region between the two fingers also contains residues strongly conserved among TFIIIA-like zinc finger proteins (TNEKPYP), yet a reduction in affinity is seen only when Thr, the N-term Pro, and the conserved Tyr are mutated.

There are 12 residues between the most C-terminal Cys ligand and the most N-terminal His ligand in ADR1 (such spacing is strongly conserved among all TFIIIA-like zinc fingers). An ADR1 deletion mutant was made so that only 11 residues separated th ese two ligands in the second finger (Parraga et al., 1990) and spectroscopic studies showed that this deletion peptide could not bind zinc tetrahedally nor was it capable of stable folding (Parraga et al., 1990).

A general gain-of-function mutant was identified that suppressed ADR1 mutants (Camier et al., 1992). An Arg-to-Lys substitution in the N-terminal region, about 15 amino acids N-terminal to the first finger suppressed loss of binding activity in a non-allele and non-finger specific fashion. This mutation also resulted in a higher affinity for ADR1 lacking mutations in the zinc finger domains. In all cases, there was a twofold to sevenfold increase in binding. When this Arg was mutated to Gly, a twofold reduction in affinity was seen. These data suggest, that in the case of ADR1, at least one residue outside the zinc finger domains is important for DNA-binding activity.

Mutagenesis analyses of Zif268, Sp1, and ADR1 show that positions

-1, +3, and +6, relative to the start of the alpha helix, are important for binding, consistent with the crystal-structure data. Furthermore, arginine-guanine contacts appear to be quite common among zinc fingers. However, the same data demonstrate z inc fingers are not perfectly modular and amino acid-nucleic acid interactions are dependent on the context of both molecules. Thus, while it is possible to rationally alter binding-specificity in a limited manner, by using sequence data and known amino acid-nucleotide contacts from other fingers, a simple, generalized code for zinc-finger/nucleic acid interaction is unlikely to be found.

 

Mutagenesis of TFIIIA

Although Xenopus TFIIIA has been subjected to several mutagenesis experiments in an attempt to uncover amino acid residues important for DNA-binding, the genetic dissection has not been as extensive as in the case of Zif268, Sp1, and ADR1. TFIIIA's large size, compared to the two/three finger proteins, renders the phage display technique impractical. Furthermore, since Xenopus is a poor genetic system, all mutagenesis studies to date have involved in vitro mutagenesis (oft en using truncation proteins) followed by in vitro characterization. Nevertheless, such studies have significantly contributed to our understanding of TFIIIA function.

Point mutations were unintentionally introduced into a TFIIIA truncation mutant containing fingers 1-7 (Clemens et al., 1994). A Gln-to-Arg substitution at position +3 (relative to the start of the alpha helix) of finger 4 was identified and resul ted in an 8.5-fold reduction in affinity relative to the same truncation mutant with wild-type sequence. Another seven-finger polypeptide contained a Thr-to-Ile substitution at position -1 in finger 6. This resulted in a 17-fold reduction in affinity. A seven-finger mutant, containing both substitutions, failed to bind specifically to the 5S RNA gene. It is of interest to point out that in both cases, a substitution has occurred in a position known to make base contacts within the major groove (as see n in the crystal structures of Zif268 and GLI). Yet several models for TFIIIA binding to DNA (Clemens et al., 1992; Hayes and Tullius, 1992) propose that fingers 4 and 6 do not bind within the major groove.

Site-directed mutagenesis was used to determine if the linker regions (the region between the last histidine of one finger and the first conserved hydrophobic residue of the next) of a truncated finger 1-3 protein were involved in DNA-binding (Clem ens et al., 1994; Choo and Klug, 1993). Residues within the linker region are likely to constrain the relation of one finger to the next and might even make contact with the DNA. The sequence TGEKP (often called the Kruppel -type linker because i t is typified by the Drosophila protein Kruppel ) between fingers 1-2 and 2-3 was targeted, as it is commonly found within the linker region of many TFIIIA-like zinc finger proteins (Gibson et al., 1988). TGEK was changed to the p43 sequen ce, SEQK (Clemens et al. 1994). p43 is a nine-finger 5S RNA-binding protein, homologous to TFIIIA, that does not bind to the 5S RNA gene (Joho et al., 1990; Darby and Joho, 1992). These substitutions between fingers 1-2 and fingers 2-3 abolished binding activity as determined by gel retardation assays. Replacing the linker between the first two fingers with residues that make up the linker between fingers 3 and 4 also abolished binding (Choo and Klug, 1993).

A truncation mutant containing the first three fingers was also made with a Thr-to-Ser substitution in the TGEK sequence between the first and second finger (Clemens et al., 1994). Although this is a conservative substitution, this mutant showed a 10-fold reduction in affinity due largely to a faster rate of dissociation. In a separate study, the same Thr was mutated to Leu and a 10-fold reduction in affinity was again seen (Choo and Klug, 1993). The crystal structure of Zif268 shows that the

-OH group from this Thr hydrogen bonds to the backbone amide of the third linker residue in Zif268, Gln. Thus, this mutation probably disrupts the linker structure, which in turn alters the positioning of the adjacent fingers.

Despite the strong conservation of sequence seen in this linker element, not all residues in the linker are equally sensitive to mutation (Choo and Klug, 1993). When the Glu of the TGEKP sequence seen in the first linker was changed to Lys no chan ge in affinity was seen. Similarly, there was only a slight reduction in affinity seen when the Asn of the TGEKN sequence of the second linker was changed to Pro. On the other hand, Lys-to-Ser and Gly-to-Pro substitutions in the first linker resulted in more than a ten-fold reduction in affinity. The latter substitution is not surprising as Gly has exceptional conformational freedom while Pro is quite constrained. Lys-to-Glu and Gly-to-Ser substitutions in the first linker, in the context of full-leng th protein, also resulted in a significant loss of binding (Smith et al., 1991) as seen in DNase I footprints. Interestingly, when the first linker sequence (TGEKP) or the second linker sequence (TGEKN) was replaced with the corresponding Xfin sequence f rom finger 31 (KNERP), DNA-binding affinity (Zang et al., 1995) of full-length TFIIIA was not significantly altered. Likewise, partial substitution of the third linker sequence, NIKI, with KNKI or GEKI, resulted in less that a two-fold reduction in affin ity.

Table I summarizes the mutagenesis data that pertain to the linker sequence between fingers one and two. These data show that the linker is not a passive polypeptide in the interaction with DNA as several amino acid substitutions a cross the length of the linker reduce binding. A proline appears to be required at the C-terminal end of the linker, suggesting conformation restriction is desirable at this position. A basic residue is required immediately N-terminal to the Pro and it is structurally feasible for the residue at this position to make phosphate contacts with the DNA (Choo and Klug, 1993). Immediately N-terminal to this position is a Glu that can be changed to Lys without any

Table I. Summary of mutagenesis effects involving amino acids within the linker region joining fingers one and two from TFIIIA.

 

 

 

Fingers present

Linker Sequence Effect in protein Reference

 

T G E K P WILD-TYPE

S E Q - - Reduced affinity 1-3 Clemens et al. 1994

S - - - - Reduced affinity 1-3 Clemens et al., 1994

L - - - - Reduced affinity 1-3 Choo and Klug, 1993

- - K - - None 1-3 Choo and Klug, 1993

- - - S - Reduced affinity 1-3 Choo and Klug, 1993

- P - - - Reduced affinity 1-3 Choo and Klug, 1993

- - - E - Reduced affinity 1-9 Smith et al., 1991

- S - - - Reduced affinity 1-9 Smith et al., 1991

- - - - N Reduced affinity 1-3 Choo and Klug, 1993

K N - R - None 1-9 Zang et al., 1995

 

 

 

 

 

 

 

 

 

 

 

 

deleterious effects on binding, suggesting this position is not involved in DNA-binding (despite its strong conservation). The position N-terminal to Glu appears to require a high degree of conformational freedom, as replacement of Gly by either Ser o r Pro results in a significant reduction in affinity. Finally, the Thr at the most N-terminal position, as noted above, probably hydrogen bonds to the backbone of the linker. The Thr-to-Lys and Gly-to-Asn substitutions seen by Zang et al. (1995) did not significantly reduce binding (80% relative affinity), however, the protein containing these mutations also contained five other mutations immediately N-terminal to the linker.

One particular linker mutant suggests that perfect modularity of zinc finger domains does not exist (Choo and Klug, 1993) within the first

three fingers of TFIIIA. The linker sequence between fingers 1-2 and 2-3 are almost identical: TGEKP and TGEKN, respectively. As noted above, replacing the Asn of the second linker with a Pro results only in a minor reduction in affinity (less than t wo-fold). However, replacement of the Pro in the first linker with Asn results in more than a ten-fold reduction in affinity. Since the Pro and Asn residues are not interchangeable, the relationship of fingers 1 and 2 is unlikely to be the same as that of fingers 2 and 3.

Site-directed mutagenesis has also been used to assess the role of Cys and His zinc-ligands in TFIIIA function (Smith et al., 1991; Del Rio et al., 1993). Cys-to-Ser substitutions in fingers 1, 4, and 6 (Smith et al., 1991) and His-to-Asn substitu tions in fingers 1-9 (Del Rio et al., 1993) resulted in altered protection patterns of the Xenopus borealis 5S RNA gene as seen by DNase I footprinting. As noted above, analysis of the broken finger mutants (generated by His-to-Asn substitutions) show that not all zinc fingers are equivalent in their contribution to the DNA-binding energy or in their patterns of binding to the 5S RNA gene ICR (Del Rio et al., 1993). Each of the nine fingers independently mutated showed a structural disruption of the mutated finger with little or no effect on other fingers (Del Rio et al., 1993). DNase I footprinting of these "broken finger" proteins demonstrated alterations in the pattern of protection that were highly localized. These localized contacts sugg est that there is a remarkable degree of functional independence between different fingers in terms of their capacity for sequence-specific binding. In addition, it was shown that while each finger contributes to the DNA-binding energy, they contribute t o widely varying extents. For example, a His-to-Asn substitution in fingers 3 or 4 resulted in 26-fold and 9-fold reductions in affinity, respectively. In contrast, the same mutation in finger 1 caused only a 2.5-fold reduction in affinity. Mutations i n fingers 7-9 also decrease transcriptional activity (Del Rio and Setzer, 1993).

The differences in the contribution to the free energy of DNA binding by individual zinc fingers seen in the broken finger analysis was supported by a series of "finger-swap" experiments. In order to identify amino acid residues of TFIIIA that are important for DNA-binding, a series of finger-swap, scanning, and single amino acid substitution mutants were constructed and binding activities were determined using a nitrocellulose filter-binding assay (Zang et al., 1995). Finger-swap mutants were co nstructed by subcloning cDNA sequences from either finger 31 of Xfin or finger 3 of p43 into the Xenopus TFIIIA gene so that a chimeric TFIIIA molecule was produced. Both Xfin and p43 exhibit RNA-binding activity with little or no DNA-binding acti vity ( Andreazzoli et al., 1993; Darby and Joho, 1992; Joho et al., 1990). Both contain C2H2 zinc fingers and, apart fro m the conserved zinc-ligands and hydrophobic residues, the amino acid sequences are vastly different from the TFIIIA zinc fingers. The finger-swap mutants showed that substitution of finger 1 or 7 had little or no effect on the interaction of TFIIIA with its binding site. In contrast, substitutions of fingers 2, 3, and 4-6 significantly reduced the chimeric TFIIIA's affinity for its binding site. Substitution of finger 3 with the p43 finger led to a 100-fold reduction in its DNA-binding activity. Fing er 3 was previously shown to make a large contribution to TFIIIA's binding energy through analysis of 'broken finger' mutants (Del Rio et al., 1993).

Scanning and single substitution mutations were also made within fingers 1-3 that identified amino acids, other than zinc-ligands and linker residues, important in TFIIIA function (Zang et al., 1995). These studies identified an arginine in finger 2 and a lysine in finger 3, both at position +6 within the alpha helix, as being important for binding (alanine substitutions significantly reduced affinity). Apparently, a positive charge at position +6 in finger 3 is sufficient for DNA-binding, as a L ys-to-Arg substitution in this position does not reduce binding. In addition, several other residues were identified in the third finger as being important for DNA-binding, suggesting that finger 3 makes extensive contact with the DNA. An Ala-to-Lys sub stitution at position +2, a Lys-to-Ser substitution at position +5, a Phe-to-Lys substitution at position +8, and an Arg-to-Tyr substitution at position +10 all resulted in significant reductions in affinity. Such extensive putative contacts in finger 3 suggest that it interacts with the DNA in a manner more analogous to that exhibited by fingers 4 and 5 from GLI than that of the three fingers of Zif268.