                         SEQUENCE LISTING

<110>  Currie, Devin
       McBride, John
       Guss, Adam
 
<120>  MODIFIED CIPA GENE FROM CLOSTRIDIUM THERMOCELLUM FOR ENHANCED 
       GENETIC STABILITY

<130>  498888

<140>  To be assigned
<141>  2010-04-21

<160>  3     

<170>  PatentIn version 3.4

<210>  1
<211>  5562
<212>  DNA
<213>  Clostridium thermocellum

<400>  1
atgagaaaag tcatcagtat gctcttagtt gtggctatgc tgacgacgat ttttgcggcg     60

atgataccgc agacagtatc ggcggccaca atgacagtcg agatcggcaa agttacagca    120

gccgttggat caaaagtaga aatacctata accctgaaag gagtgccatc caaaggaatg    180

gccaattgcg acttcgtatt gggttatgat ccaaatgtgc tggaagtaac agaagtaaaa    240

ccaggaagca taataaaaga tccggatcct agcaagagct ttgatagcgc aatatatccg    300

gatcgaaaga tgattgtatt tctgtttgca gaagacagtg gaagaggaac gtatgcaata    360

actcaggatg gagtatttgc aacaattgta gccactgtca aatcagctgc agcggcaccg    420

attactttgc ttgaagtagg tgcatttgcg gacaacgatt tagtagaaat aagcacaact    480

tttgtcgcgg gcggagtaaa tcttggtagt tccgtaccga caacacagcc aaatgttccg    540

tcagacggtg tggtagtaga aattggcaaa gttacgggat ctgttggaac tacagttgaa    600

atacctgtat atttcagagg agttccatcc aaaggaatag caaactgcga ctttgtgttc    660

agatatgatc cgaatgtatt ggaaattata gggatagatc ccggagacat aatagttgac    720

ccgaatccta ccaagagctt tgatactgca atatatcctg acagaaagat aatagtattc    780

ctgtttgcgg aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgca    840

aaaataagag caactgtaaa atcaagtgct ccgggctata ttactttcga cgaagtaggt    900

ggatttgcag ataatgacct ggtagaacag aaggtatcat ttatagacgg tggtgttaac    960

gttggcaatg caacaccgac caagggagca acaccaacaa atacagctac gccgacaaaa   1020

tcagctacgg ctacgcccac caggccatcg gtaccgacaa acacaccgac aaacacaccg   1080

gcaaatacac cggtatcagg caatttgaag gttgaattct acaacagcaa tccttcagat   1140

actactaact caatcaatcc tcagttcaag gttactaata ccggaagcag tgcaattgat   1200

ttgtccaaac tcacattgag atattattat acagtagacg gacagaaaga tcagaccttc   1260

tggtgtgacc atgctgcaat aatcggcagt aacggcagct acaacggaat tacttcaaat   1320

gtaaaaggaa catttgtaaa aatgagttcc tcaacaaata acgcagacac ctaccttgaa   1380

ataagcttta caggcggaac tcttgaaccg ggtgcacatg ttcagataca aggtagattt   1440

gcaaagaatg actggagtaa ctatacacag tcaaatgact actcattcaa gtctgcttca   1500

cagtttgttg aatgggatca ggtaacagca tacttgaacg gtgttcttgt atggggtaaa   1560

gaacccggtg gcagtgtagt accatcaaca cagcctgtaa caacaccacc tgcaacaaca   1620

aaaccacctg caacaacaaa accacctgca acaacaatac cgccgtcaga tgatccgaat   1680

gcaataaaga ttaaggtgga cacagtaaat gcaaaaccgg gagacacagt aaatatacct   1740

gtaagattca gtggtatacc atccaaggga atagcaaact gtgactttgt atacagctat   1800

gacccgaatg tacttgagat aatagagata aaaccgggag aattgatagt tgacccgaat   1860

cctgacaaga gctttgatac tgcagtatat cctgacagaa agataatagt attcctgttt   1920

gcagaagaca gcggaacagg agcgtatgca ataactaaag acggagtatt tgctacgata   1980

gtagcgaaag taaaatccgg agcacctaac ggactcagtg taatcaaatt tgtagaagta   2040

ggcggatttg cgaacaatga ccttgtagaa cagaggacac agttctttga cggtggagta   2100

aatgttggag atacaacagt acctacaaca cctacaacac ctgtaacaac accgacagat   2160

gattcgaatg cagtaaggat taaggtggac acagtaaatg caaaaccggg agacacagta   2220

agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta   2280

tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt   2340

gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta   2400

ttcctgtttg cggaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt   2460

gctacgatag tagcgaaagt aaaatccgga gcacctaacg gactcagtgt aatcaaattt   2520

gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac   2580

ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca   2640

ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa   2700

ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca   2760

aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg   2820

ggagacataa tagttgaccc gaatcctgac aagagctttg atactgcagt atatcctgac   2880

agaaagataa tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact   2940

aaagacggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc   3000

agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag   3060

acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca   3120

acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac   3180

acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca   3240

tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata   3300

atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact   3360

gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacagga   3420

gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga   3480

gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac   3540

cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa   3600

cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta   3660

aggattaaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga   3720

ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg   3780

aatgtacttg agataataga gatagaaccg ggagaattga tagttgaccc gaatcctacc   3840

aagagctttg atactgcagt atatcctgac agaaagatga tagtattcct gtttgcggaa   3900

gacagcggaa caggagcgta tgcaataact gaagatggag tatttgctac gatagtagcg   3960

aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga   4020

tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt   4080

ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat   4140

gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta   4200

agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta   4260

tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt   4320

gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta   4380

ttcctgtttg cagaagacag cggaacggga gcgtatgcaa taactaaaga cggagtattt   4440

gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt   4500

gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac   4560

ggtggagtaa atgttggaga tacaacagta cctacaacat cgccgacaac aacaccgcca   4620

gagccgacga taactccgaa caagttgaca cttaagatag gcagagcaga aggaagacct   4680

ggagacacgg tggaaatacc ggttaacttg tatggagtac ctcaaaaagg aatagcaagc   4740

ggtgacttcg tagtaagcta tgacccgaat gtacttgaga taatagagat agaaccggga   4800

gaattgatag ttgacccgaa tcctaccaag agctttgata ctgcagtata tcctgacaga   4860

aagatgatag tattcctgtt tgcggaagac agcggaacag gagcgtatgc aataactgaa   4920

gatggagtat ttgctacgat agtagcgaaa gtaaaagaag gagcacctga aggattcagt   4980

gcaatagaaa tttctgagtt tggtgcattt gcagataatg atctggtaga agtggaaact   5040

gaccttatca atggtggagt acttgtaact aataaacctg taatagaagg atataaagta   5100

tccggataca ttttgccaga cttctccttc gacgctactg ttgcaccact tgtaaaggcc   5160

ggattcaaag ttgaaatagt aggaacagaa ttgtatgcag taacagatgc aaacggatac   5220

tttgaaataa ccggagtacc tgcaaatgca agcggatata cattgaagat ttcaagagca   5280

acttacttgg acagagtaat tgcaaatgtt gtagtaacgg gagatacttc agtttcaact   5340

tcacaggctc caataatgat gtgggtagga gacatagtga aagacaattc tatcaacctg   5400

ttggacgttg cagaagttat ccgttgcttc aacgctacta aaggaagcgc aaactacgta   5460

gaagaacttg acattaatag aaacggcgca attaacatgc aagacataat gattgttcat   5520

aagcactttg gagctacatc aagtgattac gacgcacagt aa                      5562


<210>  2
<211>  5562
<212>  DNA
<213>  Artificial

<220>
<223>  synthetic DNA

<400>  2
atgaggaagg tgatcagtat gttgttagtg gttgctatgt tgacgactat ctttgccgct     60

atgatccctc aaacggttag tgcagctact atgacagtag aaatcggaaa ggtcactgct    120

gccgtaggat ctaaagtaga aatcccgatt acattaaagg gcgttccgtc taaaggaatg    180

gctaattgtg attttgtact tggctatgat ccgaatgttc ttgaggttac tgaggtaaag    240

cctggttcta taattaaaga tcccgatcca agcaagagtt ttgactctgc aatttaccca    300

gatagaaaaa tgattgtttt tttattcgct gaagactctg gaagaggtac ttatgccatt    360

acacaagatg gggtgtttgc gactatcgtt gcgactgtga agagcgccgc tgccgcaccc    420

attacattac ttgaggtcgg ggcatttgcc gataatgacc ttgttgaaat atctacgact    480

tttgttgcag gcggtgttaa tcttggcagt tctgtgccta cgacgcaacc caatgttccg    540

tctgatggcg ttgtcgttga aataggaaag gtcactgggt ctgtcggaac gactgttgaa    600

attccagtat attttagagg cgtcccttca aagggtatag caaattgtga ctttgttttt    660

aggtatgatc cgaatgtatt agaaataata ggaatcgatc cgggagatat tatagtggat    720

cctaatccga ctaagagttt tgacactgct atatatccgg atagaaaaat tatagtcttt    780

cttttcgccg aagatagtgg aacaggggct tatgcaatta caaaggatgg ggtatttgcc    840

aagattaggg ctacggttaa gtcttcagcc ccgggatata tcacttttga tgaggttggg    900

ggctttgctg acaatgattt ggtggaacag aaggtatcat ttattgacgg tggggtgaat    960

gtgggaaacg ctactccaac taaaggagcc actccaacaa atacagctac accgactaaa   1020

tctgcaactg caactccgac aagaccttct gtgccaacta atactcctac taacacacca   1080

gcaaacactc cagtttcagg aaaccttaaa gttgagtttt acaattcaaa cccttctgat   1140

actactaatt ctatcaatcc acaattcaaa gtgacaaaca ctggttcatc agctatcgat   1200

ttgtcaaaac ttactcttag gtattactat acagtggatg gtcaaaagga tcaaacattt   1260

tggtgcgatc acgctgcaat catcggatct aatggatctt ataacggaat cacttcaaat   1320

gtgaaaggga ctttcgtgaa gatgagtagt agtacgaaca acgccgacac gtacttagag   1380

attagtttca ctggcggtac attggagcct ggagcccatg tacagattca ggggaggttt   1440

gccaagaacg actggagtaa ctatacacag agtaatgact acagtttcaa aagtgctagt   1500

caattcgttg agtgggacca ggtgactgcg tatttaaacg gagtgttagt ctggggaaag   1560

gagcctggtg ggagcgtcgt gccttctaca caaccagtta caacgccgcc agctactaca   1620

aagccaccgg cgacaactaa gcctccagcc acgacaattc cgccatctga tgatcctaat   1680

gctatcaaga taaaggtcga cactgtcaac gcaaaacctg gtgacacggt taacattccc   1740

gttaggttta gcggaatacc tagcaagggc attgcgaatt gcgattttgt ttatagttat   1800

gacccgaacg ttcttgagat aattgaaatc aagccgggag aacttatagt ggacccgaac   1860

ccagacaaat ctttcgatac agccgtttac ccagacagaa aaataatcgt cttcttgttt   1920

gcagaggatt caggcactgg cgcgtacgcg ataacaaaag acggtgtgtt cgcaacaata   1980

gttgcaaaag tcaaaagtgg tgcccccaac gggttaagtg taataaagtt cgttgaagtt   2040

ggcggcttcg ccaacaacga tcttgtcgag cagaggacgc agttttttga tggtggcgta   2100

aatgtggggg acactacggt cccaactaca ccgacgacac ctgtcacgac acctacggac   2160

gattcaaacg ccgtaaggat taaggttgat actgtgaacg ccaaaccggg tgatacggtt   2220

agaatcccag tgagattcag cggcatacca tctaaaggaa tcgcgaactg cgatttcgtt   2280

tactcttatg atccaaacgt gcttgaaatt atcgaaatag agcccggaga tatcatagtc   2340

gatcctaacc ccgataaatc tttcgatact gctgtgtatc cagataggaa gatcattgtg   2400

tttttgtttg cagaagacag cggcacgggc gcgtacgcaa tcacgaaaga cggagtgttc   2460

gcgacgatcg tcgcaaaggt gaagtcagga gcaccgaatg gcttaagtgt catcaaattc   2520

gttgaagttg gaggtttcgc aaataatgac cttgtagagc agaaaactca gtttttcgat   2580

ggtggggtaa acgtagggga cactacggag ccagctacgc ccacgacgcc tgttacaacg   2640

cccactacaa cggacgattt agacgctgtg aggataaagg ttgatacagt gaatgccaaa   2700

ccaggtgaca cagtcaggat cccagtgaga ttttctggaa ttccttctaa gggaattgct   2760

aactgcgact tcgtgtactc atacgaccca aatgtattgg agattataga gattgagccg   2820

ggcgatatta tcgtggatcc gaaccccgat aagtctttcg atacagcggt gtacccggac   2880

aggaaaatta tagtgttttt gttcgcggag gactcaggta cgggcgcgta tgctattact   2940

aaagacggag tattcgctac aatagtagcc aaagtcaaat ctggtgcccc caacggattg   3000

agtgtaatca agtttgttga agttggagga tttgcaaaca acgacttagt cgagcaaaaa   3060

actcagtttt ttgacggggg tgttaacgta ggtgatacga cggagcctgc aacacctaca   3120

actcccgtta ctacgccaac tactactgac gaccttgacg ccgtaagaat caaagtggat   3180

actgttaacg cgaagcctgg agatacagtt aggatacctg ttagattctc agggattcca   3240

tcaaaaggta tagccaactg tgacttcgtc tacagttatg atccaaacgt cttagaaatt   3300

atcgagatag agcctggtga cataattgtg gaccctaacc cggacaagag cttcgacaca   3360

gcggtatatc ctgataggaa aataatcgtt ttccttttcg cagaggattc aggcacagga   3420

gcatatgcaa taactaagga cggggtgttt gctacgatcg ttgcaaaagt gaaggaagga   3480

gctcccaacg gattaagtgt gattaagttc gtcgaggtcg gcgggttcgc taacaatgac   3540

ttggtagagc agaaaacaca gttttttgat ggaggagtta atgttggaga cacgacggag   3600

ccagctactc caacaacacc ggtcacgact ccaacgacaa ctgacgattt agatgctgtg   3660

aggataaaag ttgacacagt taacgccaag ccaggggaca ctgtgaggat ccctgttagg   3720

ttcagtggga taccgagtaa ggggatagcc aattgtgact ttgtttacag ttatgatccc   3780

aacgtattag agataataga aatcgagccc ggagagctta tcgtggaccc taaccccaca   3840

aagtcattcg acactgcggt gtacccggat aggaaaatga ttgtgttctt atttgccgag   3900

gatagcggaa ctggagcata cgcaatcacg gaagatggtg tatttgcaac tatagttgcc   3960

aaggtcaaga gtggtgctcc gaatggactt agtgtaataa aatttgtgga ggttggtggg   4020

ttcgcgaata acgatttagt ggagcagaag actcaattct tcgatggagg cgttaacgtc   4080

ggagacacga ctgagcctgc cacgccaact acgccagtta caacgccaac aactacggac   4140

gacttagacg ctgtgagaat aaaggttgac acagtcaacg cgaagcctgg tgacacggtc   4200

aggattccag tcagatttag cgggattccc agtaaaggaa ttgcaaactg cgactttgtg   4260

tatagttacg atccaaacgt cttagagatt attgagatag agcctggcga cattatcgtc   4320

gaccctaacc ctgacaagtc atttgacact gcagtttacc ctgacagaaa aattatcgtc   4380

ttcttattcg cggaggacag cggtacgggt gcgtacgcga tcacgaaaga cggcgttttt   4440

gcaacaatcg tcgccaaagt caaagagggg gcgccgaacg gtttatcagt tatcaagttc   4500

gtagaggttg gcggcttcgc gaataacgat cttgttgaac agaaaacgca attctttgac   4560

ggaggtgtca atgtaggaga tacgacggta cccacaacat cacctacaac gacacctccc   4620

gagcctacga tcactccgaa taaacttaca ttaaaaatag gcagggcgga gggaagaccg   4680

ggagacacag tggaaatccc tgtgaatttg tatggtgtcc cccagaaggg tatcgcctca   4740

ggagacttcg ttgtatctta cgatccaaac gttttggaga ttatagaaat agaaccgggc   4800

gagttaatag tggatccaaa tccaactaaa agtttcgaca cagcagtcta ccctgacagg   4860

aagatgatag tgtttctttt cgccgaggat agcggcacag gggcatatgc aataacggag   4920

gatggtgtct tcgccacgat agtggctaaa gtgaaggagg gagcaccgga gggattctct   4980

gctattgaaa tttctgaatt tggagcattc gctgacaacg accttgtgga ggtggagaca   5040

gacttgatca acggaggagt tcttgttact aataaacctg ttattgaagg ttataaagtt   5100

tcaggatata ttcttcctga ctttagtttt gacgccacgg tcgcacctct tgtcaaagct   5160

ggtttcaagg ttgagatagt agggacagaa ctttacgcgg taacggacgc gaatggatac   5220

ttcgaaatca caggagttcc tgcgaacgcc agtggataca cgttgaaaat ttctagagct   5280

acttaccttg acagggtcat agcgaacgtt gttgtgacgg gggacacttc tgtgagtacg   5340

agtcaggctc cgatcatgat gtgggttggg gacattgtca aggacaacag tatcaattta   5400

ttagacgttg cagaggtgat tagatgcttc aatgccacta agggtagtgc aaactacgta   5460

gaagagttag atatcaacag aaacggagca ataaacatgc aggatatcat gatagttcat   5520

aagcattttg gagctacgtc atctgattac gatgcacaat aa                      5562


<210>  3
<211>  1853
<212>  PRT
<213>  Clostridium thermocellum

<400>  3

Met Arg Lys Val Ile Ser Met Leu Leu Val Val Ala Met Leu Thr Thr 
1               5                   10                  15      


Ile Phe Ala Ala Met Ile Pro Gln Thr Val Ser Ala Ala Thr Met Thr 
            20                  25                  30          


Val Glu Ile Gly Lys Val Thr Ala Ala Val Gly Ser Lys Val Glu Ile 
        35                  40                  45              


Pro Ile Thr Leu Lys Gly Val Pro Ser Lys Gly Met Ala Asn Cys Asp 
    50                  55                  60                  


Phe Val Leu Gly Tyr Asp Pro Asn Val Leu Glu Val Thr Glu Val Lys 
65                  70                  75                  80  


Pro Gly Ser Ile Ile Lys Asp Pro Asp Pro Ser Lys Ser Phe Asp Ser 
                85                  90                  95      


Ala Ile Tyr Pro Asp Arg Lys Met Ile Val Phe Leu Phe Ala Glu Asp 
            100                 105                 110         


Ser Gly Arg Gly Thr Tyr Ala Ile Thr Gln Asp Gly Val Phe Ala Thr 
        115                 120                 125             


Ile Val Ala Thr Val Lys Ser Ala Ala Ala Ala Pro Ile Thr Leu Leu 
    130                 135                 140                 


Glu Val Gly Ala Phe Ala Asp Asn Asp Leu Val Glu Ile Ser Thr Thr 
145                 150                 155                 160 


Phe Val Ala Gly Gly Val Asn Leu Gly Ser Ser Val Pro Thr Thr Gln 
                165                 170                 175     


Pro Asn Val Pro Ser Asp Gly Val Val Val Glu Ile Gly Lys Val Thr 
            180                 185                 190         


Gly Ser Val Gly Thr Thr Val Glu Ile Pro Val Tyr Phe Arg Gly Val 
        195                 200                 205             


Pro Ser Lys Gly Ile Ala Asn Cys Asp Phe Val Phe Arg Tyr Asp Pro 
    210                 215                 220                 


Asn Val Leu Glu Ile Ile Gly Ile Asp Pro Gly Asp Ile Ile Val Asp 
225                 230                 235                 240 


Pro Asn Pro Thr Lys Ser Phe Asp Thr Ala Ile Tyr Pro Asp Arg Lys 
                245                 250                 255     


Ile Ile Val Phe Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala 
            260                 265                 270         


Ile Thr Lys Asp Gly Val Phe Ala Lys Ile Arg Ala Thr Val Lys Ser 
        275                 280                 285             


Ser Ala Pro Gly Tyr Ile Thr Phe Asp Glu Val Gly Gly Phe Ala Asp 
    290                 295                 300                 


Asn Asp Leu Val Glu Gln Lys Val Ser Phe Ile Asp Gly Gly Val Asn 
305                 310                 315                 320 


Val Gly Asn Ala Thr Pro Thr Lys Gly Ala Thr Pro Thr Asn Thr Ala 
                325                 330                 335     


Thr Pro Thr Lys Ser Ala Thr Ala Thr Pro Thr Arg Pro Ser Val Pro 
            340                 345                 350         


Thr Asn Thr Pro Thr Asn Thr Pro Ala Asn Thr Pro Val Ser Gly Asn 
        355                 360                 365             


Leu Lys Val Glu Phe Tyr Asn Ser Asn Pro Ser Asp Thr Thr Asn Ser 
    370                 375                 380                 


Ile Asn Pro Gln Phe Lys Val Thr Asn Thr Gly Ser Ser Ala Ile Asp 
385                 390                 395                 400 


Leu Ser Lys Leu Thr Leu Arg Tyr Tyr Tyr Thr Val Asp Gly Gln Lys 
                405                 410                 415     


Asp Gln Thr Phe Trp Cys Asp His Ala Ala Ile Ile Gly Ser Asn Gly 
            420                 425                 430         


Ser Tyr Asn Gly Ile Thr Ser Asn Val Lys Gly Thr Phe Val Lys Met 
        435                 440                 445             


Ser Ser Ser Thr Asn Asn Ala Asp Thr Tyr Leu Glu Ile Ser Phe Thr 
    450                 455                 460                 


Gly Gly Thr Leu Glu Pro Gly Ala His Val Gln Ile Gln Gly Arg Phe 
465                 470                 475                 480 


Ala Lys Asn Asp Trp Ser Asn Tyr Thr Gln Ser Asn Asp Tyr Ser Phe 
                485                 490                 495     


Lys Ser Ala Ser Gln Phe Val Glu Trp Asp Gln Val Thr Ala Tyr Leu 
            500                 505                 510         


Asn Gly Val Leu Val Trp Gly Lys Glu Pro Gly Gly Ser Val Val Pro 
        515                 520                 525             


Ser Thr Gln Pro Val Thr Thr Pro Pro Ala Thr Thr Lys Pro Pro Ala 
    530                 535                 540                 


Thr Thr Lys Pro Pro Ala Thr Thr Ile Pro Pro Ser Asp Asp Pro Asn 
545                 550                 555                 560 


Ala Ile Lys Ile Lys Val Asp Thr Val Asn Ala Lys Pro Gly Asp Thr 
                565                 570                 575     


Val Asn Ile Pro Val Arg Phe Ser Gly Ile Pro Ser Lys Gly Ile Ala 
            580                 585                 590         


Asn Cys Asp Phe Val Tyr Ser Tyr Asp Pro Asn Val Leu Glu Ile Ile 
        595                 600                 605             


Glu Ile Lys Pro Gly Glu Leu Ile Val Asp Pro Asn Pro Asp Lys Ser 
    610                 615                 620                 


Phe Asp Thr Ala Val Tyr Pro Asp Arg Lys Ile Ile Val Phe Leu Phe 
625                 630                 635                 640 


Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala Ile Thr Lys Asp Gly Val 
                645                 650                 655     


Phe Ala Thr Ile Val Ala Lys Val Lys Ser Gly Ala Pro Asn Gly Leu 
            660                 665                 670         


Ser Val Ile Lys Phe Val Glu Val Gly Gly Phe Ala Asn Asn Asp Leu 
        675                 680                 685             


Val Glu Gln Arg Thr Gln Phe Phe Asp Gly Gly Val Asn Val Gly Asp 
    690                 695                 700                 


Thr Thr Val Pro Thr Thr Pro Thr Thr Pro Val Thr Thr Pro Thr Asp 
705                 710                 715                 720 


Asp Ser Asn Ala Val Arg Ile Lys Val Asp Thr Val Asn Ala Lys Pro 
                725                 730                 735     


Gly Asp Thr Val Arg Ile Pro Val Arg Phe Ser Gly Ile Pro Ser Lys 
            740                 745                 750         


Gly Ile Ala Asn Cys Asp Phe Val Tyr Ser Tyr Asp Pro Asn Val Leu 
        755                 760                 765             


Glu Ile Ile Glu Ile Glu Pro Gly Asp Ile Ile Val Asp Pro Asn Pro 
    770                 775                 780                 


Asp Lys Ser Phe Asp Thr Ala Val Tyr Pro Asp Arg Lys Ile Ile Val 
785                 790                 795                 800 


Phe Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala Tyr Ala Ile Thr Lys 
                805                 810                 815     


Asp Gly Val Phe Ala Thr Ile Val Ala Lys Val Lys Ser Gly Ala Pro 
            820                 825                 830         


Asn Gly Leu Ser Val Ile Lys Phe Val Glu Val Gly Gly Phe Ala Asn 
        835                 840                 845             


Asn Asp Leu Val Glu Gln Lys Thr Gln Phe Phe Asp Gly Gly Val Asn 
    850                 855                 860                 


Val Gly Asp Thr Thr Glu Pro Ala Thr Pro Thr Thr Pro Val Thr Thr 
865                 870                 875                 880 


Pro Thr Thr Thr Asp Asp Leu Asp Ala Val Arg Ile Lys Val Asp Thr 
                885                 890                 895     


Val Asn Ala Lys Pro Gly Asp Thr Val Arg Ile Pro Val Arg Phe Ser 
            900                 905                 910         


Gly Ile Pro Ser Lys Gly Ile Ala Asn Cys Asp Phe Val Tyr Ser Tyr 
        915                 920                 925             


Asp Pro Asn Val Leu Glu Ile Ile Glu Ile Glu Pro Gly Asp Ile Ile 
    930                 935                 940                 


Val Asp Pro Asn Pro Asp Lys Ser Phe Asp Thr Ala Val Tyr Pro Asp 
945                 950                 955                 960 


Arg Lys Ile Ile Val Phe Leu Phe Ala Glu Asp Ser Gly Thr Gly Ala 
                965                 970                 975     


Tyr Ala Ile Thr Lys Asp Gly Val Phe Ala Thr Ile Val Ala Lys Val 
            980                 985                 990         


Lys Ser Gly Ala Pro Asn Gly Leu  Ser Val Ile Lys Phe  Val Glu Val 
        995                 1000                 1005             


Gly Gly  Phe Ala Asn Asn Asp  Leu Val Glu Gln Lys  Thr Gln Phe 
    1010                 1015                 1020             


Phe Asp  Gly Gly Val Asn Val  Gly Asp Thr Thr Glu  Pro Ala Thr 
    1025                 1030                 1035             


Pro Thr  Thr Pro Val Thr Thr  Pro Thr Thr Thr Asp  Asp Leu Asp 
    1040                 1045                 1050             


Ala Val  Arg Ile Lys Val Asp  Thr Val Asn Ala Lys  Pro Gly Asp 
    1055                 1060                 1065             


Thr Val  Arg Ile Pro Val Arg  Phe Ser Gly Ile Pro  Ser Lys Gly 
    1070                 1075                 1080             


Ile Ala  Asn Cys Asp Phe Val  Tyr Ser Tyr Asp Pro  Asn Val Leu 
    1085                 1090                 1095             


Glu Ile  Ile Glu Ile Glu Pro  Gly Asp Ile Ile Val  Asp Pro Asn 
    1100                 1105                 1110             


Pro Asp  Lys Ser Phe Asp Thr  Ala Val Tyr Pro Asp  Arg Lys Ile 
    1115                 1120                 1125             


Ile Val  Phe Leu Phe Ala Glu  Asp Ser Gly Thr Gly  Ala Tyr Ala 
    1130                 1135                 1140             


Ile Thr  Lys Asp Gly Val Phe  Ala Thr Ile Val Ala  Lys Val Lys 
    1145                 1150                 1155             


Glu Gly  Ala Pro Asn Gly Leu  Ser Val Ile Lys Phe  Val Glu Val 
    1160                 1165                 1170             


Gly Gly  Phe Ala Asn Asn Asp  Leu Val Glu Gln Lys  Thr Gln Phe 
    1175                 1180                 1185             


Phe Asp  Gly Gly Val Asn Val  Gly Asp Thr Thr Glu  Pro Ala Thr 
    1190                 1195                 1200             


Pro Thr  Thr Pro Val Thr Thr  Pro Thr Thr Thr Asp  Asp Leu Asp 
    1205                 1210                 1215             


Ala Val  Arg Ile Lys Val Asp  Thr Val Asn Ala Lys  Pro Gly Asp 
    1220                 1225                 1230             


Thr Val  Arg Ile Pro Val Arg  Phe Ser Gly Ile Pro  Ser Lys Gly 
    1235                 1240                 1245             


Ile Ala  Asn Cys Asp Phe Val  Tyr Ser Tyr Asp Pro  Asn Val Leu 
    1250                 1255                 1260             


Glu Ile  Ile Glu Ile Glu Pro  Gly Glu Leu Ile Val  Asp Pro Asn 
    1265                 1270                 1275             


Pro Thr  Lys Ser Phe Asp Thr  Ala Val Tyr Pro Asp  Arg Lys Met 
    1280                 1285                 1290             


Ile Val  Phe Leu Phe Ala Glu  Asp Ser Gly Thr Gly  Ala Tyr Ala 
    1295                 1300                 1305             


Ile Thr  Glu Asp Gly Val Phe  Ala Thr Ile Val Ala  Lys Val Lys 
    1310                 1315                 1320             


Ser Gly  Ala Pro Asn Gly Leu  Ser Val Ile Lys Phe  Val Glu Val 
    1325                 1330                 1335             


Gly Gly  Phe Ala Asn Asn Asp  Leu Val Glu Gln Lys  Thr Gln Phe 
    1340                 1345                 1350             


Phe Asp  Gly Gly Val Asn Val  Gly Asp Thr Thr Glu  Pro Ala Thr 
    1355                 1360                 1365             


Pro Thr  Thr Pro Val Thr Thr  Pro Thr Thr Thr Asp  Asp Leu Asp 
    1370                 1375                 1380             


Ala Val  Arg Ile Lys Val Asp  Thr Val Asn Ala Lys  Pro Gly Asp 
    1385                 1390                 1395             


Thr Val  Arg Ile Pro Val Arg  Phe Ser Gly Ile Pro  Ser Lys Gly 
    1400                 1405                 1410             


Ile Ala  Asn Cys Asp Phe Val  Tyr Ser Tyr Asp Pro  Asn Val Leu 
    1415                 1420                 1425             


Glu Ile  Ile Glu Ile Glu Pro  Gly Asp Ile Ile Val  Asp Pro Asn 
    1430                 1435                 1440             


Pro Asp  Lys Ser Phe Asp Thr  Ala Val Tyr Pro Asp  Arg Lys Ile 
    1445                 1450                 1455             


Ile Val  Phe Leu Phe Ala Glu  Asp Ser Gly Thr Gly  Ala Tyr Ala 
    1460                 1465                 1470             


Ile Thr  Lys Asp Gly Val Phe  Ala Thr Ile Val Ala  Lys Val Lys 
    1475                 1480                 1485             


Glu Gly  Ala Pro Asn Gly Leu  Ser Val Ile Lys Phe  Val Glu Val 
    1490                 1495                 1500             


Gly Gly  Phe Ala Asn Asn Asp  Leu Val Glu Gln Lys  Thr Gln Phe 
    1505                 1510                 1515             


Phe Asp  Gly Gly Val Asn Val  Gly Asp Thr Thr Val  Pro Thr Thr 
    1520                 1525                 1530             


Ser Pro  Thr Thr Thr Pro Pro  Glu Pro Thr Ile Thr  Pro Asn Lys 
    1535                 1540                 1545             


Leu Thr  Leu Lys Ile Gly Arg  Ala Glu Gly Arg Pro  Gly Asp Thr 
    1550                 1555                 1560             


Val Glu  Ile Pro Val Asn Leu  Tyr Gly Val Pro Gln  Lys Gly Ile 
    1565                 1570                 1575             


Ala Ser  Gly Asp Phe Val Val  Ser Tyr Asp Pro Asn  Val Leu Glu 
    1580                 1585                 1590             


Ile Ile  Glu Ile Glu Pro Gly  Glu Leu Ile Val Asp  Pro Asn Pro 
    1595                 1600                 1605             


Thr Lys  Ser Phe Asp Thr Ala  Val Tyr Pro Asp Arg  Lys Met Ile 
    1610                 1615                 1620             


Val Phe  Leu Phe Ala Glu Asp  Ser Gly Thr Gly Ala  Tyr Ala Ile 
    1625                 1630                 1635             


Thr Glu  Asp Gly Val Phe Ala  Thr Ile Val Ala Lys  Val Lys Glu 
    1640                 1645                 1650             


Gly Ala  Pro Glu Gly Phe Ser  Ala Ile Glu Ile Ser  Glu Phe Gly 
    1655                 1660                 1665             


Ala Phe  Ala Asp Asn Asp Leu  Val Glu Val Glu Thr  Asp Leu Ile 
    1670                 1675                 1680             


Asn Gly  Gly Val Leu Val Thr  Asn Lys Pro Val Ile  Glu Gly Tyr 
    1685                 1690                 1695             


Lys Val  Ser Gly Tyr Ile Leu  Pro Asp Phe Ser Phe  Asp Ala Thr 
    1700                 1705                 1710             


Val Ala  Pro Leu Val Lys Ala  Gly Phe Lys Val Glu  Ile Val Gly 
    1715                 1720                 1725             


Thr Glu  Leu Tyr Ala Val Thr  Asp Ala Asn Gly Tyr  Phe Glu Ile 
    1730                 1735                 1740             


Thr Gly  Val Pro Ala Asn Ala  Ser Gly Tyr Thr Leu  Lys Ile Ser 
    1745                 1750                 1755             


Arg Ala  Thr Tyr Leu Asp Arg  Val Ile Ala Asn Val  Val Val Thr 
    1760                 1765                 1770             


Gly Asp  Thr Ser Val Ser Thr  Ser Gln Ala Pro Ile  Met Met Trp 
    1775                 1780                 1785             


Val Gly  Asp Ile Val Lys Asp  Asn Ser Ile Asn Leu  Leu Asp Val 
    1790                 1795                 1800             


Ala Glu  Val Ile Arg Cys Phe  Asn Ala Thr Lys Gly  Ser Ala Asn 
    1805                 1810                 1815             


Tyr Val  Glu Glu Leu Asp Ile  Asn Arg Asn Gly Ala  Ile Asn Met 
    1820                 1825                 1830             


Gln Asp  Ile Met Ile Val His  Lys His Phe Gly Ala  Thr Ser Ser 
    1835                 1840                 1845             


Asp Tyr  Asp Ala Gln 
    1850             


