                         б

<110>  廪ѧ
 
<120>  ϺͰжDNA-RNAӺ˫ķ

<130>  FPCH12160040P

<150>  CN 201210021004.9
<151>  2012-01-04

<160>  20    

<170>  PatentIn version 3.3

<210>  1
<211>  2883
<212>  DNA
<213>  ˹

<220>
<223>  dHax3 DNA

<400>  1
atggacccaa tacgaagcag aacgccatca ccagctaggg aacttctctc tggaccacag     60

cctgatggag ttcagccaac tgcagatcga ggtgtttctc cgccagccgg tggcccttta    120

gatggtctcc cagcaagaag aacaatgtcc cgtaccagac tcccaagtcc ccctgccccg    180

tcgccagcct tttcagctga ctccttctct gatcttctta ggcaatttga cccttctctt    240

ttcaatacat cccttttcga ttcacttcct cctttcggcg cacatcatac tgaggcagcc    300

accggcgaat gggacgaagt ccaaagtggt ttaagggcag ctgatgctcc accaccgacg    360

atgagagtcg ctgttaccgc cgcacgtcct cctagagcca agccagcccc tagaagacga    420

gctgcgcaac cctccgatgc aagccctgca gctcaagtag accttcgaac actaggttac    480

tcccagcaac aacaagaaaa aataaagcca aaggttagat ctacagttgc acaacatcac    540

gaagccctag tcggacacgg atttacacat gctcatatcg tggctctttc acaacatcct    600

gcagctcttg gaacagtcgc tgtcaaatat caggatatga ttgctgcatt gccagaagct    660

actcacgaag ctatcgtcgg agttgggaaa caatggtcag gcgcaagagc attagaggcg    720

cttctcaccg tagctggtga attacgaggt cctccactcc aattggatac tgggcaatta    780

ttaaaaatcg ctaaacgagg tggagtcact gctgtcgaag ccgttcatgc atggcgtaac    840

gctctcacgg gcgcaccact aaaccttact cctgaacagg ttgtcgcaat agcttcacat    900

gatggcggaa aacaagctct tgaaacagtg caacgtctcc ttcccgtcct ctgtcaggct    960

cacggattga ctcctcagca ggtcgtcgca attgcatcac atgatggagg caaacaagct   1020

ttagaaacag tacaaagact attgcccgtt ctttgccaag cgcatgggtt aactcccgaa   1080

caagtcgttg ccattgcaag tcacgacgga ggtaaacaag ctctcgaaac ggttcaagca   1140

cttttacccg ttctctgtca agcacatgga ctcacacctg aacaagtagt tgctatcgca   1200

tcgaatggag gtggaaaaca agcactggaa actgtacaaa gacttttgcc agttttatgt   1260

caagcgcacg gtcttactcc tcaacaagtt gtcgccattg cctctaacgg tggtggaaaa   1320

caagctcttg aaactgtcca gagacttctg cccgttctat gtcaggctca tgggctaacc   1380

cctcaacagg ttgttgcaat cgcatctaat ggaggaggaa aacaagcttt agaaactgtc   1440

caacgactac tgcccgttct ctgccaagca cacggactta ccccacaaca agttgtggca   1500

atagcttcta attctggtgg taaacaagcc cttgagacgg ttcaaagact tctaccagtt   1560

ctttgtcagg cacatggatt gaccccacaa caggtcgtag caatcgcatc taatggaggt   1620

ggtaagcaag ctctagaaac ggtacaaaga ttacttcccg tgctttgtca agctcatgga   1680

ctcactcctc aacaagtggt cgctattgca agtcatgatg gtggaaagca agcactagaa   1740

accgtccaac gactccttcc tgttctctgt caagcacatg gtcttacgcc cgaacaagtt   1800

gttgctatag cttcgaacgg aggtggaaaa caagctctcg aaaccgtcca aaggctcctc   1860

ccagtacttt gccaagcaca tggattaacc cctgagcaag tagttgcaat tgcctcgcac   1920

gacggaggaa agcaagcatt agaaactgtt cagagacttt tgcctgtcct gtgtcaagcc   1980

cacggtctaa caccacaaca agtcgtcgca atcgctagta atggaggagg tagacctgca   2040

ttggagtcga tagtcgcaca actatcacga cctgatcccg ctcttgcagc attgacaaac   2100

gatcatttag tcgcacttgc atgtttagga ggacgaccag cacttgatgc cgttaagaaa   2160

ggactaccgc acgcccctgc attgattaaa agaacaaaca gacgaatccc ggagagaact   2220

tcacatcgtg tagccgatca tgctcaagtc gtaagagttt tgggtttctt ccaatgtcat   2280

tcccacccag ctcaagcttt tgacgatgca atgactcaat ttggaatgag tagacatgga   2340

ctcctgcaat tatttcgaag ggtcggagtt acagagctcg aagccaggtc aggaacgctg   2400

ccccccgcat ctcaacgatg ggatagaatt ctccaagcct ctggaatgaa aagagctaaa   2460

ccttcaccaa cgtccacaca aacaccagac caagcttctc tccacgcttt tgccgactca   2520

ctagagagag atctagatgc accgtcacct atgcatgaag gagaccaaac aagagcctct   2580

tcaagaaaac gttctcgttc tgatagagct gtcactggac cttccgccca acaatctttc   2640

gaagtccgag ttcctgagca acgagatgcc ctacacctgc ctttgctttc ttggggagtt   2700

aagcgaccac gtactagaat tggtggacta ctcgatccag gtacaccaat ggatgctgat   2760

ctcgttgctt cctctaccgt agtatgggag caagacgcag accccttcgc tggaactgct   2820

gacgatttcc cagcctttaa cgaggaagaa ttggcttggt taatggaact tctaccgcaa   2880

tga                                                                 2883


<210>  2
<211>  977
<212>  PRT
<213>  ˹

<220>
<223>  dHax3װ

<400>  2

Met His His His His His His Ile Thr Ser Leu Tyr Lys Lys Ala Gly 
1               5                   10                  15      


Leu Met Asp Pro Ile Arg Ser Arg Thr Pro Ser Pro Ala Arg Glu Leu 
            20                  25                  30          


Leu Ser Gly Pro Gln Pro Asp Gly Val Gln Pro Thr Ala Asp Arg Gly 
        35                  40                  45              


Val Ser Pro Pro Ala Gly Gly Pro Leu Asp Gly Leu Pro Ala Arg Arg 
    50                  55                  60                  


Thr Met Ser Arg Thr Arg Leu Pro Ser Pro Pro Ala Pro Ser Pro Ala 
65                  70                  75                  80  


Phe Ser Ala Asp Ser Phe Ser Asp Leu Leu Arg Gln Phe Asp Pro Ser 
                85                  90                  95      


Leu Phe Asn Thr Ser Leu Phe Asp Ser Leu Pro Pro Phe Gly Ala His 
            100                 105                 110         


His Thr Glu Ala Ala Thr Gly Glu Trp Asp Glu Val Gln Ser Gly Leu 
        115                 120                 125             


Arg Ala Ala Asp Ala Pro Pro Pro Thr Met Arg Val Ala Val Thr Ala 
    130                 135                 140                 


Ala Arg Pro Pro Arg Ala Lys Pro Ala Pro Arg Arg Arg Ala Ala Gln 
145                 150                 155                 160 


Pro Ser Asp Ala Ser Pro Ala Ala Gln Val Asp Leu Arg Thr Leu Gly 
                165                 170                 175     


Tyr Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr 
            180                 185                 190         


Val Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala 
        195                 200                 205             


His Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala 
    210                 215                 220                 


Val Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu 
225                 230                 235                 240 


Ala Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu 
                245                 250                 255     


Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu 
            260                 265                 270         


Asp Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala 
        275                 280                 285             


Val Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu 
    290                 295                 300                 


Asn Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly 
305                 310                 315                 320 


Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln 
                325                 330                 335     


Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser His Asp 
            340                 345                 350         


Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 
        355                 360                 365             


Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser 
    370                 375                 380                 


His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro 
385                 390                 395                 400 


Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile 
                405                 410                 415     


Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu 
            420                 425                 430         


Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val 
        435                 440                 445             


Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln 
    450                 455                 460                 


Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln 
465                 470                 475                 480 


Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr 
                485                 490                 495     


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
            500                 505                 510         


Gln Gln Val Val Ala Ile Ala Ser Asn Ser Gly Gly Lys Gln Ala Leu 
        515                 520                 525             


Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 
    530                 535                 540                 


Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln 
545                 550                 555                 560 


Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His 
                565                 570                 575     


Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser His Asp Gly Gly 
            580                 585                 590         


Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln 
        595                 600                 605             


Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly 
    610                 615                 620                 


Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 
625                 630                 635                 640 


Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser 
                645                 650                 655     


His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 
            660                 665                 670         


Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile 
        675                 680                 685             


Ala Ser Asn Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln 
    690                 695                 700                 


Leu Ser Arg Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu 
705                 710                 715                 720 


Val Ala Leu Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys 
                725                 730                 735     


Lys Gly Leu Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg 
            740                 745                 750         


Ile Pro Glu Arg Thr Ser His Arg Val Ala Asp His Ala Gln Val Val 
        755                 760                 765             


Arg Val Leu Gly Phe Phe Gln Cys His Ser His Pro Ala Gln Ala Phe 
    770                 775                 780                 


Asp Asp Ala Met Thr Gln Phe Gly Met Ser Arg His Gly Leu Leu Gln 
785                 790                 795                 800 


Leu Phe Arg Arg Val Gly Val Thr Glu Leu Glu Ala Arg Ser Gly Thr 
                805                 810                 815     


Leu Pro Pro Ala Ser Gln Arg Trp Asp Arg Ile Leu Gln Ala Ser Gly 
            820                 825                 830         


Met Lys Arg Ala Lys Pro Ser Pro Thr Ser Thr Gln Thr Pro Asp Gln 
        835                 840                 845             


Ala Ser Leu His Ala Phe Ala Asp Ser Leu Glu Arg Asp Leu Asp Ala 
    850                 855                 860                 


Pro Ser Pro Met His Glu Gly Asp Gln Thr Arg Ala Ser Ser Arg Lys 
865                 870                 875                 880 


Arg Ser Arg Ser Asp Arg Ala Val Thr Gly Pro Ser Ala Gln Gln Ser 
                885                 890                 895     


Phe Glu Val Arg Val Pro Glu Gln Arg Asp Ala Leu His Leu Pro Leu 
            900                 905                 910         


Leu Ser Trp Gly Val Lys Arg Pro Arg Thr Arg Ile Gly Gly Leu Leu 
        915                 920                 925             


Asp Pro Gly Thr Pro Met Asp Ala Asp Leu Val Ala Ser Ser Thr Val 
    930                 935                 940                 


Val Trp Glu Gln Asp Ala Asp Pro Phe Ala Gly Thr Ala Asp Asp Phe 
945                 950                 955                 960 


Pro Ala Phe Asn Glu Glu Glu Leu Ala Trp Leu Met Glu Leu Leu Pro 
                965                 970                 975     


Gln 
    


<210>  3
<211>  499
<212>  PRT
<213>  ˹

<220>
<223>  dHax3ض就УC˺6Hisǩ

<400>  3

Met Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala 
1               5                   10                  15      


Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp Thr Gly Gln Leu Leu 
            20                  25                  30          


Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val Glu Ala Val His Ala 
        35                  40                  45              


Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gln 
    50                  55                  60                  


Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr 
65                  70                  75                  80  


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
                85                  90                  95      


Gln Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu 
            100                 105                 110         


Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 
        115                 120                 125             


Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln 
    130                 135                 140                 


Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His 
145                 150                 155                 160 


Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly 
                165                 170                 175     


Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln 
            180                 185                 190         


Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly 
        195                 200                 205             


Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 
    210                 215                 220                 


Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser 
225                 230                 235                 240 


Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 
                245                 250                 255     


Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile 
            260                 265                 270         


Ala Ser Asn Ser Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu 
        275                 280                 285             


Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val 
    290                 295                 300                 


Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln 
305                 310                 315                 320 


Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln 
                325                 330                 335     


Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr 
            340                 345                 350         


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
        355                 360                 365             


Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu 
    370                 375                 380                 


Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 
385                 390                 395                 400 


Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln 
                405                 410                 415     


Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His 
            420                 425                 430         


Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly 
        435                 440                 445             


Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro 
    450                 455                 460                 


Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu 
465                 470                 475                 480 


Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Leu Glu His His His 
                485                 490                 495     


His His His 
            


<210>  4
<211>  499
<212>  PRT
<213>  ˹

<220>
<223>  dHax3-NIض就УC˺6Hisǩ

<400>  4

Met Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala 
1               5                   10                  15      


Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp Thr Gly Gln Leu Leu 
            20                  25                  30          


Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val Glu Ala Val His Ala 
        35                  40                  45              


Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gln 
    50                  55                  60                  


Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr 
65                  70                  75                  80  


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
                85                  90                  95      


Gln Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu 
            100                 105                 110         


Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 
        115                 120                 125             


Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln 
    130                 135                 140                 


Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala His 
145                 150                 155                 160 


Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly 
                165                 170                 175     


Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln 
            180                 185                 190         


Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly 
        195                 200                 205             


Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu 
    210                 215                 220                 


Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser 
225                 230                 235                 240 


Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro 
                245                 250                 255     


Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val Ala Ile 
            260                 265                 270         


Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu 
        275                 280                 285             


Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln Val Val 
    290                 295                 300                 


Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln 
305                 310                 315                 320 


Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Gln Gln 
                325                 330                 335     


Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr 
            340                 345                 350         


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
        355                 360                 365             


Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu 
    370                 375                 380                 


Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu 
385                 390                 395                 400 


Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln 
                405                 410                 415     


Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His 
            420                 425                 430         


Gly Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly 
        435                 440                 445             


Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg Pro Asp Pro 
    450                 455                 460                 


Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu 
465                 470                 475                 480 


Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Leu Glu His His His 
                485                 490                 495     


His His His 
            


<210>  5
<211>  794
<212>  PRT
<213>  ˹

<220>
<223>  TALE24ظԪ

<400>  5

Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
1               5                   10                  15      


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala 
            20                  25                  30          


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
        35                  40                  45              


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val 
    50                  55                  60                  


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
65                  70                  75                  80  


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
                85                  90                  95      


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
            100                 105                 110         


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
        115                 120                 125             


Pro Ala Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala 
    130                 135                 140                 


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
145                 150                 155                 160 


Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
                165                 170                 175     


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            180                 185                 190         


His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Ile Gly 
        195                 200                 205             


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    210                 215                 220                 


Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn 
225                 230                 235                 240 


Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                245                 250                 255     


Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala 
            260                 265                 270         


Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        275                 280                 285             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala 
    290                 295                 300                 


Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
305                 310                 315                 320 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val 
                325                 330                 335     


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
            340                 345                 350         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
        355                 360                 365             


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
    370                 375                 380                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
385                 390                 395                 400 


Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
                405                 410                 415     


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            420                 425                 430         


Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
        435                 440                 445             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    450                 455                 460                 


His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly 
465                 470                 475                 480 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                485                 490                 495     


Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser His 
            500                 505                 510         


Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
        515                 520                 525             


Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala 
    530                 535                 540                 


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
545                 550                 555                 560 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala 
                565                 570                 575     


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            580                 585                 590         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val 
        595                 600                 605             


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
    610                 615                 620                 


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
625                 630                 635                 640 


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
                645                 650                 655     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            660                 665                 670         


Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
        675                 680                 685             


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
    690                 695                 700                 


Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
705                 710                 715                 720 


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
                725                 730                 735     


His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Ile Gly 
            740                 745                 750         


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
        755                 760                 765             


Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn 
    770                 775                 780                 


Asn Gly Gly Arg Arg Cys Tyr Lys Ala Leu 
785                 790                 


<210>  6
<211>  760
<212>  PRT
<213>  ˹

<220>
<223>  TALEHIV ظԪ

<400>  6

Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
1               5                   10                  15      


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala 
            20                  25                  30          


Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
        35                  40                  45              


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val 
    50                  55                  60                  


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
65                  70                  75                  80  


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
                85                  90                  95      


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
            100                 105                 110         


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
        115                 120                 125             


Pro Ala Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 
    130                 135                 140                 


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
145                 150                 155                 160 


Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
                165                 170                 175     


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            180                 185                 190         


His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
        195                 200                 205             


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    210                 215                 220                 


Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser Asn 
225                 230                 235                 240 


Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                245                 250                 255     


Leu Cys Gln Ala His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala 
            260                 265                 270         


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        275                 280                 285             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala 
    290                 295                 300                 


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
305                 310                 315                 320 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val 
                325                 330                 335     


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
            340                 345                 350         


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
        355                 360                 365             


Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu 
    370                 375                 380                 


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
385                 390                 395                 400 


Pro Ala Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala 
                405                 410                 415     


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            420                 425                 430         


Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys 
        435                 440                 445             


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
    450                 455                 460                 


His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn Ile Gly 
465                 470                 475                 480 


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
                485                 490                 495     


Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Asn 
            500                 505                 510         


Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
        515                 520                 525             


Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala Ile Ala 
    530                 535                 540                 


Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
545                 550                 555                 560 


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val Val Ala 
                565                 570                 575     


Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
            580                 585                 590         


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Asp Gln Val 
        595                 600                 605             


Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val 
    610                 615                 620                 


Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Ala 
625                 630                 635                 640 


Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu 
                645                 650                 655     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr 
            660                 665                 670         


Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala 
        675                 680                 685             


Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
    690                 695                 700                 


Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 
705                 710                 715                 720 


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
                725                 730                 735     


His Gly Leu Thr Pro Ala Gln Val Val Ala Ile Ala Ser His Asp Gly 
            740                 745                 750         


Gly Arg Arg Cys Tyr Lys Ala Leu 
        755                 760 


<210>  7
<211>  2397
<212>  DNA
<213>  ˹

<220>
<223>  TALE24ظԪDNA

<400>  7
attctagaag acactagtca tgacggtggc aaacaggctc ttgagaccgt ccaacgcctt     60

ctaccagttc tctgtcaagc ccacggacta accccagcgc aagttgtagc gattgctagt    120

catgacggtg gcaaacaggc ccttgagaca gtccaacgcc ttctaccagt tctctgccaa    180

gcacacggac taaccccagc gcaagttgta gcgattgcta gtcatgacgg tggcaaacag    240

gctcttgaaa ccgtgcaacg actgctccca gttctctgtc aagcccacgg cctcaccccg    300

gcgcaagttg tagcgattgc tagtaatggg ggtggcaaac aggctcttga aaccgtgcaa    360

cgactgctcc cagttctctg tcaagcccac ggcctcaccc cggcgcaagt tgtagcgatt    420

gctagtaatg ggggtggcaa acaggcactt gagactgttc agcgactact accagttctc    480

tgccaagccc acggacttac cccagatcaa gttgtagcga ttgctagtaa tgggggtggc    540

aaacaggcac ttgagactgt tcagcgacta ctaccagttc tctgccaagc ccacggactt    600

accccagatc aagttgtagc gattgctagt aatattggtg gcaaacaggc acttgagacg    660

gttcagcgcc tccttccagt tctttgtcaa gctcacggac tcaccccaga tcaagttgta    720

gcgattgcta gtaatggggg tggcaaacag gctcttgaaa ccgtgcaacg actgctccca    780

gttctctgtc aagcccacgg cctcaccccg gcgcaagttg tagcgattgc tagtcatgac    840

ggtggcaaac aggctcttga aaccgtgcaa cgactgctcc cagttctctg tcaagcccac    900

ggcctcaccc cggcgcaagt tgtagcgatt gctagtaatg ggggtggcaa acaggctctt    960

gaaaccgtgc aacgactgct cccagttctc tgtcaagccc acggcctcac cccggcgcaa   1020

gttgtagcga ttgctagtca tgacggtggc aaacaggctc ttgagaccgt ccaacgcctt   1080

ctaccagttc tctgtcaagc ccacggacta accccagcgc aagttgtagc gattgctagt   1140

aatgggggtg gcaaacaggc tcttgaaacc gtgcaacgac tgctcccagt tctctgtcaa   1200

gcccacggcc tcaccccggc gcaagttgta gcgattgcta gtcatgacgg tggcaaacag   1260

gctcttgaga ccgtccaacg ccttctacca gttctctgtc aagcccacgg actaacccca   1320

gcgcaagttg tagcgattgc tagtaatggg ggtggcaaac aggctcttga aaccgtgcaa   1380

cgactgctcc cagttctctg tcaagcccac ggcctcaccc cggcgcaagt tgtagcgatt   1440

gctagtcatg acggtggcaa acaggctctt gaaaccgtgc aacgactgct cccagttctc   1500

tgtcaagccc acggcctcac cccggcgcaa gttgtagcga ttgctagtca tgacggtggc   1560

aaacaggctc ttgagaccgt ccaacgcctt ctaccagttc tctgtcaagc ccacggacta   1620

accccagcgc aagttgtagc gattgctagt aatattggtg gcaaacaggc acttgagacg   1680

gttcagcgcc tccttccagt tctttgtcaa gctcacggac tcaccccaga tcaagttgta   1740

gcgattgcta gtaacaatgg tggcaaacag gctctcgaaa ccgtacaacg actcctccca   1800

gttctctgtc aagcccacgg actaactcct gatcaagttg tagcgattgc tagtcatgac   1860

ggtggcaaac aggctcttga gaccgtccaa cgccttctac cagttctctg tcaagcccac   1920

ggactaaccc cagcgcaagt tgtagcgatt gctagtaatg ggggtggcaa acaggctctt   1980

gaaaccgtgc aacgactgct cccagttctc tgtcaagccc acggcctcac cccggcgcaa   2040

gttgtagcga ttgctagtca tgacggtggc aaacaggctc ttgaaaccgt gcaacgactg   2100

ctcccagttc tctgtcaagc ccacggcctc accccggcgc aagttgtagc gattgctagt   2160

aacaatggtg gcaaacaggc tctcgaaacc gtacaacgac tcctcccagt tctctgtcaa   2220

gcccacggac taactcctga tcaagttgta gcgattgcta gtaatattgg tggcaaacag   2280

gcacttgaga cggttcagcg cctccttcca gttctttgtc aagctcacgg actcacccca   2340

gatcaagttg tagcgattgc tagcaacaat ggcggtcgac gctgctataa agcttta      2397


<210>  8
<211>  2295
<212>  DNA
<213>  ˹

<220>
<223>  TALEHIVظԪDNA

<400>  8
attctagaag acactagtca tgacggtggc aaacaggctc ttgagaccgt ccaacgcctt     60

ctaccagttc tctgtcaagc ccacggacta accccagcgc aagttgtagc gattgctagt    120

catgacggtg gcaaacaggc tcttgagacc gtccaacgcc ttctaccagt tctctgtcaa    180

gcccacggac taaccccagc gcaagttgta gcgattgcta gtcatgacgg tggcaaacag    240

gctcttgaaa ccgtgcaacg actgctccca gttctctgtc aagcccacgg cctcaccccg    300

gcgcaagttg tagcgattgc tagtaatggg ggtggcaaac aggctcttga aaccgtgcaa    360

cgactgctcc cagttctctg tcaagcccac ggcctcaccc cggcgcaagt tgtagcgatt    420

gctagtaata ttggtggcaa acaggcactt gagacggttc agcgcctcct tccagttctt    480

tgtcaagctc acggactcac cccagatcaa gttgtagcga ttgctagtaa caatggtggc    540

aaacaggctc tcgaaaccgt acaacgactc ctcccagttc tctgtcaagc ccacggacta    600

actcctgatc aagttgtagc gattgctagt aatgggggtg gcaaacaggc tcttgaaacc    660

gtgcaacgac tgctcccagt tctctgtcaa gcccacggcc tcaccccggc gcaagttgta    720

gcgattgcta gtaatggggg tggcaaacag gctcttgaaa ccgtgcaacg actgctccca    780

gttctctgtc aagcccacgg cctcaccccg gcgcaagttg tagcgattgc tagtaatatt    840

ggtggcaaac aggcacttga gacggttcag cgcctccttc cagttctttg tcaagctcac    900

ggactcaccc cagatcaagt tgtagcgatt gctagtaaca atggtggcaa acaggctctc    960

gaaaccgtac aacgactcct cccagttctc tgtcaagccc acggactaac tcctgatcaa   1020

gttgtagcga ttgctagtca tgacggtggc aaacaggctc ttgagaccgt ccaacgcctt   1080

ctaccagttc tctgtcaagc ccacggacta accccagcgc aagttgtagc gattgctagt   1140

catgacggtg gcaaacaggc tcttgaaacc gtgcaacgac tgctcccagt tctctgtcaa   1200

gcccacggcc tcaccccggc gcaagttgta gcgattgcta gtaatattgg tggcaaacag   1260

gcacttgaga cggttcagcg cctccttcca gttctttgtc aagctcacgg actcacccca   1320

gatcaagttg tagcgattgc tagtaacaat ggtggcaaac aggctctcga aaccgtacaa   1380

cgactcctcc cagttctctg tcaagcccac ggactaactc ctgatcaagt tgtagcgatt   1440

gctagtaata ttggtggcaa acaggcactt gagacggttc agcgcctcct tccagttctt   1500

tgtcaagctc acggactcac cccagatcaa gttgtagcga ttgctagtaa caatggtggc   1560

aaacaggctc tcgaaaccgt acaacgactc ctcccagttc tctgtcaagc ccacggacta   1620

actcctgatc aagttgtagc gattgctagt aatattggtg gcaaacaggc acttgagacg   1680

gttcagcgcc tccttccagt tctttgtcaa gctcacggac tcaccccaga tcaagttgta   1740

gcgattgcta gtaacaatgg tggcaaacag gctctcgaaa ccgtacaacg actcctccca   1800

gttctctgtc aagcccacgg actaactcct gatcaagttg tagcgattgc tagtcatgac   1860

ggtggcaaac aggctcttga gaccgtccaa cgccttctac cagttctctg tcaagcccac   1920

ggactaaccc cagcgcaagt tgtagcgatt gctagtaatg ggggtggcaa acaggctctt   1980

gaaaccgtgc aacgactgct cccagttctc tgtcaagccc acggcctcac cccggcgcaa   2040

gttgtagcga ttgctagtca tgacggtggc aaacaggccc ttgagacagt ccaacgcctt   2100

ctaccagttc tctgccaagc acacggacta accccagcgc aagttgtagc gattgctagt   2160

catgacggtg gcaaacaggc ccttgagaca gtccaacgcc ttctaccagt tctctgccaa   2220

gcacacggac taaccccagc gcaagttgta gcgattgcta gccatgacgg cggtcgacgc   2280

tgctataaag cttta                                                    2295


<210>  9
<211>  17
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 5'3'

<400>  9
tgtcccttta tctctct                                                    17


<210>  10
<211>  17
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 3'5'

<400>  10
acagggaaat agagaga                                                    17


<210>  11
<211>  17
<212>  RNA
<213>  ˹

<220>
<223>  ˹ϳɵRNA 3'5'

<400>  11
acagggaaau agagaga                                                    17


<210>  12
<211>  49
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 5'3'

<400>  12
ccacatatgt catacgtgtc cctttatctc tctccagctc gaggaattc                 49


<210>  13
<211>  48
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 5'3'

<400>  13
gaattcctga gctggagaga gataaaggga cacgtatgac atatgtgg                  48


<210>  14
<211>  49
<212>  RNA
<213>  ˹

<220>
<223>  ˹ϳɵRNA 5'3'

<400>  14
gaauuccucg agcuggagag agauaaaggg acacguauga cauaugugg                 49


<210>  15
<211>  31
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 5'3'

<400>  15
ccacatatgt catacgtgtc cctttatctc t                                    31


<210>  16
<211>  49
<212>  RNA
<213>  ˹

<220>
<223>  ˹ϳɵRNA 5'3'

<400>  16
gaauuccucg agcuggagag agauaaaggg acacguauga cauaugugg                 49


<210>  17
<211>  43
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA 5'3'

<400>  17
ccacatatgt catacgtgtc cctttatctc tctccagctc gag                       43


<210>  18
<211>  49
<212>  RNA
<213>  ˹

<220>
<223>  ˹ϳɵRNA 5'3'
 
<400>  18
gaauuccucg agcuggagag agauaaaggg acacguauga cauaugugg                 49


<210>  19
<211>  26
<212>  DNA
<213>  ˹

<220>
<223>  ˹ϳɵDNA

<400>  19
gtgggttccc tagccagaga gctccc                                          26


<210>  20
<211>  36
<212>  RNA
<213>  ˹

<220>
<223>  ˹ϳɵRNA

<400>  20
agaucugagc cugggagcuc ucuggcuaac uaggga                               36


