                         SEQUENCE LISTING

<110>  Ramot at Tel-Aviv University Ltd.
 
<120>  SITE SPECIFIC RECOMBINASE INTEGRASE VARIANTS AND USES THEREOF IN 
       GENE EDITING IN EUKARYOTIC CELLS

<130>  2693629

<150>  US 62/803,637
<151>  2019-02-11

<150>  US 62/803,634
<151>  2019-02-11

<150>  US 62/803,640
<151>  2019-02-11

<160>  244   

<170>  PatentIn version 3.5

<210>  1
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 204

<400>  1
attgacgtca atgggagttt gttttggc                                          28


<210>  2
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 469

<400>  2
gcatttaggt gacactatag aataggg                                           27


<210>  3
<211>  55
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 894

<400>  3
gatcagggtg aggaacagca cactttacca atgaaagtcg tgaccaggcc acgtt            55


<210>  4
<211>  55
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 895

<400>  4
agctaacgtg gcctggtcac gactttcatt ggtaaagtgt gctgttcctc accct            55


<210>  5
<211>  61
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 944

<400>  5
cctttttaac ccatcacata tacctgccgt tctcaggtca ctaatactat ctaagtagtt       60

g                                                                       61


<210>  6
<211>  64
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 945

<400>  6
cgtttggatt gcaactggtc tattttcctc tcgacaaatg attttatttt gactaataat       60

gacc                                                                    64


<210>  7
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1021

<400>  7
gcagcagtgc agaggcgcca gcagcagcga g                                      31


<210>  8
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1022

<400>  8
ctcgctgctg ctggcgcctc tgcactgctg c                                      31


<210>  9
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1023

<400>  9
ctgccaggct gtacggcaac cagatcggcg ac                                     32


<210>  10
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1024

<400>  10
gtcgccgatc tggttgccgt acagcctggc ag                                     32


<210>  11
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1025

<400>  11
ctgggccaca agagcgtgag catggccgcc ag                                     32


<210>  12
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1026

<400>  12
ctggcggcca tgctcacgct cttgtggccc ag                                     32


<210>  13
<211>  357
<212>  PRT
<213>  Bacteriophage HK022


<220>
<221>  MISC_FEATURE
<223>  wt HK022 integrase

<400>  13

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  14
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E174K mutant of the HK022 integrase

<400>  14

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  15
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E174K mutant of the HK022 integrase

<400>  15
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  16
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Consensus sequence of B


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  w is a or t

<220>
<221>  misc_feature
<222>  (5)..(10)
<223>  n is null

<400>  16
cttwnnnnnn                                                              10


<210>  17
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Consensus sequence of B'


<220>
<221>  misc_feature
<222>  (5)..(10)
<223>  n is null

<400>  17
aaagnnnnnn                                                              10


<210>  18
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Tay-Sachs Hexa3 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  18
accaatgnnn                                                              10


<210>  19
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Tay-Sachs Hexa7 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  19
taaaaatnnn                                                              10


<210>  20
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM4 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  20
gactcagnnn                                                              10


<210>  21
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM8 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n i s null

<400>  21
gtgaggtnnn                                                              10


<210>  22
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sickle cell anemia haem1 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  22
tctgaacnnn                                                              10


<210>  23
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sickle cell anemia haem13 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  23
gactaggnnn                                                              10


<210>  24
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lesch-Nyhan syndrome hgprt1 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  24
tatccctnnn                                                              10


<210>  25
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lesch-Nyhan syndrome hgprt13 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  25
cttttagnnn                                                              10


<210>  26
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Tay-Sachs Hexa3

<400>  26
acactttacc aatgaaagtc g                                                 21


<210>  27
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Tay-Sachs Hexa7

<400>  27
gaacttttaa aaataaaggg c                                                 21


<210>  28
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM4

<400>  28
tttctttgac tcagaaaggg a                                                 21


<210>  29
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM8

<400>  29
tgacttagtg aggtaaagta a                                                 21


<210>  30
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sickle cell anemia haem1 or hbb1

<400>  30
gtacttatct gaacaaagga g                                                 21


<210>  31
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sickle cell anemia haem13 or hbb13

<400>  31
tttctttgac taggaaaggg a                                                 21


<210>  32
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lesch-Nyhan syndrome hgprt1

<400>  32
agtcttttat ccctaaagga g                                                 21


<210>  33
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Lesch-Nyhan syndrome hgprt13

<400>  33
aaactttctt ttagaaaggt g                                                 21


<210>  34
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1030

<400>  34
ggaccgccaa gagcaaagtg cggcggagca gg                                     32


<210>  35
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1031

<400>  35
cctgctccgc cgcactttgc tcttggcggt cc                                     32


<210>  36
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1032

<400>  36
ctgggccggg acaggcggtt cgccatcacc gaggccatcc                             40


<210>  37
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1033

<400>  37
ggatggcctc ggtgatggcg aaccgcctgt cccggcccag                             40


<210>  38
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1051

<400>  38
atgtatttag aaaaataaac aaataggggt cgtgaggctc cggtgcccgt c                51


<210>  39
<211>  52
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1052

<400>  39
atctcccgat ccgtcgacgt caggtggcac acctagccag cttgggtctc cc               52


<210>  40
<211>  52
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1064

<400>  40
tcgagtctag agggcccgtt taaacccgct atggtgagca agggcgagga gg               52


<210>  41
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1065

<400>  41
gtcaaggaag gcacggggga ggggcaaaca ggacaaacca caactagaat gcagtg           56


<210>  42
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  I43F mutant of the HK022 integrase

<400>  42

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Phe Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  43
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  I43F mutant of the HK022 integrase

<400>  43
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggttcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  44
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E264G mutant of the HK022 integrase

<400>  44

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Gly Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  45
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E264G mutant of the HK022 integrase

<400>  45
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag gcgccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  46
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  R319G mutant of the HK022 integrase

<400>  46

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Gly Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  47
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  R319G mutant of the HK022 integrase

<400>  47
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtacggcaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  48
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  D336V mutant of the HK022 integrase

<400>  48

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Val 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  49
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  D336V mutant of the HK022 integrase

<400>  49
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgtgag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  50
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM2

<400>  50
gaacttatac cacgaaaggt a                                                 21


<210>  51
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ataxia ATM2 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  51
taccacgnnn                                                              10


<210>  52
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS SOD-1

<400>  52
taacttacat gctgaaagga a                                                 21


<210>  53
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS SOD-2

<400>  53
aatctttact gataaaaggt a                                                 21


<210>  54
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS SOD-1 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  54
catgctgnnn                                                              10


<210>  55
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS SOD-2 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  55
actgatannn                                                              10


<210>  56
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS TARDBP4

<400>  56
caccttagcc tcccaaagtg c                                                 21


<210>  57
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS TARDBP5

<400>  57
gtccttagta ggaaaaagta g                                                 21


<210>  58
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS TARDBP4 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  58
gcctcccnnn                                                              10


<210>  59
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS TARDBP5 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  59
gtaggaannn                                                              10


<210>  60
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS VAPB5

<400>  60
tgcctttctc ttccaaagca a                                                 21


<210>  61
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS VAPB6

<400>  61
ttactttgtg ggagaaagct a                                                 21


<210>  62
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS VAPB5 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  62
ctcttccnnn                                                              10


<210>  63
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS VAPB6 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  63
gtgggagnnn                                                              10


<210>  64
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS c9ORF 71-1

<400>  64
ctacttagag agtgaaagct g                                                 21


<210>  65
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS c9ORF 71-2

<400>  65
acactttcat ctgcaaagct a                                                 21


<210>  66
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS c9ORF 71-1, O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  66
gagagtgnnn                                                              10


<210>  67
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ALS c9ORF 71-2, O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  67
catctgcnnn                                                              10


<210>  68
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS2

<400>  68
gagcttacta agcaaaagga g                                                 21


<210>  69
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS3

<400>  69
gaacttttac tacaaaagca c                                                 21


<210>  70
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS2 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  70
ctaagcannn                                                              10


<210>  71
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS3 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  71
tactacannn                                                              10


<210>  72
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS4

<400>  72
atacttatga gtgaaaagta t                                                 21


<210>  73
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Cystinosis CTNS4 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  73
tgagtgannn                                                              10


<210>  74
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1069

<400>  74
gaaagcaggt agcttgcagt gggc                                              24


<210>  75
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1070

<400>  75
ggcgacacgg aaatgttgaa tactcatac                                         29


<210>  76
<211>  78
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1143

<400>  76
tcaggttact catatatact ttagattgat gaattccagg atatccgaca aatgatttta       60

ttttgactaa taatgacc                                                     78


<210>  77
<211>  75
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1144

<400>  77
acggggtctg acgctcagtg gaacgaaaac ccgcggcagc ccgggctcag gtcactaata       60

ctatctaagt agttg                                                        75


<210>  78
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1167

<400>  78
caggttactc atatatactt tagattgatg aattccgcga tgtacgggcc agatatac         58


<210>  79
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1169

<400>  79
cattattagt caaaataaaa tcatttgtcg gatatcgcag tgggttctct agttagcc         58


<210>  80
<211>  8867
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  docking plasmid EF1alfa-attBHEXA3-PuroR-attBATM4-mCherry

<400>  80
gacggatcgg gagatcaggg tgaggaacag cacactttac caatgaaagt cgtgaccagg       60

cctcgttagc ttggtaccga gctcggatcc gaattcgtcg acctcgaaat tctaccgggt      120

aggggaggcg cttttcccaa ggcagtctgg agcatgcgct ttagcagccc cgctgggcac      180

ttggcgctac acaagtggcc tctggcctcg cacacattcc acatccaccg gtaggcgcca      240

accggctccg ttctttggtg gccccttcgc gccaccttct actcctcccc tagtcaggaa      300

gttccccccc gccccgcagc tcgcgtcgtg caggacgtga caaatggaag tagcacgtct      360

cactagtctc gtgcagatgg acagcaccgc tgagcaatgg aagcgggtag gcctttgggg      420

cagcggccaa tagcagcttt gctccttcgc tttctgggct cagaggctgg gaaggggtgg      480

gtccgggggc gggctcaggg gcgggctcag gggcggggcg ggcgcccgaa ggtcctccgg      540

aggcccggca ttctgcacgc ttcaaaagcg cacgtctgcc gcgctgttct cctcttcctc      600

atctccgggc ctttcgacct gcatccatct agatctcgag cagctgaagc ttaccatgac      660

cgagtacaag cccacggtgc gcctcgccac ccgcgacgac gtccccaggg ccgtacgcac      720

cctcgccgcc gcgttcgccg actaccccgc cacgcgccac accgtcgatc cggaccgcca      780

catcgagcgg gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc tcgacatcgg      840

caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc cggagagcgt      900

cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga gcggttcccg      960

gctggccgcg cagcaacaga tggaaggcct cctggcgccg caccggccca aggagcccgc     1020

gtggttcctg gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc tgggcagcgc     1080

cgtcgtgctc cccggagtgg aggcggccga gcgcgccggg gtgcccgcct tcctggagac     1140

ctccgcgccc cgcaacctcc ccttctacga gcggctcggc ttcaccgtca ccgccgacgt     1200

cgaggtgccc gaaggaccgc gcacctggtg catgacccgc aagcccggtg cctgacgccc     1260

gccccacgac ccgcagcgcc cgaccgaaag gagcgcacga ccccatgcat cgatgatatc     1320

agcttactta ccatgtcaga tccagacatg ataagataca ttgatgagtt tggacaaacc     1380

acaactagaa tgcagtgaaa aaaatgcttt atttgtgaaa tttgtgtgct attgctttat     1440

ttgtaaccat tataagctgc aataaacaag ttaacaacaa caattcattc attttatgtt     1500

tcaggttcag ggggaggtgt gggaggtttt ttaaagcaag taaacctcta caaatgtggt     1560

atggctgatt atgatctcta gtcaaggcac tatacatcaa atatccttat taaccccttt     1620

acaaattaaa aagctaaagg tacacaattt ttgagcatag ttttaatagc agacactcta     1680

tgcctgtgtg gagtaagaaa aaacagtatg ttatgattat actgttatgc ctacttataa     1740

aggttacaga atatttttcc ataattttct tgtatagcag gcagcttttt cctttgtggt     1800

gtaaatagca aagcaagcaa gagttctatt actaaacacg catgactcaa aaaacttagc     1860

aattctgaag gaaagtcctt ggggtcttct acctttcttt cttttttgga ggagtagaat     1920

gttgagagtc agcagtagcc tcatcatcac tagatggatt tcttctgagc aaaacaggtt     1980

ttcctcatta aaggcattcc accactgctc ccattctcag ttccataggt tggaatctaa     2040

aatacacaaa caattagaat cagtagttta acacatatac acttaaaaat tttatattta     2100

ccttagagct ttaaatctct gtaggtagtt tgtcaattat gtcacaccac agaagtaagg     2160

ttccttcaca aagatccctc gagaaaaaaa ataaaaagag atggaggaac gggaaaaagt     2220

tagttgtggt gataggtggc aagtggtatt cctaagaaca acaagaaaag catttcatat     2280

tatggctgaa ctgagcgaac aagtgcaaaa ttaagcatca acgacaacaa cgagaatggt     2340

tatgttcctc ctcacttaag aggaaaacca gaagtgccag aaataacatg agcaactaca     2400

ataacaacaa cggcggctac aacggtggcg tggcggtggc agcttcttta gcaacaaccg     2460

tcgtggtggt tacggcaacg gtggtttctc ggtggaaaca acggtggcag cagatctaac     2520

ggccgttctg gtggtagatg gatcgatggc aaacatgtcc cagctccaag aaacgaaaag     2580

gccgagatcg ccatatttgg tgtccccgag gatcctctag agtcgacggt atcgataaag     2640

gggtcaggga gttccctttc tgagtcaaag aaagggggga cggacggcgc ggccgcatgg     2700

tgagcaaggg cgaggaggat aacatggcca tcatcaagga gttcatgcgc ttcaaggtgc     2760

acatggaggg ctccgtgaac ggccacgagt tcgagatcga gggcgagggc gagggccgcc     2820

cctacgaggg cacccagacc gccaagctga aggtgaccaa gggtggcccc ctgcccttcg     2880

cctgggacat cctgtcccct cagttcatgt acggctccaa ggcctacgtg aagcaccccg     2940

ccgacatccc cgactacttg aagctgtcct tccccgaggg cttcaagtgg gagcgcgtga     3000

tgaacttcga ggacggcggc gtggtgaccg tgacccagga ctcctccctg caggacggcg     3060

agttcatcta caaggtgaag ctgcgcggca ccaacttccc ctccgacggc cccgtaatgc     3120

agaagaagac catgggctgg gaggcctcct ccgagcggat gtaccccgag gacggcgccc     3180

tgaagggcga gatcaagcag aggctgaagc tgaaggacgg cggccactac gacgctgagg     3240

tcaagaccac ctacaaggcc aagaagcccg tgcagctgcc cggcgcctac aacgtcaaca     3300

tcaagttgga catcacctcc cacaacgagg actacaccat cgtggaacag tacgaacgcg     3360

ccgagggccg ccactccacc ggcggcatgg acgagctgta caagtgaata agcttggccg     3420

cgactctaga tcataatcag ccataccaca tttgtagagg ttttacttgc tttaaaaaac     3480

ctcccacacc tccccctgaa cctgaaacat aaaatgaatg caattgttgt tgttaacttg     3540

tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt cacaaataaa     3600

gcattttttt cactgcattc tagttgtggt ttgtcctgtt tgcccctccc ccgtgccttc     3660

cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc     3720

gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg     3780

ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta tggcttctga     3840

ggcggaaaga accagctggg gctctagggg gtatccccac gcgccctgta gcggcgcatt     3900

aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc     3960

gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca     4020

agctctaaat cgggggtccc tttagggttc cgatttagtg ctttacggca cctcgacccc     4080

aaaaaacttg attagggtga tggttcacgt acctagaagt tcctattccg aagttcctat     4140

tctctagaaa gtataggaac ttccttggcc aaaaagcctg aactcaccgc gacgtctgtc     4200

gagaagtttc tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc     4260

gaagaatctc gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat     4320

agctgcgccg atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg     4380

ctcccgattc cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc     4440

tcccgccgtg cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt     4500

ctgcagccgg tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc     4560

gggttcggcc cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata     4620

tgcgcgattg ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt     4680

gcgtccgtcg cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc     4740

cggcacctcg tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata     4800

acagcggtca ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac     4860

atcttcttct ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg     4920

aggcatccgg agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt     4980

gaccaactct atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt     5040

cgatgcgacg caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc     5100

agaagcgcgg ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga     5160

cgccccagca ctcgtccgag ggcaaaggaa tagcacgtac tacgagattt cgattccacc     5220

gccgccttct atgaaaggtt gggcttcgga atcgttttcc gggacgccgg ctggatgatc     5280

ctccagcgcg gggatctcat gctggagttc ttcgcccacc ccaacttgtt tattgcagct     5340

tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca     5400

ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgtataccg     5460

tcgacctcta gctagagctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt     5520

tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt     5580

gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg     5640

ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg     5700

cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg     5760

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat     5820

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc     5880

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc     5940

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga     6000

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt     6060

ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg     6120

taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc     6180

gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg     6240

gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc     6300

ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg     6360

ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc     6420

gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct     6480

caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt     6540

taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa     6600

aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa     6660

tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc     6720

tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct     6780

gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca     6840

gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt     6900

aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt     6960

gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc     7020

ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc     7080

tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt     7140

atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact     7200

ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc     7260

ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt     7320

ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg     7380

atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct     7440

gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa     7500

tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt     7560

ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggtcgtgagg     7620

ctccggtgcc cgtcagtggg cagagcgcac atcgcccaca gtccccgaga agttgggggg     7680

aggggtcggc aattgaaccg gtgcctagag aaggtggcgc ggggtaaact gggaaagtga     7740

tgtcgtgtac tggctccgcc tttttcccga gggtggggga gaaccgtata taagtgcagt     7800

agtcgccgtg aacgttcttt ttcgcaacgg gtttgccgcc agaacacagg taagtgccgt     7860

gtgtggttcc cgcgggcctg gcctctttac gggttatggc ccttgcgtgc cttgaattac     7920

ttccacctgg ctgcagtacg tgattcttga tcccgagctt cgggttggaa gtgggtggga     7980

gagttcgagg ccttgcgctt aaggagcccc ttcgcctcgt gcttgagttg aggcctggcc     8040

tgggcgctgg ggccgccgcg tgcgaatctg gtggcacctt cgcgcctgtc tcgctgcttt     8100

cgataagtct ctagccattt aaaatttttg atgacctgct gcgacgcttt ttttctggca     8160

agatagtctt gtaaatgcgg gccaagatct gcacactggt atttcggttt ttggggccgc     8220

gggcggcgac ggggcccgtg cgtcccagcg cacatgttcg gcgaggcggg gcctgcgagc     8280

gcggccaccg agaatcggac gggggtagtc tcaagctggc cggcctgctc tggtgcctgg     8340

cctcgcgccg ccgtgtatcg ccccgccctg ggcggcaagg ctggcccggt cggcaccagt     8400

tgcgtgagcg gaaagatggc cgcttcccgg ccctgctgca gggagctcaa aatggaggac     8460

gcggcgctcg ggagagcggg cgggtgagtc acccacacaa aggaaaaggg cctttccgtc     8520

ctcagccgtc gcttcatgtg actccacgga gtaccgggcg ccgtccaggc acctcgatta     8580

gttctcgagc ttttggagta cgtcgtcttt aggttggggg gaggggtttt atgcgatgga     8640

gtttccccac actgagtggg tggagactga agttaggcca gcttggcact tgatgtaatt     8700

ctccttggaa tttgcccttt ttgagtttgg atcttggttc attctcaagc ctcagacagt     8760

ggttcaaagt ttttttcttc catttcaggt gtcgtgagga attagcttgg tactaatacg     8820

actcactata gggagaccca agctggctag gtgtgccacc tgacgtc                   8867


<210>  81
<211>  6392
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  incoming plasmid attPHEXA5-GFP(ORF)-NeoR-CMV promoter-attPATM4

<400>  81
tagttattag atctcgagct caagcttaag cttacttacc atgtcagatc cagacatgat       60

aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa aatgctttat      120

ttgtgaaatt tgtgtgctat tgctttattt gtaaccatta taagctgcaa taaacaagtt      180

aacaacaaca attcattcat tttatgtttc aggttcaggg ggaggtgtgg gaggtttttt      240

aaagcaagta aacctctaca aatgtggtat ggctgattat gatctctagt caaggcacta      300

tacatcaaat atccttatta acccctttac aaattaaaaa gctaaaggta cacaattttt      360

gagcatagtt ttaatagcag acactctatg cctgtgtgga gtaagaaaaa acagtatgtt      420

atgattatac tgttatgcct acttataaag gttacagaat atttttccat aattttcttg      480

tatagcaggc agctttttcc tttgtggtgt aaatagcaaa gcaagcaaga gttctattac      540

taaacacgca tgactcaaaa aacttagcaa ttctgaagga aagtccttgg ggtcttctac      600

ctttctttct tttttggagg agtagaatgt tgagagtcag cagtagcctc atcatcacta      660

gatggatttc ttctgagcaa aacaggtttt cctcattaaa ggcattccac cactgctccc      720

attctcagtt ccataggttg gaatctaaaa tacacaaaca attagaatca gtagtttaac      780

acatatacac ttaaaaattt tatatttacc ttagagcttt aaatctctgt aggtagtttg      840

tcaattatgt cacaccacag aagtaaggtt ccttcacaaa gatccctcga gaaaaaaaat      900

aaaaagagat ggaggaacgg gaaaaagtta gttgtggtga taggtggcaa gtggtattcc      960

taagaacaac aagaaaagca tttcatatta tggctgaact gagcgaacaa gtgcaaaatt     1020

aagcatcaac gacaacaacg agaatggtta tgttcctcct cacttaagag gaaaaccaga     1080

agtgccagaa ataacatgag caactacaat aacaacaacg gcggctacaa cggtggcgtg     1140

gcggtggcag cttctttagc aacaaccgtc gtggtggtta cggcaacggt ggtttctcgg     1200

tggaaacaac ggtggcagca gatctaacgg atcctctaga gtcgacggta tcgataagct     1260

taagcttgca tgcctgcaga ggtcactaat actatctaag tagttgattc atagtgactg     1320

gatatgttgc gttttgtcgc attatgtagt ctatcattta accacagatt agtgtaatgc     1380

gatgattttt aagtgattaa tgttattttg tcatccttta ccaatgtaag ttgtatattt     1440

aaaatctctt taattatcag taaattaatg taagtaggtc attattagtc aaaataaaat     1500

catttgaccg gtcgccacca tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc     1560

catcctggtc gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg     1620

cgagggcgat gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct     1680

gcccgtgccc tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg     1740

ctaccccgac cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt     1800

ccaggagcgc accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa     1860

gttcgagggc gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga     1920

cggcaacatc ctggggcaca agctggagta caactacaac agccacaacg tctatatcat     1980

ggccgacaag cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga     2040

cggcagcgtg cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt     2100

gctgctgccc gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga     2160

gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat     2220

ggacgagctg tacaagtaaa gcggccgcga ctctagatca taatcagcca taccacattt     2280

gtagaggttt tacttgcttt aaaaaacctc ccacacctcc ccctgaacct gaaacataaa     2340

atgaatgcaa ttgttgttgt taacttgttt attgcagctt ataatggtta caaataaagc     2400

aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg     2460

tccaaactca tcaatgtatc ttaaggcgta aattgtaagc gttaatattt tgttaaaatt     2520

cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa taggccgaaa tcggcaaaat     2580

cccttataaa tcaaaagaat agaccgagat agggttgagt gttgttccag tttggaacaa     2640

gagtccacta ttaaagaacg tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg     2700

cgatggccca ctacgtgaac catcacccta atcaagtttt ttggggtcga ggtgccgtaa     2760

agcactaaat cggaacccta aagggagccc ccgatttaga gcttgacggg gaaagccggc     2820

gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag     2880

tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc cgctacaggg     2940

cgcgtcaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta     3000

aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata     3060

ttgaaaaagg aagagtcctg aggcggaaag aaccagctgt ggaatgtgtg tcagttaggg     3120

tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag     3180

tcagcaacca ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg     3240

catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact     3300

ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag     3360

gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc     3420

ctaggctttt gcaaagatcg atcaagagac aggatgagga tcgtttcgca tgattgaaca     3480

agatggattg cacgcaggtt ctccggccgc ttgggtggag aggctattcg gctatgactg     3540

ggcacaacag acaatcggct gctctgatgc cgccgtgttc cggctgtcag cgcaggggcg     3600

cccggttctt tttgtcaaga ccgacctgtc cggtgccctg aatgaactgc aagacgaggc     3660

agcgcggcta tcgtggctgg ccacgacggg cgttccttgc gcagctgtgc tcgacgttgt     3720

cactgaagcg ggaagggact ggctgctatt gggcgaagtg ccggggcagg atctcctgtc     3780

atctcacctt gctcctgccg agaaagtatc catcatggct gatgcaatgc ggcggctgca     3840

tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca tcgagcgagc     3900

acgtactcgg atggaagccg gtcttgtcga tcaggatgat ctggacgaag agcatcaggg     3960

gctcgcgcca gccgaactgt tcgccaggct caaggcgagc atgcccgacg gcgaggatct     4020

cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc     4080

tggattcatc gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc     4140

tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc tcgtgcttta     4200

cggtatcgcc gctcccgatt cgcagcgcat cgccttctat cgccttcttg acgagttctt     4260

ctgagcggga ctctggggtt cgaaatgacc gaccaagcga cgcccaacct gccatcacga     4320

gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt tttccgggac     4380

gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc ccaccctagg     4440

gggaggctaa ctgaaacacg gaaggagaca ataccggaag gaacccgcgc tatgacggca     4500

ataaaaagac agaataaaac gcacggtgtt gggtcgtttg ttcataaacg cggggttcgg     4560

tcccagggct ggcactctgt cgatacccca ccgagacccc attggggcca atacgcccgc     4620

gtttcttcct tttccccacc ccacccccca agttcgggtg aaggcccagg gctcgcagcc     4680

aacgtcgggg cggcaggccc tgccatagcc tcaggttact catatatact ttagattgat     4740

gaattccgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa     4800

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa     4860

cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata     4920

atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac     4980

tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc     5040

cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta     5100

tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg     5160

cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt     5220

ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca     5280

aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag     5340

gtctatataa gcagagctct ctggctaact agagaaccca ctgcgatatc cgacaaatga     5400

ttttattttg actaataatg acctacttac attaatttac tgataattaa agagatttta     5460

aatatacaac ttactgagtc aaaggatgac aaaataacat taatcactta aaaatcatcg     5520

cattacacta atctgtggtt aaatgataga ctacataatg cgacaaaacg caacatatcc     5580

agtcactatg aatcaactac ttagatagta ttagtgacct gagcccgggc tgccgcgggt     5640

tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt     5700

tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt     5760

gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc     5820

agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg     5880

tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg     5940

ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt     6000

cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac     6060

tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg     6120

acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg     6180

gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat     6240

ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt     6300

tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg     6360

attctgtgga taaccgtatt accgccatgc at                                   6392


<210>  82
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/I43F

<400>  82
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggttcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  83
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/ I43F

<400>  83

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Phe Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  84
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/R319G

<400>  84
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtacggcaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  85
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/ R319G

<400>  85

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Gly Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  86
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/E264G

<400>  86
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag gcgccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  87
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/E264G

<400>  87

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Gly Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  88
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/D336V

<400>  88
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgtgag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  89
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Double mutant E174K/D336V

<400>  89

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Val 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  90
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 958

<400>  90
ggccagctgt cccaaacgtc cag                                               23


<210>  91
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1080

<400>  91
cctggcgcag ttgcaaacgc tgcccc                                            26


<210>  92
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD2 attE1

<400>  92
ttgcttaatg gagaaaaggt a                                                 21


<210>  93
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD 3 attE2

<400>  93
gtgctttaaa aagaaaaggg g                                                 21


<210>  94
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD2 O1


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  94
atggagannn                                                              10


<210>  95
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD3 O2


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  95
aaaaagannn                                                              10


<210>  96
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR10 attE1

<400>  96
ctacttttaa aaacaaagtc t                                                 21


<210>  97
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR12 attE2

<400>  97
acgctttccc cttcaaaggt g                                                 21


<210>  98
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR10 O1


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  98
taaaaacnnn                                                              10


<210>  99
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR12 O2


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  99
ccccttcnnn                                                              10


<210>  100
<211>  142
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Int-HK022 attP1 P

<400>  100
tcaggtcact aatactatct aagtagttga ttcatagtga ctggatatgt tgcgttttgt       60

cgcattatgt agtctatcat ttaaccacag attagtgtaa tgcgatgatt tttaagtgat      120

taatgttatt ttgtcatcct tt                                               142


<210>  101
<211>  81
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Int-HK022 attP2 P'

<400>  101
taagttgtat atttaaaatc tctttaatta tcagtaaatt aatgtaagta ggtcattatt       60

agtcaaaata aaatcatttg t                                                 81


<210>  102
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NPC1 O1


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  102
agatgccnnn                                                              10


<210>  103
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NPC1 O2


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  103
acactggnnn                                                              10


<210>  104
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A4 O1


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  104
gcactgtnnn                                                              10


<210>  105
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A3 O2


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  105
acagtgcnnn                                                              10


<210>  106
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL3A1 O1


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  106
aaaacagnnn                                                              10


<210>  107
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL3A1 O2


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  107
tttaaaannn                                                              10


<210>  108
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD4

<400>  108
atactttttg cctaaaagca g                                                 21


<210>  109
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD4 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  109
ttgcctannn                                                              10


<210>  110
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD5

<400>  110
tttcttttgt aaacaaaggt a                                                 21


<210>  111
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD5 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  111
tgtaaacnnn                                                              10


<210>  112
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD6

<400>  112
cttctttatg ttttaaagta t                                                 21


<210>  113
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD6 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  113
atgttttnnn                                                              10


<210>  114
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD7

<400>  114
actctttcct gacaaaagta g                                                 21


<210>  115
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD7 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  115
cctgacannn                                                              10


<210>  116
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS1

<400>  116
tcactttggt acagaaaggt a                                                 21


<210>  117
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS1 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  117
ggtacagnnn                                                              10


<210>  118
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NPC1 attE1

<400>  118
tggcttaaga tgccaaaggt g                                                 21


<210>  119
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NPC1 attE2

<400>  119
tcacttaaca ctggaaaggc a                                                 21


<210>  120
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A4 attE1

<400>  120
atactttgca ctgtaaagtg t                                                 21


<210>  121
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A3 attE2

<400>  121
atactttaca gtgcaaagta t                                                 21


<210>  122
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL3A1 attE1

<400>  122
aaacttaaaa acagaaagtg t                                                 21


<210>  123
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  COL3A1 attE2

<400>  123
tgacttattt aaaaaaaggt a                                                 21


<210>  124
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 788

<400>  124
gggaagctta ttccgctttg cgactcaacc                                        30


<210>  125
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR13

<400>  125
cagcttttct taataaagca a                                                 21


<210>  126
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR14

<400>  126
gtactttgtt agcaaaagct g                                                 21


<210>  127
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR13 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  127
tcttaatnnn                                                              10


<210>  128
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CFTR14 O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  128
gttagcannn                                                              10


<210>  129
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS a

<400>  129
atactttagc cccgaaaggc a                                                 21


<210>  130
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS d

<400>  130
gcccttaagg caaaaaagtc c                                                 21


<210>  131
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS a o


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  131
agccccgnnn                                                              10


<210>  132
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS d o


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  132
aggcaaannn                                                              10


<210>  133
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY400

<400>  133
cgggatccga tgtacgggcc agatatac                                          28


<210>  134
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY416

<400>  134
gcggatccgg gtctccctat agtgagtcg                                         29


<210>  135
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY606

<400>  135
gggagatcta cttaccatgt cagatccag                                         29


<210>  136
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY674

<400>  136
ggaccggtca aatgatttta ttttgactaa taatgacc                               38


<210>  137
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY675

<400>  137
ggggctgcag aggtcactaa tactatctaa gtagttg                                37


<210>  138
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY736

<400>  138
aggtcactaa tactatctaa gtagttgatt catagtgact gg                          42


<210>  139
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY931

<400>  139
cgtgccagct gcattaatga atcggccaac gaattccaga agctt                       45


<210>  140
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1192

<400>  140
gtagcggtca cgctgcgcgt aaccaccaca                                        30


<210>  141
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1201

<400>  141
cccggatcct tagggttccg atttagtgct ttacggcac                              39


<210>  142
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1202

<400>  142
gggtctagac aaatgatttt attttgacta ataatgacc                              39


<210>  143
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1203

<400>  143
cccggatcca ggtcactaat actatctaag tagttgattc atagtgactg g                51


<210>  144
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1215

<400>  144
gggccgcggc tcaggtcact aatactatct aagtagttg                              39


<210>  145
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1216

<400>  145
gggccgcggc tcaaaggcgg taatacggtt atccacag                               38


<210>  146
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1217

<400>  146
cccgaattcg ttggccgatt cattaatgca gctgg                                  35


<210>  147
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1237

<400>  147
cccggatccc aaatgatttt attttgacta ataatgacct ac                          42


<210>  148
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1238

<400>  148
ccctctagaa ggtcactaat actatctaag tagttgattc atagtgactg g                51


<210>  149
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1240

<400>  149
ctacttagat agtattagtg acctggatcc ctctgcaaat gcaggaaact atcagag          57


<210>  150
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1241

<400>  150
ttcgcgcgct caacagatct gtcaaatcgc ct                                     32


<210>  151
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1242

<400>  151
tgttgagcgc gcgaaacgcg g                                                 21


<210>  152
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1243

<400>  152
gctcaccata ggtccagggt tctcctcc                                          28


<210>  153
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1244

<400>  153
ctggacctat ggtgagcaag ggcgag                                            26


<210>  154
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1245

<400>  154
aaatcatttg tcgaagcttc tggaattcgg acaaaccaca actagaatgc agt              53


<210>  155
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1246

<400>  155
gggtctagag ctgccaccgt tgtttccacc gag                                    33


<210>  156
<211>  46
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1254

<400>  156
tattagtcaa aataaaatca tttgggatcc atggtgagca agggcg                      46


<210>  157
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1255

<400>  157
ttcgcgcgct tgtacagctc gtccatgcc                                         29


<210>  158
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1256

<400>  158
gtacaagcgc gcgaaacgcg g                                                 21


<210>  159
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  oEY1257

<400>  159
atttgtcgaa gcttctggaa ttcaacttac cacatttagg tccagggttc tcctcc           56


<210>  160
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PRIMER 206

<400>  160
cgtcgccgtc cagctcgacc ag                                                22


<210>  161
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  attB of coliphage HK022

<400>  161
gcactttagg tgaaaaaggt t                                                 21


<210>  162
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  O of attB of coliphage HK022


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  162
aggtgaannn                                                              10


<210>  163
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  consensus sequence of an active attB


<220>
<221>  misc_feature
<222>  (6)..(12)
<223>  n is a, c, g, or t

<400>  163
actttnnnnn nnaaagg                                                      17


<210>  164
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1265

<400>  164
cagcaagcac cacaaacccc tgagcccc                                          28


<210>  165
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1266

<400>  165
ggggctcagg ggtttgtggt gcttgctgg                                         29


<210>  166
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1280

<400>  166
agctttgata gtttatgcct ctacttttaa aaacaaagtc taacagattt ttctcag          57


<210>  167
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1281

<400>  167
aattctgaga aaaatctgtt agactttgtt tttaaaagta gaggcataaa ctatcaa          57


<210>  168
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1282

<400>  168
agctttgaga tgatggaaac acgctttccc cttcaaaggt gctgctagtt ccaaagg          57


<210>  169
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1283

<400>  169
aattcctttg gaactagcag cacctttgaa ggggaaagcg tgtttccatc atctcaa          57


<210>  170
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 143

<400>  170
gcaaaatcaa aagtaaggcg ttc                                               23


<210>  171
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 144

<400>  171
gaacgcctta cttttgattt tgc                                               23


<210>  172
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 203

<400>  172
gctagttatt gctcagcgg                                                    19


<210>  173
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 513

<400>  173
aagaggatca catatggg                                                     18


<210>  174
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1351

<400>  174
tttgacagat ctgttgagga gagccaagag aggctctgg                              39


<210>  175
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1352

<400>  175
gagcctctct tggctctcct caacagatct gtcaaatcgc c                           41


<210>  176
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1353

<400>  176
cttaagcttg gactcacctg acgaggtcca gggttctcct c                           41


<210>  177
<211>  63
<212>  PRT
<213>  Bacteriophage HK022


<220>
<221>  MISC_FEATURE
<223>  ND domain of HK022 Integrase

<400>  177

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser 
    50                  55                  60              


<210>  178
<211>  101
<212>  PRT
<213>  Bacteriophage HK022


<220>
<221>  MISC_FEATURE
<223>  CB domain of HK022 Integrase

<400>  178

Thr Leu His Ala Trp Leu Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg 
1               5                   10                  15      


Gly Ile Arg Pro Lys Thr Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala 
            20                  25                  30          


Ile Arg Arg Lys Leu Pro Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys 
        35                  40                  45              


Glu Val Ala Ala Met Leu Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala 
    50                  55                  60                  


Ser Ala Lys Leu Ile Arg Ser Thr Leu Val Asp Val Phe Arg Glu Ala 
65                  70                  75                  80  


Ile Ala Glu Gly His Val Ala Thr Asn Pro Val Thr Ala Thr Arg Thr 
                85                  90                  95      


Ala Lys Ser Glu Val 
            100     


<210>  179
<211>  181
<212>  PRT
<213>  Bacteriophage HK022


<220>
<221>  MISC_FEATURE
<223>  CD domain of HK022 Integrase

<400>  179

Arg Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala 
1               5                   10                  15      


Ala Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val 
            20                  25                  30          


Val Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp 
        35                  40                  45              


Ile Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys 
    50                  55                  60                  


Leu Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu 
65                  70                  75                  80  


Ala Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile 
                85                  90                  95      


Ile Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys 
            100                 105                 110         


Tyr Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn 
        115                 120                 125             


Pro Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg 
    130                 135                 140                 


Asn Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser 
145                 150                 155                 160 


Asp Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp 
                165                 170                 175     


Lys Ile Glu Ile Asp 
            180     


<210>  180
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E134K mutant of the HK022 integrase

<400>  180

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Lys Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  181
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E134K mutant of the HK022 integrase

<400>  181
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggcca aaggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  182
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  D278K mutant of the HK022 integrase

<400>  182

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Lys Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  183
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  D278K mutant of the HK022 integrase

<400>  183
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca caaacccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  184
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E174K/D278K double mutant of the HK022 integrase

<400>  184

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Lys Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  185
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E174K/I43F/R319G triple mutant of the HK022 integrase

<400>  185

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Phe Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Lys Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Gly Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  186
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E174K/D278K double mutant of the HK022 integrase

<400>  186
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca caaacccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  187
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E174K/I43F/R319G triple mutant of the HK022 integrase

<400>  187
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggttcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagca aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtacggcaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  188
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  D149K mutant of the HK022 integrase

<400>  188

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Lys Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  189
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  D149K mutant of the HK022 integrase

<400>  189
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtgaaagtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  190
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  D215K mutant of the HK022 integrase

<400>  190

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Lys Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  191
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  D215K mutant of the HK022 integrase

<400>  191
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcaaactgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  192
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E309K mutant of the HK022 integrase

<400>  192

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Asn Pro 
    290                 295                 300                 


Pro Thr Phe His Lys Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  193
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  E309K mutant of the HK022 integrase

<400>  193
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaacc cccccacctt ccacaaactg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  194
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF1 attE

<400>  194
actctttgaa aattaaagtc c                                                 21


<210>  195
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF1 attE O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  195
gaaaattnnn                                                              10


<210>  196
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF2 attE

<400>  196
tcacttaacc atgaaaagct t                                                 21


<210>  197
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF2 attE O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  197
accatgannn                                                              10


<210>  198
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF3 attE

<400>  198
tttcttttgc cagtaaagtc a                                                 21


<210>  199
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mCF3 attE O


<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is null

<400>  199
tgccagtnnn                                                              10


<210>  200
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 635

<400>  200
ggcgccgtcc aggcacctcg                                                   20


<210>  201
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1185

<400>  201
gaactgaggg gacaggatgt cccagg                                            26


<210>  202
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 421

<400>  202
gaggccgcct ctgcctctga gc                                                22


<210>  203
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1016

<400>  203
cgtgacaccc tgtgcacggc gggagatg                                          28


<210>  204
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 834

<400>  204
ccaccccatt gacgtcaatg ggagtttgtt ttggc                                  35


<210>  205
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1191

<400>  205
gccgtccgtc ccccctttct ttgactcag                                         29


<210>  206
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 432

<400>  206
tcgttgggcg gtcagccagg cgggc                                             25


<210>  207
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1298

<400>  207
tacatccacg tcgaatcctc gcgc                                              24


<210>  208
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1015

<400>  208
ccgccgccgg gatcactctc gg                                                22


<210>  209
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1300

<400>  209
atcttcctgc cttggcctcc caaagc                                            26


<210>  210
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1279

<400>  210
gccgttctcc agctttacga caggagg                                           27


<210>  211
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1232

<400>  211
gaggcttttc tgttacagcg tcctcctcc                                         29


<210>  212
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 1236

<400>  212
ggtctcttag gaagaccttg tcctgtagtc agtgg                                  35


<210>  213
<211>  140
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P for attP donor cassette

<400>  213
aggtcactaa tactatctaa gtagttgatt catagtgact ggatatgttg cgttttgtcg       60

cattatgtag tctatcattt aaccacagat tagtgtaatg cgatgatttt taagtgatta      120

atgttatttt gtcatccttt                                                  140


<210>  214
<211>  77
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P' for attP donor cassette

<400>  214
taagttgtat atttaaaatc tctttaatta tcagtaaatt aatgtaagta ggtcattatt       60

agtcaaaata aaatcat                                                      77


<210>  215
<211>  3014
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CF Native replacement sequence for exon 3 mutations recovery 
       using CF10 and CF12

<400>  215
ctacttttaa aaacaaagtc taacagattt ttctcatgtt aaatcacaga aaaagccacc       60

tgacatttta acttgttttt gatttgacag tgaaatctta taaatctgcc acagttctaa      120

accaataaag atcaaggtat aagggaaaaa tgtagaatgt ttgtgtgttt attttttcca      180

ccttgttcta agcacagcaa tgagcattcg taaaagcctt actttatttg tccacccttt      240

tcattgtttt ttagaagccc aacacttttc tttaacacat acaatgtggc cttttcatga      300

aatcaattcc ctgcacagtg atatatggca gagcattgaa ttctgccaaa tatctggctg      360

agtgtttggt gttgtatggt ctccatgaga ttttgtctct ataatacttg ggttaatctc      420

cttggatata cttgtgtgaa tcaaactatg ttaagggaaa taggacaact aaaatatttg      480

cacatgcaac ttattggtcc cactttttat tcttttgcag agaatgggat agagagctgg      540

cttcaaagaa aaatcctaaa ctcattaatg cccttcggcg atgttttttc tggagattta      600

tgttctatgg aatcttttta tatttagggg taaggatctc atttgtacat tcattatgta      660

tcacataact atattcattt ttgtgattat gaaaagacta cgaaatctgg tgaataggtg      720

taaaaatata aaggatgaat ccaactccaa acactaagaa accacctaaa actctagtaa      780

ggataagtaa aaatcctttg gaactaaaat gtcctggaac acgggtggca atttacaatc      840

tcaatgggct cagcaaaata aattgcttgc ttaaaaaatt attttctgtt atgattccaa      900

atcacattat cttactagta catgagatta ctggtgcctt tattttgctg tattcaacag      960

gagagtgtca ggagacaatg tcagcagaat taggtcaaat gcagctaatt acatatatga     1020

atgtttgtaa tattttgaaa tcatatctgc atggtgaatt gtttcaaaga aaaacactaa     1080

aaatttaaag tatagcagct ttaaatacta aataaataat actaaaaatt taaagttctc     1140

ttgcaatata ttttcttaat atcttacatc tcatcagtgt gaaaagttgc acatctgaaa     1200

atccaggctt tgtggtgttt aagtgccttg tatgttcccc agttgctgtc caatgtgact     1260

ctgatttatt attttctaca tcatgaaagc attatttgaa tccttggttg taacctataa     1320

aaggagacag attcaagact tgtttaatct tcttgttaaa gctgtgcaca atatttgctt     1380

tggggcgttt acttatcata tggattgact tgtgtttata ttggtcttta tgcctcaggg     1440

agttaaacag tgtctcccag agaaatgcca tttgtgttac attgcttgaa aaatttcagt     1500

tcatacaccc ccatgaaaaa tacatttaaa acttatctta acaaagatga gtacacttag     1560

gcccagaatg ttctctaatg ctcttgataa tttcctagaa gaaatttttc tgacttttga     1620

aataatagat ccataatata tattcttatg gaaatctgaa accatttggg catttggggg     1680

taaaaagtat tttattagta aatttaaatg aggtagctgg ataattaaat tacttttaag     1740

ttacctttga gatgattttt ctcaatcaga gcaccaccca gagctttgag aaacaatttt     1800

attcacagct tctgattcta tttgatgtaa tttttagaaa ataagttttg ctggttgctt     1860

tgaatcaggg tatggagtac agttcactct gatcctatca tataaatcat gtaagtatat     1920

aacattttca ataagtgatt gttggattga agtgaatgat atttcaagta attgttatgt     1980

catggccaag atttcagtga aactcaaaat ttctcctggt tgtgttctcc attgcatgct     2040

gcttctattg attaacctaa gcactactga gtagaagctg gaagaggggt ctaattagaa     2100

ggcccctttc tatgctctgc ttggcttgta aaataattta tttctctaga tcccaccaac     2160

atagtagttt catgtatgca aaaacaccca cctaaatgtc aaagtttgta tgatacatgg     2220

acatatctat agaatttttt ttggtctggt gcatgccaaa aaataaacat gatatagaag     2280

aatttaatat ttattgagta cctaatctgt tccagttcaa tatgaaggtc tttatgcaga     2340

ttattttact taattttcct agtaactcca tggagcaaaa attatctcta atttatataa     2400

caggaagttg agcgtgaggc aaattaagta actttcccaa agttacacat atggtaagtt     2460

tgagagatat cccagtctct ttagctccaa agcctttgac cctttcacca taccagatta     2520

tgattgctat taatatataa ttataattat aatgattgta tttaggtact caacagaatg     2580

gtgactctag taaccagcct tggttctgct gagcttctct gcgtcttctc aggagacaca     2640

ggctacagag cttgaaggct gaggattctt ccagggtcac ttcaggggca aatctgaaac     2700

tttcttcagg acaggaatca acgagatctt ctcacttact tatacctggg ggaggaactg     2760

tatgaaatcc acccaagaac cagtcatgct aagggccaaa cctatagaca aaaaaaggga     2820

taggagaatg gagtatgtat ggagaaagac taaattgttc ttaaacttct caagcttaaa     2880

aatatcccag caaaagagat cgtaaaagcc cttcatggcg tattaattat ccatgcatgg     2940

gggtgagtgg aaaggtactc ctgagcccga ggctacagct ttggaactag cagcaccttt     3000

gaaggggaaa gcgt                                                       3014


<210>  216
<211>  4443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CF Universal replacement cassette with cDNA for any mutations 
       recovery

<400>  216
atgcagaggt cgcctctgga aaaggccagc gttgtctcca aacttttttt cagctggacc       60

agaccaattt tgaggaaagg atacagacag cgcctggaat tgtcagacat ataccaaatc      120

ccttctgttg attctgctga caatctatct gaaaaattgg aaagagaatg ggatagagag      180

ctggcttcaa agaaaaatcc taaactcatt aatgcccttc ggcgatgttt tttctggaga      240

tttatgttct atggaatctt tttatattta ggggaagtca ccaaagcagt acagcctctc      300

ttactgggaa gaatcatagc ttcctatgac ccggataaca aggaggaacg ctctatcgcg      360

atttatctag gcataggctt atgccttctc tttattgtga ggacactgct cctacaccca      420

gccatttttg gccttcatca cattggaatg cagatgagaa tagctatgtt tagtttgatt      480

tataagaaga ctttaaagct gtcaagccgt gttctagata aaataagtat tggacaactt      540

gttagtctcc tttccaacaa cctgaacaaa tttgatgaag gacttgcatt ggcacatttc      600

gtgtggatcg ctcctttgca agtggcactc ctcatggggc taatctggga gttgttacag      660

gcgtctgcct tctgtggact tggtttcctg atagtccttg ccctttttca ggctgggcta      720

gggagaatga tgatgaagta cagagatcag agagctggga agatcagtga aagacttgtg      780

attacctcag aaatgattga aaatatccaa tctgttaagg catactgctg ggaagaagca      840

atggaaaaaa tgattgaaaa cttaagacaa acagaactga aactgactcg gaaggcagcc      900

tatgtgagat acttcaatag ctcagccttc ttcttctcag ggttctttgt ggtgttttta      960

tctgtgcttc cctatgcact aatcaaagga atcatcctcc ggaaaatatt caccaccatc     1020

tcattctgca ttgttctgcg catggcggtc actcggcaat ttccctgggc tgtacaaaca     1080

tggtatgact ctcttggagc aataaacaaa atacaggatt tcttacaaaa gcaagaatat     1140

aagacattgg aatataactt aacgactaca gaagtagtga tggagaatgt aacagccttc     1200

tgggaggagg gatttgggga attatttgag aaagcaaaac aaaacaataa caatagaaaa     1260

acttctaatg gtgatgacag cctcttcttc agtaatttct cacttcttgg tactcctgtc     1320

ctgaaagata ttaatttcaa gatagaaaga ggacagttgt tggcggttgc tggatccact     1380

ggagcaggca agacttcact tctaatggtg attatgggag aactggagcc ttcagagggt     1440

aaaattaagc acagtggaag aatttcattc tgttctcagt tttcctggat tatgcctggc     1500

accattaaag aaaatatcat ctttggtgtt tcctatgatg aatatagata cagaagcgtc     1560

atcaaagcat gccaactaga agaggacatc tccaagtttg cagagaaaga caatatagtt     1620

cttggagaag gtggaatcac actgagtgga ggtcaacgag caagaatttc tttagcaaga     1680

gcagtataca aagatgctga tttgtattta ttagactctc cttttggata cctagatgtt     1740

ttaacagaaa aagaaatatt tgaaagctgt gtctgtaaac tgatggctaa caaaactagg     1800

attttggtca cttctaaaat ggaacattta aagaaagctg acaaaatatt aattttgcat     1860

gaaggtagca gctattttta tgggacattt tcagaactcc aaaatctaca gccagacttt     1920

agctcaaaac tcatgggatg tgattctttc gaccaattta gtgcagaaag aagaaattca     1980

atcctaactg agaccttaca ccgtttctca ttagaaggag atgctcctgt ctcctggaca     2040

gaaacaaaaa aacaatcttt taaacagact ggagagtttg gggaaaaaag gaagaattct     2100

attctcaatc caatcaactc tatacgaaaa ttttccattg tgcaaaagac tcccttacaa     2160

atgaatggca tcgaagagga ttctgatgag cctttagaga gaaggctgtc cttagtacca     2220

gattctgagc agggagaggc gatactgcct cgcatcagcg tgatcagcac tggccccacg     2280

cttcaggcac gaaggaggca gtctgtcctg aacctgatga cacactcagt taaccaaggt     2340

cagaacattc accgaaagac aacagcatcc acacgaaaag tgtcactggc ccctcaggca     2400

aacttgactg aactggatat atattcaaga aggttatctc aagaaactgg cttggaaata     2460

agtgaagaaa ttaacgaaga agacttaaag gagtgctttt ttgatgatat ggagagcata     2520

ccagcagtga ctacatggaa cacatacctt cgatatatta ctgtccacaa gagcttaatt     2580

tttgtgctaa tttggtgctt agtaattttt ctggcagagg tggctgcttc tttggttgtg     2640

ctgtggctcc ttggaaacac tcctcttcaa gacaaaggga atagtactca tagtagaaat     2700

aacagctatg cagtgattat caccagcacc agttcgtatt atgtgtttta catttacgtg     2760

ggagtagccg acactttgct tgctatggga ttcttcagag gtctaccact ggtgcatact     2820

ctaatcacag tgtcgaaaat tttacaccac aaaatgttac attctgttct tcaagcacct     2880

atgtcaaccc tcaacacgtt gaaagcaggt gggattctta atagattctc caaagatata     2940

gcaattttgg atgaccttct gcctcttacc atatttgact tcatccagtt gttattaatt     3000

gtgattggag ctatagcagt tgtcgcagtt ttacaaccct acatctttgt tgcaacagtg     3060

ccagtgatag tggcttttat tatgttgaga gcatatttcc tccaaacctc acagcaactc     3120

aaacaactgg aatctgaagg caggagtcca attttcactc atcttgttac aagcttaaaa     3180

ggactatgga cacttcgtgc cttcggacgg cagccttact ttgaaactct gttccacaaa     3240

gctctgaatt tacatactgc caactggttc ttgtacctgt caacactgcg ctggttccaa     3300

atgagaatag aaatgatttt tgtcatcttc ttcattgctg ttaccttcat ttccatttta     3360

acaacaggag aaggagaagg aagagttggt attatcctga ctttagccat gaatatcatg     3420

agtacattgc agtgggctgt aaactccagc atagatgtgg atagcttgat gcgatctgtg     3480

agccgagtct ttaagttcat tgacatgcca acagaaggta aacctaccaa gtcaaccaaa     3540

ccatacaaga atggccaact ctcgaaagtt atgattattg agaattcaca cgtgaagaaa     3600

gatgacatct ggccctcagg gggccaaatg actgtcaaag atctcacagc aaaatacaca     3660

gaaggtggaa atgccatatt agagaacatt tccttctcaa taagtcctgg ccagagggtg     3720

ggcctcttgg gaagaactgg atcagggaag agtactttgt tatcagcttt tttgagacta     3780

ctgaacactg aaggagaaat ccagatcgat ggtgtgtctt gggattcaat aactttgcaa     3840

cagtggagga aagcctttgg agtgatacca cagaaagtat ttattttttc tggaacattt     3900

agaaaaaact tggatcccta tgaacagtgg agtgatcaag aaatatggaa agttgcagat     3960

gaggttgggc tcagatctgt gatagaacag tttcctggga agcttgactt tgtccttgtg     4020

gatgggggct gtgtcctaag ccatggccac aagcagttga tgtgcttggc tagatctgtt     4080

ctcagtaagg cgaagatctt gctgcttgat gaacccagtg ctcatttgga tccagtaaca     4140

taccaaataa ttagaagaac tctaaaacaa gcatttgctg attgcacagt aattctctgt     4200

gaacacagga tagaagcaat gctggaatgc caacaatttt tggtcataga agagaacaaa     4260

gtgcggcagt acgattccat ccagaaactg ctgaacgaga ggagcctctt ccggcaagcc     4320

atcagcccct ccgacagggt gaagctcttt ccccaccgga actcaagcaa gtgcaagtct     4380

aagccccaga ttgctgctct gaaagaggag acagaagaag aggtgcaaga tacaaggctt     4440

tag                                                                   4443


<210>  217
<211>  22964
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD Native replacement sequence for exon 44 mutations recovery 
       using DMD2 and DMD3

<400>  217
taccttttct ccattaagca atttcctatc cttcgccccc atcccaccct ctcgcccttc       60

tgagtctcca gtgtctatta ttccacactc tgtgcgcatg tgtacacatt atttagcttc      120

cacttgtaag tgagaacatg caatatttga ctttctgttt ttgagttatt ccacttaaga      180

tgaccaccag ttccatccat gttgctgcaa aagacatgat ttcattcttt actatggctt      240

tgtagtattt ttcattgtgt atatgaaatt gtttattcca tacgcaattt gtgtgtgtgt      300

acatatatat atatatatat atatatatat atatatatat atatatatat gcttagactt      360

agaagctagg atagacacac aatggaatac tacacaatgg aatacattca ttcacacaca      420

tataaataaa agaatatgtg gagatatatc tccacatatt ctttatccaa tcatctgttt      480

ttaaataatg ctattgactt ctttagggtg aattttatca atattgtttt ggtttaaaac      540

actcacctta aaagagtcac agtccctaaa tgtgcatcct catatttaaa ttaggtctca      600

gtaaatttgt gcaaagtgta ttctttttag gatggtgttg aacttgctaa attatttatc      660

tttaagaatc atcattttgt gtcttttatt aatgaaaaca acaattatgt gattgctgat      720

atatttggaa aatgatttct gatgtagatt gattttttta ttctaaattc tgtgtcggta      780

ttaaaaattt atagattact aactgtatta atatcgataa tactaaattt tattgctatt      840

tataacttgg agtgtacttt catcctcctg aaaaagctga atgaggtagg cagtattatt      900

ctgggtttat gtgtgagata actgagactc agaggtaaaa tagtgtatcc aagcattcat      960

ggctcttaaa tggaagatat aaggggtttg tgaaattact catggacttt tttattcatt     1020

cattcagtta ttaaaatgta ttcaacattt atcatgtacc aggaacagcg cttagtacca     1080

ggaattcaaa ggtgcataaa acatcttcct tattctaaga ggtacatagt gtactggaac     1140

aaacagcctt gtaaatacat aattagaaca tgaagtagta tgttaataga ggttttcaca     1200

aagctgtgga agcttgtctt atgaagtaac taattccaag ggagagaagc cttatggaat     1260

agtgacattt tagatagggt gtcattctaa aatacagcaa aaggcccaca gtaaaaaagg     1320

aattttggtt gttatgaaaa ttttcagatt ttctatgttt tcagtacagt atacatggtg     1380

ggctatgtga atgtttgtat agggaccaaa gtaggaagtg aggttgtctg ttagagagcg     1440

ctgagaaacc gaaaataggg agagatgagt tggaatatgc tgaggaaaag ttattaggag     1500

ttttcaagaa aggccacgac agtggggcta gagagaagag gctaaattaa agagtcattt     1560

ctggtttaga attgataaaa tatagagaca agcatgataa gaaagaagtc gagaagtaaa     1620

cgatggtctc aagatttcta gcttggaaat cattgactaa aattaaaact aaggactgga     1680

ttaggccatt cttgcattgc tataaagaaa tacctgagac tgggtgttta taaagtaaag     1740

aggtttaatt ggctgacgat tctgcaggct ctacaggaag catagcaaca tctgtttctg     1800

gggaggcctc agggagcttt tactcatggt ggaaggcaga gcaggtgtag gcatttcaca     1860

tggcgaaagc agagagagag agttggtggt gggggtgggt ggctacctac ttttaaacaa     1920

ccagatcttg gagaactcac tcattttcat gaggacagta ccaagaggat ggtattaaac     1980

cgtgagaaac caccctgatg atccagtcac ctctcaccag gccccacctc caacattggg     2040

gattacaatt taatatgaga tttgggtggg gacacagatc caaatcatat caaagacttg     2100

catgggaaaa taaggaattg ttgacataac atctttgagg ttcacatcaa atgttctgat     2160

gaggatagtc caagtagcag ttggctatat acctcagata agggctgaaa tttggagcta     2220

tgtcataatc agcctagatt aagagtcaat aatctcctgc ccatgggcca attacaccca     2280

ccacttgttt ttgtaaagta gtattgaatc ccagccatat ccatttgctt atgctccatg     2340

tatacctttt ttttgaactt caaggcagag ttgagtagtt gtaacaaaaa ccatacggcc     2400

cacaaagcct gaaatatttg ttctcaagat ctttatctat aaagtttgcc aatacctgct     2460

gtagatgtta gttgaagctt tgaaagcaaa tgaggtttca taaggcagtg tccatacaag     2520

acatttaaca agtttaccta taaaaactag aattcctttg aggggaacac atcctagtct     2580

ccattaagca cagtagaaga gtcccctata atgggaaaga ggtcacttta ggtgttgatg     2640

ttggtggtac aggtcaaaga aaatttatct ttgctgttta ttcagaatgc aataagtgaa     2700

gttatgagaa ataagggaaa aaatgtgtag aatttcaaca gcgaagagag gggataaagg     2760

catgagaatg agttcctaag ctcaagtatt ataaacactg tgagaaactt aaaatcaaag     2820

tatgactcca aacgtatttg aagcctgaga acaaggctca caacctaggg aggattaggg     2880

atcaataaaa tagagtgtta caaagtataa tgtcaatcca gagttgtaaa aatatcagca     2940

ttgaatatat tgaaagcagt aaaactgaat gaggagacta tcattttata tcactgtgtt     3000

tatttctttg ccttgttcta taaatattta aaattataaa atttttatta acagtgagag     3060

cagaactacc agagtgagca gatcaaaatt gggacagatg cttttcactg cacacacttt     3120

tatttttctg ctgttcatgc attatcttgt acagtgcaca tgttttacct aaaaaattaa     3180

aatggagtct cctgcttagg aaaaaagtat atattctgtt tcaaactata tacaaaaata     3240

aaatcccagg tgactaaaaa ctgacatgag aaaaaaacaa attgataaag cttttacagt     3300

aaaatagagg agaatatgtt aattaatata gggtaagaaa aaattgctta cacaaatgat     3360

gaagcactaa tcatgaataa aaataataaa gtggactacc ttgtatatta ataacatcta     3420

tacatcaaaa gacagcactg agagagtaaa aatgaaaccc acagagtagg ataaattatt     3480

tggaatacac acataatgga tgaaatgtgt gtattcataa ttataaagaa ttcctacaaa     3540

tctttcagaa aagaacagat aatccaatag aaaaatggga aaagttcttg aaaagtgaac     3600

catggcacaa aaagggcttg tggcctgctg gcaatattct gtatcttgac ctggatggca     3660

tttttaaggt gatcacttta tagtaaataa ctaatgtgtt ttatgcatca tagtaacgtt     3720

aagatttttg tcatctttac aaaataagaa atccaaacgg ccaataaata tataaagaat     3780

ttctaagtcc cattaatggt ccaggccatg caaattaaaa ctaaaatgaa atatcactgc     3840

ttaccaacca gaatcattga aatttataag tctgacaatt ccatgtggtg gtgagaatat     3900

acagcaatta gaaatttcac acaatgttac ttggtctgtg aattgtaaat agaagtgtaa     3960

aattacacta ctgcttcttg gagtgaaatc catttggcac tatttagtaa attcaaagat     4020

ctgcataacc tatagcccac caatttcact tctatatata cactctacag aaatgcatat     4080

gttcatattc caggagacat gtttgggaat gtcatagcag catagtaata gccccaaacc     4140

aaaactactt cagtatttat taatagtaaa atttgctata gtttgaatgt gtctctttcc     4200

aaattcaggt gtcgataatg tgctagtact aagaggtagg gtgtttaagt ggtgattagg     4260

ccatgagggc tccttctttg ttaataaaaa taagaccctt ataaacaagg cttcacgcag     4320

cattcagtca gcttgctctc ttgcccttct accttctgcc ttgtgaagat acagcaggaa     4380

ggccctcacc agacaccaaa tgccagagcc tttatcttgg acttcccagc ctccagaact     4440

gtgagtgaat acattggtat tatttgtaaa ttacccagtc tcaggcattt tgttataaca     4500

gcacaaacag actaagacaa tcatacagtg agaaattaat caacaactaa taagcaaaga     4560

ggtagattaa tcttgaaact atgatataga gtgttccatt tggctgctgg aagttttatt     4620

tcttggtctg ggtgatggtc accatgggtt tatatgaatg gttccctata ttatgtttca     4680

caacaaaaag catttaaaaa gtaaatatat gtaatgtact cagggatagg catggccaac     4740

catggattct atgctgaaat aatgattcag atttcatcag caggctaatg acactgccta     4800

tttaaatact ttaagtcctg aaattaaaga aggtaatttc tcaagaagga atttctaatt     4860

tatgggtggg tctattcccc accagagaga cactagcatg gctcagattc tatgttggtc     4920

attttatttg catttaaagt cttaagccaa atagaggtac actaataatg acaacaacta     4980

ctactactca tacttgtgga acactgccag atgctgtttt aagaaatttg cattttcatt     5040

tgtaactgag cttacttgaa tcttctctct ttttttcttg gttaatctaa ctactggtct     5100

atcaatttta cttatctttt caaagaatca acattttgtt tcattgatct tttatatttt     5160

tgtttcaatt tcatttagtt ctgctctgat ctttgttatt tcttttcttc tggagctttg     5220

tgttggcttt gttgttgatt ctctagttcc ttcaggtgtg atgttaggta gtcagactgt     5280

gaactttcag gctctttgat gtaggcattt ggtgctagaa aatttcctct tagccttgct     5340

tttgctgtat cccagaggtt ttgaatagat tttgttgtga atgtgatgaa aacggaacat     5400

ttgtacactg ctggtgattg taaattagta caacctacat ggaaaacagt atgaagattt     5460

cttaaagaac taaaagtaga tctaacattt gatctggaaa tctcactacc gattatgtac     5520

ctagaggaag agaattcatt atatcaaaaa gacacttgca cgcatatgtt tatagcagca     5580

caattcacag ttgcaaagat atggaaccat cctaagtgcc agccgaccaa tgagtggata     5640

aagaaaatgt ggcatatatt ttcatatacc gtgaaatact attcagccac ataccatgca     5700

atactactca gccgtagaaa ataatgaaat aatgtctttt gcagcaactt tgatggagct     5760

ggatgccatt attctaagtg aagtaattca ggaatggaaa accaaatact gtatgttctc     5820

acttataagt gggagctacg ctgtaggtac acaaaggcag acagagtggt agaatggact     5880

ttgaagactc agaaggggca gagtgggaag gtagtgaggg ataaaaaatt acctttgggg     5940

tgtaatgtac actacttggg tgacacgtgc actaaaatat ctgattttac ttctatacaa     6000

ttcattcatg taaccaaaaa tcacttgtat tccaaagact attgaatttg aattttttaa     6060

aaacattaat aaaataaaag atgtaaaaaa agaaatttat atatactcat ttattgagct     6120

cccacaatta accttaggag gtaagtactt cataattggt agtatactta tcttttacta     6180

aatatttgta ttacttggga agttgagggt tggggagaag tagcaaggta ctatgatttg     6240

gggcagataa ctaacttatt tattcgcaca tacagtttgg accatgagac acgagctcag     6300

gtccctcctc ctcacctaat caaagatgaa atatgtggga tgggatgaaa taatcagcag     6360

tccaatgctg agtttccaga ccgaagtata aagcaacaat ggatatgtca gaagtctact     6420

agggtgttat ttatttaaat ctatttcatg gaatttacta ccaccttaat ggcccgaaag     6480

tgttaaagta tgccccagag taccgaatta ctccctaaat gtaatttatg cttgagaata     6540

atctgactaa cttgatttag aacatcagaa aataagttat gctgcacata aatgaagcag     6600

cagtgtaatt ttaaataccg gttgcacggt gaatgagaat tttaatattt gcaaaattct     6660

aaaatcactt gatttattat ccttatgttt atactgacat ttttttgccc tttgttaagt     6720

tccatccata tttcttctta ctgccaagaa aaaaaacttt ttttcctaga aatattacag     6780

aaggcaaaaa ttatatttgt ttccctgaat gctatttttg atgtctctac ttgtttctca     6840

ttgttaccat ttgcttcatt catgggcagc ccaattaatg gagcgagaca aatttaggga     6900

gcacagtgac taattagata ttaaattggt aaatctaact ttgtaaaacc agaaaaaata     6960

tatatatatt tttttcattt ggaattttcc ttggtggaaa agagtttaaa agtagtcatg     7020

ataaaaaatg taattttacg tagtaaattc aagaatagat ttagactgtg ctattaacag     7080

cacctattaa atactgaaaa gtgtatttta aaattttatg tgaggcttga aatggagtct     7140

aaagtattat tactcacatt aagtgtcatc acatgtaaag cccatgattt tattctttaa     7200

tattttgttt gaatagttac ttatttcaac agtaatttca ataataaaat taaatcaact     7260

ttacagtttt caaaggttta gcagttgcat gctgtaataa atacttcata tttatatatt     7320

tataaagtga cagcataagt catttttatt aggtccttga ggatgcaaaa gtttggatta     7380

tacgaggaga cgagagaaaa agggaagaag ggcatttcag aaatatgcta ccgatatgca     7440

aattcacaag tcctaagaca gtagcagggg tcgggcagaa agtccatcct gcctccctct     7500

tgtgggcctg gaacaatggt gtaagtggaa ggcctgttcc ccttctcttc ctacctccag     7560

ctctgtctta cagagctacg gataccatga gcaagtgtat gaacccttac ggttttcttc     7620

tcttgggaga atgtaaagga aagataactt gtagaaactt gtagataact tgtaaaaagg     7680

aaaagaattc agggtgagag ggggatttgt tgaatttgat agaggatggc aattaccaat     7740

atgatgagtg attgagaaac aagtctgtgc aacaggtttg aaatcgaaaa tctttgaggt     7800

gtacaggatc ctgaaatgaa gaatgggcat ttatagcagt atgtcagaga aacagtcacc     7860

tcctagtagc taaaagtgtt ggcaaaagta tagttcaagt gattgggtag gaaaaacagc     7920

aaaccaagag tggagactga tggttgctac aaaggtggag tggtaagtcg tgaccaactg     7980

gtacttctct gtgctctggt tagctgctga ctgtttctca gactgtggta gcaggaggag     8040

ggttggagtt agcagtcatt tgcatatgag actgccattt aaaaaaaaat tttaaattat     8100

ttcatttttc tgactctcaa tatgaaaagc acattgtaga caaattgaaa aatatagaaa     8160

aattatataa gaaaatatag tctcaccagt atggaacaat gctaactatg ttgcatagat     8220

ttttagattc tcattcaaaa gcaactcttt gactccagtg atgcaaatgc atgtaacata     8280

tgcaatgtgc aattcatttt taaagggaat aaacttacga tatattcata ggtcatttat     8340

tgtgtgttat ataccattga aaatatatga atgctaaatt attagtaaac atgcaaaaac     8400

attggcaaga tcattttgtt gtggaaggat atattgtatc tgaataactc tagaatacca     8460

taaatcatca aaggcaacat tcttattttt cactaactac agttagagaa tacctcttcg     8520

gctaccttcg gttgcctttt ttatgctacc aaaatgctgt ctgttttaca agattttaaa     8580

ggttaagcat ataattattc attaaataca atgagtgcaa tgtacatgta gatacattat     8640

taaattttgg gtagttaata aaaataaggg gaaaaaacct ctagaactat cacttttaat     8700

tgtttaactg ataaagtgaa gcttcatctt ggaaaaataa tttcacaaga gagcatgtgc     8760

actggtagaa aagtgccatt gaaacaagag atatttgggt tagaagcctc tctctactat     8820

ttaataccat tttcaccttt tggcaaatta cttggcctct gttttctcca atggaaaatg     8880

ggaataataa ttgttatgct gcagggttat tgtaggtgtc aatgaaatga tgtgtctggc     8940

actataaaag cacagagccc ggtgcctggc tattagtaac tgtttaataa atgttaattc     9000

ctttctctgc ccaggacatc agtaggcaga tgtagcaatt taaaacttct agtgttactt     9060

taaattcctg aatgaaggta gaggactgaa aagatatcat ggtattcaaa agtatgatcc     9120

attgcttctt aagaatagag ttcagaaaag cttgacagat tcctgtactc tgaggcagca     9180

ccatagccgg taatctgtag gatggctatt ggttttgtgc tcacaaatgc ttgcttgggc     9240

aggccccagg aaatctggta gactgtaagc ccagtaagat ttcaaatctt actttacggc     9300

agtgtttttc accttgactg tacattgaaa tcacctggat gctttgaaaa ataacagcgt     9360

cagtgtccaa cctccagaaa tactgattaa gttggtctgg aatggagccc caggatcact     9420

gtttggttat tgttgttgct gtgttttaaa tgccccagtt gattcttatg tgcaactgtc     9480

ttaggtaaac atacagccct ggttcatatt atttctgcct cagtctcttt tatgactgga     9540

aggtgaccaa atgcttgttt cctaatattc tttccatgtg tagtattaac acatttgact     9600

tgtactaagt tcctgcagta ttccaatcta aaattttagt gactacaata aaataagaag     9660

gattaaagaa ggcatcgcat agtttagtat atcggttatt taatgcttac atgtgagcct     9720

acaatatgaa ttatatctgt catcttattt taaatattga cagaatcttt aatgatagtg     9780

acgaattatt gatttattgg tgtgataatg gtattttagt tatattttta aagttttatt     9840

tgtaataact atatgtattt atggggtaca gtgtgacgtt tcagtgtaat gtttcattgt     9900

gtaatgatca aatcaggttt cttggcagat ccatagcctc aaacatttat aatttctctg     9960

tggtgagaaa atttaaaatt ctctttcact attttgaaat atacagcaca atattggtaa    10020

ctttgttcat attactatgc aatagaacac tagaacttat tactcctttc agttgatgaa    10080

caggcagttt tggatcaaga ataatattga aagtgataga atttatgaag taatttttat    10140

ccaaaaatat tttgaaaggg aatatattgc ttccaaataa tttattacaa tgttaagata    10200

tttgtaaatt tctagaatta aaaaaatata tttttaggaa agaaaatgcc aatagtccaa    10260

aatagttgct ttatctttct tttaatcaat aaatatattc attttaaagg gaaaaattgc    10320

aaccttccat ttaaaatcag cttttatatt gagtattttt ttaaaatgtt gtgtgtacat    10380

gctaggtgtg tatattaatt tttatttgtt acttgaaact aaactctgca aatgcaggaa    10440

actatcagag tgatatcttt gtcagtataa ccaaaaaata tacgctatat ctctataatc    10500

tgttttacat aatccatcta tttttcttga tccatatgct tttacctgca ggcgatttga    10560

cagatctgtt gagaaatggc ggcgttttca ttatgatata aagatattta atcagtggct    10620

aacagaagct gaacagtttc tcagaaagac acaaattcct gagaattggg aacatgctaa    10680

atacaaatgg tatcttaagg taagtctttg atttgttttt tcgaaattgt atttatcttc    10740

agcacatctg gactctttaa cttcttaaag atcaggttct gaagggtgat ggaaattact    10800

tttgactgtt gttgtcatca ttatattact agaaagaaaa ttatcataat gataatatta    10860

gagcacggtg ctatggactt tttgtgtcag gatgagagag tttgcctgga cggagctggt    10920

ttatctgata aactgcaaaa tataattgaa tctgtgacag agggaagcat cgtaacagca    10980

aggtgttttg tggctttggg gcagtgtgta tttcggcttt atgttggaac ctttccagaa    11040

ggagaacttg tggcatactt agctaaaatg aagttgctag aaatatccat catgataaaa    11100

ttacagttct gttttcctaa agacaatttt gtagtgctgt agcaatattt ctatatattc    11160

tattgacaaa atgccttctg aaatagtcca gaggccaaaa caatgcagag ttaattgttg    11220

gtacttattg acattttatg gtttatgtta atagggaaac agcatatgga tgataaccag    11280

tgtgtagttt aatttcaact tgtggtgtcc tttgaatatg caggtaaaga tagattagat    11340

tgtccaggat ataatttggt tgctaaatta catagtttag gcataagaaa cactgtgttt    11400

attacacgaa gacttaatta tttttgcatc ttttttagct caaattgttc atgttgcaat    11460

agtcaatcaa gtggatttga attgtagcca atttttaatg ccagaaaata ctgattaaga    11520

cagatgaggg caaaaaacac ccagtagttt attaaatact ttagatattt caaaatgctg    11580

gattcacaaa agcagtatca catttgactt tacaagtctt cattctcaaa tatgtttcca    11640

tagtaaatat gccctttaat attaaggagt taagcattta aacacctatt tatatgataa    11700

gctatttaaa cacagaaaat atttttaaaa ccttgtgtaa ttatatgtgt atcaatcaaa    11760

cttgcatgca caccagcgtt ggcatttgta tagagaggaa atgtatggat tcccaatctg    11820

ctttaatata gaagatacat tttaaaaata gcactgaagt gaattttggg ctaatgtagc    11880

ataatggggt ttctgcctga gaggcagaaa catattagag ttatataaaa tgttttgggg    11940

tagatataga aaccacttgc cattttcaat gatatccaac ccaaggtagt tatatatttc    12000

aatttatatt ttattatcaa attagtactt attgtgaaaa aaatcaagta acatagaaat    12060

ttgtaaaagt acctccattc tactctttgg aggatagttg ttcagtatga attttgctac    12120

atatttcagg ctgggtttct tggaaagcca ttgtaaaatg gagatttgta tgtagaaggt    12180

taactaggga gtacttttac gatgaagcaa tttgttttga tgtaacttgg tgtagttttc    12240

ttcatgtttc ttgttcttga agtcagttaa gctcttgaat ctgtgcattt aacatttcat    12300

caaatttaga aacctttcaa ccattttttt aaaaaaaatg gaactccaat tgtacattta    12360

ttaggctcct taaagtgccc cactactcac tgatgttatg ttcattgtct gtttggtctc    12420

tcttttctct gtaatttgtt ttatataatc tctattgtca aattgactaa tctttttcaa    12480

agtctaatct atggctaatc ccatgtagta tatattttta acatcagaca ttttcatctc    12540

ttagaagtaa aagttgggtc tttttatttc ttccatgtgt ctactcaaca tgttcagtct    12600

ttactttctt gactatatgg aatacagata taataactgt tagaatattc ttctctacta    12660

attttatcat ctgtgtctat tctgggttaa tttaaattga tttatttttc tcctcattaa    12720

gtgtgttgtt taactgcttc tttggatgac tggtaatttt tgactatatg ccagacattg    12780

tgaattttaa cttagcgcgt gcttgatact tcaaataaat tcaaatatat tgaaataaat    12840

attctcaaac ctcgttctgg aacacagtta attcacttgg aaacaatttg atcttttgag    12900

aatcttcctt ttatgctttg ttatgaccag aacagtgtaa gtttagggct actttttccc    12960

cactactgag gcaaaaccct tctgagtact ctctctgatg tcctgtgaat gataaaattt    13020

ttcactgggg ctcgtgggaa caggtggtat tactagccac gtgtgagctc tggtgattgt    13080

ttcctttaat tcttttgtga agttctttcc ttagctttga gtggttttct tgcatacatg    13140

aactgatcaa gactcagatg aagaataaaa taaagctttc tacaaatctc caaaatttcc    13200

tctgtgtata tatcacctct ctggtatttt gccctgtgat cactagtcag ccttgggctg    13260

ctgaaactct cagcttcatc ttttaacaaa agcctcctgg caaggatcac tgtccttcaa    13320

tgtctgatgt tcaatgtgtt gaaaaccgtt gtagcatata ttttgtcttt tttttttttt    13380

tttttttttt aagtgtttca ggtgtttcag gcaggagatt aagttcagcc tcctttactc    13440

caacttgaaa acaagtccaa aacaaactat tttgatgtaa tttgatcttt taatacatta    13500

acattacaca attttgtgaa tatatcataa tttaaaattt tcagagaatg tctaatggtc    13560

ctcatttctt gacagtgtgg tttagttgaa actgatgaac attttatcaa aacttttccc    13620

ctcaattgga tacttttttt tttttgagat ggaattttgc ttttgtcacc caggctggag    13680

tggcatgatc tcagctcact gcaacctctg cctccaggct tcaagcaatt ctcctgcctt    13740

agcctcccga gtagctggga ttacaggtgc ccacccccac acctggctaa tttttgtatt    13800

tttagtagag acgagatttc accatgttgg tcaggctggt ctagatctcc gacctcaggt    13860

ggtctgcctg tctcagcctc ccaaagtgct gggattgcag acgtgagcca ccatgcctgg    13920

ccaactggat aattttaaaa agaccatttt atttagtcta ttttttctca atctatagat    13980

gagataagaa aaatcattct agatgtccaa ggaaaaattc tttcagaaaa gagctgtgaa    14040

tgatatcaca aaccccccaa acagttaagg tatttctttc ctggttattt tatgtccaaa    14100

atcatgcata tgaacatgtg cacacacatg agcgtgcaca cacacatgaa tacatataca    14160

cgcacataat gtaccttagg ttatctttcc attctgagta attatcgtaa aatgggtaaa    14220

atcaaccccg taagatacct tcatcgataa ggcaaatcaa agctttggta atttctgcta    14280

tcttggcctt tgttgattga ctaataatga ataagagaat gagtttcaat atttactatg    14340

aaattatttt agaagacagg atgtagacag tggctgttag caggcaattg tttggcatga    14400

gccagtaatg gttactgtga aaaaaatcaa ccaagcagcc catatattaa acaaacacac    14460

gcagaagcac gttggagtct gaagcctcat atgtacaatt ttcagtaaag aaataacttt    14520

tagatatgaa ataaacaaat agatatatgt tgtaaacttg tccctatgta ttttgatcaa    14580

attgcatcat atttttttca ctttaaagaa gagaatttag tgctttaact gagacttagt    14640

gttatcattc aaaatatact gactgccaat agcagtagaa agataatctg gttccatgca    14700

actctatttt ttttcctctg tcgcaagtaa aagacaaaat taagtacatg aattagtgct    14760

ttttgaagat attccagagc aatataccat gccactatgg agaacctctc taaaaatatc    14820

ccattttttt acctgagaaa aatattgatc atgttatatg ccactcaaat tggtttatta    14880

aattcgttga atgatatcag catctcttaa tgcattcact aaacaagcag taattgagtg    14940

catatacaaa gttttatcat ccaccaaaac agtgacaatc cacatgaggc tctaatagaa    15000

gtttagaaag ggggttaagt ggttaaatgc tggactcaga aagattggat tcaaatccca    15060

ggtcctttag cttaatagtt gtagaatctt gtgaaaatat cttaattctt ttcatgtctc    15120

tgatttctct tctctaaaat ggaaatataa atgagatgtg tataaagcca cttggaatag    15180

cattttgcac aaaataatta ctcattaaat gtaagcccct attataacta atcactcttt    15240

ataagtgatt agttcatatc aatacaaact aagacttatt tactgaatta tcgtctctaa    15300

acatccacac tgcagaaaaa ccaacctgga aatttcataa aaccttattt ttatgtagta    15360

taatttcttc tcaaagcata agggctcttg gattaggaat tgaggaaaat tccaattcag    15420

ccaaacgcat ctgtttcaga tagctgacac ttctgcctac tcatttccta gctaacaaga    15480

agaaatgtta atgggagttt tcaaaggaaa agctgaacac catgaaggaa agtgacacaa    15540

ataatgttag ctcatatatt gacagggtga atttgtgtgc tttcaagtcc cttcagtgaa    15600

aataggaaag tagaaattat aaaatgccct aacatttaaa gctagcatgt tcttggagac    15660

taggaaaaaa taagttttaa aacatgggct atgatagaat gagatggaaa atgtttgtag    15720

ttgccagtag aaacaataac aattaccatt agattaagta tttaaaccag ctgaatattt    15780

ttattaatgg aaatggcatc tgttttatga aataatgctg ctgaatgaac catattaaaa    15840

atgaccagta tttcctgcag aacgttgtcg cagacataca agcctgagac cctaaaatct    15900

taaggtattc catttgaaat cgaccttaag acattaacag tagtggtatt gtttagatga    15960

aattttttag gctttaaatc aacaaatgtt aagcagacat ggggagcgaa acaccagtgt    16020

gttattctga catgaataaa ctgctgtttt tagggaaaaa atatagtctt gttaaggtta    16080

agctaattgg ttttctggta tcttttgcaa tgttagtgtg ttttactgct ccataaccta    16140

tgttatatgg taaatgtgca atatatttat atatgttgct gtaaagaaat gtaataaaaa    16200

actgtttact ttgtgatatg aaagtaaaaa tttattcatt gtcattgagc atacagaagt    16260

aaatatggat tacatatgtc atattttaat gttcacatgg tcccaccatc aaatgttgaa    16320

aaacttatag tttaacgtca tattctattg aagaaaaata cactcccttt tctcaaatgt    16380

gaaatgtcca gagagaatgg aaaattacat ataaagcatg tagttatagc atggtgaccc    16440

tgctgtgatc tctcagatga ggaacaaaag ggagaaagaa agagcacact ggtgctttgg    16500

agttgagaga aggcaaaaaa agagtacaaa aatgtcaaag ccaagtttag ctgctcttca    16560

gctctccctt tagctgctct tcagctttac cttaccatgg ttattagtga ttgaagaaaa    16620

ttctaaagca ctttttaaag gacccaattc tgaagagttt agattcagag agcacaatgg    16680

agttggagtg actcctgctc aaaagtttga gacaagcgag tccatgaaaa gaccgtcctc    16740

ctcttaatgg aaatacccag gttttctcat tcttctcgcc ttgctttcag cactcgcagc    16800

ccagaaagcc cttatctaac aggtactgcc gttgaaaggt cattgacttg tacaaaaatg    16860

atgagtgctg aatagatgtg cataggtcac tgacagtatc tgctacagag aatgagtttt    16920

cgtattttta ttaggataca cctaacatgg caatctactg cctcaaagaa ctctatagga    16980

ggtaagtgaa tttatattaa tacagattga attaaaggat aatctagaaa aaggcatatg    17040

atgtaaaaaa atcagacaca agtatatttt ctgtatagtc agtttttaca ttgtgatttc    17100

accagctggc tgctgagttt gacggcttct taacagccac actgctgaga ttcaaatgct    17160

gatagaaact ttgatggaaa aatcactgga gtaaatattt ctaccatctg ttgcccttca    17220

ctgggaccct aacgttaaga ataattcata ccattgcttg tcctttatat ttccccagca    17280

gtaataaaat ttcataagat tttgttttgt ggtcacaaag ctatcctggt ttctgtaact    17340

agaagacata cactagcata agggaatcag ccggaaaatt tactgctaag agaatttgtc    17400

tctagtcact tactttaagg ttacagcaat gtgtaagtgt gggaatacat tttaaaatga    17460

gcttttcaaa gttattagct ggtagtggca tgagagttaa gtctcttaat acagttaaac    17520

agttgggcac ttcatccttg cgtaaatatt gttacccttt tattgctgct tggaaactcc    17580

tctgcaactt tttggcccct atccatcttt tcagaagtag taaataacca atttactggg    17640

agtgtggtac caggcagaaa ttccgagagg ggctttcaat ccttgcccat caagtgtatc    17700

tttcagaaat aagtatatta aaataattgg ataatttcag tggcttgtta ttagacttcc    17760

gttgtccagc atggcatgtt taagaagatg acagattttc atacattatt ggaaagaagc    17820

aagaacaaaa aaacataact tactgtagta accacggtaa agaactgctt aaaatgcagg    17880

ataaacatgt catccctaag ggattcccat tcttagagca tgaaattatc aagagagtaa    17940

gagactacaa aaaatgagaa gaatgctgat tgcaaattcc aaatagaaaa aatcaaaaca    18000

aaactgcgca ccatcattct ggaagcaatg agaagcagaa attgtcattt aatgaaatgt    18060

aagattaaag ttaatagaag taattttcat gaaataatat tttgcaagga cgatgttcca    18120

gccatattga tcttcgtgtt ttcttttcac atcccttctt actgttccct agaatgcttg    18180

tttctacctt taaatttgct tttctctcta ccagagggct ctaccctatc tccagtttct    18240

caccatgtcc caatctactc cctctcagaa tttttgtaca cttcccttta tatatatttg    18300

tgctctaatt ttatattcac agatatgcct tttgtaactc ccccatctta aagaaagcac    18360

acacgtacgc acacatgcac acacacaaaa ttgaactctt tctgggagat ctgcttaact    18420

ttcttcataa ctctgtcact tgctgaaact gtagtatgtg ttttcatgtt tattatcttt    18480

tccattagaa tgaacatatt ttgggtactt ggtctttctc gatcaccaat atacctcggt    18540

acgtagaaaa attgattcat atattgaaaa tgtaatattc agtagaacga ataaatacat    18600

aaataaattt aaaaatgata cttttattgt attacctgag acaaatgatc cccaagtttg    18660

tccttgcttt tcatagccaa aacattctct cttacattga gcttccttca cctcttctgt    18720

gtacagagca cttaaaattt tcacattgcc tgatacttta acaatatgat ggccctgttc    18780

tcttacccat tggagcatat gttaaatacc agaacccatg taacaaacat atattgtgat    18840

cctactgtgt gcaaagcaga tactgcttgc tgctaggaat acagagctga ctaagagctc    18900

cttttctctt tatgagctca cagtctcatg agttcaacgt cttaaggcac aacgtctaaa    18960

gcaaagggca gtaagtaaac actccagaaa gtactggatc tggcctagga caaatggtgg    19020

gttgtttttc cagctgttat ttttcctgcc ccctaattga cagtcctcca ttacacctct    19080

gggataccta gtctgacttg ggaaaacctg actttgggaa tcagaggcag tctctcttgc    19140

ttatatatga ggaactctaa tggatactta ctgtcattag agaaactctg cttctagcct    19200

ggctcctttt gtaaagaagg ttgagtcccc ttggagagcc tgcagaacat aaccatttgc    19260

atgtaatgaa cagtttgtaa tactttgaga ttgatgtgca atttctattt gacaagggaa    19320

aaacaattag gattaaccgt ggtcgtatat cccagaatac caacgttgtt tccacactct    19380

aagtgttgtt gggtcattat atgagattca taattttgtc ctgttgtacc cacgtttgca    19440

ttaccattca gtcttaattt attataccct attaaaagtt tttttggtaa tttgttctta    19500

ttgctactca ggcattaaaa tgtctgcagg ctgtgaaaat gaataaattt aatgtggcag    19560

catagttctc aaaatcctgg ctttacaact catagtacag gcttgtattg taaatcctag    19620

ttaacatgga tttatttgaa aatccaattt tactgctaat cttaaataac acatttttca    19680

aacattttat ccttgaattt ctattttttt ataatttatg gctgttgtat gtatttacaa    19740

aaggacaatg tgtgtacttt taaatactag taatggattg ctgaaacaac tgtaacttta    19800

aaacaatgca attgttaaaa aaataaactg tgcagcctgg cttaatggag gcttatgaac    19860

atatgattaa gatatatgct ataataagca aattcactca actgatagtt cataggaact    19920

ttcaaattta atctcataac cagtgctatc cttcaaagaa tggtcagggc aatttaacga    19980

gtacatgacc acgcaagata atttcattga agagtggctg aactgttgaa atattttcta    20040

gtctccttgg gatatcatta agagcagaaa ttttgaaatg gaattgtaat gatgttcaga    20100

aaagataagt aggtaactct cttaatacgt tttgtgctgc tgtaacaaag tacctaagac    20160

taggtaataa tttgtaatga acaaaaatgt attggctcac agttctggag actaggaagt    20220

ctaacattaa ggtgtcagcc tctggcgagg gcctacttga tatgtcatca catgatggac    20280

gattagaggg caagaaagat caaaaggggg ctgaactccc acttttataa gggaaccaaa    20340

cccactcgtg agggtggagc cctcaatcct taatcacctc ctaaagctcc caccccttaa    20400

tactgtcaca atggcaatta aatttcaaca tcagttttgg agggaaaaac attgaaacca    20460

tagtagtgat actgactact accacacagg gcttgggagg ctaccctagc tgttgcaccc    20520

aagagatgaa tcttctaatg tgattacctt tatcattttt tttactttat taaaatactt    20580

ttattttaca tgtatacttt tgtctaccca ccatttccat gtctgaccac tgctactact    20640

atgtcctagc ataacattcc atacatcctt aaaaccaagc aaagggtgga gttccatctt    20700

taaaaactaa acaggcattt tggacaacac attcttggca atggaatctg gacaacattt    20760

atcaaacatg gtagggaagg ttctcactct gcattatcaa aacgacagcc agatatcaac    20820

tgttacagaa acgaaatcag atggaaaatt tttaacaaat tgtttaaact attttcttag    20880

agagacttcc tccactgcca gagatcttga atagcctctg gtcagtcatc tggaagcaat    20940

tcttcacata attcatgaac ttggcttcca ctttaggaag agaaccacct ttttctatac    21000

ttgcttgcat ttttgcttta atgtcttcta cagaactagg tcctttgggt gttttaggag    21060

tttttccttg ttttgaagga ttcttgtcct tttgatcttg gtgttgacgg ttttgagtct    21120

tttccattcc gatttgactt ttgtgcattt ttggctggag tatctcatat agatttcttc    21180

actggcgctt tttcttcagt ttcctcatca tcaaaatcat catcatcatc aaaatcatca    21240

tcttcatcag cagcaagttt tacttttttc tgtggaacct tgctaccacc tccaggagca    21300

gatcgctttc cagatatact tatgagtttc acatcctcct cctgttcgtc ttctgactct    21360

gtatcttcct ccccagctac taaatgctgt ccactcacat gcactggccc tgaaccacac    21420

ttcaaccgta agaccactga tggtgttatt tcaaagccct caagggaaac catgggctgt    21480

acagacattt tcaaagctgc cagtgttact ttaattggac tgcctttgta actcattgcc    21540

tctgcttcaa caatgtgcaa tttatccttt gccccagccc ctaaactgac cgttcttaaa    21600

gataactgtt gctcaatttc attattatcc accttaaagt gatcatcttt gtcggccttt    21660

agttcacaac caaaaagata gttttggggc ctcagaggac tcatgtccat catcgtccat    21720

caggtggcag gacgcactta ggtgggagag aaggcagatg atgataaagg accactgctc    21780

aagagaacag ctgtgcagga cagaatcaca ccagggagat tacctttatc ttagaaaacc    21840

tgaacatctt gtgtactttg acacttctct acatttcacc taacctttaa catcaacaca    21900

tttattcaga aaacttttac ttttggagct gctctgtgtc aggctctatg ctaggtgctc    21960

aggatattga aattgataca atcctaacct attcacatat aatccaaggt ttgctgaaat    22020

tgatggacat ttaaacaatt gaaacattta agtggtataa ttagcaaatg gacatttaag    22080

ccataaaaat agcatctaat agatataata gaggtcggta caccattgat gagtcagagc    22140

agaggcaacc caaagagtaa ctagccagaa gaattgggaa agcttcatag agagagcgat    22200

atgaaaataa gggagagaat tgtaaatcca tgaaaatgag aaaaagttga aaagtgatgg    22260

tgtcagaaaa acttgtggta tgataatgac aagatgagag gaactcttgg taagcgtgtt    22320

ggatgcatgg aaagaaatgg cacaaaataa tgctgaggac attttttatt ttattgttgg    22380

ttttgttttg gttaatttca ttttttaaat ctagtatgct agtgttcatt gtccaaactg    22440

tgaatcataa actcagtttg tggatcaaca ccggcctttg atttttagtg aaacaaaata    22500

gaaaatatca gcattcatca caaatagatg tttcacagat tttttgtttt aattgcgact    22560

gtgtgtgtgt gggtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtatgtga gagagagaga    22620

gagagagaga gagagagatg gcttggatgt ttatcacctc cgaatcttat attgaaatgt    22680

gatttccaat gttggaggca gggcctggta ggtgtgattg gatcatgtgg gtggatcctt    22740

catgaatgat ccctttggtg acaagttagt tcatgctata tgtggttgtt taaaagagta    22800

tgagacctca acccccacct gtttcctgct ctcccctttg ccttccacca tggttggttg    22860

taaacttcct gaggctctca ccagaagtag atgccagtga catgcttcct gtacagcctg    22920

cagaaccgta agtcaaaaga aaaccccttt tctttttaaa gcac                     22964


<210>  218
<211>  11058
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DMD Universal replacement cassette with cDNA for any mutations 
       recovery

<400>  218
atgctttggt gggaagaagt agaggactgt tatgaaagag aagatgttca aaagaaaaca       60

ttcacaaaat gggtaaatgc acaattttct aagtttggga agcagcatat tgagaacctc      120

ttcagtgacc tacaggatgg gaggcgcctc ctagacctcc tcgaaggcct gacagggcaa      180

aaactgccaa aagaaaaagg atccacaaga gttcatgccc tgaacaatgt caacaaggca      240

ctgcgggttt tgcagaacaa taatgttgat ttagtgaata ttggaagtac tgacatcgta      300

gatggaaatc ataaactgac tcttggtttg atttggaata taatcctcca ctggcaggtc      360

aaaaatgtaa tgaaaaatat catggctgga ttgcaacaaa ccaacagtga aaagattctc      420

ctgagctggg tccgacaatc aactcgtaat tatccacagg ttaatgtaat caacttcacc      480

accagctggt ctgatggcct ggctttgaat gctctcatcc atagtcatag gccagaccta      540

tttgactgga atagtgtggt ttgccagcag tcagccacac aacgactgga acatgcattc      600

aacatcgcca gatatcaatt aggcatagag aaactactcg atcctgaaga tgttgatacc      660

acctatccag ataagaagtc catcttaatg tacatcacat cactcttcca agttttgcct      720

caacaagtga gcattgaagc catccaggaa gtggaaatgt tgccaaggcc acctaaagtg      780

actaaagaag aacattttca gttacatcat caaatgcact attctcaaca gatcacggtc      840

agtctagcac agggatatga gagaacttct tcccctaagc ctcgattcaa gagctatgcc      900

tacacacagg ctgcttatgt caccacctct gaccctacac ggagcccatt tccttcacag      960

catttggaag ctcctgaaga caagtcattt ggcagttcat tgatggagag tgaagtaaac     1020

ctggaccgtt atcaaacagc tttagaagaa gtattatcgt ggcttctttc tgctgaggac     1080

acattgcaag cacaaggaga gatttctaat gatgtggaag tggtgaaaga ccagtttcat     1140

actcatgagg ggtacatgat ggatttgaca gcccatcagg gccgggttgg taatattcta     1200

caattgggaa gtaagctgat tggaacagga aaattatcag aagatgaaga aactgaagta     1260

caagagcaga tgaatctcct aaattcaaga tgggaatgcc tcagggtagc tagcatggaa     1320

aaacaaagca atttacatag agttttaatg gatctccaga atcagaaact gaaagagttg     1380

aatgactggc taacaaaaac agaagaaaga acaaggaaaa tggaggaaga gcctcttgga     1440

cctgatcttg aagacctaaa acgccaagta caacaacata aggtgcttca agaagatcta     1500

gaacaagaac aagtcagggt caattctctc actcacatgg tggtggtagt tgatgaatct     1560

agtggagatc acgcaactgc tgctttggaa gaacaactta aggtattggg agatcgatgg     1620

gcaaacatct gtagatggac agaagaccgc tgggttcttt tacaagacat ccttctcaaa     1680

tggcaacgtc ttactgaaga acagtgcctt tttagtgcat ggctttcaga aaaagaagat     1740

gcagtgaaca agattcacac aactggcttt aaagatcaaa atgaaatgtt atcaagtctt     1800

caaaaactgg ccgttttaaa agcggatcta gaaaagaaaa agcaatccat gggcaaactg     1860

tattcactca aacaagatct tctttcaaca ctgaagaata agtcagtgac ccagaagacg     1920

gaagcatggc tggataactt tgcccggtgt tgggataatt tagtccaaaa acttgaaaag     1980

agtacagcac agatttcaca ggctgtcacc accactcagc catcactaac acagacaact     2040

gtaatggaaa cagtaactac ggtgaccaca agggaacaga tcctggtaaa gcatgctcaa     2100

gaggaacttc caccaccacc tccccaaaag aagaggcaga ttactgtgga ttctgaaatt     2160

aggaaaaggt tggatgttga tataactgaa cttcacagct ggattactcg ctcagaagct     2220

gtgttgcaga gtcctgaatt tgcaatcttt cggaaggaag gcaacttctc agacttaaaa     2280

gaaaaagtca atgccataga gcgagaaaaa gctgagaagt tcagaaaact gcaagatgcc     2340

agcagatcag ctcaggccct ggtggaacag atggtgaatg agggtgttaa tgcagatagc     2400

atcaaacaag cctcagaaca actgaacagc cggtggatcg aattctgcca gttgctaagt     2460

gagagactta actggctgga gtatcagaac aacatcatcg ctttctataa tcagctacaa     2520

caattggagc agatgacaac tactgctgaa aactggttga aaatccaacc caccacccca     2580

tcagagccaa cagcaattaa aagtcagtta aaaatttgta aggatgaagt caaccggcta     2640

tcagatcttc aacctcaaat tgaacgatta aaaattcaaa gcatagccct gaaagagaaa     2700

ggacaaggac ccatgttcct ggatgcagac tttgtggcct ttacaaatca ttttaagcaa     2760

gtcttttctg atgtgcaggc cagagagaaa gagctacaga caatttttga cactttgcca     2820

ccaatgcgct atcaggagac catgagtgcc atcaggacat gggtccagca gtcagaaacc     2880

aaactctcca tacctcaact tagtgtcacc gactatgaaa tcatggagca gagactcggg     2940

gaattgcagg ctttacaaag ttctctgcaa gagcaacaaa gtggcctata ctatctcagc     3000

accactgtga aagagatgtc gaagaaagcg ccctctgaaa ttagccggaa atatcaatca     3060

gaatttgaag aaattgaggg acgctggaag aagctctcct cccagctggt tgagcattgt     3120

caaaagctag aggagcaaat gaataaactc cgaaaaattc agaatcacat acaaaccctg     3180

aagaaatgga tggctgaagt tgatgttttt ctgaaggagg aatggcctgc ccttggggat     3240

tcagaaattc taaaaaagca gctgaaacag tgcagacttt tagtcagtga tattcagaca     3300

attcagccca gtctaaacag tgtcaatgaa ggtgggcaga agataaagaa tgaagcagag     3360

ccagagtttg cttcgagact tgagacagaa ctcaaagaac ttaacactca gtgggatcac     3420

atgtgccaac aggtctatgc cagaaaggag gccttgaagg gaggtttgga gaaaactgta     3480

agcctccaga aagatctatc agagatgcac gaatggatga cacaagctga agaagagtat     3540

cttgagagag attttgaata taaaactcca gatgaattac agaaagcagt tgaagagatg     3600

aagagagcta aagaagaggc ccaacaaaaa gaagcgaaag tgaaactcct tactgagtct     3660

gtaaatagtg tcatagctca agctccacct gtagcacaag aggccttaaa aaaggaactt     3720

gaaactctaa ccaccaacta ccagtggctc tgcactaggc tgaatgggaa atgcaagact     3780

ttggaagaag tttgggcatg ttggcatgag ttattgtcat acttggagaa agcaaacaag     3840

tggctaaatg aagtagaatt taaacttaaa accactgaaa acattcctgg cggagctgag     3900

gaaatctctg aggtgctaga ttcacttgaa aatttgatgc gacattcaga ggataaccca     3960

aatcagattc gcatattggc acagacccta acagatggcg gagtcatgga tgagctaatc     4020

aatgaggaac ttgagacatt taattctcgt tggagggaac tacatgaaga ggctgtaagg     4080

aggcaaaagt tgcttgaaca gagcatccag tctgcccagg agactgaaaa atccttacac     4140

ttaatccagg agtccctcac attcattgac aagcagttgg cagcttatat tgcagacaag     4200

gtggacgcag ctcaaatgcc tcaggaagcc cagaaaatcc aatctgattt gacaagtcat     4260

gagatcagtt tagaagaaat gaagaaacat aatcagggga aggaggctgc ccaaagagtc     4320

ctgtctcaga ttgatgttgc acagaaaaaa ttacaagatg tctccatgaa gtttcgatta     4380

ttccagaaac cagccaattt tgagcagcgt ctacaagaaa gtaagatgat tttagatgaa     4440

gtgaagatgc acttgcctgc attggaaaca aagagtgtgg aacaggaagt agtacagtca     4500

cagctaaatc attgtgtgaa cttgtataaa agtctgagtg aagtgaagtc tgaagtggaa     4560

atggtgataa agactggacg tcagattgta cagaaaaagc agacggaaaa tcccaaagaa     4620

cttgatgaaa gagtaacagc tttgaaattg cattataatg agctgggagc aaaggtaaca     4680

gaaagaaagc aacagttgga gaaatgcttg aaattgtccc gtaagatgcg aaaggaaatg     4740

aatgtcttga cagaatggct ggcagctaca gatatggaat tgacaaagag atcagcagtt     4800

gaaggaatgc ctagtaattt ggattctgaa gttgcctggg gaaaggctac tcaaaaagag     4860

attgagaaac agaaggtgca cctgaagagt atcacagagg taggagaggc cttgaaaaca     4920

gttttgggca agaaggagac gttggtggaa gataaactca gtcttctgaa tagtaactgg     4980

atagctgtca cctcccgagc agaagagtgg ttaaatcttt tgttggaata ccagaaacac     5040

atggaaactt ttgaccagaa tgtggaccac atcacaaagt ggatcattca ggctgacaca     5100

cttttggatg aatcagagaa aaagaaaccc cagcaaaaag aagacgtgct taagcgttta     5160

aaggcagaac tgaatgacat acgcccaaag gtggactcta cacgtgacca agcagcaaac     5220

ttgatggcaa accgcggtga ccactgcagg aaattagtag agccccaaat ctcagagctc     5280

aaccatcgat ttgcagccat ttcacacaga attaagactg gaaaggcctc cattcctttg     5340

aaggaattgg agcagtttaa ctcagatata caaaaattgc ttgaaccact ggaggctgaa     5400

attcagcagg gggtgaatct gaaagaggaa gacttcaata aagatatgaa tgaagacaat     5460

gagggtactg taaaagaatt gttgcaaaga ggagacaact tacaacaaag aatcacagat     5520

gagagaaagc gagaggaaat aaagataaaa cagcagctgt tacagacaaa acataatgct     5580

ctcaaggatt tgaggtctca aagaagaaaa aaggctctag aaatttctca tcagtggtat     5640

cagtacaaga ggcaggctga tgatctcctg aaatgcttgg atgacattga aaaaaaatta     5700

gccagcctac ctgagcccag agatgaaagg aaaataaagg aaattgatcg ggaattgcag     5760

aagaagaaag aggagctgaa tgcagtgcgt aggcaagctg agggcttgtc tgaggatggg     5820

gccgcaatgg cagtggagcc aactcagatc cagctcagca agcgctggcg ggaaattgag     5880

agcaaatttg ctcagtttcg aagactcaac tttgcacaaa ttcacactgt ccgtgaagaa     5940

acgatgatgg tgatgactga agacatgcct ttggaaattt cttatgtgcc ttctacttat     6000

ttgactgaaa tcactcatgt ctcacaagcc ctattagaag tggaacaact tctcaatgct     6060

cctgacctct gtgctaagga ctttgaagat ctctttaagc aagaggagtc tctgaagaat     6120

ataaaagata gtctacaaca aagctcaggt cggattgaca ttattcatag caagaagaca     6180

gcagcattgc aaagtgcaac gcctgtggaa agggtgaagc tacaggaagc tctctcccag     6240

cttgatttcc aatgggaaaa agttaacaaa atgtacaagg accgacaagg gcgatttgac     6300

agatctgttg agaaatggcg gcgttttcat tatgatataa agatatttaa tcagtggcta     6360

acagaagctg aacagtttct cagaaagaca caaattcctg agaattggga acatgctaaa     6420

tacaaatggt atcttaagga actccaggat ggcattgggc agcggcaaac tgttgtcaga     6480

acattgaatg caactgggga agaaataatt cagcaatcct caaaaacaga tgccagtatt     6540

ctacaggaaa aattgggaag cctgaatctg cggtggcagg aggtctgcaa acagctgtca     6600

gacagaaaaa agaggctaga agaacaaaag aatatcttgt cagaatttca aagagattta     6660

aatgaatttg ttttatggtt ggaggaagca gataacattg ctagtatccc acttgaacct     6720

ggaaaagagc agcaactaaa agaaaagctt gagcaagtca agttactggt ggaagagttg     6780

cccctgcgcc agggaattct caaacaatta aatgaaactg gaggacccgt gcttgtaagt     6840

gctcccataa gcccagaaga gcaagataaa cttgaaaata agctcaagca gacaaatctc     6900

cagtggataa aggtttccag agctttacct gagaaacaag gagaaattga agctcaaata     6960

aaagaccttg ggcagcttga aaaaaagctt gaagaccttg aagagcagtt aaatcatctg     7020

ctgctgtggt tatctcctat taggaatcag ttggaaattt ataaccaacc aaaccaagaa     7080

ggaccatttg acgttaagga aactgaaata gcagttcaag ctaaacaacc ggatgtggaa     7140

gagattttgt ctaaagggca gcatttgtac aaggaaaaac cagccactca gccagtgaag     7200

aggaagttag aagatctgag ctctgagtgg aaggcggtaa accgtttact tcaagagctg     7260

agggcaaagc agcctgacct agctcctgga ctgaccacta ttggagcctc tcctactcag     7320

actgttactc tggtgacaca acctgtggtt actaaggaaa ctgccatctc caaactagaa     7380

atgccatctt ccttgatgtt ggaggtacct gctctggcag atttcaaccg ggcttggaca     7440

gaacttaccg actggctttc tctgcttgat caagttataa aatcacagag ggtgatggtg     7500

ggtgaccttg aggatatcaa cgagatgatc atcaagcaga aggcaacaat gcaggatttg     7560

gaacagaggc gtccccagtt ggaagaactc attaccgctg cccaaaattt gaaaaacaag     7620

accagcaatc aagaggctag aacaatcatt acggatcgaa ttgaaagaat tcagaatcag     7680

tgggatgaag tacaagaaca ccttcagaac cggaggcaac agttgaatga aatgttaaag     7740

gattcaacac aatggctgga agctaaggaa gaagctgagc aggtcttagg acaggccaga     7800

gccaagcttg agtcatggaa ggagggtccc tatacagtag atgcaatcca aaagaaaatc     7860

acagaaacca agcagttggc caaagacctc cgccagtggc agacaaatgt agatgtggca     7920

aatgacttgg ccctgaaact tctccgggat tattctgcag atgataccag aaaagtccac     7980

atgataacag agaatatcaa tgcctcttgg agaagcattc ataaaagggt gagtgagcga     8040

gaggctgctt tggaagaaac tcatagatta ctgcaacagt tccccctgga cctggaaaag     8100

tttcttgcct ggcttacaga agctgaaaca actgccaatg tcctacagga tgctacccgt     8160

aaggaaaggc tcctagaaga ctccaaggga gtaaaagagc tgatgaaaca atggcaagac     8220

ctccaaggtg aaattgaagc tcacacagat gtttatcaca acctggatga aaacagccaa     8280

aaaatcctga gatccctgga aggttccgat gatgcagtcc tgttacaaag acgtttggat     8340

aacatgaact tcaagtggag tgaacttcgg aaaaagtctc tcaacattag gtcccatttg     8400

gaagccagtt ctgaccagtg gaagcgtctg cacctttctc tgcaggaact tctggtgtgg     8460

ctacagctga aagatgatga attaagccgg caggcaccta ttggaggcga ctttccagca     8520

gttcagaagc agaacgatgt acatagggcc ttcaagaggg aattgaaaac taaagaacct     8580

gtaatcatga gtactcttga gactgtacga atatttctga cagagcagcc tttggaagga     8640

ctagagaaac tctaccagga gcccagagag ctgcctcctg aggagagagc ccagaatgtc     8700

actcggcttc tacgaaagca ggctgaggag gtcaatactg agtgggaaaa attgaacctg     8760

cactccgctg actggcagag aaaaatagat gagacccttg aaagactccg ggaacttcaa     8820

gaggccacgg atgagctgga cctcaagctg cgccaagctg aggtgatcaa gggatcctgg     8880

cagcccgtgg gcgatctcct cattgactct ctccaagatc acctcgagaa agtcaaggca     8940

cttcgaggag aaattgcgcc tctgaaagag aacgtgagcc acgtcaatga ccttgctcgc     9000

cagcttacca ctttgggcat tcagctctca ccgtataacc tcagcactct ggaagacctg     9060

aacaccagat ggaagcttct gcaggtggcc gtcgaggacc gagtcaggca gctgcatgaa     9120

gcccacaggg actttggtcc agcatctcag cactttcttt ccacgtctgt ccagggtccc     9180

tgggagagag ccatctcgcc aaacaaagtg ccctactata tcaaccacga gactcaaaca     9240

acttgctggg accatcccaa aatgacagag ctctaccagt ctttagctga cctgaataat     9300

gtcagattct cagcttatag gactgccatg aaactccgaa gactgcagaa ggccctttgc     9360

ttggatctct tgagcctgtc agctgcatgt gatgccttgg accagcacaa cctcaagcaa     9420

aatgaccagc ccatggatat cctgcagatt attaattgtt tgaccactat ttatgaccgc     9480

ctggagcaag agcacaacaa tttggtcaac gtccctctct gcgtggatat gtgtctgaac     9540

tggctgctga atgtttatga tacgggacga acagggagga tccgtgtcct gtcttttaaa     9600

actggcatca tttccctgtg taaagcacat ttggaagaca agtacagata ccttttcaag     9660

caagtggcaa gttcaacagg attttgtgac cagcgcaggc tgggcctcct tctgcatgat     9720

tctatccaaa ttccaagaca gttgggtgaa gttgcatcct ttgggggcag taacattgag     9780

ccaagtgtcc ggagctgctt ccaatttgct aataataagc cagagatcga agcggccctc     9840

ttcctagact ggatgagact ggaaccccag tccatggtgt ggctgcccgt cctgcacaga     9900

gtggctgctg cagaaactgc caagcatcag gccaaatgta acatctgcaa agagtgtcca     9960

atcattggat tcaggtacag gagtctaaag cactttaatt atgacatctg ccaaagctgc    10020

tttttttctg gtcgagttgc aaaaggccat aaaatgcact atcccatggt ggaatattgc    10080

actccgacta catcaggaga agatgttcga gactttgcca aggtactaaa aaacaaattt    10140

cgaaccaaaa ggtattttgc gaagcatccc cgaatgggct acctgccagt gcagactgtc    10200

ttagaggggg acaacatgga aactcccgtt actctgatca acttctggcc agtagattct    10260

gcgcctgcct cgtcccctca gctttcacac gatgatactc attcacgcat tgaacattat    10320

gctagcaggc tagcagaaat ggaaaacagc aatggatctt atctaaatga tagcatctct    10380

cctaatgaga gcatagatga tgaacatttg ttaatccagc attactgcca aagtttgaac    10440

caggactccc ccctgagcca gcctcgtagt cctgcccaga tcttgatttc cttagagagt    10500

gaggaaagag gggagctaga gagaatccta gcagatcttg aggaagaaaa caggaatctg    10560

caagcagaat atgaccgtct aaagcagcag cacgaacata aaggcctgtc cccactgccg    10620

tcccctcctg aaatgatgcc cacctctccc cagagtcccc gggatgctga gctcattgct    10680

gaggccaagc tactgcgtca acacaaaggc cgcctggaag ccaggatgca aatcctggaa    10740

gaccacaata aacagctgga gtcacagtta cacaggctaa ggcagctgct ggagcaaccc    10800

caggcagagg ccaaagtgaa tggcacaacg gtgtcctctc cttctacctc tctacagagg    10860

tccgacagca gtcagcctat gctgctccga gtggttggca gtcaaacttc ggactccatg    10920

ggtgaggaag atcttctcag tcctccccag gacacaagca cagggttaga ggaggtgatg    10980

gagcaactca acaactcctt ccctagttca agaggaagaa atacccctgg aaagccaatg    11040

agagaggaca caatgtag                                                  11058


<210>  219
<211>  7607
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS Native replacement sequence for Promoter, exons 1-3 
       mutations recovery using CTNS4 and CTNS1

<400>  219
atacttatga gtgaaaagta tgaacttgag gaaagaacac agccagcaga tattactttt       60

tttttttttt tttttttttt ggagacagag tcttactctg ttgcccaggc tggagtgcag      120

tggtatgatc tgggctcact gcaacctctg cctcccgagt tcaagcaatt ctcctgcctc      180

agcctcccaa gtagctggga ttacaagcac gcatcaccac gcccggctaa tttttgttat      240

tttgtagtag agacagggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca      300

agtgatccac ccacctccgc ctcccaaagt gctgggatta caggcaagag ccaccgcgcc      360

cggccacaga tatgactata gatcactggt tcctactcgg ggtggtcttg tcacctaggg      420

aacatttggc aacatggaga catttttggt tgtcacatct ggggaagagg ggcaagcgtg      480

gctggcatct agtgggccag agatgttgct aaacattcta caacatgcag gacacccctc      540

acacaacaaa aactatgcag cccaaaatgt cagcagcacc aaggttgaga aaccctgcta      600

tatagactaa ctcacagcag tgctgtttgt cccagagcac gattcatatg tggtgtgggg      660

gggttaatga ctggcctccg ctaagcactt cattaaatag gtgtgacaca ctgggtgagc      720

ctgtaagcac agaacagcct gctgaaagct ggggagggag ggcagaaaag ttttcaagaa      780

gtggccgtgc tgccgcccct actgggaagt gaggagcccc tctgcccggc caccaccccg      840

tctgggtagt gtacccaaca gctcattgag aatgggccat gatgacaatg gcggttttgt      900

ggaatagaaa agggggaaag gtggggaaaa gattgagaaa tcggatggtt gctgtgtctg      960

tgtagaaaga agtagacatg ggagactttt cattttgttc cgtactaaga aaaattcttc     1020

tgccttggga tcctgttgat ctgtgacctt acccccaacc ctgtgctctc tcaaacatgt     1080

gctgtgtcca ctcagggtta aatggattaa gggcggtgca agatgtgctt tgttaaacag     1140

atgcttgaag gcagcatgcc cgttaagagt catcaccact ccctaatctc aagtacccag     1200

ggacacaaac actgcggaag gccgcagggt cctctgccta ggaaaaccag agacctttgt     1260

tcacttgttt atctgctgtc cttccctcca ctattgtcct atgaccctgc caaatccccc     1320

tctgcgagaa acacccaaga gtgatcaatt aaaaaaaaaa aaaaagtggc catgctgggt     1380

gcggtggctc acacctgtaa tcccagcact ttgggaaacc gaggcaggca gatcagttga     1440

ggtcaggagt ttgagaccag ccttgccaac atggtgaaac cccatctcta ccaaaaatac     1500

aaaaaaattc tccaagcatg gtggcgcaca cctgtaatcc cagctactcg ggaaactgag     1560

gcacgaaaat cacttgaacc cgggaggcag aggtttcagt gagcagagat tgcaccactg     1620

cactccagcc tgggtgacag agcgagaccc tgtctcaaaa aaaaaaaaaa aaaaaaaaga     1680

agtgctctat ttcaggagaa actggcactt tctgagccta ctctccccta atgccagctc     1740

tcctgctcac cccaccaggg tcagagccaa ctttgcctcc aattcatagt cctttaagta     1800

agaatccttt taatatgccc taatgtccca accaaactaa tcttgaaagc ttctatgtag     1860

atacaaagtg ctcctgaaat ccctatcctc agaaatgctt ctgagccaaa tgggctctga     1920

accctaaaca accgtgtcca tgtatgtggc aagagcttgt gaaaaacaaa gctgggccag     1980

gcgcagtgac tcacaactgt aatcctagca ctttgggagg ctgaagtggg cagatcactt     2040

gaggtcagga gttcaagacc agtctggcga acatggcgaa accctgtctc tactaaaaat     2100

acaaaaagta gccgggcgcg gtggctcaca cctgtagtcc cagctactcg ggaggctgaa     2160

gcaggagaat cacttgaatc cagttggcgg aggttgcagt gagcccagat cacgccactg     2220

tactccagcc tgggcaacag agcgagactt ggtaagaaag agaaagaaag gaaagaatga     2280

aggaaggaag gaaggaagga aggaaggaag gaaggaagga aggaaggaag ggaaggaagg     2340

gaaggagtct cgctctgtca cccaggctgg agtgcaacgg agcgatctcg actcactgca     2400

agctccgcct cccgggttcg cgccattctc ctgcctcagc ctcccgagta gctgggacta     2460

caggcgcccg ccaccacgcc ccgctaattt tttgtatttt tagtacagac ggggtttcac     2520

cgtgttagcc aggatggtct cgatctcctg acctcgtgat ccgcccgcct cggcctccca     2580

aagcgctggg attacaggcg tgagccaccg cgcccggctg accaaaggtt tcttggtccg     2640

cattctgctt ctgtggaatg agccaggagc cagttaggcc tgatttgaca tctgatttcc     2700

ggaggaaaac ccagactctg ccctgggcaa caaactgaat cctgaacttg aggtcacagg     2760

gcaggtgtga ggagcggaga gcagcaagag tgaaagggag gcctgtggtc attccataca     2820

cacaagagat cagttcctcc aaggtcaggg gacagagagc acagggatcc agcgccaagc     2880

gcaaggcccc cagaagaagc cagagagtcg gggagggggc gggggggaat cggtcccagc     2940

aggtgggaag gattctggga ccagacctaa gggatcatga gcacagctgc tgcaggcaga     3000

cgggcccctg gagaagctgg ggacaagctg gaatagagac ttcattgcgg gaagggctgt     3060

cagggaggcc tcctggggtg gaaaagggtg gtcaggaggc tcctggaggc ggcgcggccc     3120

cgggggtcca actcacctgg ggcccggcca ccgcgctctc gaccgccgcc tctgcccgcg     3180

cagcacgggc acagctcgcc agcactgcga acccggatgg gtcgtcgggc gcggccctca     3240

gcagagctgc cttcacagat gtggtgccca ggtcaatgcc gagggtgatc ggccgcgcag     3300

ccattatctc cctgacccgc gcagctccag tctgcagcca gcggccccac aagtccgcgc     3360

tcttcgccca ggggggcggg gcaggggcgg ggagtcgcct gccaatcttt cagccacacc     3420

caacatggag gcttctcgtc ttcccactgg ccggggaagg cgagcttcca cgcaacctct     3480

cggcgggccc cggctatagg cggagaggcg gcggaaggcg ggacctaaag ggggccccgc     3540

cccacgggct ctgatttccg cccaatggag ggcggtctga gcttcgctca cgaaaggagc     3600

cgggaggcgc tggcggctcc aagagtctct gtgtccctgg cagcggacct catcttccct     3660

cacgccggag ccccgatctc tgcgccccgg cccgacccag ctgcgctctg tccgtctaag     3720

acgcgcggaa actacaactc ccagagctca tctcgccgag atccggcccc acgagtcagg     3780

tggcggaggt caggtgacag cggacccgcc tctcccaaag tctagccggg caggggaacg     3840

cggtgcattc ctgaccggca cctggcgagg ctcatgcgtc ccgtgagggc ggttcctcga     3900

gcctgggggc gctcaggtga gagcggacgc ggcctcccct gtttcccagg cggacccctt     3960

gaggcacagc aggtcagcgg ggcagcctgc cgggggtcca gcgccctcag ccgcggcggg     4020

ctcctttccc cgccaccagt gctggcctcg cgacacggga caacccccgg gtggaagggc     4080

ccgagcggtg gtcagccgag gcaggggcag cgggctgccg gggtgggtgc cgttcccagc     4140

cccttacctt ctgctcagtt gccgcctggg tctcggttgg ggaatttgca gattgctttg     4200

gagacgctga gagaaccttt gcgagagcgc cggttgacgt gcggagtgcg gggctccggg     4260

ggactgagca gcacgagacc ccatcctccc ctccgggttt tcacactggg cgaagggagg     4320

actcctgagc tctgcctctt ccagtaacat tgaggattac tgtgttttgt gagagctcgc     4380

taggcgccct aagcaacaga ggtaaccact ttatatcctt gtttctcaac ctcgttattc     4440

ctacctaccc ccttcccata aaatttaata ccactagtac gctgtgtatt tgtttctgtg     4500

gccacaaacc attgtaatag ctagatttct tcactaccac cccaagccaa tttttttttt     4560

ttttttgaga tggagtctgc agcctctgtc acccaggctg gagtgcagtg gcgcgatctc     4620

ggctcactgc aacctccgcc tccggggttc aagcgattct cctacctcag ccttccgagt     4680

agctgggact acaggcctga gccaccatgc ccagctaatt tttgtatttt tagtagagat     4740

ggggattcac catgttggcc aggctggtct cgaactcctg acctcaggtg atgcgctcac     4800

ctcggcctcc caaagtgctg ggatgacagg cgtgagccac cgcgcccagc ctacccccag     4860

ccaattttag tcccacttga caatgcgtgc tttacatctc ctcatttaag tcctgtgagg     4920

tagttaccac ctccttgttt ggcaccacaa ggtcgcataa gtaataaata ggtcaagcct     4980

gtctccagtg cacacagccc ttgccactat ttgtgtaccc tctccaaaag caggagaccc     5040

agggagttcc aggtcgtaga acagaggaca ggaccaactc atacctggca gacaggagct     5100

gccacactag acccctagcc ccaggttgct cctgggaagg gactgaatgg gtgaggagcc     5160

ttcttgaaac atgtgacatc tgaatgaggc ctggacaata gttagaactt acataggaag     5220

ggcacgccag acagagccca ttgtcaggag atacttcatt tctatcttgt agctttcaca     5280

agccactagt tgtatgtaat tatcaatctg gttttttttt tgtttttttt ttttaatttg     5340

agacggagtt tcactcttat cactcaggct ggagtgcaat ggtgcaatct cggctcactg     5400

caacctccac ctcccgggtt caagcgattc tcctgcctca gcctcctgag tagctgggac     5460

tacaggcaca tgccaccacg cctggctaat ttttgtattt ttagtagaga cggggattca     5520

ccatgttggc caggctggtc tcgaactcct gacttcaagt gatccaactg cctcggcctc     5580

ccaaagtgct ggaattacac acacgagcca ctgcgctcag cctaatctga tgttttttaa     5640

cattttaatt gacttacctc tcaatgtcgt tttgtctctg ctggcatcgt tcctccaggg     5700

gtctcagcct ttgaggcttg ggaatgtttg ctgaccaagt ctgtgagttt gagaagctgg     5760

ttaggcctga ttctgcatct aatttctgga gaaaaaccag actctgtcct gggcaacaaa     5820

ctgaatcctg aacttgaggc cacagggcag gtgtgaggag cggagggcag caagagtgag     5880

agggaggcct gtggtcattc catacacgca ggagggcaat tcctccaagg tcaggggaca     5940

gagcacaggg atccagcgcc aagagcaagg cccccagagg aggccagaga gtaggtacgg     6000

ggtcattccc ggccggtgag aagggtctca gatgaggcag acctgcagca ggcaaagaga     6060

gaaccctgga ggagacgggc caacagaggt cagacagctg gagcagccag ggagacttct     6120

tgaggagtgt gtaagggaga tgtccggaga tgctggaggc cttggggaaa ctgaaatcag     6180

agtgggaaca gggatgtctc cacacagacc ttacccagag ctccccacag tctgcaggag     6240

gcccgtgaga ctgtgtactg aggcagcacg gagaccaagc tacagaaatc catgccggcc     6300

tggctgctct tgacccactg ttcacctgct gtgtcttggg tttacaggaa tgcagctccc     6360

catcttccac actaaaccaa ggacttgctc tggggctcat ccctccccga gtcctccttg     6420

tgaatgaccc cagccagtcc tggaatggtg acacttgtca aataaagtct tgacaggcgc     6480

ggtggctcct acctgtaacc ccagcacttt gggaggctga ggcgggcgga tcactcgagg     6540

tcaggagttt gagaccaggc tggccaacat ggtgaaaccc catctctact aaaaatacaa     6600

aagttagccg ggcatggtgg ggggcacctg taatcccagc tactcaggag gctgaggcac     6660

aagaattgct tgaacccagg gggtggaggt ttcagtgaac agagttcgca ccactgcact     6720

ccagcctggg caacagagca agactctgtc tcaaaaaaaa aaaaatttaa atatgtatat     6780

taaaaaaaaa tgttttttta agtcttaagg gtcagttggt gtcatcagcc cttagactct     6840

tatcccagga caggaaagga aattaatttc cttgaggttt ataggttcac aatgtcaaat     6900

atctgaccac agttttaaca acttttggag aaaaagaatc tcaagccagt aaaattgcat     6960

tctttctttc tgctaactaa gtttttacaa aaagcaattg aagagggaaa aattctggtc     7020

tttgttcact tcctcagggg ggcactttac acaacccatt tatctgctcg gagcccgttt     7080

cccctgtata tcaaagaaag ataagtcctc tctagggtgt ccctctgagg ccgtgatgca     7140

aagccctgag gtcacagctg tcaggtggca gtcctttatg agccatccat gctccagagg     7200

gcagattgtc tacagggagc tgagctgatt caacattccc ctgaacttct ctcttgctgt     7260

ttttcttcct agttctgaga aatcgagaaa catgataagg aattggctga ctatttttat     7320

cctttttccc ctgaagctcg tagagaaatg tggtaagttt agaaatgaca cgtcaacttt     7380

gtaaagaggg aaatggtggc tagaggaagg agtaatctga tctgtttgtt gccaagggtt     7440

tagaatcatt cagaccacat gtctctgtct gcctcttggc catgtggcca ctggggtggt     7500

ggagcagacc caggtctggg atccaggtgt tctgcaaaga gccagatagt tccacatata     7560

attggccttc tgccctggta tctctgtacc tttctgtacc aaagtga                   7607


<210>  220
<211>  1899
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CTNS Universal replacement cassette with Promoter-cDNA for any 
       mutations recovery

<400>  220
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc       60

catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca      120

acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga      180

ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc      240

aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct      300

ggcattatgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat      360

tagtcatcgc tattaccatg gtgatgcggt tttggcagta catcaatggg cgtggatagc      420

ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt      480

ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa      540

tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctctctgg ctaactagag      600

aacccactgc ttactggctt atcgaaatta atacgactca ctatagggag acccaagctg      660

gctagcgttt aaacttaagc ttggtaccga gctcggatcc tgagctctgc ctcttccagt      720

aacattgagg attactgtgt tttgtgagag ctcgctaggc gccctaagca acagagttct      780

gagaaatcga gaaacatgat aaggaattgg ctgactattt ttatcctttt tcccctgaag      840

ctcgtagaga aatgtgagtc aagcgtcagc ctcactgttc ctcctgtcgt aaagctggag      900

aacggcagct cgaccaacgt cagcctcacc ctgcggccac cattaaatgc aaccctggtg      960

atcacttttg aaatcacatt tcgttccaaa aatattacta tccttgagct ccccgatgaa     1020

gttgtggtgc ctcctggagt gacaaactcc tcttttcaag tgacatctca aaatgttgga     1080

caacttactg tttatctaca tggaaatcac tccaatcaga ccggcccgag gatacgcttt     1140

cttgtgatcc gcagcagcgc cattagcatc ataaaccagg tgattggctg gatctacttt     1200

gtggcctggt ccatctcctt ctaccctcag gtgatcatga attggaggcg gaaaagtgtc     1260

attggtctga gcttcgactt cgtggctctg aacctgacgg gcttcgtggc ctacagtgta     1320

ttcaacatcg gcctcctctg ggtgccctac atcaaggagc agtttctcct caaatacccc     1380

aacggagtga accccgtgaa cagcaacgac gtcttcttca gcctgcacgc ggttgtcctc     1440

acgctgatca tcatcgtgca gtgctgcctg tatgagcgcg gtggccagcg cgtgtcctgg     1500

cctgccatcg gcttcctggt gctcgcgtgg ctcttcgcat ttgtcaccat gatcgtggct     1560

gcagtgggag tgatcacgtg gctgcagttt ctcttctgct tctcctacat caagctcgca     1620

gtcacgctgg tcaagtattt tccacaggcc tacatgaact tttactacaa aagcactgag     1680

ggctggagca ttggcaacgt gctcctggac ttcaccgggg gcagcttcag cctcctgcag     1740

atgttcctcc agtcctacaa caacgaccag tggacgctga tcttcggaga cccaaccaag     1800

tttggactcg gggtcttctc catcgtcttc gacgtcgtct tcttcatcca gcacttctgt     1860

ttgtacagaa agagaccggg gtatgaccag ctgaactag                            1899


<210>  221
<211>  96
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A Native replacement sequence for intron 6  mutations 
       recovery using SCN1A3 and SCN1A4

<400>  221
atactttgca ctgtaaagtg tctaaagtat ctttgcactg tatctaatct aatgtcattt       60

cttcataatg aagaaatact ttgcactgta aagtat                                 96


<210>  222
<211>  5997
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SCN1A Universal replacement cassette with cDNA for any mutations 
       recovery

<400>  222
atggagcaaa cagtgcttgt accaccagga cctgacagct tcaacttctt caccagagaa       60

tctcttgcgg ctattgaaag acgcattgca gaagaaaagg caaagaatcc caaaccagac      120

aaaaaagatg acgacgaaaa tggcccaaag ccaaatagtg acttggaagc tggaaagaac      180

cttccattta tttatggaga cattcctcca gagatggtgt cagagcccct ggaggacctg      240

gacccctact atatcaataa gaaaactttt atagtattga ataaagggaa ggccatcttc      300

cggttcagtg ccacctctgc cctgtacatt ttaactccct tcaatcctct taggaaaata      360

gctattaaga ttttggtaca ttcattattc agcatgctaa ttatgtgcac tattttgaca      420

aactgtgtgt ttatgacaat gagtaaccct cctgattgga caaagaatgt agaatacacc      480

ttcacaggaa tatatacttt tgaatcactt ataaaaatta ttgcaagggg attctgttta      540

gaagatttta ctttccttcg ggatccatgg aactggctcg atttcactgt cattacattt      600

gcgtacgtca cagagtttgt ggacctgggc aatgtctcgg cattgagaac attcagagtt      660

ctccgagcat tgaagacgat ttcagtcatt ccaggcctga aaaccattgt gggagccctg      720

atccagtctg tgaagaagct ctcagatgta atgatcctga ctgtgttctg tctgagcgta      780

tttgctctaa ttgggctgca gctgttcatg ggcaacctga ggaataaatg tatacaatgg      840

cctcccacca atgcttcctt ggaggaacat agtatagaaa agaatataac tgtgaattat      900

aatggtacac ttataaatga aactgtcttt gagtttgact ggaagtcata tattcaagat      960

tcaagatatc attatttcct ggagggtttt ttagatgcac tactatgtgg aaatagctct     1020

gatgcaggcc aatgtccaga gggatatatg tgtgtgaaag ctggtagaaa tcccaattat     1080

ggctacacaa gctttgatac cttcagttgg gcttttttgt ccttgtttcg actaatgact     1140

caggacttct gggaaaatct ttatcaactg acattacgtg ctgctgggaa aacgtacatg     1200

atattttttg tattggtcat tttcttgggc tcattctacc taataaattt gatcctggct     1260

gtggtggcca tggcctacga ggaacagaat caggccacct tggaagaagc agaacagaaa     1320

gaggccgaat ttcagcagat gattgaacag cttaaaaagc aacaggaggc agctcagcag     1380

gcagcaacgg caactgcctc agaacattcc agagagccca gtgcagcagg caggctctca     1440

gacagctcat ctgaagcctc taagttgagt tccaagagtg ctaaggaaag aagaaatcgg     1500

aggaagaaaa gaaaacagaa agagcagtct ggtggggaag agaaagatga ggatgaattc     1560

caaaaatctg aatctgagga cagcatcagg aggaaaggtt ttcgcttctc cattgaaggg     1620

aaccgattga catatgaaaa gaggtactcc tccccacacc agtctttgtt gagcatccgt     1680

ggctccctat tttcaccaag gcgaaatagc agaacaagcc ttttcagctt tagagggcga     1740

gcaaaggatg tgggatctga gaacgacttc gcagatgatg agcacagcac ctttgaggat     1800

aacgagagcc gtagagattc cttgtttgtg ccccgacgac acggagagag acgcaacagc     1860

aacctgagtc agaccagtag gtcatcccgg atgctggcag tgtttccagc gaatgggaag     1920

atgcacagca ctgtggattg caatggtgtg gtttccttgg ttggtggacc ttcagttcct     1980

acatcgcctg ttggacagct tctgccagag ggaacaacca ctgaaactga aatgagaaag     2040

agaaggtcaa gttctttcca cgtttccatg gactttctag aagatccttc ccaaaggcaa     2100

cgagcaatga gtatagccag cattctaaca aatacagtag aagaacttga agaatccagg     2160

cagaaatgcc caccctgttg gtataaattt tccaacatat tcttaatctg ggactgttct     2220

ccatattggt taaaagtgaa acatgttgtc aacctggttg tgatggaccc atttgttgac     2280

ctggccatca ccatctgtat tgtcttaaat actcttttca tggccatgga gcactatcca     2340

atgacggacc atttcaataa tgtgcttaca gtaggaaact tggttttcac tgggatcttt     2400

acagcagaaa tgtttctgaa aattattgcc atggatcctt actattattt ccaagaaggc     2460

tggaatatct ttgacggttt tattgtgacg cttagcctgg tagaacttgg actcgccaat     2520

gtggaaggat tatctgttct ccgttcattt cgattgctgc gagttttcaa gttggcaaaa     2580

tcttggccaa cgttaaatat gctaataaag atcatcggca attccgtggg ggctctggga     2640

aatttaaccc tcgtcttggc catcatcgtc ttcatttttg ccgtggtcgg catgcagctc     2700

tttggtaaaa gctacaaaga ttgtgtctgc aagatcgcca gtgattgtca actcccacgc     2760

tggcacatga atgacttctt ccactccttc ctgattgtgt tccgcgtgct gtgtggggag     2820

tggatagaga ccatgtggga ctgtatggag gttgctggtc aagccatgtg ccttactgtc     2880

ttcatgatgg tcatggtgat tggaaaccta gtggtcctga atctctttct ggccttgctt     2940

ctgagctcat ttagtgcaga caaccttgca gccactgatg atgataatga aatgaataat     3000

ctccaaattg ctgtggatag gatgcacaaa ggagtagctt atgtgaaaag aaaaatatat     3060

gaatttattc aacagtcctt cattaggaaa caaaagattt tagatgaaat taaaccactt     3120

gatgatctaa acaacaagaa agacagttgt atgtccaatc atacagcaga aattgggaaa     3180

gatcttgact atcttaaaga tgtaaatgga actacaagtg gtataggaac tggcagcagt     3240

gttgaaaaat acattattga tgaaagtgat tacatgtcat tcataaacaa ccccagtctt     3300

actgtgactg taccaattgc tgtaggagaa tctgactttg aaaatttaaa cacggaagac     3360

tttagtagtg aatcggatct ggaagaaagc aaagagaaac tgaatgaaag cagtagctca     3420

tcagaaggta gcactgtgga catcggcgca cctgtagaag aacagcccgt agtggaacct     3480

gaagaaactc ttgaaccaga agcttgtttc actgaaggct gtgtacaaag attcaagtgt     3540

tgtcaaatca atgtggaaga aggcagagga aaacaatggt ggaacctgag aaggacgtgt     3600

ttccgaatag ttgaacataa ctggtttgag accttcattg ttttcatgat tctccttagt     3660

agtggtgctc tggcatttga agatatatat attgatcagc gaaagacgat taagacgatg     3720

ttggaatatg ctgacaaggt tttcacttac attttcattc tggaaatgct tctaaaatgg     3780

gtggcatatg gctatcaaac atatttcacc aatgcctggt gttggctgga cttcttaatt     3840

gttgatgttt cattggtcag tttaacagca aatgccttgg gttactcaga acttggagcc     3900

atcaaatctc tcaggacact aagagctctg agacctctaa gagccttatc tcgatttgaa     3960

gggatgaggg tggttgtgaa tgccctttta ggagcaattc catccatcat gaatgtgctt     4020

ctggtttgtc ttatattctg gctaattttc agcatcatgg gcgtaaattt gtttgctggc     4080

aaattctacc actgtattaa caccacaact ggtgacaggt ttgacatcga agacgtgaat     4140

aatcatactg attgcctaaa actaatagaa agaaatgaga ctgctcgatg gaaaaatgtg     4200

aaagtaaact ttgataatgt aggatttggg tatctctctt tgcttcaagt tgccacattc     4260

aaaggatgga tggatataat gtatgcagca gttgattcca gaaatgtgga actccagcct     4320

aagtatgaag aaagtctgta catgtatctt tactttgtta ttttcatcat ctttgggtcc     4380

ttcttcacct tgaacctgtt tattggtgtc atcatagata atttcaacca gcagaaaaag     4440

aagtttggag gtcaagacat ctttatgaca gaagaacaga agaaatacta taatgcaatg     4500

aaaaaattag gatcgaaaaa accgcaaaag cctatacctc gaccaggaaa caaatttcaa     4560

ggaatggtct ttgacttcgt aaccagacaa gtttttgaca taagcatcat gattctcatc     4620

tgtcttaaca tggtcacaat gatggtggaa acagatgacc agagtgaata tgtgactacc     4680

attttgtcac gcatcaatct ggtgttcatt gtgctattta ctggagagtg tgtactgaaa     4740

ctcatctctc tacgccatta ttattttacc attggatgga atatttttga ttttgtggtt     4800

gtcattctct ccattgtagg tatgtttctt gccgagctga tagaaaagta tttcgtgtcc     4860

cctaccctgt tccgagtgat ccgtcttgct aggattggcc gaatcctacg tctgatcaaa     4920

ggagcaaagg ggatccgcac gctgctcttt gctttgatga tgtcccttcc tgcgttgttt     4980

aacatcggcc tcctactctt cctagtcatg ttcatctacg ccatctttgg gatgtccaac     5040

tttgcctatg ttaagaggga agttgggatc gatgacatgt tcaactttga gacctttggc     5100

aacagcatga tctgcctatt ccaaattaca acctctgctg gctgggatgg attgctagca     5160

cccattctca acagtaagcc acccgactgt gaccctaata aagttaaccc tggaagctca     5220

gttaagggag actgtgggaa cccatctgtt ggaattttct tttttgtcag ttacatcatc     5280

atatccttcc tggttgtggt gaacatgtac atcgcggtca tcctggagaa cttcagtgtt     5340

gctactgaag aaagtgcaga gcctctgagt gaggatgact ttgagatgtt ctatgaggtt     5400

tgggagaagt ttgatcccga tgcaactcag ttcatggaat ttgaaaaatt atctcagttt     5460

gcagctgcgc ttgaaccgcc tctcaatctg ccacaaccaa acaaactcca gctcattgcc     5520

atggatttgc ccatggtgag tggtgaccgg atccactgtc ttgatatctt atttgctttt     5580

acaaagcggg ttctaggaga gagtggagag atggatgctc tacgaataca gatggaagag     5640

cgattcatgg cttccaatcc ttccaaggtc tcctatcagc caatcactac tactttaaaa     5700

cgaaaacaag aggaagtatc tgctgtcatt attcagcgtg cttacagacg ccacctttta     5760

aagcgaactg taaaacaagc ttcctttacg tacaataaaa acaaaatcaa aggtggggct     5820

aatcttctta taaaagaaga catgataatt gacagaataa atgaaaactc tattacagaa     5880

aaaactgatc tgaccatgtc cactgcagct tgtccacctt cctatgaccg ggtgacaaag     5940

ccaattgtgg aaaaacatga gcaagaaggc aaagatgaaa aagccaaagg gaaataa        5997


<210>  223
<211>  357
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  N303K mutant of the HK022 integrase

<400>  223

Met Gly Arg Arg Arg Ser His Glu Arg Arg Asp Leu Pro Pro Asn Leu 
1               5                   10                  15      


Tyr Ile Arg Asn Asn Gly Tyr Tyr Cys Tyr Arg Asp Pro Arg Thr Gly 
            20                  25                  30          


Lys Glu Phe Gly Leu Gly Arg Asp Arg Arg Ile Ala Ile Thr Glu Ala 
        35                  40                  45              


Ile Gln Ala Asn Ile Glu Leu Leu Ser Gly Asn Arg Arg Glu Ser Leu 
    50                  55                  60                  


Ile Asp Arg Ile Lys Gly Ala Asp Ala Ile Thr Leu His Ala Trp Leu 
65                  70                  75                  80  


Asp Arg Tyr Glu Thr Ile Leu Ser Glu Arg Gly Ile Arg Pro Lys Thr 
                85                  90                  95      


Leu Leu Asp Tyr Ala Ser Lys Ile Arg Ala Ile Arg Arg Lys Leu Pro 
            100                 105                 110         


Asp Lys Pro Leu Ala Asp Ile Ser Thr Lys Glu Val Ala Ala Met Leu 
        115                 120                 125             


Asn Thr Tyr Val Ala Glu Gly Lys Ser Ala Ser Ala Lys Leu Ile Arg 
    130                 135                 140                 


Ser Thr Leu Val Asp Val Phe Arg Glu Ala Ile Ala Glu Gly His Val 
145                 150                 155                 160 


Ala Thr Asn Pro Val Thr Ala Thr Arg Thr Ala Lys Ser Glu Val Arg 
                165                 170                 175     


Arg Ser Arg Leu Thr Ala Asn Glu Tyr Val Ala Ile Tyr His Ala Ala 
            180                 185                 190         


Glu Pro Leu Pro Ile Trp Leu Arg Leu Ala Met Asp Leu Ala Val Val 
        195                 200                 205             


Thr Gly Gln Arg Val Gly Asp Leu Cys Arg Met Lys Trp Ser Asp Ile 
    210                 215                 220                 


Asn Asp Asn His Leu His Ile Glu Gln Ser Lys Thr Gly Ala Lys Leu 
225                 230                 235                 240 


Ala Ile Pro Leu Thr Leu Thr Ile Asp Ala Leu Asn Ile Ser Leu Ala 
                245                 250                 255     


Asp Thr Leu Gln Gln Cys Arg Glu Ala Ser Ser Ser Glu Thr Ile Ile 
            260                 265                 270         


Ala Ser Lys His His Asp Pro Leu Ser Pro Lys Thr Val Ser Lys Tyr 
        275                 280                 285             


Phe Thr Lys Ala Arg Asn Ala Ser Gly Leu Ser Phe Asp Gly Lys Pro 
    290                 295                 300                 


Pro Thr Phe His Glu Leu Arg Ser Leu Ser Ala Arg Leu Tyr Arg Asn 
305                 310                 315                 320 


Gln Ile Gly Asp Lys Phe Ala Gln Arg Leu Leu Gly His Lys Ser Asp 
                325                 330                 335     


Ser Met Ala Ala Arg Tyr Arg Asp Ser Arg Gly Arg Glu Trp Asp Lys 
            340                 345                 350         


Ile Glu Ile Asp Lys 
        355         


<210>  224
<211>  1071
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  N303K mutant of the HK022 integrase

<400>  224
atgggcaggc ggcggagcca cgagcggaga gacctgcccc ccaacctgta catccggaac       60

aacggctact actgctaccg ggacccccgg accggcaaag agttcggcct gggccgggac      120

aggcggatcg ccatcaccga ggccatccag gccaacatcg agctgctgtc cggcaaccgg      180

cgggagagcc tgatcgaccg gatcaagggc gccgacgcca tcaccctgca cgcctggctg      240

gacagatacg agaccatcct gagcgagcgg ggcatccggc ccaagaccct gctggactac      300

gcctctaaga tccgggccat cagacggaag ctgcccgaca agcccctggc cgacatcagc      360

accaaagaag tggccgccat gctgaacacc tacgtggccg agggcaagag cgccagcgcc      420

aagctgatcc ggtccaccct ggtggacgtg ttccgggagg ccatcgccga gggccacgtc      480

gccaccaacc ccgtgaccgc cacccggacc gccaagagcg aagtgcggcg gagcaggctg      540

accgccaacg agtacgtggc catctaccat gccgctgagc ccctgcccat ctggctgcgg      600

ctggccatgg acctggccgt ggtgaccggc cagagagtgg gcgacctgtg ccggatgaag      660

tggagcgaca tcaacgacaa ccacctgcac atcgagcaga gcaagaccgg cgccaaactg      720

gccatccccc tgaccctgac catcgacgcc ctgaacatca gcctggccga taccctgcag      780

cagtgcagag aggccagcag cagcgagacc atcatcgcca gcaagcacca cgaccccctg      840

agccccaaga ccgtgagcaa gtacttcacc aaggcccgga acgccagcgg cctgagcttc      900

gacggcaaac cccccacctt ccacgagctg cggagcctgt ctgccaggct gtaccggaac      960

cagatcggcg acaagttcgc tcagcggctc ctgggccaca agagcgacag catggccgcc     1020

agataccggg acagccgggg acgggagtgg gacaagatcg agatcgacaa g              1071


<210>  225
<211>  28
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 1288

<400>  225
gctctctccc agcttgattt ccaatggg                                          28


<210>  226
<211>  3685
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  dystrophin (DMD), transcript variant Dp427m, isoform Dp427m, 
       accession number: NP_003997.2

<400>  226

Met Leu Trp Trp Glu Glu Val Glu Asp Cys Tyr Glu Arg Glu Asp Val 
1               5                   10                  15      


Gln Lys Lys Thr Phe Thr Lys Trp Val Asn Ala Gln Phe Ser Lys Phe 
            20                  25                  30          


Gly Lys Gln His Ile Glu Asn Leu Phe Ser Asp Leu Gln Asp Gly Arg 
        35                  40                  45              


Arg Leu Leu Asp Leu Leu Glu Gly Leu Thr Gly Gln Lys Leu Pro Lys 
    50                  55                  60                  


Glu Lys Gly Ser Thr Arg Val His Ala Leu Asn Asn Val Asn Lys Ala 
65                  70                  75                  80  


Leu Arg Val Leu Gln Asn Asn Asn Val Asp Leu Val Asn Ile Gly Ser 
                85                  90                  95      


Thr Asp Ile Val Asp Gly Asn His Lys Leu Thr Leu Gly Leu Ile Trp 
            100                 105                 110         


Asn Ile Ile Leu His Trp Gln Val Lys Asn Val Met Lys Asn Ile Met 
        115                 120                 125             


Ala Gly Leu Gln Gln Thr Asn Ser Glu Lys Ile Leu Leu Ser Trp Val 
    130                 135                 140                 


Arg Gln Ser Thr Arg Asn Tyr Pro Gln Val Asn Val Ile Asn Phe Thr 
145                 150                 155                 160 


Thr Ser Trp Ser Asp Gly Leu Ala Leu Asn Ala Leu Ile His Ser His 
                165                 170                 175     


Arg Pro Asp Leu Phe Asp Trp Asn Ser Val Val Cys Gln Gln Ser Ala 
            180                 185                 190         


Thr Gln Arg Leu Glu His Ala Phe Asn Ile Ala Arg Tyr Gln Leu Gly 
        195                 200                 205             


Ile Glu Lys Leu Leu Asp Pro Glu Asp Val Asp Thr Thr Tyr Pro Asp 
    210                 215                 220                 


Lys Lys Ser Ile Leu Met Tyr Ile Thr Ser Leu Phe Gln Val Leu Pro 
225                 230                 235                 240 


Gln Gln Val Ser Ile Glu Ala Ile Gln Glu Val Glu Met Leu Pro Arg 
                245                 250                 255     


Pro Pro Lys Val Thr Lys Glu Glu His Phe Gln Leu His His Gln Met 
            260                 265                 270         


His Tyr Ser Gln Gln Ile Thr Val Ser Leu Ala Gln Gly Tyr Glu Arg 
        275                 280                 285             


Thr Ser Ser Pro Lys Pro Arg Phe Lys Ser Tyr Ala Tyr Thr Gln Ala 
    290                 295                 300                 


Ala Tyr Val Thr Thr Ser Asp Pro Thr Arg Ser Pro Phe Pro Ser Gln 
305                 310                 315                 320 


His Leu Glu Ala Pro Glu Asp Lys Ser Phe Gly Ser Ser Leu Met Glu 
                325                 330                 335     


Ser Glu Val Asn Leu Asp Arg Tyr Gln Thr Ala Leu Glu Glu Val Leu 
            340                 345                 350         


Ser Trp Leu Leu Ser Ala Glu Asp Thr Leu Gln Ala Gln Gly Glu Ile 
        355                 360                 365             


Ser Asn Asp Val Glu Val Val Lys Asp Gln Phe His Thr His Glu Gly 
    370                 375                 380                 


Tyr Met Met Asp Leu Thr Ala His Gln Gly Arg Val Gly Asn Ile Leu 
385                 390                 395                 400 


Gln Leu Gly Ser Lys Leu Ile Gly Thr Gly Lys Leu Ser Glu Asp Glu 
                405                 410                 415     


Glu Thr Glu Val Gln Glu Gln Met Asn Leu Leu Asn Ser Arg Trp Glu 
            420                 425                 430         


Cys Leu Arg Val Ala Ser Met Glu Lys Gln Ser Asn Leu His Arg Val 
        435                 440                 445             


Leu Met Asp Leu Gln Asn Gln Lys Leu Lys Glu Leu Asn Asp Trp Leu 
    450                 455                 460                 


Thr Lys Thr Glu Glu Arg Thr Arg Lys Met Glu Glu Glu Pro Leu Gly 
465                 470                 475                 480 


Pro Asp Leu Glu Asp Leu Lys Arg Gln Val Gln Gln His Lys Val Leu 
                485                 490                 495     


Gln Glu Asp Leu Glu Gln Glu Gln Val Arg Val Asn Ser Leu Thr His 
            500                 505                 510         


Met Val Val Val Val Asp Glu Ser Ser Gly Asp His Ala Thr Ala Ala 
        515                 520                 525             


Leu Glu Glu Gln Leu Lys Val Leu Gly Asp Arg Trp Ala Asn Ile Cys 
    530                 535                 540                 


Arg Trp Thr Glu Asp Arg Trp Val Leu Leu Gln Asp Ile Leu Leu Lys 
545                 550                 555                 560 


Trp Gln Arg Leu Thr Glu Glu Gln Cys Leu Phe Ser Ala Trp Leu Ser 
                565                 570                 575     


Glu Lys Glu Asp Ala Val Asn Lys Ile His Thr Thr Gly Phe Lys Asp 
            580                 585                 590         


Gln Asn Glu Met Leu Ser Ser Leu Gln Lys Leu Ala Val Leu Lys Ala 
        595                 600                 605             


Asp Leu Glu Lys Lys Lys Gln Ser Met Gly Lys Leu Tyr Ser Leu Lys 
    610                 615                 620                 


Gln Asp Leu Leu Ser Thr Leu Lys Asn Lys Ser Val Thr Gln Lys Thr 
625                 630                 635                 640 


Glu Ala Trp Leu Asp Asn Phe Ala Arg Cys Trp Asp Asn Leu Val Gln 
                645                 650                 655     


Lys Leu Glu Lys Ser Thr Ala Gln Ile Ser Gln Ala Val Thr Thr Thr 
            660                 665                 670         


Gln Pro Ser Leu Thr Gln Thr Thr Val Met Glu Thr Val Thr Thr Val 
        675                 680                 685             


Thr Thr Arg Glu Gln Ile Leu Val Lys His Ala Gln Glu Glu Leu Pro 
    690                 695                 700                 


Pro Pro Pro Pro Gln Lys Lys Arg Gln Ile Thr Val Asp Ser Glu Ile 
705                 710                 715                 720 


Arg Lys Arg Leu Asp Val Asp Ile Thr Glu Leu His Ser Trp Ile Thr 
                725                 730                 735     


Arg Ser Glu Ala Val Leu Gln Ser Pro Glu Phe Ala Ile Phe Arg Lys 
            740                 745                 750         


Glu Gly Asn Phe Ser Asp Leu Lys Glu Lys Val Asn Ala Ile Glu Arg 
        755                 760                 765             


Glu Lys Ala Glu Lys Phe Arg Lys Leu Gln Asp Ala Ser Arg Ser Ala 
    770                 775                 780                 


Gln Ala Leu Val Glu Gln Met Val Asn Glu Gly Val Asn Ala Asp Ser 
785                 790                 795                 800 


Ile Lys Gln Ala Ser Glu Gln Leu Asn Ser Arg Trp Ile Glu Phe Cys 
                805                 810                 815     


Gln Leu Leu Ser Glu Arg Leu Asn Trp Leu Glu Tyr Gln Asn Asn Ile 
            820                 825                 830         


Ile Ala Phe Tyr Asn Gln Leu Gln Gln Leu Glu Gln Met Thr Thr Thr 
        835                 840                 845             


Ala Glu Asn Trp Leu Lys Ile Gln Pro Thr Thr Pro Ser Glu Pro Thr 
    850                 855                 860                 


Ala Ile Lys Ser Gln Leu Lys Ile Cys Lys Asp Glu Val Asn Arg Leu 
865                 870                 875                 880 


Ser Asp Leu Gln Pro Gln Ile Glu Arg Leu Lys Ile Gln Ser Ile Ala 
                885                 890                 895     


Leu Lys Glu Lys Gly Gln Gly Pro Met Phe Leu Asp Ala Asp Phe Val 
            900                 905                 910         


Ala Phe Thr Asn His Phe Lys Gln Val Phe Ser Asp Val Gln Ala Arg 
        915                 920                 925             


Glu Lys Glu Leu Gln Thr Ile Phe Asp Thr Leu Pro Pro Met Arg Tyr 
    930                 935                 940                 


Gln Glu Thr Met Ser Ala Ile Arg Thr Trp Val Gln Gln Ser Glu Thr 
945                 950                 955                 960 


Lys Leu Ser Ile Pro Gln Leu Ser Val Thr Asp Tyr Glu Ile Met Glu 
                965                 970                 975     


Gln Arg Leu Gly Glu Leu Gln Ala Leu Gln Ser Ser Leu Gln Glu Gln 
            980                 985                 990         


Gln Ser Gly Leu Tyr Tyr Leu Ser  Thr Thr Val Lys Glu  Met Ser Lys 
        995                 1000                 1005             


Lys Ala  Pro Ser Glu Ile Ser  Arg Lys Tyr Gln Ser  Glu Phe Glu 
    1010                 1015                 1020             


Glu Ile  Glu Gly Arg Trp Lys  Lys Leu Ser Ser Gln  Leu Val Glu 
    1025                 1030                 1035             


His Cys  Gln Lys Leu Glu Glu  Gln Met Asn Lys Leu  Arg Lys Ile 
    1040                 1045                 1050             


Gln Asn  His Ile Gln Thr Leu  Lys Lys Trp Met Ala  Glu Val Asp 
    1055                 1060                 1065             


Val Phe  Leu Lys Glu Glu Trp  Pro Ala Leu Gly Asp  Ser Glu Ile 
    1070                 1075                 1080             


Leu Lys  Lys Gln Leu Lys Gln  Cys Arg Leu Leu Val  Ser Asp Ile 
    1085                 1090                 1095             


Gln Thr  Ile Gln Pro Ser Leu  Asn Ser Val Asn Glu  Gly Gly Gln 
    1100                 1105                 1110             


Lys Ile  Lys Asn Glu Ala Glu  Pro Glu Phe Ala Ser  Arg Leu Glu 
    1115                 1120                 1125             


Thr Glu  Leu Lys Glu Leu Asn  Thr Gln Trp Asp His  Met Cys Gln 
    1130                 1135                 1140             


Gln Val  Tyr Ala Arg Lys Glu  Ala Leu Lys Gly Gly  Leu Glu Lys 
    1145                 1150                 1155             


Thr Val  Ser Leu Gln Lys Asp  Leu Ser Glu Met His  Glu Trp Met 
    1160                 1165                 1170             


Thr Gln  Ala Glu Glu Glu Tyr  Leu Glu Arg Asp Phe  Glu Tyr Lys 
    1175                 1180                 1185             


Thr Pro  Asp Glu Leu Gln Lys  Ala Val Glu Glu Met  Lys Arg Ala 
    1190                 1195                 1200             


Lys Glu  Glu Ala Gln Gln Lys  Glu Ala Lys Val Lys  Leu Leu Thr 
    1205                 1210                 1215             


Glu Ser  Val Asn Ser Val Ile  Ala Gln Ala Pro Pro  Val Ala Gln 
    1220                 1225                 1230             


Glu Ala  Leu Lys Lys Glu Leu  Glu Thr Leu Thr Thr  Asn Tyr Gln 
    1235                 1240                 1245             


Trp Leu  Cys Thr Arg Leu Asn  Gly Lys Cys Lys Thr  Leu Glu Glu 
    1250                 1255                 1260             


Val Trp  Ala Cys Trp His Glu  Leu Leu Ser Tyr Leu  Glu Lys Ala 
    1265                 1270                 1275             


Asn Lys  Trp Leu Asn Glu Val  Glu Phe Lys Leu Lys  Thr Thr Glu 
    1280                 1285                 1290             


Asn Ile  Pro Gly Gly Ala Glu  Glu Ile Ser Glu Val  Leu Asp Ser 
    1295                 1300                 1305             


Leu Glu  Asn Leu Met Arg His  Ser Glu Asp Asn Pro  Asn Gln Ile 
    1310                 1315                 1320             


Arg Ile  Leu Ala Gln Thr Leu  Thr Asp Gly Gly Val  Met Asp Glu 
    1325                 1330                 1335             


Leu Ile  Asn Glu Glu Leu Glu  Thr Phe Asn Ser Arg  Trp Arg Glu 
    1340                 1345                 1350             


Leu His  Glu Glu Ala Val Arg  Arg Gln Lys Leu Leu  Glu Gln Ser 
    1355                 1360                 1365             


Ile Gln  Ser Ala Gln Glu Thr  Glu Lys Ser Leu His  Leu Ile Gln 
    1370                 1375                 1380             


Glu Ser  Leu Thr Phe Ile Asp  Lys Gln Leu Ala Ala  Tyr Ile Ala 
    1385                 1390                 1395             


Asp Lys  Val Asp Ala Ala Gln  Met Pro Gln Glu Ala  Gln Lys Ile 
    1400                 1405                 1410             


Gln Ser  Asp Leu Thr Ser His  Glu Ile Ser Leu Glu  Glu Met Lys 
    1415                 1420                 1425             


Lys His  Asn Gln Gly Lys Glu  Ala Ala Gln Arg Val  Leu Ser Gln 
    1430                 1435                 1440             


Ile Asp  Val Ala Gln Lys Lys  Leu Gln Asp Val Ser  Met Lys Phe 
    1445                 1450                 1455             


Arg Leu  Phe Gln Lys Pro Ala  Asn Phe Glu Gln Arg  Leu Gln Glu 
    1460                 1465                 1470             


Ser Lys  Met Ile Leu Asp Glu  Val Lys Met His Leu  Pro Ala Leu 
    1475                 1480                 1485             


Glu Thr  Lys Ser Val Glu Gln  Glu Val Val Gln Ser  Gln Leu Asn 
    1490                 1495                 1500             


His Cys  Val Asn Leu Tyr Lys  Ser Leu Ser Glu Val  Lys Ser Glu 
    1505                 1510                 1515             


Val Glu  Met Val Ile Lys Thr  Gly Arg Gln Ile Val  Gln Lys Lys 
    1520                 1525                 1530             


Gln Thr  Glu Asn Pro Lys Glu  Leu Asp Glu Arg Val  Thr Ala Leu 
    1535                 1540                 1545             


Lys Leu  His Tyr Asn Glu Leu  Gly Ala Lys Val Thr  Glu Arg Lys 
    1550                 1555                 1560             


Gln Gln  Leu Glu Lys Cys Leu  Lys Leu Ser Arg Lys  Met Arg Lys 
    1565                 1570                 1575             


Glu Met  Asn Val Leu Thr Glu  Trp Leu Ala Ala Thr  Asp Met Glu 
    1580                 1585                 1590             


Leu Thr  Lys Arg Ser Ala Val  Glu Gly Met Pro Ser  Asn Leu Asp 
    1595                 1600                 1605             


Ser Glu  Val Ala Trp Gly Lys  Ala Thr Gln Lys Glu  Ile Glu Lys 
    1610                 1615                 1620             


Gln Lys  Val His Leu Lys Ser  Ile Thr Glu Val Gly  Glu Ala Leu 
    1625                 1630                 1635             


Lys Thr  Val Leu Gly Lys Lys  Glu Thr Leu Val Glu  Asp Lys Leu 
    1640                 1645                 1650             


Ser Leu  Leu Asn Ser Asn Trp  Ile Ala Val Thr Ser  Arg Ala Glu 
    1655                 1660                 1665             


Glu Trp  Leu Asn Leu Leu Leu  Glu Tyr Gln Lys His  Met Glu Thr 
    1670                 1675                 1680             


Phe Asp  Gln Asn Val Asp His  Ile Thr Lys Trp Ile  Ile Gln Ala 
    1685                 1690                 1695             


Asp Thr  Leu Leu Asp Glu Ser  Glu Lys Lys Lys Pro  Gln Gln Lys 
    1700                 1705                 1710             


Glu Asp  Val Leu Lys Arg Leu  Lys Ala Glu Leu Asn  Asp Ile Arg 
    1715                 1720                 1725             


Pro Lys  Val Asp Ser Thr Arg  Asp Gln Ala Ala Asn  Leu Met Ala 
    1730                 1735                 1740             


Asn Arg  Gly Asp His Cys Arg  Lys Leu Val Glu Pro  Gln Ile Ser 
    1745                 1750                 1755             


Glu Leu  Asn His Arg Phe Ala  Ala Ile Ser His Arg  Ile Lys Thr 
    1760                 1765                 1770             


Gly Lys  Ala Ser Ile Pro Leu  Lys Glu Leu Glu Gln  Phe Asn Ser 
    1775                 1780                 1785             


Asp Ile  Gln Lys Leu Leu Glu  Pro Leu Glu Ala Glu  Ile Gln Gln 
    1790                 1795                 1800             


Gly Val  Asn Leu Lys Glu Glu  Asp Phe Asn Lys Asp  Met Asn Glu 
    1805                 1810                 1815             


Asp Asn  Glu Gly Thr Val Lys  Glu Leu Leu Gln Arg  Gly Asp Asn 
    1820                 1825                 1830             


Leu Gln  Gln Arg Ile Thr Asp  Glu Arg Lys Arg Glu  Glu Ile Lys 
    1835                 1840                 1845             


Ile Lys  Gln Gln Leu Leu Gln  Thr Lys His Asn Ala  Leu Lys Asp 
    1850                 1855                 1860             


Leu Arg  Ser Gln Arg Arg Lys  Lys Ala Leu Glu Ile  Ser His Gln 
    1865                 1870                 1875             


Trp Tyr  Gln Tyr Lys Arg Gln  Ala Asp Asp Leu Leu  Lys Cys Leu 
    1880                 1885                 1890             


Asp Asp  Ile Glu Lys Lys Leu  Ala Ser Leu Pro Glu  Pro Arg Asp 
    1895                 1900                 1905             


Glu Arg  Lys Ile Lys Glu Ile  Asp Arg Glu Leu Gln  Lys Lys Lys 
    1910                 1915                 1920             


Glu Glu  Leu Asn Ala Val Arg  Arg Gln Ala Glu Gly  Leu Ser Glu 
    1925                 1930                 1935             


Asp Gly  Ala Ala Met Ala Val  Glu Pro Thr Gln Ile  Gln Leu Ser 
    1940                 1945                 1950             


Lys Arg  Trp Arg Glu Ile Glu  Ser Lys Phe Ala Gln  Phe Arg Arg 
    1955                 1960                 1965             


Leu Asn  Phe Ala Gln Ile His  Thr Val Arg Glu Glu  Thr Met Met 
    1970                 1975                 1980             


Val Met  Thr Glu Asp Met Pro  Leu Glu Ile Ser Tyr  Val Pro Ser 
    1985                 1990                 1995             


Thr Tyr  Leu Thr Glu Ile Thr  His Val Ser Gln Ala  Leu Leu Glu 
    2000                 2005                 2010             


Val Glu  Gln Leu Leu Asn Ala  Pro Asp Leu Cys Ala  Lys Asp Phe 
    2015                 2020                 2025             


Glu Asp  Leu Phe Lys Gln Glu  Glu Ser Leu Lys Asn  Ile Lys Asp 
    2030                 2035                 2040             


Ser Leu  Gln Gln Ser Ser Gly  Arg Ile Asp Ile Ile  His Ser Lys 
    2045                 2050                 2055             


Lys Thr  Ala Ala Leu Gln Ser  Ala Thr Pro Val Glu  Arg Val Lys 
    2060                 2065                 2070             


Leu Gln  Glu Ala Leu Ser Gln  Leu Asp Phe Gln Trp  Glu Lys Val 
    2075                 2080                 2085             


Asn Lys  Met Tyr Lys Asp Arg  Gln Gly Arg Phe Asp  Arg Ser Val 
    2090                 2095                 2100             


Glu Lys  Trp Arg Arg Phe His  Tyr Asp Ile Lys Ile  Phe Asn Gln 
    2105                 2110                 2115             


Trp Leu  Thr Glu Ala Glu Gln  Phe Leu Arg Lys Thr  Gln Ile Pro 
    2120                 2125                 2130             


Glu Asn  Trp Glu His Ala Lys  Tyr Lys Trp Tyr Leu  Lys Glu Leu 
    2135                 2140                 2145             


Gln Asp  Gly Ile Gly Gln Arg  Gln Thr Val Val Arg  Thr Leu Asn 
    2150                 2155                 2160             


Ala Thr  Gly Glu Glu Ile Ile  Gln Gln Ser Ser Lys  Thr Asp Ala 
    2165                 2170                 2175             


Ser Ile  Leu Gln Glu Lys Leu  Gly Ser Leu Asn Leu  Arg Trp Gln 
    2180                 2185                 2190             


Glu Val  Cys Lys Gln Leu Ser  Asp Arg Lys Lys Arg  Leu Glu Glu 
    2195                 2200                 2205             


Gln Lys  Asn Ile Leu Ser Glu  Phe Gln Arg Asp Leu  Asn Glu Phe 
    2210                 2215                 2220             


Val Leu  Trp Leu Glu Glu Ala  Asp Asn Ile Ala Ser  Ile Pro Leu 
    2225                 2230                 2235             


Glu Pro  Gly Lys Glu Gln Gln  Leu Lys Glu Lys Leu  Glu Gln Val 
    2240                 2245                 2250             


Lys Leu  Leu Val Glu Glu Leu  Pro Leu Arg Gln Gly  Ile Leu Lys 
    2255                 2260                 2265             


Gln Leu  Asn Glu Thr Gly Gly  Pro Val Leu Val Ser  Ala Pro Ile 
    2270                 2275                 2280             


Ser Pro  Glu Glu Gln Asp Lys  Leu Glu Asn Lys Leu  Lys Gln Thr 
    2285                 2290                 2295             


Asn Leu  Gln Trp Ile Lys Val  Ser Arg Ala Leu Pro  Glu Lys Gln 
    2300                 2305                 2310             


Gly Glu  Ile Glu Ala Gln Ile  Lys Asp Leu Gly Gln  Leu Glu Lys 
    2315                 2320                 2325             


Lys Leu  Glu Asp Leu Glu Glu  Gln Leu Asn His Leu  Leu Leu Trp 
    2330                 2335                 2340             


Leu Ser  Pro Ile Arg Asn Gln  Leu Glu Ile Tyr Asn  Gln Pro Asn 
    2345                 2350                 2355             


Gln Glu  Gly Pro Phe Asp Val  Lys Glu Thr Glu Ile  Ala Val Gln 
    2360                 2365                 2370             


Ala Lys  Gln Pro Asp Val Glu  Glu Ile Leu Ser Lys  Gly Gln His 
    2375                 2380                 2385             


Leu Tyr  Lys Glu Lys Pro Ala  Thr Gln Pro Val Lys  Arg Lys Leu 
    2390                 2395                 2400             


Glu Asp  Leu Ser Ser Glu Trp  Lys Ala Val Asn Arg  Leu Leu Gln 
    2405                 2410                 2415             


Glu Leu  Arg Ala Lys Gln Pro  Asp Leu Ala Pro Gly  Leu Thr Thr 
    2420                 2425                 2430             


Ile Gly  Ala Ser Pro Thr Gln  Thr Val Thr Leu Val  Thr Gln Pro 
    2435                 2440                 2445             


Val Val  Thr Lys Glu Thr Ala  Ile Ser Lys Leu Glu  Met Pro Ser 
    2450                 2455                 2460             


Ser Leu  Met Leu Glu Val Pro  Ala Leu Ala Asp Phe  Asn Arg Ala 
    2465                 2470                 2475             


Trp Thr  Glu Leu Thr Asp Trp  Leu Ser Leu Leu Asp  Gln Val Ile 
    2480                 2485                 2490             


Lys Ser  Gln Arg Val Met Val  Gly Asp Leu Glu Asp  Ile Asn Glu 
    2495                 2500                 2505             


Met Ile  Ile Lys Gln Lys Ala  Thr Met Gln Asp Leu  Glu Gln Arg 
    2510                 2515                 2520             


Arg Pro  Gln Leu Glu Glu Leu  Ile Thr Ala Ala Gln  Asn Leu Lys 
    2525                 2530                 2535             


Asn Lys  Thr Ser Asn Gln Glu  Ala Arg Thr Ile Ile  Thr Asp Arg 
    2540                 2545                 2550             


Ile Glu  Arg Ile Gln Asn Gln  Trp Asp Glu Val Gln  Glu His Leu 
    2555                 2560                 2565             


Gln Asn  Arg Arg Gln Gln Leu  Asn Glu Met Leu Lys  Asp Ser Thr 
    2570                 2575                 2580             


Gln Trp  Leu Glu Ala Lys Glu  Glu Ala Glu Gln Val  Leu Gly Gln 
    2585                 2590                 2595             


Ala Arg  Ala Lys Leu Glu Ser  Trp Lys Glu Gly Pro  Tyr Thr Val 
    2600                 2605                 2610             


Asp Ala  Ile Gln Lys Lys Ile  Thr Glu Thr Lys Gln  Leu Ala Lys 
    2615                 2620                 2625             


Asp Leu  Arg Gln Trp Gln Thr  Asn Val Asp Val Ala  Asn Asp Leu 
    2630                 2635                 2640             


Ala Leu  Lys Leu Leu Arg Asp  Tyr Ser Ala Asp Asp  Thr Arg Lys 
    2645                 2650                 2655             


Val His  Met Ile Thr Glu Asn  Ile Asn Ala Ser Trp  Arg Ser Ile 
    2660                 2665                 2670             


His Lys  Arg Val Ser Glu Arg  Glu Ala Ala Leu Glu  Glu Thr His 
    2675                 2680                 2685             


Arg Leu  Leu Gln Gln Phe Pro  Leu Asp Leu Glu Lys  Phe Leu Ala 
    2690                 2695                 2700             


Trp Leu  Thr Glu Ala Glu Thr  Thr Ala Asn Val Leu  Gln Asp Ala 
    2705                 2710                 2715             


Thr Arg  Lys Glu Arg Leu Leu  Glu Asp Ser Lys Gly  Val Lys Glu 
    2720                 2725                 2730             


Leu Met  Lys Gln Trp Gln Asp  Leu Gln Gly Glu Ile  Glu Ala His 
    2735                 2740                 2745             


Thr Asp  Val Tyr His Asn Leu  Asp Glu Asn Ser Gln  Lys Ile Leu 
    2750                 2755                 2760             


Arg Ser  Leu Glu Gly Ser Asp  Asp Ala Val Leu Leu  Gln Arg Arg 
    2765                 2770                 2775             


Leu Asp  Asn Met Asn Phe Lys  Trp Ser Glu Leu Arg  Lys Lys Ser 
    2780                 2785                 2790             


Leu Asn  Ile Arg Ser His Leu  Glu Ala Ser Ser Asp  Gln Trp Lys 
    2795                 2800                 2805             


Arg Leu  His Leu Ser Leu Gln  Glu Leu Leu Val Trp  Leu Gln Leu 
    2810                 2815                 2820             


Lys Asp  Asp Glu Leu Ser Arg  Gln Ala Pro Ile Gly  Gly Asp Phe 
    2825                 2830                 2835             


Pro Ala  Val Gln Lys Gln Asn  Asp Val His Arg Ala  Phe Lys Arg 
    2840                 2845                 2850             


Glu Leu  Lys Thr Lys Glu Pro  Val Ile Met Ser Thr  Leu Glu Thr 
    2855                 2860                 2865             


Val Arg  Ile Phe Leu Thr Glu  Gln Pro Leu Glu Gly  Leu Glu Lys 
    2870                 2875                 2880             


Leu Tyr  Gln Glu Pro Arg Glu  Leu Pro Pro Glu Glu  Arg Ala Gln 
    2885                 2890                 2895             


Asn Val  Thr Arg Leu Leu Arg  Lys Gln Ala Glu Glu  Val Asn Thr 
    2900                 2905                 2910             


Glu Trp  Glu Lys Leu Asn Leu  His Ser Ala Asp Trp  Gln Arg Lys 
    2915                 2920                 2925             


Ile Asp  Glu Thr Leu Glu Arg  Leu Arg Glu Leu Gln  Glu Ala Thr 
    2930                 2935                 2940             


Asp Glu  Leu Asp Leu Lys Leu  Arg Gln Ala Glu Val  Ile Lys Gly 
    2945                 2950                 2955             


Ser Trp  Gln Pro Val Gly Asp  Leu Leu Ile Asp Ser  Leu Gln Asp 
    2960                 2965                 2970             


His Leu  Glu Lys Val Lys Ala  Leu Arg Gly Glu Ile  Ala Pro Leu 
    2975                 2980                 2985             


Lys Glu  Asn Val Ser His Val  Asn Asp Leu Ala Arg  Gln Leu Thr 
    2990                 2995                 3000             


Thr Leu  Gly Ile Gln Leu Ser  Pro Tyr Asn Leu Ser  Thr Leu Glu 
    3005                 3010                 3015             


Asp Leu  Asn Thr Arg Trp Lys  Leu Leu Gln Val Ala  Val Glu Asp 
    3020                 3025                 3030             


Arg Val  Arg Gln Leu His Glu  Ala His Arg Asp Phe  Gly Pro Ala 
    3035                 3040                 3045             


Ser Gln  His Phe Leu Ser Thr  Ser Val Gln Gly Pro  Trp Glu Arg 
    3050                 3055                 3060             


Ala Ile  Ser Pro Asn Lys Val  Pro Tyr Tyr Ile Asn  His Glu Thr 
    3065                 3070                 3075             


Gln Thr  Thr Cys Trp Asp His  Pro Lys Met Thr Glu  Leu Tyr Gln 
    3080                 3085                 3090             


Ser Leu  Ala Asp Leu Asn Asn  Val Arg Phe Ser Ala  Tyr Arg Thr 
    3095                 3100                 3105             


Ala Met  Lys Leu Arg Arg Leu  Gln Lys Ala Leu Cys  Leu Asp Leu 
    3110                 3115                 3120             


Leu Ser  Leu Ser Ala Ala Cys  Asp Ala Leu Asp Gln  His Asn Leu 
    3125                 3130                 3135             


Lys Gln  Asn Asp Gln Pro Met  Asp Ile Leu Gln Ile  Ile Asn Cys 
    3140                 3145                 3150             


Leu Thr  Thr Ile Tyr Asp Arg  Leu Glu Gln Glu His  Asn Asn Leu 
    3155                 3160                 3165             


Val Asn  Val Pro Leu Cys Val  Asp Met Cys Leu Asn  Trp Leu Leu 
    3170                 3175                 3180             


Asn Val  Tyr Asp Thr Gly Arg  Thr Gly Arg Ile Arg  Val Leu Ser 
    3185                 3190                 3195             


Phe Lys  Thr Gly Ile Ile Ser  Leu Cys Lys Ala His  Leu Glu Asp 
    3200                 3205                 3210             


Lys Tyr  Arg Tyr Leu Phe Lys  Gln Val Ala Ser Ser  Thr Gly Phe 
    3215                 3220                 3225             


Cys Asp  Gln Arg Arg Leu Gly  Leu Leu Leu His Asp  Ser Ile Gln 
    3230                 3235                 3240             


Ile Pro  Arg Gln Leu Gly Glu  Val Ala Ser Phe Gly  Gly Ser Asn 
    3245                 3250                 3255             


Ile Glu  Pro Ser Val Arg Ser  Cys Phe Gln Phe Ala  Asn Asn Lys 
    3260                 3265                 3270             


Pro Glu  Ile Glu Ala Ala Leu  Phe Leu Asp Trp Met  Arg Leu Glu 
    3275                 3280                 3285             


Pro Gln  Ser Met Val Trp Leu  Pro Val Leu His Arg  Val Ala Ala 
    3290                 3295                 3300             


Ala Glu  Thr Ala Lys His Gln  Ala Lys Cys Asn Ile  Cys Lys Glu 
    3305                 3310                 3315             


Cys Pro  Ile Ile Gly Phe Arg  Tyr Arg Ser Leu Lys  His Phe Asn 
    3320                 3325                 3330             


Tyr Asp  Ile Cys Gln Ser Cys  Phe Phe Ser Gly Arg  Val Ala Lys 
    3335                 3340                 3345             


Gly His  Lys Met His Tyr Pro  Met Val Glu Tyr Cys  Thr Pro Thr 
    3350                 3355                 3360             


Thr Ser  Gly Glu Asp Val Arg  Asp Phe Ala Lys Val  Leu Lys Asn 
    3365                 3370                 3375             


Lys Phe  Arg Thr Lys Arg Tyr  Phe Ala Lys His Pro  Arg Met Gly 
    3380                 3385                 3390             


Tyr Leu  Pro Val Gln Thr Val  Leu Glu Gly Asp Asn  Met Glu Thr 
    3395                 3400                 3405             


Pro Val  Thr Leu Ile Asn Phe  Trp Pro Val Asp Ser  Ala Pro Ala 
    3410                 3415                 3420             


Ser Ser  Pro Gln Leu Ser His  Asp Asp Thr His Ser  Arg Ile Glu 
    3425                 3430                 3435             


His Tyr  Ala Ser Arg Leu Ala  Glu Met Glu Asn Ser  Asn Gly Ser 
    3440                 3445                 3450             


Tyr Leu  Asn Asp Ser Ile Ser  Pro Asn Glu Ser Ile  Asp Asp Glu 
    3455                 3460                 3465             


His Leu  Leu Ile Gln His Tyr  Cys Gln Ser Leu Asn  Gln Asp Ser 
    3470                 3475                 3480             


Pro Leu  Ser Gln Pro Arg Ser  Pro Ala Gln Ile Leu  Ile Ser Leu 
    3485                 3490                 3495             


Glu Ser  Glu Glu Arg Gly Glu  Leu Glu Arg Ile Leu  Ala Asp Leu 
    3500                 3505                 3510             


Glu Glu  Glu Asn Arg Asn Leu  Gln Ala Glu Tyr Asp  Arg Leu Lys 
    3515                 3520                 3525             


Gln Gln  His Glu His Lys Gly  Leu Ser Pro Leu Pro  Ser Pro Pro 
    3530                 3535                 3540             


Glu Met  Met Pro Thr Ser Pro  Gln Ser Pro Arg Asp  Ala Glu Leu 
    3545                 3550                 3555             


Ile Ala  Glu Ala Lys Leu Leu  Arg Gln His Lys Gly  Arg Leu Glu 
    3560                 3565                 3570             


Ala Arg  Met Gln Ile Leu Glu  Asp His Asn Lys Gln  Leu Glu Ser 
    3575                 3580                 3585             


Gln Leu  His Arg Leu Arg Gln  Leu Leu Glu Gln Pro  Gln Ala Glu 
    3590                 3595                 3600             


Ala Lys  Val Asn Gly Thr Thr  Val Ser Ser Pro Ser  Thr Ser Leu 
    3605                 3610                 3615             


Gln Arg  Ser Asp Ser Ser Gln  Pro Met Leu Leu Arg  Val Val Gly 
    3620                 3625                 3630             


Ser Gln  Thr Ser Asp Ser Met  Gly Glu Glu Asp Leu  Leu Ser Pro 
    3635                 3640                 3645             


Pro Gln  Asp Thr Ser Thr Gly  Leu Glu Glu Val Met  Glu Gln Leu 
    3650                 3655                 3660             


Asn Asn  Ser Phe Pro Ser Ser  Arg Gly Arg Asn Thr  Pro Gly Lys 
    3665                 3670                 3675             


Pro Met  Arg Glu Asp Thr Met  
    3680                 3685 


<210>  227
<211>  1480
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  cystic fibrosis transmembrane conductance regulator (CFTR), 
       accession number NP_000483.3

<400>  227

Met Gln Arg Ser Pro Leu Glu Lys Ala Ser Val Val Ser Lys Leu Phe 
1               5                   10                  15      


Phe Ser Trp Thr Arg Pro Ile Leu Arg Lys Gly Tyr Arg Gln Arg Leu 
            20                  25                  30          


Glu Leu Ser Asp Ile Tyr Gln Ile Pro Ser Val Asp Ser Ala Asp Asn 
        35                  40                  45              


Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 
    50                  55                  60                  


Lys Asn Pro Lys Leu Ile Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 
65                  70                  75                  80  


Phe Met Phe Tyr Gly Ile Phe Leu Tyr Leu Gly Glu Val Thr Lys Ala 
                85                  90                  95      


Val Gln Pro Leu Leu Leu Gly Arg Ile Ile Ala Ser Tyr Asp Pro Asp 
            100                 105                 110         


Asn Lys Glu Glu Arg Ser Ile Ala Ile Tyr Leu Gly Ile Gly Leu Cys 
        115                 120                 125             


Leu Leu Phe Ile Val Arg Thr Leu Leu Leu His Pro Ala Ile Phe Gly 
    130                 135                 140                 


Leu His His Ile Gly Met Gln Met Arg Ile Ala Met Phe Ser Leu Ile 
145                 150                 155                 160 


Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys Ile Ser 
                165                 170                 175     


Ile Gly Gln Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
            180                 185                 190         


Glu Gly Leu Ala Leu Ala His Phe Val Trp Ile Ala Pro Leu Gln Val 
        195                 200                 205             


Ala Leu Leu Met Gly Leu Ile Trp Glu Leu Leu Gln Ala Ser Ala Phe 
    210                 215                 220                 


Cys Gly Leu Gly Phe Leu Ile Val Leu Ala Leu Phe Gln Ala Gly Leu 
225                 230                 235                 240 


Gly Arg Met Met Met Lys Tyr Arg Asp Gln Arg Ala Gly Lys Ile Ser 
                245                 250                 255     


Glu Arg Leu Val Ile Thr Ser Glu Met Ile Glu Asn Ile Gln Ser Val 
            260                 265                 270         


Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met Ile Glu Asn Leu 
        275                 280                 285             


Arg Gln Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
    290                 295                 300                 


Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 
305                 310                 315                 320 


Ser Val Leu Pro Tyr Ala Leu Ile Lys Gly Ile Ile Leu Arg Lys Ile 
                325                 330                 335     


Phe Thr Thr Ile Ser Phe Cys Ile Val Leu Arg Met Ala Val Thr Arg 
            340                 345                 350         


Gln Phe Pro Trp Ala Val Gln Thr Trp Tyr Asp Ser Leu Gly Ala Ile 
        355                 360                 365             


Asn Lys Ile Gln Asp Phe Leu Gln Lys Gln Glu Tyr Lys Thr Leu Glu 
    370                 375                 380                 


Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
385                 390                 395                 400 


Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gln Asn Asn 
                405                 410                 415     


Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
            420                 425                 430         


Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp Ile Asn Phe Lys Ile 
        435                 440                 445             


Glu Arg Gly Gln Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
    450                 455                 460                 


Thr Ser Leu Leu Met Val Ile Met Gly Glu Leu Glu Pro Ser Glu Gly 
465                 470                 475                 480 


Lys Ile Lys His Ser Gly Arg Ile Ser Phe Cys Ser Gln Phe Ser Trp 
                485                 490                 495     


Ile Met Pro Gly Thr Ile Lys Glu Asn Ile Ile Phe Gly Val Ser Tyr 
            500                 505                 510         


Asp Glu Tyr Arg Tyr Arg Ser Val Ile Lys Ala Cys Gln Leu Glu Glu 
        515                 520                 525             


Asp Ile Ser Lys Phe Ala Glu Lys Asp Asn Ile Val Leu Gly Glu Gly 
    530                 535                 540                 


Gly Ile Thr Leu Ser Gly Gly Gln Arg Ala Arg Ile Ser Leu Ala Arg 
545                 550                 555                 560 


Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly 
                565                 570                 575     


Tyr Leu Asp Val Leu Thr Glu Lys Glu Ile Phe Glu Ser Cys Val Cys 
            580                 585                 590         


Lys Leu Met Ala Asn Lys Thr Arg Ile Leu Val Thr Ser Lys Met Glu 
        595                 600                 605             


His Leu Lys Lys Ala Asp Lys Ile Leu Ile Leu His Glu Gly Ser Ser 
    610                 615                 620                 


Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gln Asn Leu Gln Pro Asp Phe 
625                 630                 635                 640 


Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gln Phe Ser Ala Glu 
                645                 650                 655     


Arg Arg Asn Ser Ile Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
            660                 665                 670         


Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gln Ser Phe Lys 
        675                 680                 685             


Gln Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser Ile Leu Asn Pro 
    690                 695                 700                 


Ile Asn Ser Ile Arg Lys Phe Ser Ile Val Gln Lys Thr Pro Leu Gln 
705                 710                 715                 720 


Met Asn Gly Ile Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 
                725                 730                 735     


Ser Leu Val Pro Asp Ser Glu Gln Gly Glu Ala Ile Leu Pro Arg Ile 
            740                 745                 750         


Ser Val Ile Ser Thr Gly Pro Thr Leu Gln Ala Arg Arg Arg Gln Ser 
        755                 760                 765             


Val Leu Asn Leu Met Thr His Ser Val Asn Gln Gly Gln Asn Ile His 
    770                 775                 780                 


Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gln Ala 
785                 790                 795                 800 


Asn Leu Thr Glu Leu Asp Ile Tyr Ser Arg Arg Leu Ser Gln Glu Thr 
                805                 810                 815     


Gly Leu Glu Ile Ser Glu Glu Ile Asn Glu Glu Asp Leu Lys Glu Cys 
            820                 825                 830         


Phe Phe Asp Asp Met Glu Ser Ile Pro Ala Val Thr Thr Trp Asn Thr 
        835                 840                 845             


Tyr Leu Arg Tyr Ile Thr Val His Lys Ser Leu Ile Phe Val Leu Ile 
    850                 855                 860                 


Trp Cys Leu Val Ile Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865                 870                 875                 880 


Leu Trp Leu Leu Gly Asn Thr Pro Leu Gln Asp Lys Gly Asn Ser Thr 
                885                 890                 895     


His Ser Arg Asn Asn Ser Tyr Ala Val Ile Ile Thr Ser Thr Ser Ser 
            900                 905                 910         


Tyr Tyr Val Phe Tyr Ile Tyr Val Gly Val Ala Asp Thr Leu Leu Ala 
        915                 920                 925             


Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu Ile Thr Val 
    930                 935                 940                 


Ser Lys Ile Leu His His Lys Met Leu His Ser Val Leu Gln Ala Pro 
945                 950                 955                 960 


Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly Ile Leu Asn Arg Phe 
                965                 970                 975     


Ser Lys Asp Ile Ala Ile Leu Asp Asp Leu Leu Pro Leu Thr Ile Phe 
            980                 985                 990         


Asp Phe Ile Gln Leu Leu Leu Ile  Val Ile Gly Ala Ile  Ala Val Val 
        995                 1000                 1005             


Ala Val  Leu Gln Pro Tyr Ile  Phe Val Ala Thr Val  Pro Val Ile 
    1010                 1015                 1020             


Val Ala  Phe Ile Met Leu Arg  Ala Tyr Phe Leu Gln  Thr Ser Gln 
    1025                 1030                 1035             


Gln Leu  Lys Gln Leu Glu Ser  Glu Gly Arg Ser Pro  Ile Phe Thr 
    1040                 1045                 1050             


His Leu  Val Thr Ser Leu Lys  Gly Leu Trp Thr Leu  Arg Ala Phe 
    1055                 1060                 1065             


Gly Arg  Gln Pro Tyr Phe Glu  Thr Leu Phe His Lys  Ala Leu Asn 
    1070                 1075                 1080             


Leu His  Thr Ala Asn Trp Phe  Leu Tyr Leu Ser Thr  Leu Arg Trp 
    1085                 1090                 1095             


Phe Gln  Met Arg Ile Glu Met  Ile Phe Val Ile Phe  Phe Ile Ala 
    1100                 1105                 1110             


Val Thr  Phe Ile Ser Ile Leu  Thr Thr Gly Glu Gly  Glu Gly Arg 
    1115                 1120                 1125             


Val Gly  Ile Ile Leu Thr Leu  Ala Met Asn Ile Met  Ser Thr Leu 
    1130                 1135                 1140             


Gln Trp  Ala Val Asn Ser Ser  Ile Asp Val Asp Ser  Leu Met Arg 
    1145                 1150                 1155             


Ser Val  Ser Arg Val Phe Lys  Phe Ile Asp Met Pro  Thr Glu Gly 
    1160                 1165                 1170             


Lys Pro  Thr Lys Ser Thr Lys  Pro Tyr Lys Asn Gly  Gln Leu Ser 
    1175                 1180                 1185             


Lys Val  Met Ile Ile Glu Asn  Ser His Val Lys Lys  Asp Asp Ile 
    1190                 1195                 1200             


Trp Pro  Ser Gly Gly Gln Met  Thr Val Lys Asp Leu  Thr Ala Lys 
    1205                 1210                 1215             


Tyr Thr  Glu Gly Gly Asn Ala  Ile Leu Glu Asn Ile  Ser Phe Ser 
    1220                 1225                 1230             


Ile Ser  Pro Gly Gln Arg Val  Gly Leu Leu Gly Arg  Thr Gly Ser 
    1235                 1240                 1245             


Gly Lys  Ser Thr Leu Leu Ser  Ala Phe Leu Arg Leu  Leu Asn Thr 
    1250                 1255                 1260             


Glu Gly  Glu Ile Gln Ile Asp  Gly Val Ser Trp Asp  Ser Ile Thr 
    1265                 1270                 1275             


Leu Gln  Gln Trp Arg Lys Ala  Phe Gly Val Ile Pro  Gln Lys Val 
    1280                 1285                 1290             


Phe Ile  Phe Ser Gly Thr Phe  Arg Lys Asn Leu Asp  Pro Tyr Glu 
    1295                 1300                 1305             


Gln Trp  Ser Asp Gln Glu Ile  Trp Lys Val Ala Asp  Glu Val Gly 
    1310                 1315                 1320             


Leu Arg  Ser Val Ile Glu Gln  Phe Pro Gly Lys Leu  Asp Phe Val 
    1325                 1330                 1335             


Leu Val  Asp Gly Gly Cys Val  Leu Ser His Gly His  Lys Gln Leu 
    1340                 1345                 1350             


Met Cys  Leu Ala Arg Ser Val  Leu Ser Lys Ala Lys  Ile Leu Leu 
    1355                 1360                 1365             


Leu Asp  Glu Pro Ser Ala His  Leu Asp Pro Val Thr  Tyr Gln Ile 
    1370                 1375                 1380             


Ile Arg  Arg Thr Leu Lys Gln  Ala Phe Ala Asp Cys  Thr Val Ile 
    1385                 1390                 1395             


Leu Cys  Glu His Arg Ile Glu  Ala Met Leu Glu Cys  Gln Gln Phe 
    1400                 1405                 1410             


Leu Val  Ile Glu Glu Asn Lys  Val Arg Gln Tyr Asp  Ser Ile Gln 
    1415                 1420                 1425             


Lys Leu  Leu Asn Glu Arg Ser  Leu Phe Arg Gln Ala  Ile Ser Pro 
    1430                 1435                 1440             


Ser Asp  Arg Val Lys Leu Phe  Pro His Arg Asn Ser  Ser Lys Cys 
    1445                 1450                 1455             


Lys Ser  Lys Pro Gln Ile Ala  Ala Leu Lys Glu Glu  Thr Glu Glu 
    1460                 1465                 1470             


Glu Val  Gln Asp Thr Arg Leu  
    1475                 1480 


<210>  228
<211>  367
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  cystinosin lysosomal cystine transporter (CTNS) isoform 2 
       precursor, accession number NP_004928.2

<400>  228

Met Ile Arg Asn Trp Leu Thr Ile Phe Ile Leu Phe Pro Leu Lys Leu 
1               5                   10                  15      


Val Glu Lys Cys Glu Ser Ser Val Ser Leu Thr Val Pro Pro Val Val 
            20                  25                  30          


Lys Leu Glu Asn Gly Ser Ser Thr Asn Val Ser Leu Thr Leu Arg Pro 
        35                  40                  45              


Pro Leu Asn Ala Thr Leu Val Ile Thr Phe Glu Ile Thr Phe Arg Ser 
    50                  55                  60                  


Lys Asn Ile Thr Ile Leu Glu Leu Pro Asp Glu Val Val Val Pro Pro 
65                  70                  75                  80  


Gly Val Thr Asn Ser Ser Phe Gln Val Thr Ser Gln Asn Val Gly Gln 
                85                  90                  95      


Leu Thr Val Tyr Leu His Gly Asn His Ser Asn Gln Thr Gly Pro Arg 
            100                 105                 110         


Ile Arg Phe Leu Val Ile Arg Ser Ser Ala Ile Ser Ile Ile Asn Gln 
        115                 120                 125             


Val Ile Gly Trp Ile Tyr Phe Val Ala Trp Ser Ile Ser Phe Tyr Pro 
    130                 135                 140                 


Gln Val Ile Met Asn Trp Arg Arg Lys Ser Val Ile Gly Leu Ser Phe 
145                 150                 155                 160 


Asp Phe Val Ala Leu Asn Leu Thr Gly Phe Val Ala Tyr Ser Val Phe 
                165                 170                 175     


Asn Ile Gly Leu Leu Trp Val Pro Tyr Ile Lys Glu Gln Phe Leu Leu 
            180                 185                 190         


Lys Tyr Pro Asn Gly Val Asn Pro Val Asn Ser Asn Asp Val Phe Phe 
        195                 200                 205             


Ser Leu His Ala Val Val Leu Thr Leu Ile Ile Ile Val Gln Cys Cys 
    210                 215                 220                 


Leu Tyr Glu Arg Gly Gly Gln Arg Val Ser Trp Pro Ala Ile Gly Phe 
225                 230                 235                 240 


Leu Val Leu Ala Trp Leu Phe Ala Phe Val Thr Met Ile Val Ala Ala 
                245                 250                 255     


Val Gly Val Thr Thr Trp Leu Gln Phe Leu Phe Cys Phe Ser Tyr Ile 
            260                 265                 270         


Lys Leu Ala Val Thr Leu Val Lys Tyr Phe Pro Gln Ala Tyr Met Asn 
        275                 280                 285             


Phe Tyr Tyr Lys Ser Thr Glu Gly Trp Ser Ile Gly Asn Val Leu Leu 
    290                 295                 300                 


Asp Phe Thr Gly Gly Ser Phe Ser Leu Leu Gln Met Phe Leu Gln Ser 
305                 310                 315                 320 


Tyr Asn Asn Asp Gln Trp Thr Leu Ile Phe Gly Asp Pro Thr Lys Phe 
                325                 330                 335     


Gly Leu Gly Val Phe Ser Ile Val Phe Asp Val Val Phe Phe Ile Gln 
            340                 345                 350         


His Phe Cys Leu Tyr Arg Lys Arg Pro Gly Tyr Asp Gln Leu Asn 
        355                 360                 365         


<210>  229
<211>  529
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  beta-hexosaminidase subunit alpha (HEXA) isoform 2 preproprotein,
       accession number NP_000511.2

<400>  229

Met Thr Ser Ser Arg Leu Trp Phe Ser Leu Leu Leu Ala Ala Ala Phe 
1               5                   10                  15      


Ala Gly Arg Ala Thr Ala Leu Trp Pro Trp Pro Gln Asn Phe Gln Thr 
            20                  25                  30          


Ser Asp Gln Arg Tyr Val Leu Tyr Pro Asn Asn Phe Gln Phe Gln Tyr 
        35                  40                  45              


Asp Val Ser Ser Ala Ala Gln Pro Gly Cys Ser Val Leu Asp Glu Ala 
    50                  55                  60                  


Phe Gln Arg Tyr Arg Asp Leu Leu Phe Gly Ser Gly Ser Trp Pro Arg 
65                  70                  75                  80  


Pro Tyr Leu Thr Gly Lys Arg His Thr Leu Glu Lys Asn Val Leu Val 
                85                  90                  95      


Val Ser Val Val Thr Pro Gly Cys Asn Gln Leu Pro Thr Leu Glu Ser 
            100                 105                 110         


Val Glu Asn Tyr Thr Leu Thr Ile Asn Asp Asp Gln Cys Leu Leu Leu 
        115                 120                 125             


Ser Glu Thr Val Trp Gly Ala Leu Arg Gly Leu Glu Thr Phe Ser Gln 
    130                 135                 140                 


Leu Val Trp Lys Ser Ala Glu Gly Thr Phe Phe Ile Asn Lys Thr Glu 
145                 150                 155                 160 


Ile Glu Asp Phe Pro Arg Phe Pro His Arg Gly Leu Leu Leu Asp Thr 
                165                 170                 175     


Ser Arg His Tyr Leu Pro Leu Ser Ser Ile Leu Asp Thr Leu Asp Val 
            180                 185                 190         


Met Ala Tyr Asn Lys Leu Asn Val Phe His Trp His Leu Val Asp Asp 
        195                 200                 205             


Pro Ser Phe Pro Tyr Glu Ser Phe Thr Phe Pro Glu Leu Met Arg Lys 
    210                 215                 220                 


Gly Ser Tyr Asn Pro Val Thr His Ile Tyr Thr Ala Gln Asp Val Lys 
225                 230                 235                 240 


Glu Val Ile Glu Tyr Ala Arg Leu Arg Gly Ile Arg Val Leu Ala Glu 
                245                 250                 255     


Phe Asp Thr Pro Gly His Thr Leu Ser Trp Gly Pro Gly Ile Pro Gly 
            260                 265                 270         


Leu Leu Thr Pro Cys Tyr Ser Gly Ser Glu Pro Ser Gly Thr Phe Gly 
        275                 280                 285             


Pro Val Asn Pro Ser Leu Asn Asn Thr Tyr Glu Phe Met Ser Thr Phe 
    290                 295                 300                 


Phe Leu Glu Val Ser Ser Val Phe Pro Asp Phe Tyr Leu His Leu Gly 
305                 310                 315                 320 


Gly Asp Glu Val Asp Phe Thr Cys Trp Lys Ser Asn Pro Glu Ile Gln 
                325                 330                 335     


Asp Phe Met Arg Lys Lys Gly Phe Gly Glu Asp Phe Lys Gln Leu Glu 
            340                 345                 350         


Ser Phe Tyr Ile Gln Thr Leu Leu Asp Ile Val Ser Ser Tyr Gly Lys 
        355                 360                 365             


Gly Tyr Val Val Trp Gln Glu Val Phe Asp Asn Lys Val Lys Ile Gln 
    370                 375                 380                 


Pro Asp Thr Ile Ile Gln Val Trp Arg Glu Asp Ile Pro Val Asn Tyr 
385                 390                 395                 400 


Met Lys Glu Leu Glu Leu Val Thr Lys Ala Gly Phe Arg Ala Leu Leu 
                405                 410                 415     


Ser Ala Pro Trp Tyr Leu Asn Arg Ile Ser Tyr Gly Pro Asp Trp Lys 
            420                 425                 430         


Asp Phe Tyr Ile Val Glu Pro Leu Ala Phe Glu Gly Thr Pro Glu Gln 
        435                 440                 445             


Lys Ala Leu Val Ile Gly Gly Glu Ala Cys Met Trp Gly Glu Tyr Val 
    450                 455                 460                 


Asp Asn Thr Asn Leu Val Pro Arg Leu Trp Pro Arg Ala Gly Ala Val 
465                 470                 475                 480 


Ala Glu Arg Leu Trp Ser Asn Lys Leu Thr Ser Asp Leu Thr Phe Ala 
                485                 490                 495     


Tyr Glu Arg Leu Ser His Phe Arg Cys Glu Leu Leu Arg Arg Gly Val 
            500                 505                 510         


Gln Ala Gln Pro Leu Asn Val Gly Phe Cys Glu Gln Glu Phe Glu Gln 
        515                 520                 525             


Thr 
    


<210>  230
<211>  3056
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  serine/threonine kinase (ATM) isoform a, accession number 
       NP_000042.3

<400>  230

Met Ser Leu Val Leu Asn Asp Leu Leu Ile Cys Cys Arg Gln Leu Glu 
1               5                   10                  15      


His Asp Arg Ala Thr Glu Arg Lys Lys Glu Val Glu Lys Phe Lys Arg 
            20                  25                  30          


Leu Ile Arg Asp Pro Glu Thr Ile Lys His Leu Asp Arg His Ser Asp 
        35                  40                  45              


Ser Lys Gln Gly Lys Tyr Leu Asn Trp Asp Ala Val Phe Arg Phe Leu 
    50                  55                  60                  


Gln Lys Tyr Ile Gln Lys Glu Thr Glu Cys Leu Arg Ile Ala Lys Pro 
65                  70                  75                  80  


Asn Val Ser Ala Ser Thr Gln Ala Ser Arg Gln Lys Lys Met Gln Glu 
                85                  90                  95      


Ile Ser Ser Leu Val Lys Tyr Phe Ile Lys Cys Ala Asn Arg Arg Ala 
            100                 105                 110         


Pro Arg Leu Lys Cys Gln Glu Leu Leu Asn Tyr Ile Met Asp Thr Val 
        115                 120                 125             


Lys Asp Ser Ser Asn Gly Ala Ile Tyr Gly Ala Asp Cys Ser Asn Ile 
    130                 135                 140                 


Leu Leu Lys Asp Ile Leu Ser Val Arg Lys Tyr Trp Cys Glu Ile Ser 
145                 150                 155                 160 


Gln Gln Gln Trp Leu Glu Leu Phe Ser Val Tyr Phe Arg Leu Tyr Leu 
                165                 170                 175     


Lys Pro Ser Gln Asp Val His Arg Val Leu Val Ala Arg Ile Ile His 
            180                 185                 190         


Ala Val Thr Lys Gly Cys Cys Ser Gln Thr Asp Gly Leu Asn Ser Lys 
        195                 200                 205             


Phe Leu Asp Phe Phe Ser Lys Ala Ile Gln Cys Ala Arg Gln Glu Lys 
    210                 215                 220                 


Ser Ser Ser Gly Leu Asn His Ile Leu Ala Ala Leu Thr Ile Phe Leu 
225                 230                 235                 240 


Lys Thr Leu Ala Val Asn Phe Arg Ile Arg Val Cys Glu Leu Gly Asp 
                245                 250                 255     


Glu Ile Leu Pro Thr Leu Leu Tyr Ile Trp Thr Gln His Arg Leu Asn 
            260                 265                 270         


Asp Ser Leu Lys Glu Val Ile Ile Glu Leu Phe Gln Leu Gln Ile Tyr 
        275                 280                 285             


Ile His His Pro Lys Gly Ala Lys Thr Gln Glu Lys Gly Ala Tyr Glu 
    290                 295                 300                 


Ser Thr Lys Trp Arg Ser Ile Leu Tyr Asn Leu Tyr Asp Leu Leu Val 
305                 310                 315                 320 


Asn Glu Ile Ser His Ile Gly Ser Arg Gly Lys Tyr Ser Ser Gly Phe 
                325                 330                 335     


Arg Asn Ile Ala Val Lys Glu Asn Leu Ile Glu Leu Met Ala Asp Ile 
            340                 345                 350         


Cys His Gln Val Phe Asn Glu Asp Thr Arg Ser Leu Glu Ile Ser Gln 
        355                 360                 365             


Ser Tyr Thr Thr Thr Gln Arg Glu Ser Ser Asp Tyr Ser Val Pro Cys 
    370                 375                 380                 


Lys Arg Lys Lys Ile Glu Leu Gly Trp Glu Val Ile Lys Asp His Leu 
385                 390                 395                 400 


Gln Lys Ser Gln Asn Asp Phe Asp Leu Val Pro Trp Leu Gln Ile Ala 
                405                 410                 415     


Thr Gln Leu Ile Ser Lys Tyr Pro Ala Ser Leu Pro Asn Cys Glu Leu 
            420                 425                 430         


Ser Pro Leu Leu Met Ile Leu Ser Gln Leu Leu Pro Gln Gln Arg His 
        435                 440                 445             


Gly Glu Arg Thr Pro Tyr Val Leu Arg Cys Leu Thr Glu Val Ala Leu 
    450                 455                 460                 


Cys Gln Asp Lys Arg Ser Asn Leu Glu Ser Ser Gln Lys Ser Asp Leu 
465                 470                 475                 480 


Leu Lys Leu Trp Asn Lys Ile Trp Cys Ile Thr Phe Arg Gly Ile Ser 
                485                 490                 495     


Ser Glu Gln Ile Gln Ala Glu Asn Phe Gly Leu Leu Gly Ala Ile Ile 
            500                 505                 510         


Gln Gly Ser Leu Val Glu Val Asp Arg Glu Phe Trp Lys Leu Phe Thr 
        515                 520                 525             


Gly Ser Ala Cys Arg Pro Ser Cys Pro Ala Val Cys Cys Leu Thr Leu 
    530                 535                 540                 


Ala Leu Thr Thr Ser Ile Val Pro Gly Thr Val Lys Met Gly Ile Glu 
545                 550                 555                 560 


Gln Asn Met Cys Glu Val Asn Arg Ser Phe Ser Leu Lys Glu Ser Ile 
                565                 570                 575     


Met Lys Trp Leu Leu Phe Tyr Gln Leu Glu Gly Asp Leu Glu Asn Ser 
            580                 585                 590         


Thr Glu Val Pro Pro Ile Leu His Ser Asn Phe Pro His Leu Val Leu 
        595                 600                 605             


Glu Lys Ile Leu Val Ser Leu Thr Met Lys Asn Cys Lys Ala Ala Met 
    610                 615                 620                 


Asn Phe Phe Gln Ser Val Pro Glu Cys Glu His His Gln Lys Asp Lys 
625                 630                 635                 640 


Glu Glu Leu Ser Phe Ser Glu Val Glu Glu Leu Phe Leu Gln Thr Thr 
                645                 650                 655     


Phe Asp Lys Met Asp Phe Leu Thr Ile Val Arg Glu Cys Gly Ile Glu 
            660                 665                 670         


Lys His Gln Ser Ser Ile Gly Phe Ser Val His Gln Asn Leu Lys Glu 
        675                 680                 685             


Ser Leu Asp Arg Cys Leu Leu Gly Leu Ser Glu Gln Leu Leu Asn Asn 
    690                 695                 700                 


Tyr Ser Ser Glu Ile Thr Asn Ser Glu Thr Leu Val Arg Cys Ser Arg 
705                 710                 715                 720 


Leu Leu Val Gly Val Leu Gly Cys Tyr Cys Tyr Met Gly Val Ile Ala 
                725                 730                 735     


Glu Glu Glu Ala Tyr Lys Ser Glu Leu Phe Gln Lys Ala Lys Ser Leu 
            740                 745                 750         


Met Gln Cys Ala Gly Glu Ser Ile Thr Leu Phe Lys Asn Lys Thr Asn 
        755                 760                 765             


Glu Glu Phe Arg Ile Gly Ser Leu Arg Asn Met Met Gln Leu Cys Thr 
    770                 775                 780                 


Arg Cys Leu Ser Asn Cys Thr Lys Lys Ser Pro Asn Lys Ile Ala Ser 
785                 790                 795                 800 


Gly Phe Phe Leu Arg Leu Leu Thr Ser Lys Leu Met Asn Asp Ile Ala 
                805                 810                 815     


Asp Ile Cys Lys Ser Leu Ala Ser Phe Ile Lys Lys Pro Phe Asp Arg 
            820                 825                 830         


Gly Glu Val Glu Ser Met Glu Asp Asp Thr Asn Gly Asn Leu Met Glu 
        835                 840                 845             


Val Glu Asp Gln Ser Ser Met Asn Leu Phe Asn Asp Tyr Pro Asp Ser 
    850                 855                 860                 


Ser Val Ser Asp Ala Asn Glu Pro Gly Glu Ser Gln Ser Thr Ile Gly 
865                 870                 875                 880 


Ala Ile Asn Pro Leu Ala Glu Glu Tyr Leu Ser Lys Gln Asp Leu Leu 
                885                 890                 895     


Phe Leu Asp Met Leu Lys Phe Leu Cys Leu Cys Val Thr Thr Ala Gln 
            900                 905                 910         


Thr Asn Thr Val Ser Phe Arg Ala Ala Asp Ile Arg Arg Lys Leu Leu 
        915                 920                 925             


Met Leu Ile Asp Ser Ser Thr Leu Glu Pro Thr Lys Ser Leu His Leu 
    930                 935                 940                 


His Met Tyr Leu Met Leu Leu Lys Glu Leu Pro Gly Glu Glu Tyr Pro 
945                 950                 955                 960 


Leu Pro Met Glu Asp Val Leu Glu Leu Leu Lys Pro Leu Ser Asn Val 
                965                 970                 975     


Cys Ser Leu Tyr Arg Arg Asp Gln Asp Val Cys Lys Thr Ile Leu Asn 
            980                 985                 990         


His Val Leu His Val Val Lys Asn  Leu Gly Gln Ser Asn  Met Asp Ser 
        995                 1000                 1005             


Glu Asn  Thr Arg Asp Ala Gln  Gly Gln Phe Leu Thr  Val Ile Gly 
    1010                 1015                 1020             


Ala Phe  Trp His Leu Thr Lys  Glu Arg Lys Tyr Ile  Phe Ser Val 
    1025                 1030                 1035             


Arg Met  Ala Leu Val Asn Cys  Leu Lys Thr Leu Leu  Glu Ala Asp 
    1040                 1045                 1050             


Pro Tyr  Ser Lys Trp Ala Ile  Leu Asn Val Met Gly  Lys Asp Phe 
    1055                 1060                 1065             


Pro Val  Asn Glu Val Phe Thr  Gln Phe Leu Ala Asp  Asn His His 
    1070                 1075                 1080             


Gln Val  Arg Met Leu Ala Ala  Glu Ser Ile Asn Arg  Leu Phe Gln 
    1085                 1090                 1095             


Asp Thr  Lys Gly Asp Ser Ser  Arg Leu Leu Lys Ala  Leu Pro Leu 
    1100                 1105                 1110             


Lys Leu  Gln Gln Thr Ala Phe  Glu Asn Ala Tyr Leu  Lys Ala Gln 
    1115                 1120                 1125             


Glu Gly  Met Arg Glu Met Ser  His Ser Ala Glu Asn  Pro Glu Thr 
    1130                 1135                 1140             


Leu Asp  Glu Ile Tyr Asn Arg  Lys Ser Val Leu Leu  Thr Leu Ile 
    1145                 1150                 1155             


Ala Val  Val Leu Ser Cys Ser  Pro Ile Cys Glu Lys  Gln Ala Leu 
    1160                 1165                 1170             


Phe Ala  Leu Cys Lys Ser Val  Lys Glu Asn Gly Leu  Glu Pro His 
    1175                 1180                 1185             


Leu Val  Lys Lys Val Leu Glu  Lys Val Ser Glu Thr  Phe Gly Tyr 
    1190                 1195                 1200             


Arg Arg  Leu Glu Asp Phe Met  Ala Ser His Leu Asp  Tyr Leu Val 
    1205                 1210                 1215             


Leu Glu  Trp Leu Asn Leu Gln  Asp Thr Glu Tyr Asn  Leu Ser Ser 
    1220                 1225                 1230             


Phe Pro  Phe Ile Leu Leu Asn  Tyr Thr Asn Ile Glu  Asp Phe Tyr 
    1235                 1240                 1245             


Arg Ser  Cys Tyr Lys Val Leu  Ile Pro His Leu Val  Ile Arg Ser 
    1250                 1255                 1260             


His Phe  Asp Glu Val Lys Ser  Ile Ala Asn Gln Ile  Gln Glu Asp 
    1265                 1270                 1275             


Trp Lys  Ser Leu Leu Thr Asp  Cys Phe Pro Lys Ile  Leu Val Asn 
    1280                 1285                 1290             


Ile Leu  Pro Tyr Phe Ala Tyr  Glu Gly Thr Arg Asp  Ser Gly Met 
    1295                 1300                 1305             


Ala Gln  Gln Arg Glu Thr Ala  Thr Lys Val Tyr Asp  Met Leu Lys 
    1310                 1315                 1320             


Ser Glu  Asn Leu Leu Gly Lys  Gln Ile Asp His Leu  Phe Ile Ser 
    1325                 1330                 1335             


Asn Leu  Pro Glu Ile Val Val  Glu Leu Leu Met Thr  Leu His Glu 
    1340                 1345                 1350             


Pro Ala  Asn Ser Ser Ala Ser  Gln Ser Thr Asp Leu  Cys Asp Phe 
    1355                 1360                 1365             


Ser Gly  Asp Leu Asp Pro Ala  Pro Asn Pro Pro His  Phe Pro Ser 
    1370                 1375                 1380             


His Val  Ile Lys Ala Thr Phe  Ala Tyr Ile Ser Asn  Cys His Lys 
    1385                 1390                 1395             


Thr Lys  Leu Lys Ser Ile Leu  Glu Ile Leu Ser Lys  Ser Pro Asp 
    1400                 1405                 1410             


Ser Tyr  Gln Lys Ile Leu Leu  Ala Ile Cys Glu Gln  Ala Ala Glu 
    1415                 1420                 1425             


Thr Asn  Asn Val Tyr Lys Lys  His Arg Ile Leu Lys  Ile Tyr His 
    1430                 1435                 1440             


Leu Phe  Val Ser Leu Leu Leu  Lys Asp Ile Lys Ser  Gly Leu Gly 
    1445                 1450                 1455             


Gly Ala  Trp Ala Phe Val Leu  Arg Asp Val Ile Tyr  Thr Leu Ile 
    1460                 1465                 1470             


His Tyr  Ile Asn Gln Arg Pro  Ser Cys Ile Met Asp  Val Ser Leu 
    1475                 1480                 1485             


Arg Ser  Phe Ser Leu Cys Cys  Asp Leu Leu Ser Gln  Val Cys Gln 
    1490                 1495                 1500             


Thr Ala  Val Thr Tyr Cys Lys  Asp Ala Leu Glu Asn  His Leu His 
    1505                 1510                 1515             


Val Ile  Val Gly Thr Leu Ile  Pro Leu Val Tyr Glu  Gln Val Glu 
    1520                 1525                 1530             


Val Gln  Lys Gln Val Leu Asp  Leu Leu Lys Tyr Leu  Val Ile Asp 
    1535                 1540                 1545             


Asn Lys  Asp Asn Glu Asn Leu  Tyr Ile Thr Ile Lys  Leu Leu Asp 
    1550                 1555                 1560             


Pro Phe  Pro Asp His Val Val  Phe Lys Asp Leu Arg  Ile Thr Gln 
    1565                 1570                 1575             


Gln Lys  Ile Lys Tyr Ser Arg  Gly Pro Phe Ser Leu  Leu Glu Glu 
    1580                 1585                 1590             


Ile Asn  His Phe Leu Ser Val  Ser Val Tyr Asp Ala  Leu Pro Leu 
    1595                 1600                 1605             


Thr Arg  Leu Glu Gly Leu Lys  Asp Leu Arg Arg Gln  Leu Glu Leu 
    1610                 1615                 1620             


His Lys  Asp Gln Met Val Asp  Ile Met Arg Ala Ser  Gln Asp Asn 
    1625                 1630                 1635             


Pro Gln  Asp Gly Ile Met Val  Lys Leu Val Val Asn  Leu Leu Gln 
    1640                 1645                 1650             


Leu Ser  Lys Met Ala Ile Asn  His Thr Gly Glu Lys  Glu Val Leu 
    1655                 1660                 1665             


Glu Ala  Val Gly Ser Cys Leu  Gly Glu Val Gly Pro  Ile Asp Phe 
    1670                 1675                 1680             


Ser Thr  Ile Ala Ile Gln His  Ser Lys Asp Ala Ser  Tyr Thr Lys 
    1685                 1690                 1695             


Ala Leu  Lys Leu Phe Glu Asp  Lys Glu Leu Gln Trp  Thr Phe Ile 
    1700                 1705                 1710             


Met Leu  Thr Tyr Leu Asn Asn  Thr Leu Val Glu Asp  Cys Val Lys 
    1715                 1720                 1725             


Val Arg  Ser Ala Ala Val Thr  Cys Leu Lys Asn Ile  Leu Ala Thr 
    1730                 1735                 1740             


Lys Thr  Gly His Ser Phe Trp  Glu Ile Tyr Lys Met  Thr Thr Asp 
    1745                 1750                 1755             


Pro Met  Leu Ala Tyr Leu Gln  Pro Phe Arg Thr Ser  Arg Lys Lys 
    1760                 1765                 1770             


Phe Leu  Glu Val Pro Arg Phe  Asp Lys Glu Asn Pro  Phe Glu Gly 
    1775                 1780                 1785             


Leu Asp  Asp Ile Asn Leu Trp  Ile Pro Leu Ser Glu  Asn His Asp 
    1790                 1795                 1800             


Ile Trp  Ile Lys Thr Leu Thr  Cys Ala Phe Leu Asp  Ser Gly Gly 
    1805                 1810                 1815             


Thr Lys  Cys Glu Ile Leu Gln  Leu Leu Lys Pro Met  Cys Glu Val 
    1820                 1825                 1830             


Lys Thr  Asp Phe Cys Gln Thr  Val Leu Pro Tyr Leu  Ile His Asp 
    1835                 1840                 1845             


Ile Leu  Leu Gln Asp Thr Asn  Glu Ser Trp Arg Asn  Leu Leu Ser 
    1850                 1855                 1860             


Thr His  Val Gln Gly Phe Phe  Thr Ser Cys Leu Arg  His Phe Ser 
    1865                 1870                 1875             


Gln Thr  Ser Arg Ser Thr Thr  Pro Ala Asn Leu Asp  Ser Glu Ser 
    1880                 1885                 1890             


Glu His  Phe Phe Arg Cys Cys  Leu Asp Lys Lys Ser  Gln Arg Thr 
    1895                 1900                 1905             


Met Leu  Ala Val Val Asp Tyr  Met Arg Arg Gln Lys  Arg Pro Ser 
    1910                 1915                 1920             


Ser Gly  Thr Ile Phe Asn Asp  Ala Phe Trp Leu Asp  Leu Asn Tyr 
    1925                 1930                 1935             


Leu Glu  Val Ala Lys Val Ala  Gln Ser Cys Ala Ala  His Phe Thr 
    1940                 1945                 1950             


Ala Leu  Leu Tyr Ala Glu Ile  Tyr Ala Asp Lys Lys  Ser Met Asp 
    1955                 1960                 1965             


Asp Gln  Glu Lys Arg Ser Leu  Ala Phe Glu Glu Gly  Ser Gln Ser 
    1970                 1975                 1980             


Thr Thr  Ile Ser Ser Leu Ser  Glu Lys Ser Lys Glu  Glu Thr Gly 
    1985                 1990                 1995             


Ile Ser  Leu Gln Asp Leu Leu  Leu Glu Ile Tyr Arg  Ser Ile Gly 
    2000                 2005                 2010             


Glu Pro  Asp Ser Leu Tyr Gly  Cys Gly Gly Gly Lys  Met Leu Gln 
    2015                 2020                 2025             


Pro Ile  Thr Arg Leu Arg Thr  Tyr Glu His Glu Ala  Met Trp Gly 
    2030                 2035                 2040             


Lys Ala  Leu Val Thr Tyr Asp  Leu Glu Thr Ala Ile  Pro Ser Ser 
    2045                 2050                 2055             


Thr Arg  Gln Ala Gly Ile Ile  Gln Ala Leu Gln Asn  Leu Gly Leu 
    2060                 2065                 2070             


Cys His  Ile Leu Ser Val Tyr  Leu Lys Gly Leu Asp  Tyr Glu Asn 
    2075                 2080                 2085             


Lys Asp  Trp Cys Pro Glu Leu  Glu Glu Leu His Tyr  Gln Ala Ala 
    2090                 2095                 2100             


Trp Arg  Asn Met Gln Trp Asp  His Cys Thr Ser Val  Ser Lys Glu 
    2105                 2110                 2115             


Val Glu  Gly Thr Ser Tyr His  Glu Ser Leu Tyr Asn  Ala Leu Gln 
    2120                 2125                 2130             


Ser Leu  Arg Asp Arg Glu Phe  Ser Thr Phe Tyr Glu  Ser Leu Lys 
    2135                 2140                 2145             


Tyr Ala  Arg Val Lys Glu Val  Glu Glu Met Cys Lys  Arg Ser Leu 
    2150                 2155                 2160             


Glu Ser  Val Tyr Ser Leu Tyr  Pro Thr Leu Ser Arg  Leu Gln Ala 
    2165                 2170                 2175             


Ile Gly  Glu Leu Glu Ser Ile  Gly Glu Leu Phe Ser  Arg Ser Val 
    2180                 2185                 2190             


Thr His  Arg Gln Leu Ser Glu  Val Tyr Ile Lys Trp  Gln Lys His 
    2195                 2200                 2205             


Ser Gln  Leu Leu Lys Asp Ser  Asp Phe Ser Phe Gln  Glu Pro Ile 
    2210                 2215                 2220             


Met Ala  Leu Arg Thr Val Ile  Leu Glu Ile Leu Met  Glu Lys Glu 
    2225                 2230                 2235             


Met Asp  Asn Ser Gln Arg Glu  Cys Ile Lys Asp Ile  Leu Thr Lys 
    2240                 2245                 2250             


His Leu  Val Glu Leu Ser Ile  Leu Ala Arg Thr Phe  Lys Asn Thr 
    2255                 2260                 2265             


Gln Leu  Pro Glu Arg Ala Ile  Phe Gln Ile Lys Gln  Tyr Asn Ser 
    2270                 2275                 2280             


Val Ser  Cys Gly Val Ser Glu  Trp Gln Leu Glu Glu  Ala Gln Val 
    2285                 2290                 2295             


Phe Trp  Ala Lys Lys Glu Gln  Ser Leu Ala Leu Ser  Ile Leu Lys 
    2300                 2305                 2310             


Gln Met  Ile Lys Lys Leu Asp  Ala Ser Cys Ala Ala  Asn Asn Pro 
    2315                 2320                 2325             


Ser Leu  Lys Leu Thr Tyr Thr  Glu Cys Leu Arg Val  Cys Gly Asn 
    2330                 2335                 2340             


Trp Leu  Ala Glu Thr Cys Leu  Glu Asn Pro Ala Val  Ile Met Gln 
    2345                 2350                 2355             


Thr Tyr  Leu Glu Lys Ala Val  Glu Val Ala Gly Asn  Tyr Asp Gly 
    2360                 2365                 2370             


Glu Ser  Ser Asp Glu Leu Arg  Asn Gly Lys Met Lys  Ala Phe Leu 
    2375                 2380                 2385             


Ser Leu  Ala Arg Phe Ser Asp  Thr Gln Tyr Gln Arg  Ile Glu Asn 
    2390                 2395                 2400             


Tyr Met  Lys Ser Ser Glu Phe  Glu Asn Lys Gln Ala  Leu Leu Lys 
    2405                 2410                 2415             


Arg Ala  Lys Glu Glu Val Gly  Leu Leu Arg Glu His  Lys Ile Gln 
    2420                 2425                 2430             


Thr Asn  Arg Tyr Thr Val Lys  Val Gln Arg Glu Leu  Glu Leu Asp 
    2435                 2440                 2445             


Glu Leu  Ala Leu Arg Ala Leu  Lys Glu Asp Arg Lys  Arg Phe Leu 
    2450                 2455                 2460             


Cys Lys  Ala Val Glu Asn Tyr  Ile Asn Cys Leu Leu  Ser Gly Glu 
    2465                 2470                 2475             


Glu His  Asp Met Trp Val Phe  Arg Leu Cys Ser Leu  Trp Leu Glu 
    2480                 2485                 2490             


Asn Ser  Gly Val Ser Glu Val  Asn Gly Met Met Lys  Arg Asp Gly 
    2495                 2500                 2505             


Met Lys  Ile Pro Thr Tyr Lys  Phe Leu Pro Leu Met  Tyr Gln Leu 
    2510                 2515                 2520             


Ala Ala  Arg Met Gly Thr Lys  Met Met Gly Gly Leu  Gly Phe His 
    2525                 2530                 2535             


Glu Val  Leu Asn Asn Leu Ile  Ser Arg Ile Ser Met  Asp His Pro 
    2540                 2545                 2550             


His His  Thr Leu Phe Ile Ile  Leu Ala Leu Ala Asn  Ala Asn Arg 
    2555                 2560                 2565             


Asp Glu  Phe Leu Thr Lys Pro  Glu Val Ala Arg Arg  Ser Arg Ile 
    2570                 2575                 2580             


Thr Lys  Asn Val Pro Lys Gln  Ser Ser Gln Leu Asp  Glu Asp Arg 
    2585                 2590                 2595             


Thr Glu  Ala Ala Asn Arg Ile  Ile Cys Thr Ile Arg  Ser Arg Arg 
    2600                 2605                 2610             


Pro Gln  Met Val Arg Ser Val  Glu Ala Leu Cys Asp  Ala Tyr Ile 
    2615                 2620                 2625             


Ile Leu  Ala Asn Leu Asp Ala  Thr Gln Trp Lys Thr  Gln Arg Lys 
    2630                 2635                 2640             


Gly Ile  Asn Ile Pro Ala Asp  Gln Pro Ile Thr Lys  Leu Lys Asn 
    2645                 2650                 2655             


Leu Glu  Asp Val Val Val Pro  Thr Met Glu Ile Lys  Val Asp His 
    2660                 2665                 2670             


Thr Gly  Glu Tyr Gly Asn Leu  Val Thr Ile Gln Ser  Phe Lys Ala 
    2675                 2680                 2685             


Glu Phe  Arg Leu Ala Gly Gly  Val Asn Leu Pro Lys  Ile Ile Asp 
    2690                 2695                 2700             


Cys Val  Gly Ser Asp Gly Lys  Glu Arg Arg Gln Leu  Val Lys Gly 
    2705                 2710                 2715             


Arg Asp  Asp Leu Arg Gln Asp  Ala Val Met Gln Gln  Val Phe Gln 
    2720                 2725                 2730             


Met Cys  Asn Thr Leu Leu Gln  Arg Asn Thr Glu Thr  Arg Lys Arg 
    2735                 2740                 2745             


Lys Leu  Thr Ile Cys Thr Tyr  Lys Val Val Pro Leu  Ser Gln Arg 
    2750                 2755                 2760             


Ser Gly  Val Leu Glu Trp Cys  Thr Gly Thr Val Pro  Ile Gly Glu 
    2765                 2770                 2775             


Phe Leu  Val Asn Asn Glu Asp  Gly Ala His Lys Arg  Tyr Arg Pro 
    2780                 2785                 2790             


Asn Asp  Phe Ser Ala Phe Gln  Cys Gln Lys Lys Met  Met Glu Val 
    2795                 2800                 2805             


Gln Lys  Lys Ser Phe Glu Glu  Lys Tyr Glu Val Phe  Met Asp Val 
    2810                 2815                 2820             


Cys Gln  Asn Phe Gln Pro Val  Phe Arg Tyr Phe Cys  Met Glu Lys 
    2825                 2830                 2835             


Phe Leu  Asp Pro Ala Ile Trp  Phe Glu Lys Arg Leu  Ala Tyr Thr 
    2840                 2845                 2850             


Arg Ser  Val Ala Thr Ser Ser  Ile Val Gly Tyr Ile  Leu Gly Leu 
    2855                 2860                 2865             


Gly Asp  Arg His Val Gln Asn  Ile Leu Ile Asn Glu  Gln Ser Ala 
    2870                 2875                 2880             


Glu Leu  Val His Ile Asp Leu  Gly Val Ala Phe Glu  Gln Gly Lys 
    2885                 2890                 2895             


Ile Leu  Pro Thr Pro Glu Thr  Val Pro Phe Arg Leu  Thr Arg Asp 
    2900                 2905                 2910             


Ile Val  Asp Gly Met Gly Ile  Thr Gly Val Glu Gly  Val Phe Arg 
    2915                 2920                 2925             


Arg Cys  Cys Glu Lys Thr Met  Glu Val Met Arg Asn  Ser Gln Glu 
    2930                 2935                 2940             


Thr Leu  Leu Thr Ile Val Glu  Val Leu Leu Tyr Asp  Pro Leu Phe 
    2945                 2950                 2955             


Asp Trp  Thr Met Asn Pro Leu  Lys Ala Leu Tyr Leu  Gln Gln Arg 
    2960                 2965                 2970             


Pro Glu  Asp Glu Thr Glu Leu  His Pro Thr Leu Asn  Ala Asp Asp 
    2975                 2980                 2985             


Gln Glu  Cys Lys Arg Asn Leu  Ser Asp Ile Asp Gln  Ser Phe Asn 
    2990                 2995                 3000             


Lys Val  Ala Glu Arg Val Leu  Met Arg Leu Gln Glu  Lys Leu Lys 
    3005                 3010                 3015             


Gly Val  Glu Glu Gly Thr Val  Leu Ser Val Gly Gly  Gln Val Asn 
    3020                 3025                 3030             


Leu Leu  Ile Gln Gln Ala Ile  Asp Pro Lys Asn Leu  Ser Arg Leu 
    3035                 3040                 3045             


Phe Pro  Gly Trp Lys Ala Trp  Val 
    3050                 3055     


<210>  231
<211>  218
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  hypoxanthine-guanine phosphoribosyltransferase 1 (HPRT1), 
       accession number NP_000185.1

<400>  231

Met Ala Thr Arg Ser Pro Gly Val Val Ile Ser Asp Asp Glu Pro Gly 
1               5                   10                  15      


Tyr Asp Leu Asp Leu Phe Cys Ile Pro Asn His Tyr Ala Glu Asp Leu 
            20                  25                  30          


Glu Arg Val Phe Ile Pro His Gly Leu Ile Met Asp Arg Thr Glu Arg 
        35                  40                  45              


Leu Ala Arg Asp Val Met Lys Glu Met Gly Gly His His Ile Val Ala 
    50                  55                  60                  


Leu Cys Val Leu Lys Gly Gly Tyr Lys Phe Phe Ala Asp Leu Leu Asp 
65                  70                  75                  80  


Tyr Ile Lys Ala Leu Asn Arg Asn Ser Asp Arg Ser Ile Pro Met Thr 
                85                  90                  95      


Val Asp Phe Ile Arg Leu Lys Ser Tyr Cys Asn Asp Gln Ser Thr Gly 
            100                 105                 110         


Asp Ile Lys Val Ile Gly Gly Asp Asp Leu Ser Thr Leu Thr Gly Lys 
        115                 120                 125             


Asn Val Leu Ile Val Glu Asp Ile Ile Asp Thr Gly Lys Thr Met Gln 
    130                 135                 140                 


Thr Leu Leu Ser Leu Val Arg Gln Tyr Asn Pro Lys Met Val Lys Val 
145                 150                 155                 160 


Ala Ser Leu Leu Val Lys Arg Thr Pro Arg Ser Val Gly Tyr Lys Pro 
                165                 170                 175     


Asp Phe Val Gly Phe Glu Ile Pro Asp Lys Phe Val Val Gly Tyr Ala 
            180                 185                 190         


Leu Asp Tyr Asn Glu Tyr Phe Arg Asp Leu Asn His Val Cys Val Ile 
        195                 200                 205             


Ser Glu Thr Gly Lys Ala Lys Tyr Lys Ala 
    210                 215             


<210>  232
<211>  154
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  superoxide dismutase 1 [Cu-Zn], (SOD1), accession number 
       NP_000445.1

<400>  232

Met Ala Thr Lys Ala Val Cys Val Leu Lys Gly Asp Gly Pro Val Gln 
1               5                   10                  15      


Gly Ile Ile Asn Phe Glu Gln Lys Glu Ser Asn Gly Pro Val Lys Val 
            20                  25                  30          


Trp Gly Ser Ile Lys Gly Leu Thr Glu Gly Leu His Gly Phe His Val 
        35                  40                  45              


His Glu Phe Gly Asp Asn Thr Ala Gly Cys Thr Ser Ala Gly Pro His 
    50                  55                  60                  


Phe Asn Pro Leu Ser Arg Lys His Gly Gly Pro Lys Asp Glu Glu Arg 
65                  70                  75                  80  


His Val Gly Asp Leu Gly Asn Val Thr Ala Asp Lys Asp Gly Val Ala 
                85                  90                  95      


Asp Val Ser Ile Glu Asp Ser Val Ile Ser Leu Ser Gly Asp His Cys 
            100                 105                 110         


Ile Ile Gly Arg Thr Leu Val Val His Glu Lys Ala Asp Asp Leu Gly 
        115                 120                 125             


Lys Gly Gly Asn Glu Glu Ser Thr Lys Thr Gly Asn Ala Gly Ser Arg 
    130                 135                 140                 


Leu Ala Cys Gly Val Ile Gly Ile Ala Gln 
145                 150                 


<210>  233
<211>  414
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  TAR DNA-binding protein (TARDBP) 43, accession number NP_031401.1

<400>  233

Met Ser Glu Tyr Ile Arg Val Thr Glu Asp Glu Asn Asp Glu Pro Ile 
1               5                   10                  15      


Glu Ile Pro Ser Glu Asp Asp Gly Thr Val Leu Leu Ser Thr Val Thr 
            20                  25                  30          


Ala Gln Phe Pro Gly Ala Cys Gly Leu Arg Tyr Arg Asn Pro Val Ser 
        35                  40                  45              


Gln Cys Met Arg Gly Val Arg Leu Val Glu Gly Ile Leu His Ala Pro 
    50                  55                  60                  


Asp Ala Gly Trp Gly Asn Leu Val Tyr Val Val Asn Tyr Pro Lys Asp 
65                  70                  75                  80  


Asn Lys Arg Lys Met Asp Glu Thr Asp Ala Ser Ser Ala Val Lys Val 
                85                  90                  95      


Lys Arg Ala Val Gln Lys Thr Ser Asp Leu Ile Val Leu Gly Leu Pro 
            100                 105                 110         


Trp Lys Thr Thr Glu Gln Asp Leu Lys Glu Tyr Phe Ser Thr Phe Gly 
        115                 120                 125             


Glu Val Leu Met Val Gln Val Lys Lys Asp Leu Lys Thr Gly His Ser 
    130                 135                 140                 


Lys Gly Phe Gly Phe Val Arg Phe Thr Glu Tyr Glu Thr Gln Val Lys 
145                 150                 155                 160 


Val Met Ser Gln Arg His Met Ile Asp Gly Arg Trp Cys Asp Cys Lys 
                165                 170                 175     


Leu Pro Asn Ser Lys Gln Ser Gln Asp Glu Pro Leu Arg Ser Arg Lys 
            180                 185                 190         


Val Phe Val Gly Arg Cys Thr Glu Asp Met Thr Glu Asp Glu Leu Arg 
        195                 200                 205             


Glu Phe Phe Ser Gln Tyr Gly Asp Val Met Asp Val Phe Ile Pro Lys 
    210                 215                 220                 


Pro Phe Arg Ala Phe Ala Phe Val Thr Phe Ala Asp Asp Gln Ile Ala 
225                 230                 235                 240 


Gln Ser Leu Cys Gly Glu Asp Leu Ile Ile Lys Gly Ile Ser Val His 
                245                 250                 255     


Ile Ser Asn Ala Glu Pro Lys His Asn Ser Asn Arg Gln Leu Glu Arg 
            260                 265                 270         


Ser Gly Arg Phe Gly Gly Asn Pro Gly Gly Phe Gly Asn Gln Gly Gly 
        275                 280                 285             


Phe Gly Asn Ser Arg Gly Gly Gly Ala Gly Leu Gly Asn Asn Gln Gly 
    290                 295                 300                 


Ser Asn Met Gly Gly Gly Met Asn Phe Gly Ala Phe Ser Ile Asn Pro 
305                 310                 315                 320 


Ala Met Met Ala Ala Ala Gln Ala Ala Leu Gln Ser Ser Trp Gly Met 
                325                 330                 335     


Met Gly Met Leu Ala Ser Gln Gln Asn Gln Ser Gly Pro Ser Gly Asn 
            340                 345                 350         


Asn Gln Asn Gln Gly Asn Met Gln Arg Glu Pro Asn Gln Ala Phe Gly 
        355                 360                 365             


Ser Gly Asn Asn Ser Tyr Ser Gly Ser Asn Ser Gly Ala Ala Ile Gly 
    370                 375                 380                 


Trp Gly Ser Ala Ser Asn Ala Gly Ser Gly Ser Gly Phe Asn Gly Gly 
385                 390                 395                 400 


Phe Gly Ser Ser Met Asp Ser Lys Ser Ser Gly Trp Gly Met 
                405                 410                 


<210>  234
<211>  243
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  vesicle-associated membrane protein-associated protein B/C 
       (VAPB), accession number NP_004729.1

<400>  234

Met Ala Lys Val Glu Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu 
1               5                   10                  15      


Lys Phe Arg Gly Pro Phe Thr Asp Val Val Thr Thr Asn Leu Lys Leu 
            20                  25                  30          


Gly Asn Pro Thr Asp Arg Asn Val Cys Phe Lys Val Lys Thr Thr Ala 
        35                  40                  45              


Pro Arg Arg Tyr Cys Val Arg Pro Asn Ser Gly Ile Ile Asp Ala Gly 
    50                  55                  60                  


Ala Ser Ile Asn Val Ser Val Met Leu Gln Pro Phe Asp Tyr Asp Pro 
65                  70                  75                  80  


Asn Glu Lys Ser Lys His Lys Phe Met Val Gln Ser Met Phe Ala Pro 
                85                  90                  95      


Thr Asp Thr Ser Asp Met Glu Ala Val Trp Lys Glu Ala Lys Pro Glu 
            100                 105                 110         


Asp Leu Met Asp Ser Lys Leu Arg Cys Val Phe Glu Leu Pro Ala Glu 
        115                 120                 125             


Asn Asp Lys Pro His Asp Val Glu Ile Asn Lys Ile Ile Ser Thr Thr 
    130                 135                 140                 


Ala Ser Lys Thr Glu Thr Pro Ile Val Ser Lys Ser Leu Ser Ser Ser 
145                 150                 155                 160 


Leu Asp Asp Thr Glu Val Lys Lys Val Met Glu Glu Cys Lys Arg Leu 
                165                 170                 175     


Gln Gly Glu Val Gln Arg Leu Arg Glu Glu Asn Lys Gln Phe Lys Glu 
            180                 185                 190         


Glu Asp Gly Leu Arg Met Arg Lys Thr Val Gln Ser Asn Ser Pro Ile 
        195                 200                 205             


Ser Ala Leu Ala Pro Thr Gly Lys Glu Glu Gly Leu Ser Thr Arg Leu 
    210                 215                 220                 


Leu Ala Leu Val Val Leu Phe Phe Ile Val Gly Val Ile Ile Gly Lys 
225                 230                 235                 240 


Ile Ala Leu 
            


<210>  235
<211>  1278
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  NPC intracellular cholesterol transporter 1 precursor (NPC1), 
       accession number NP_000262.2

<400>  235

Met Thr Ala Arg Gly Leu Ala Leu Gly Leu Leu Leu Leu Leu Leu Cys 
1               5                   10                  15      


Pro Ala Gln Val Phe Ser Gln Ser Cys Val Trp Tyr Gly Glu Cys Gly 
            20                  25                  30          


Ile Ala Tyr Gly Asp Lys Arg Tyr Asn Cys Glu Tyr Ser Gly Pro Pro 
        35                  40                  45              


Lys Pro Leu Pro Lys Asp Gly Tyr Asp Leu Val Gln Glu Leu Cys Pro 
    50                  55                  60                  


Gly Phe Phe Phe Gly Asn Val Ser Leu Cys Cys Asp Val Arg Gln Leu 
65                  70                  75                  80  


Gln Thr Leu Lys Asp Asn Leu Gln Leu Pro Leu Gln Phe Leu Ser Arg 
                85                  90                  95      


Cys Pro Ser Cys Phe Tyr Asn Leu Leu Asn Leu Phe Cys Glu Leu Thr 
            100                 105                 110         


Cys Ser Pro Arg Gln Ser Gln Phe Leu Asn Val Thr Ala Thr Glu Asp 
        115                 120                 125             


Tyr Val Asp Pro Val Thr Asn Gln Thr Lys Thr Asn Val Lys Glu Leu 
    130                 135                 140                 


Gln Tyr Tyr Val Gly Gln Ser Phe Ala Asn Ala Met Tyr Asn Ala Cys 
145                 150                 155                 160 


Arg Asp Val Glu Ala Pro Ser Ser Asn Asp Lys Ala Leu Gly Leu Leu 
                165                 170                 175     


Cys Gly Lys Asp Ala Asp Ala Cys Asn Ala Thr Asn Trp Ile Glu Tyr 
            180                 185                 190         


Met Phe Asn Lys Asp Asn Gly Gln Ala Pro Phe Thr Ile Thr Pro Val 
        195                 200                 205             


Phe Ser Asp Phe Pro Val His Gly Met Glu Pro Met Asn Asn Ala Thr 
    210                 215                 220                 


Lys Gly Cys Asp Glu Ser Val Asp Glu Val Thr Ala Pro Cys Ser Cys 
225                 230                 235                 240 


Gln Asp Cys Ser Ile Val Cys Gly Pro Lys Pro Gln Pro Pro Pro Pro 
                245                 250                 255     


Pro Ala Pro Trp Thr Ile Leu Gly Leu Asp Ala Met Tyr Val Ile Met 
            260                 265                 270         


Trp Ile Thr Tyr Met Ala Phe Leu Leu Val Phe Phe Gly Ala Phe Phe 
        275                 280                 285             


Ala Val Trp Cys Tyr Arg Lys Arg Tyr Phe Val Ser Glu Tyr Thr Pro 
    290                 295                 300                 


Ile Asp Ser Asn Ile Ala Phe Ser Val Asn Ala Ser Asp Lys Gly Glu 
305                 310                 315                 320 


Ala Ser Cys Cys Asp Pro Val Ser Ala Ala Phe Glu Gly Cys Leu Arg 
                325                 330                 335     


Arg Leu Phe Thr Arg Trp Gly Ser Phe Cys Val Arg Asn Pro Gly Cys 
            340                 345                 350         


Val Ile Phe Phe Ser Leu Val Phe Ile Thr Ala Cys Ser Ser Gly Leu 
        355                 360                 365             


Val Phe Val Arg Val Thr Thr Asn Pro Val Asp Leu Trp Ser Ala Pro 
    370                 375                 380                 


Ser Ser Gln Ala Arg Leu Glu Lys Glu Tyr Phe Asp Gln His Phe Gly 
385                 390                 395                 400 


Pro Phe Phe Arg Thr Glu Gln Leu Ile Ile Arg Ala Pro Leu Thr Asp 
                405                 410                 415     


Lys His Ile Tyr Gln Pro Tyr Pro Ser Gly Ala Asp Val Pro Phe Gly 
            420                 425                 430         


Pro Pro Leu Asp Ile Gln Ile Leu His Gln Val Leu Asp Leu Gln Ile 
        435                 440                 445             


Ala Ile Glu Asn Ile Thr Ala Ser Tyr Asp Asn Glu Thr Val Thr Leu 
    450                 455                 460                 


Gln Asp Ile Cys Leu Ala Pro Leu Ser Pro Tyr Asn Thr Asn Cys Thr 
465                 470                 475                 480 


Ile Leu Ser Val Leu Asn Tyr Phe Gln Asn Ser His Ser Val Leu Asp 
                485                 490                 495     


His Lys Lys Gly Asp Asp Phe Phe Val Tyr Ala Asp Tyr His Thr His 
            500                 505                 510         


Phe Leu Tyr Cys Val Arg Ala Pro Ala Ser Leu Asn Asp Thr Ser Leu 
        515                 520                 525             


Leu His Asp Pro Cys Leu Gly Thr Phe Gly Gly Pro Val Phe Pro Trp 
    530                 535                 540                 


Leu Val Leu Gly Gly Tyr Asp Asp Gln Asn Tyr Asn Asn Ala Thr Ala 
545                 550                 555                 560 


Leu Val Ile Thr Phe Pro Val Asn Asn Tyr Tyr Asn Asp Thr Glu Lys 
                565                 570                 575     


Leu Gln Arg Ala Gln Ala Trp Glu Lys Glu Phe Ile Asn Phe Val Lys 
            580                 585                 590         


Asn Tyr Lys Asn Pro Asn Leu Thr Ile Ser Phe Thr Ala Glu Arg Ser 
        595                 600                 605             


Ile Glu Asp Glu Leu Asn Arg Glu Ser Asp Ser Asp Val Phe Thr Val 
    610                 615                 620                 


Val Ile Ser Tyr Ala Ile Met Phe Leu Tyr Ile Ser Leu Ala Leu Gly 
625                 630                 635                 640 


His Met Lys Ser Cys Arg Arg Leu Leu Val Asp Ser Lys Val Ser Leu 
                645                 650                 655     


Gly Ile Ala Gly Ile Leu Ile Val Leu Ser Ser Val Ala Cys Ser Leu 
            660                 665                 670         


Gly Val Phe Ser Tyr Ile Gly Leu Pro Leu Thr Leu Ile Val Ile Glu 
        675                 680                 685             


Val Ile Pro Phe Leu Val Leu Ala Val Gly Val Asp Asn Ile Phe Ile 
    690                 695                 700                 


Leu Val Gln Ala Tyr Gln Arg Asp Glu Arg Leu Gln Gly Glu Thr Leu 
705                 710                 715                 720 


Asp Gln Gln Leu Gly Arg Val Leu Gly Glu Val Ala Pro Ser Met Phe 
                725                 730                 735     


Leu Ser Ser Phe Ser Glu Thr Val Ala Phe Phe Leu Gly Ala Leu Ser 
            740                 745                 750         


Val Met Pro Ala Val His Thr Phe Ser Leu Phe Ala Gly Leu Ala Val 
        755                 760                 765             


Phe Ile Asp Phe Leu Leu Gln Ile Thr Cys Phe Val Ser Leu Leu Gly 
    770                 775                 780                 


Leu Asp Ile Lys Arg Gln Glu Lys Asn Arg Leu Asp Ile Phe Cys Cys 
785                 790                 795                 800 


Val Arg Gly Ala Glu Asp Gly Thr Ser Val Gln Ala Ser Glu Ser Cys 
                805                 810                 815     


Leu Phe Arg Phe Phe Lys Asn Ser Tyr Ser Pro Leu Leu Leu Lys Asp 
            820                 825                 830         


Trp Met Arg Pro Ile Val Ile Ala Ile Phe Val Gly Val Leu Ser Phe 
        835                 840                 845             


Ser Ile Ala Val Leu Asn Lys Val Asp Ile Gly Leu Asp Gln Ser Leu 
    850                 855                 860                 


Ser Met Pro Asp Asp Ser Tyr Met Val Asp Tyr Phe Lys Ser Ile Ser 
865                 870                 875                 880 


Gln Tyr Leu His Ala Gly Pro Pro Val Tyr Phe Val Leu Glu Glu Gly 
                885                 890                 895     


His Asp Tyr Thr Ser Ser Lys Gly Gln Asn Met Val Cys Gly Gly Met 
            900                 905                 910         


Gly Cys Asn Asn Asp Ser Leu Val Gln Gln Ile Phe Asn Ala Ala Gln 
        915                 920                 925             


Leu Asp Asn Tyr Thr Arg Ile Gly Phe Ala Pro Ser Ser Trp Ile Asp 
    930                 935                 940                 


Asp Tyr Phe Asp Trp Val Lys Pro Gln Ser Ser Cys Cys Arg Val Asp 
945                 950                 955                 960 


Asn Ile Thr Asp Gln Phe Cys Asn Ala Ser Val Val Asp Pro Ala Cys 
                965                 970                 975     


Val Arg Cys Arg Pro Leu Thr Pro Glu Gly Lys Gln Arg Pro Gln Gly 
            980                 985                 990         


Gly Asp Phe Met Arg Phe Leu Pro  Met Phe Leu Ser Asp  Asn Pro Asn 
        995                 1000                 1005             


Pro Lys  Cys Gly Lys Gly Gly  His Ala Ala Tyr Ser  Ser Ala Val 
    1010                 1015                 1020             


Asn Ile  Leu Leu Gly His Gly  Thr Arg Val Gly Ala  Thr Tyr Phe 
    1025                 1030                 1035             


Met Thr  Tyr His Thr Val Leu  Gln Thr Ser Ala Asp  Phe Ile Asp 
    1040                 1045                 1050             


Ala Leu  Lys Lys Ala Arg Leu  Ile Ala Ser Asn Val  Thr Glu Thr 
    1055                 1060                 1065             


Met Gly  Ile Asn Gly Ser Ala  Tyr Arg Val Phe Pro  Tyr Ser Val 
    1070                 1075                 1080             


Phe Tyr  Val Phe Tyr Glu Gln  Tyr Leu Thr Ile Ile  Asp Asp Thr 
    1085                 1090                 1095             


Ile Phe  Asn Leu Gly Val Ser  Leu Gly Ala Ile Phe  Leu Val Thr 
    1100                 1105                 1110             


Met Val  Leu Leu Gly Cys Glu  Leu Trp Ser Ala Val  Ile Met Cys 
    1115                 1120                 1125             


Ala Thr  Ile Ala Met Val Leu  Val Asn Met Phe Gly  Val Met Trp 
    1130                 1135                 1140             


Leu Trp  Gly Ile Ser Leu Asn  Ala Val Ser Leu Val  Asn Leu Val 
    1145                 1150                 1155             


Met Ser  Cys Gly Ile Ser Val  Glu Phe Cys Ser His  Ile Thr Arg 
    1160                 1165                 1170             


Ala Phe  Thr Val Ser Met Lys  Gly Ser Arg Val Glu  Arg Ala Glu 
    1175                 1180                 1185             


Glu Ala  Leu Ala His Met Gly  Ser Ser Val Phe Ser  Gly Ile Thr 
    1190                 1195                 1200             


Leu Thr  Lys Phe Gly Gly Ile  Val Val Leu Ala Phe  Ala Lys Ser 
    1205                 1210                 1215             


Gln Ile  Phe Gln Ile Phe Tyr  Phe Arg Met Tyr Leu  Ala Met Val 
    1220                 1225                 1230             


Leu Leu  Gly Ala Thr His Gly  Leu Ile Phe Leu Pro  Val Leu Leu 
    1235                 1240                 1245             


Ser Tyr  Ile Gly Pro Ser Val  Asn Lys Ala Lys Ser  Cys Ala Thr 
    1250                 1255                 1260             


Glu Glu  Arg Tyr Lys Gly Thr  Glu Arg Glu Arg Leu  Leu Asn Phe 
    1265                 1270                 1275             


<210>  236
<211>  1998
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  sodium channel protein type 1 subunit alpha 1 (SCN1A), isoform 2,
       accession number NP_008851.3

<400>  236

Met Glu Gln Thr Val Leu Val Pro Pro Gly Pro Asp Ser Phe Asn Phe 
1               5                   10                  15      


Phe Thr Arg Glu Ser Leu Ala Ala Ile Glu Arg Arg Ile Ala Glu Glu 
            20                  25                  30          


Lys Ala Lys Asn Pro Lys Pro Asp Lys Lys Asp Asp Asp Glu Asn Gly 
        35                  40                  45              


Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Asn Leu Pro Phe Ile 
    50                  55                  60                  


Tyr Gly Asp Ile Pro Pro Glu Met Val Ser Glu Pro Leu Glu Asp Leu 
65                  70                  75                  80  


Asp Pro Tyr Tyr Ile Asn Lys Lys Thr Phe Ile Val Leu Asn Lys Gly 
                85                  90                  95      


Lys Ala Ile Phe Arg Phe Ser Ala Thr Ser Ala Leu Tyr Ile Leu Thr 
            100                 105                 110         


Pro Phe Asn Pro Leu Arg Lys Ile Ala Ile Lys Ile Leu Val His Ser 
        115                 120                 125             


Leu Phe Ser Met Leu Ile Met Cys Thr Ile Leu Thr Asn Cys Val Phe 
    130                 135                 140                 


Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr Thr 
145                 150                 155                 160 


Phe Thr Gly Ile Tyr Thr Phe Glu Ser Leu Ile Lys Ile Ile Ala Arg 
                165                 170                 175     


Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn Trp 
            180                 185                 190         


Leu Asp Phe Thr Val Ile Thr Phe Ala Tyr Val Thr Glu Phe Val Asp 
        195                 200                 205             


Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu 
    210                 215                 220                 


Lys Thr Ile Ser Val Ile Pro Gly Leu Lys Thr Ile Val Gly Ala Leu 
225                 230                 235                 240 


Ile Gln Ser Val Lys Lys Leu Ser Asp Val Met Ile Leu Thr Val Phe 
                245                 250                 255     


Cys Leu Ser Val Phe Ala Leu Ile Gly Leu Gln Leu Phe Met Gly Asn 
            260                 265                 270         


Leu Arg Asn Lys Cys Ile Gln Trp Pro Pro Thr Asn Ala Ser Leu Glu 
        275                 280                 285             


Glu His Ser Ile Glu Lys Asn Ile Thr Val Asn Tyr Asn Gly Thr Leu 
    290                 295                 300                 


Ile Asn Glu Thr Val Phe Glu Phe Asp Trp Lys Ser Tyr Ile Gln Asp 
305                 310                 315                 320 


Ser Arg Tyr His Tyr Phe Leu Glu Gly Phe Leu Asp Ala Leu Leu Cys 
                325                 330                 335     


Gly Asn Ser Ser Asp Ala Gly Gln Cys Pro Glu Gly Tyr Met Cys Val 
            340                 345                 350         


Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp Thr Phe 
        355                 360                 365             


Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Met Thr Gln Asp Phe Trp 
    370                 375                 380                 


Glu Asn Leu Tyr Gln Leu Thr Leu Arg Ala Ala Gly Lys Thr Tyr Met 
385                 390                 395                 400 


Ile Phe Phe Val Leu Val Ile Phe Leu Gly Ser Phe Tyr Leu Ile Asn 
                405                 410                 415     


Leu Ile Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gln Asn Gln Ala 
            420                 425                 430         


Thr Leu Glu Glu Ala Glu Gln Lys Glu Ala Glu Phe Gln Gln Met Ile 
        435                 440                 445             


Glu Gln Leu Lys Lys Gln Gln Glu Ala Ala Gln Gln Ala Ala Thr Ala 
    450                 455                 460                 


Thr Ala Ser Glu His Ser Arg Glu Pro Ser Ala Ala Gly Arg Leu Ser 
465                 470                 475                 480 


Asp Ser Ser Ser Glu Ala Ser Lys Leu Ser Ser Lys Ser Ala Lys Glu 
                485                 490                 495     


Arg Arg Asn Arg Arg Lys Lys Arg Lys Gln Lys Glu Gln Ser Gly Gly 
            500                 505                 510         


Glu Glu Lys Asp Glu Asp Glu Phe Gln Lys Ser Glu Ser Glu Asp Ser 
        515                 520                 525             


Ile Arg Arg Lys Gly Phe Arg Phe Ser Ile Glu Gly Asn Arg Leu Thr 
    530                 535                 540                 


Tyr Glu Lys Arg Tyr Ser Ser Pro His Gln Ser Leu Leu Ser Ile Arg 
545                 550                 555                 560 


Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Thr Ser Leu Phe Ser 
                565                 570                 575     


Phe Arg Gly Arg Ala Lys Asp Val Gly Ser Glu Asn Asp Phe Ala Asp 
            580                 585                 590         


Asp Glu His Ser Thr Phe Glu Asp Asn Glu Ser Arg Arg Asp Ser Leu 
        595                 600                 605             


Phe Val Pro Arg Arg His Gly Glu Arg Arg Asn Ser Asn Leu Ser Gln 
    610                 615                 620                 


Thr Ser Arg Ser Ser Arg Met Leu Ala Val Phe Pro Ala Asn Gly Lys 
625                 630                 635                 640 


Met His Ser Thr Val Asp Cys Asn Gly Val Val Ser Leu Val Gly Gly 
                645                 650                 655     


Pro Ser Val Pro Thr Ser Pro Val Gly Gln Leu Leu Pro Glu Gly Thr 
            660                 665                 670         


Thr Thr Glu Thr Glu Met Arg Lys Arg Arg Ser Ser Ser Phe His Val 
        675                 680                 685             


Ser Met Asp Phe Leu Glu Asp Pro Ser Gln Arg Gln Arg Ala Met Ser 
    690                 695                 700                 


Ile Ala Ser Ile Leu Thr Asn Thr Val Glu Glu Leu Glu Glu Ser Arg 
705                 710                 715                 720 


Gln Lys Cys Pro Pro Cys Trp Tyr Lys Phe Ser Asn Ile Phe Leu Ile 
                725                 730                 735     


Trp Asp Cys Ser Pro Tyr Trp Leu Lys Val Lys His Val Val Asn Leu 
            740                 745                 750         


Val Val Met Asp Pro Phe Val Asp Leu Ala Ile Thr Ile Cys Ile Val 
        755                 760                 765             


Leu Asn Thr Leu Phe Met Ala Met Glu His Tyr Pro Met Thr Asp His 
    770                 775                 780                 


Phe Asn Asn Val Leu Thr Val Gly Asn Leu Val Phe Thr Gly Ile Phe 
785                 790                 795                 800 


Thr Ala Glu Met Phe Leu Lys Ile Ile Ala Met Asp Pro Tyr Tyr Tyr 
                805                 810                 815     


Phe Gln Glu Gly Trp Asn Ile Phe Asp Gly Phe Ile Val Thr Leu Ser 
            820                 825                 830         


Leu Val Glu Leu Gly Leu Ala Asn Val Glu Gly Leu Ser Val Leu Arg 
        835                 840                 845             


Ser Phe Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr 
    850                 855                 860                 


Leu Asn Met Leu Ile Lys Ile Ile Gly Asn Ser Val Gly Ala Leu Gly 
865                 870                 875                 880 


Asn Leu Thr Leu Val Leu Ala Ile Ile Val Phe Ile Phe Ala Val Val 
                885                 890                 895     


Gly Met Gln Leu Phe Gly Lys Ser Tyr Lys Asp Cys Val Cys Lys Ile 
            900                 905                 910         


Ala Ser Asp Cys Gln Leu Pro Arg Trp His Met Asn Asp Phe Phe His 
        915                 920                 925             


Ser Phe Leu Ile Val Phe Arg Val Leu Cys Gly Glu Trp Ile Glu Thr 
    930                 935                 940                 


Met Trp Asp Cys Met Glu Val Ala Gly Gln Ala Met Cys Leu Thr Val 
945                 950                 955                 960 


Phe Met Met Val Met Val Ile Gly Asn Leu Val Val Leu Asn Leu Phe 
                965                 970                 975     


Leu Ala Leu Leu Leu Ser Ser Phe Ser Ala Asp Asn Leu Ala Ala Thr 
            980                 985                 990         


Asp Asp Asp Asn Glu Met Asn Asn  Leu Gln Ile Ala Val  Asp Arg Met 
        995                 1000                 1005             


His Lys  Gly Val Ala Tyr Val  Lys Arg Lys Ile Tyr  Glu Phe Ile 
    1010                 1015                 1020             


Gln Gln  Ser Phe Ile Arg Lys  Gln Lys Ile Leu Asp  Glu Ile Lys 
    1025                 1030                 1035             


Pro Leu  Asp Asp Leu Asn Asn  Lys Lys Asp Ser Cys  Met Ser Asn 
    1040                 1045                 1050             


His Thr  Ala Glu Ile Gly Lys  Asp Leu Asp Tyr Leu  Lys Asp Val 
    1055                 1060                 1065             


Asn Gly  Thr Thr Ser Gly Ile  Gly Thr Gly Ser Ser  Val Glu Lys 
    1070                 1075                 1080             


Tyr Ile  Ile Asp Glu Ser Asp  Tyr Met Ser Phe Ile  Asn Asn Pro 
    1085                 1090                 1095             


Ser Leu  Thr Val Thr Val Pro  Ile Ala Val Gly Glu  Ser Asp Phe 
    1100                 1105                 1110             


Glu Asn  Leu Asn Thr Glu Asp  Phe Ser Ser Glu Ser  Asp Leu Glu 
    1115                 1120                 1125             


Glu Ser  Lys Glu Lys Leu Asn  Glu Ser Ser Ser Ser  Ser Glu Gly 
    1130                 1135                 1140             


Ser Thr  Val Asp Ile Gly Ala  Pro Val Glu Glu Gln  Pro Val Val 
    1145                 1150                 1155             


Glu Pro  Glu Glu Thr Leu Glu  Pro Glu Ala Cys Phe  Thr Glu Gly 
    1160                 1165                 1170             


Cys Val  Gln Arg Phe Lys Cys  Cys Gln Ile Asn Val  Glu Glu Gly 
    1175                 1180                 1185             


Arg Gly  Lys Gln Trp Trp Asn  Leu Arg Arg Thr Cys  Phe Arg Ile 
    1190                 1195                 1200             


Val Glu  His Asn Trp Phe Glu  Thr Phe Ile Val Phe  Met Ile Leu 
    1205                 1210                 1215             


Leu Ser  Ser Gly Ala Leu Ala  Phe Glu Asp Ile Tyr  Ile Asp Gln 
    1220                 1225                 1230             


Arg Lys  Thr Ile Lys Thr Met  Leu Glu Tyr Ala Asp  Lys Val Phe 
    1235                 1240                 1245             


Thr Tyr  Ile Phe Ile Leu Glu  Met Leu Leu Lys Trp  Val Ala Tyr 
    1250                 1255                 1260             


Gly Tyr  Gln Thr Tyr Phe Thr  Asn Ala Trp Cys Trp  Leu Asp Phe 
    1265                 1270                 1275             


Leu Ile  Val Asp Val Ser Leu  Val Ser Leu Thr Ala  Asn Ala Leu 
    1280                 1285                 1290             


Gly Tyr  Ser Glu Leu Gly Ala  Ile Lys Ser Leu Arg  Thr Leu Arg 
    1295                 1300                 1305             


Ala Leu  Arg Pro Leu Arg Ala  Leu Ser Arg Phe Glu  Gly Met Arg 
    1310                 1315                 1320             


Val Val  Val Asn Ala Leu Leu  Gly Ala Ile Pro Ser  Ile Met Asn 
    1325                 1330                 1335             


Val Leu  Leu Val Cys Leu Ile  Phe Trp Leu Ile Phe  Ser Ile Met 
    1340                 1345                 1350             


Gly Val  Asn Leu Phe Ala Gly  Lys Phe Tyr His Cys  Ile Asn Thr 
    1355                 1360                 1365             


Thr Thr  Gly Asp Arg Phe Asp  Ile Glu Asp Val Asn  Asn His Thr 
    1370                 1375                 1380             


Asp Cys  Leu Lys Leu Ile Glu  Arg Asn Glu Thr Ala  Arg Trp Lys 
    1385                 1390                 1395             


Asn Val  Lys Val Asn Phe Asp  Asn Val Gly Phe Gly  Tyr Leu Ser 
    1400                 1405                 1410             


Leu Leu  Gln Val Ala Thr Phe  Lys Gly Trp Met Asp  Ile Met Tyr 
    1415                 1420                 1425             


Ala Ala  Val Asp Ser Arg Asn  Val Glu Leu Gln Pro  Lys Tyr Glu 
    1430                 1435                 1440             


Glu Ser  Leu Tyr Met Tyr Leu  Tyr Phe Val Ile Phe  Ile Ile Phe 
    1445                 1450                 1455             


Gly Ser  Phe Phe Thr Leu Asn  Leu Phe Ile Gly Val  Ile Ile Asp 
    1460                 1465                 1470             


Asn Phe  Asn Gln Gln Lys Lys  Lys Phe Gly Gly Gln  Asp Ile Phe 
    1475                 1480                 1485             


Met Thr  Glu Glu Gln Lys Lys  Tyr Tyr Asn Ala Met  Lys Lys Leu 
    1490                 1495                 1500             


Gly Ser  Lys Lys Pro Gln Lys  Pro Ile Pro Arg Pro  Gly Asn Lys 
    1505                 1510                 1515             


Phe Gln  Gly Met Val Phe Asp  Phe Val Thr Arg Gln  Val Phe Asp 
    1520                 1525                 1530             


Ile Ser  Ile Met Ile Leu Ile  Cys Leu Asn Met Val  Thr Met Met 
    1535                 1540                 1545             


Val Glu  Thr Asp Asp Gln Ser  Glu Tyr Val Thr Thr  Ile Leu Ser 
    1550                 1555                 1560             


Arg Ile  Asn Leu Val Phe Ile  Val Leu Phe Thr Gly  Glu Cys Val 
    1565                 1570                 1575             


Leu Lys  Leu Ile Ser Leu Arg  His Tyr Tyr Phe Thr  Ile Gly Trp 
    1580                 1585                 1590             


Asn Ile  Phe Asp Phe Val Val  Val Ile Leu Ser Ile  Val Gly Met 
    1595                 1600                 1605             


Phe Leu  Ala Glu Leu Ile Glu  Lys Tyr Phe Val Ser  Pro Thr Leu 
    1610                 1615                 1620             


Phe Arg  Val Ile Arg Leu Ala  Arg Ile Gly Arg Ile  Leu Arg Leu 
    1625                 1630                 1635             


Ile Lys  Gly Ala Lys Gly Ile  Arg Thr Leu Leu Phe  Ala Leu Met 
    1640                 1645                 1650             


Met Ser  Leu Pro Ala Leu Phe  Asn Ile Gly Leu Leu  Leu Phe Leu 
    1655                 1660                 1665             


Val Met  Phe Ile Tyr Ala Ile  Phe Gly Met Ser Asn  Phe Ala Tyr 
    1670                 1675                 1680             


Val Lys  Arg Glu Val Gly Ile  Asp Asp Met Phe Asn  Phe Glu Thr 
    1685                 1690                 1695             


Phe Gly  Asn Ser Met Ile Cys  Leu Phe Gln Ile Thr  Thr Ser Ala 
    1700                 1705                 1710             


Gly Trp  Asp Gly Leu Leu Ala  Pro Ile Leu Asn Ser  Lys Pro Pro 
    1715                 1720                 1725             


Asp Cys  Asp Pro Asn Lys Val  Asn Pro Gly Ser Ser  Val Lys Gly 
    1730                 1735                 1740             


Asp Cys  Gly Asn Pro Ser Val  Gly Ile Phe Phe Phe  Val Ser Tyr 
    1745                 1750                 1755             


Ile Ile  Ile Ser Phe Leu Val  Val Val Asn Met Tyr  Ile Ala Val 
    1760                 1765                 1770             


Ile Leu  Glu Asn Phe Ser Val  Ala Thr Glu Glu Ser  Ala Glu Pro 
    1775                 1780                 1785             


Leu Ser  Glu Asp Asp Phe Glu  Met Phe Tyr Glu Val  Trp Glu Lys 
    1790                 1795                 1800             


Phe Asp  Pro Asp Ala Thr Gln  Phe Met Glu Phe Glu  Lys Leu Ser 
    1805                 1810                 1815             


Gln Phe  Ala Ala Ala Leu Glu  Pro Pro Leu Asn Leu  Pro Gln Pro 
    1820                 1825                 1830             


Asn Lys  Leu Gln Leu Ile Ala  Met Asp Leu Pro Met  Val Ser Gly 
    1835                 1840                 1845             


Asp Arg  Ile His Cys Leu Asp  Ile Leu Phe Ala Phe  Thr Lys Arg 
    1850                 1855                 1860             


Val Leu  Gly Glu Ser Gly Glu  Met Asp Ala Leu Arg  Ile Gln Met 
    1865                 1870                 1875             


Glu Glu  Arg Phe Met Ala Ser  Asn Pro Ser Lys Val  Ser Tyr Gln 
    1880                 1885                 1890             


Pro Ile  Thr Thr Thr Leu Lys  Arg Lys Gln Glu Glu  Val Ser Ala 
    1895                 1900                 1905             


Val Ile  Ile Gln Arg Ala Tyr  Arg Arg His Leu Leu  Lys Arg Thr 
    1910                 1915                 1920             


Val Lys  Gln Ala Ser Phe Thr  Tyr Asn Lys Asn Lys  Ile Lys Gly 
    1925                 1930                 1935             


Gly Ala  Asn Leu Leu Ile Lys  Glu Asp Met Ile Ile  Asp Arg Ile 
    1940                 1945                 1950             


Asn Glu  Asn Ser Ile Thr Glu  Lys Thr Asp Leu Thr  Met Ser Thr 
    1955                 1960                 1965             


Ala Ala  Cys Pro Pro Ser Tyr  Asp Arg Val Thr Lys  Pro Ile Val 
    1970                 1975                 1980             


Glu Lys  His Glu Gln Glu Gly  Lys Asp Glu Lys Ala  Lys Gly Lys 
    1985                 1990                 1995             


<210>  237
<211>  1466
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  collagen type III alpha 1 chain preproprotein, (COL3A1) accession
       number NP_000081.2

<400>  237

Met Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu 
1               5                   10                  15      


His Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys 
            20                  25                  30          


Ser His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu 
        35                  40                  45              


Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp 
    50                  55                  60                  


Ile Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro 
65                  70                  75                  80  


Phe Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr 
                85                  90                  95      


Arg Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly 
            100                 105                 110         


Pro Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln 
        115                 120                 125             


Pro Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys 
    130                 135                 140                 


Pro Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val 
145                 150                 155                 160 


Lys Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala 
                165                 170                 175     


Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly 
            180                 185                 190         


Ser Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln 
        195                 200                 205             


Ala Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser 
    210                 215                 220                 


Gly Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly 
225                 230                 235                 240 


Glu Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile 
                245                 250                 255     


Pro Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn 
            260                 265                 270         


Gly Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly 
        275                 280                 285             


Leu Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala 
    290                 295                 300                 


Pro Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg 
305                 310                 315                 320 


Gly Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly 
                325                 330                 335     


Pro Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu 
            340                 345                 350         


Val Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg 
        355                 360                 365             


Gly Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly 
    370                 375                 380                 


Pro Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro 
385                 390                 395                 400 


Ala Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro 
                405                 410                 415     


Gly Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly 
            420                 425                 430         


Glu Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu 
        435                 440                 445             


Arg Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp 
    450                 455                 460                 


Gly Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly 
465                 470                 475                 480 


Ala Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro 
                485                 490                 495     


Asn Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro 
            500                 505                 510         


Gly Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly 
        515                 520                 525             


Val Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly 
    530                 535                 540                 


Pro Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser 
545                 550                 555                 560 


Gly Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly 
                565                 570                 575     


Val Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys 
            580                 585                 590         


Asn Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro 
        595                 600                 605             


Gly Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly 
    610                 615                 620                 


Pro Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu 
625                 630                 635                 640 


Gln Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro 
                645                 650                 655     


Gly Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly 
            660                 665                 670         


Gly Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu 
        675                 680                 685             


Ala Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu 
    690                 695                 700                 


Gly Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly 
705                 710                 715                 720 


Thr Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser 
                725                 730                 735     


Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp 
            740                 745                 750         


Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly 
        755                 760                 765             


Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala 
    770                 775                 780                 


Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg 
785                 790                 795                 800 


Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly 
                805                 810                 815     


Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu 
            820                 825                 830         


Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Gly Ser 
        835                 840                 845             


Gly Pro Ala Gly Pro Pro Gly Pro Gln Gly Val Lys Gly Glu Arg Gly 
    850                 855                 860                 


Ser Pro Gly Gly Pro Gly Ala Ala Gly Phe Pro Gly Ala Arg Gly Leu 
865                 870                 875                 880 


Pro Gly Pro Pro Gly Ser Asn Gly Asn Pro Gly Pro Pro Gly Pro Ser 
                885                 890                 895     


Gly Ser Pro Gly Lys Asp Gly Pro Pro Gly Pro Ala Gly Asn Thr Gly 
            900                 905                 910         


Ala Pro Gly Ser Pro Gly Val Ser Gly Pro Lys Gly Asp Ala Gly Gln 
        915                 920                 925             


Pro Gly Glu Lys Gly Ser Pro Gly Ala Gln Gly Pro Pro Gly Ala Pro 
    930                 935                 940                 


Gly Pro Leu Gly Ile Ala Gly Ile Thr Gly Ala Arg Gly Leu Ala Gly 
945                 950                 955                 960 


Pro Pro Gly Met Pro Gly Pro Arg Gly Ser Pro Gly Pro Gln Gly Val 
                965                 970                 975     


Lys Gly Glu Ser Gly Lys Pro Gly Ala Asn Gly Leu Ser Gly Glu Arg 
            980                 985                 990         


Gly Pro Pro Gly Pro Gln Gly Leu  Pro Gly Leu Ala Gly  Thr Ala Gly 
        995                 1000                 1005             


Glu Pro  Gly Arg Asp Gly Asn  Pro Gly Ser Asp Gly  Leu Pro Gly 
    1010                 1015                 1020             


Arg Asp  Gly Ser Pro Gly Gly  Lys Gly Asp Arg Gly  Glu Asn Gly 
    1025                 1030                 1035             


Ser Pro  Gly Ala Pro Gly Ala  Pro Gly His Pro Gly  Pro Pro Gly 
    1040                 1045                 1050             


Pro Val  Gly Pro Ala Gly Lys  Ser Gly Asp Arg Gly  Glu Ser Gly 
    1055                 1060                 1065             


Pro Ala  Gly Pro Ala Gly Ala  Pro Gly Pro Ala Gly  Ser Arg Gly 
    1070                 1075                 1080             


Ala Pro  Gly Pro Gln Gly Pro  Arg Gly Asp Lys Gly  Glu Thr Gly 
    1085                 1090                 1095             


Glu Arg  Gly Ala Ala Gly Ile  Lys Gly His Arg Gly  Phe Pro Gly 
    1100                 1105                 1110             


Asn Pro  Gly Ala Pro Gly Ser  Pro Gly Pro Ala Gly  Gln Gln Gly 
    1115                 1120                 1125             


Ala Ile  Gly Ser Pro Gly Pro  Ala Gly Pro Arg Gly  Pro Val Gly 
    1130                 1135                 1140             


Pro Ser  Gly Pro Pro Gly Lys  Asp Gly Thr Ser Gly  His Pro Gly 
    1145                 1150                 1155             


Pro Ile  Gly Pro Pro Gly Pro  Arg Gly Asn Arg Gly  Glu Arg Gly 
    1160                 1165                 1170             


Ser Glu  Gly Ser Pro Gly His  Pro Gly Gln Pro Gly  Pro Pro Gly 
    1175                 1180                 1185             


Pro Pro  Gly Ala Pro Gly Pro  Cys Cys Gly Gly Val  Gly Ala Ala 
    1190                 1195                 1200             


Ala Ile  Ala Gly Ile Gly Gly  Glu Lys Ala Gly Gly  Phe Ala Pro 
    1205                 1210                 1215             


Tyr Tyr  Gly Asp Glu Pro Met  Asp Phe Lys Ile Asn  Thr Asp Glu 
    1220                 1225                 1230             


Ile Met  Thr Ser Leu Lys Ser  Val Asn Gly Gln Ile  Glu Ser Leu 
    1235                 1240                 1245             


Ile Ser  Pro Asp Gly Ser Arg  Lys Asn Pro Ala Arg  Asn Cys Arg 
    1250                 1255                 1260             


Asp Leu  Lys Phe Cys His Pro  Glu Leu Lys Ser Gly  Glu Tyr Trp 
    1265                 1270                 1275             


Val Asp  Pro Asn Gln Gly Cys  Lys Leu Asp Ala Ile  Lys Val Phe 
    1280                 1285                 1290             


Cys Asn  Met Glu Thr Gly Glu  Thr Cys Ile Ser Ala  Asn Pro Leu 
    1295                 1300                 1305             


Asn Val  Pro Arg Lys His Trp  Trp Thr Asp Ser Ser  Ala Glu Lys 
    1310                 1315                 1320             


Lys His  Val Trp Phe Gly Glu  Ser Met Asp Gly Gly  Phe Gln Phe 
    1325                 1330                 1335             


Ser Tyr  Gly Asn Pro Glu Leu  Pro Glu Asp Val Leu  Asp Val His 
    1340                 1345                 1350             


Leu Ala  Phe Leu Arg Leu Leu  Ser Ser Arg Ala Ser  Gln Asn Ile 
    1355                 1360                 1365             


Thr Tyr  His Cys Lys Asn Ser  Ile Ala Tyr Met Asp  Gln Ala Ser 
    1370                 1375                 1380             


Gly Asn  Val Lys Lys Ala Leu  Lys Leu Met Gly Ser  Asn Glu Gly 
    1385                 1390                 1395             


Glu Phe  Lys Ala Glu Gly Asn  Ser Lys Phe Thr Tyr  Thr Val Leu 
    1400                 1405                 1410             


Glu Asp  Gly Cys Thr Lys His  Thr Gly Glu Trp Ser  Lys Thr Val 
    1415                 1420                 1425             


Phe Glu  Tyr Arg Thr Arg Lys  Ala Val Arg Leu Pro  Ile Val Asp 
    1430                 1435                 1440             


Ile Ala  Pro Tyr Asp Ile Gly  Gly Pro Asp Gln Glu  Phe Gly Val 
    1445                 1450                 1455             


Asp Val  Gly Pro Val Cys Phe  Leu 
    1460                 1465     


<210>  238
<211>  170
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  transmembrane protein 252 (TMEM252), also known as c9ORF 71-1, 
       accession number NP_694969.1

<400>  238

Met Gln Asn Arg Thr Gly Leu Ile Leu Cys Ala Leu Ala Leu Leu Met 
1               5                   10                  15      


Gly Phe Leu Met Val Cys Leu Gly Ala Phe Phe Ile Ser Trp Gly Ser 
            20                  25                  30          


Ile Phe Asp Cys Gln Gly Ser Leu Ile Ala Ala Tyr Leu Leu Leu Pro 
        35                  40                  45              


Leu Gly Phe Val Ile Leu Leu Ser Gly Ile Phe Trp Ser Asn Tyr Arg 
    50                  55                  60                  


Gln Val Thr Glu Ser Lys Gly Val Leu Arg His Met Leu Arg Gln His 
65                  70                  75                  80  


Leu Ala His Gly Ala Leu Pro Val Ala Thr Val Asp Arg Pro Asp Phe 
                85                  90                  95      


Tyr Pro Pro Ala Tyr Glu Glu Ser Leu Glu Val Glu Lys Gln Ser Cys 
            100                 105                 110         


Pro Ala Glu Arg Glu Ala Ser Gly Ile Pro Pro Pro Leu Tyr Thr Glu 
        115                 120                 125             


Thr Gly Leu Glu Phe Gln Asp Gly Asn Asp Ser His Pro Glu Ala Pro 
    130                 135                 140                 


Pro Ser Tyr Arg Glu Ser Ile Ala Gly Leu Val Val Thr Ala Ile Ser 
145                 150                 155                 160 


Glu Asp Ala Gln Arg Arg Gly Gln Glu Cys 
                165                 170 


<210>  239
<211>  147
<212>  PRT
<213>  Homo sapiens


<220>
<221>  MISC_FEATURE
<223>  Hemoglobin subunit beta PROTEIN (HBB), accession number 
       NP_000509.1

<400>  239

Met Val His Leu Thr Pro Glu Glu Lys Ser Ala Val Thr Ala Leu Trp 
1               5                   10                  15      


Gly Lys Val Asn Val Asp Glu Val Gly Gly Glu Ala Leu Gly Arg Leu 
            20                  25                  30          


Leu Val Val Tyr Pro Trp Thr Gln Arg Phe Phe Glu Ser Phe Gly Asp 
        35                  40                  45              


Leu Ser Thr Pro Asp Ala Val Met Gly Asn Pro Lys Val Lys Ala His 
    50                  55                  60                  


Gly Lys Lys Val Leu Gly Ala Phe Ser Asp Gly Leu Ala His Leu Asp 
65                  70                  75                  80  


Asn Leu Lys Gly Thr Phe Ala Thr Leu Ser Glu Leu His Cys Asp Lys 
                85                  90                  95      


Leu His Val Asp Pro Glu Asn Phe Arg Leu Leu Gly Asn Val Leu Val 
            100                 105                 110         


Cys Val Leu Ala His His Phe Gly Lys Glu Phe Thr Pro Pro Val Gln 
        115                 120                 125             


Ala Ala Tyr Gln Lys Val Val Ala Gly Val Ala Asn Ala Leu Ala His 
    130                 135                 140                 


Lys Tyr His 
145         


<210>  240
<211>  140
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P variant (tenP1) for attP donor cassette

<400>  240
agggtcctaa tactatctaa gtagttgatt catagtgact ggatatgttg cgttttgtcg       60

cattatgtag tctatcattt aaccacagat tagtgtaatg cgatgatttt taagtgatta      120

atgttatttt gtcatccttt                                                  140


<210>  241
<211>  140
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P variant (tenP2) for attP donor cassette

<400>  241
aggtcactaa tactatctaa gtagttgatt cataggacct ggatatgttg cgttttgtcg       60

cattatgtag tctatcattt aaccacagat tagtgtaatg cgatgatttt taagtgatta      120

atgttatttt gtcatccttt                                                  140


<210>  242
<211>  77
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P' variant (tenP'1) for attP donor cassette

<400>  242
taagttgtat atttaaaatc tctttaatta tcagtaaatt aatgtaagta gggtcttatt       60

agtcaaaata aaatcat                                                      77


<210>  243
<211>  77
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P' variant (tenP'2) for attP donor cassette

<400>  243
taagttgtat atttaaaatc tctttaatta tcagtaaatt aatgtaagta ggtcattatt       60

aggtcaaata aaatcat                                                      77


<210>  244
<211>  77
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P' variant (tenP'3) for attP donor cassette

<400>  244
taagttgtat atttaaaatc tctttaatta tcagtaaatt aatgtaagta ggtcattatt       60

agtcaaaata aaagtct                                                      77


