
                                SEQUENCE LISTING

<110> Vidanes, Genevieve M.
      Provenance Biofabrics, Inc.
      Dueber, John
      Gartner Zev
      Vazirani, Chirag

<120> EXPRESSION OF MODIFIED PROTEINS IN A
  PEROXISOME

<130> PBFAB.001WO2

<150> 62/847769
<151> 2019-05-14

<160> 82

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 3
<212> PRT
<213> Artificial Sequence

<220> 
<223> Peroxisome targeting sequence

<400> 1
Ser Leu Lys
 1          


<210> 2
<211> 9
<212> PRT
<213> Artificial Sequence

<220> 
<223> Peroxisome targeting sequence

<220> 
<221> VARIANT        
<222> (3)...(7)
<223> Xaa = any amino acid

<220> 
<221> VARIANT        
<222> 8
<223> Xaa = H or Q

<400> 2
Arg Leu Xaa Xaa Xaa Xaa Xaa Xaa Leu
 1               5              


<210> 3
<211> 9
<212> PRT
<213> Artificial Sequence

<220> 
<223> Peroxisome targeting sequence

<400> 3
Leu Gly Arg Gly Arg Arg Ser Lys Leu
 1               5                  


<210> 4
<211> 16
<212> PRT
<213> Artificial Sequence

<220> 
<223> Consensus sequence

<220> 
<221> VARIANT        
<222> 1,2,5-11,13,14,16
<223> Xaa = any amino acid

<220> 
<221> VARIANT        
<222> (3)...(4)
<223> Xaa = K or R

<220> 
<221> VARIANT        
<222> 12
<223> Xaa = T or S

<220> 
<221> VARIANT        
<222> 15
<223> Xaa = D or E

<400> 4
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
 1               5                  10                  15        


<210> 5
<211> 8
<212> PRT
<213> Artificial Sequence

<220> 
<223> tag

<400> 5
Asp Tyr Lys Asp Asp Asp Asp Lys
 1               5              


<210> 6
<211> 6
<212> PRT
<213> Artificial Sequence

<220> 
<223> tag

<400> 6
His His His His His His
 1               5      


<210> 7
<211> 26
<212> PRT
<213> Artificial Sequence

<220> 
<223> calmodulin tag

<400> 7
Lys Arg Arg Trp Lys Lys Asn Phe Ile Ala Val Ser Ala Ala Asn Arg
 1               5                  10                  15      
Phe Lys Lys Ile Ser Ser Ser Gly Ala Leu
            20                  25      


<210> 8
<211> 9
<212> PRT
<213> Artificial Sequence

<220> 
<223> HA tag

<400> 8
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
 1               5                  


<210> 9
<211> 10
<212> PRT
<213> Artificial Sequence

<220> 
<223> Myc tag

<400> 9
Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu
 1               5                  10  


<210> 10
<211> 38
<212> PRT
<213> Artificial Sequence

<220> 
<223> SBP tag

<400> 10
Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly
 1               5                  10                  15      
Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro
            20                  25                  30          
Gln Gly Gln Arg Glu Pro
        35              


<210> 11
<211> 8
<212> PRT
<213> Artificial Sequence

<220> 
<223> Strp tag

<400> 11
Trp Ser His Pro Gln Phe Glu Lys
 1               5              


<210> 12
<211> 27
<212> DNA
<213> Artificial Sequence

<220> 
<223> ePTS1 tag

<400> 12
ttgggaagag gtagaagatc caaattg 27      


<210> 13
<211> 238
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP tag

<400> 13
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
225                 230                 235             


<210> 14
<211> 714
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP tag

<400> 14
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaa       714     


<210> 15
<211> 4392
<212> DNA
<213> Artificial Sequence

<220> 
<223> Btau COL1A1

<400> 15
atgttcagct ttgtggacct ccggctcctg ctcctcttag cggccaccgc cctcctgacg 60
cacggccaag aggagggcca ggaagaaggc caagaagaag acatcccacc agtcacctgc 120
gtacagaacg gcctcaggta ccatgaccga gacgtgtgga aacccgtgcc ctgccagatc 180
tgtgtctgcg acaacggcaa cgtgctgtgc gatgacgtga tctgcgacga acttaaggac 240
tgtcctaacg ccaaagtccc cacggacgaa tgctgccccg tctgccccga aggccaggaa 300
tcacccacgg accaagaaac caccggagtc gagggaccga aaggagacac tggcccccga 360
ggcccaaggg gacccgccgg cccccccggc cgagatggca tccctggaca acctggactt 420
cccggacccc ctggaccccc cggacctccc ggaccccctg gcctcggagg aaactttgct 480
ccccagttgt cttacggcta tgatgagaaa tcaacaggaa tttccgtgcc tggtcccatg 540
ggtccttctg gtcctcgtgg tctccctggc ccccctggcg cacctggtcc ccaaggtttc 600
caaggccccc ctggtgagcc tggcgagcca ggagcctcag gtcccatggg tccccgtggt 660
ccccctggcc cccctggcaa gaacggagat gatggcgaag ctggaaagcc tggtcgtcct 720
ggtgagcgcg ggcctcccgg acctcagggt gctcggggat tgcctggaac agctggcctc 780
cctggaatga agggacacag aggtttcagt ggtttggatg gtgccaaggg agatgctggt 840
cctgctggcc ccaagggcga gcctggtagc cccggtgaaa atggagctcc tggtcagatg 900
ggcccccgtg gtctgcctgg tgagagaggt cgccctggag cccctggccc tgctggtgct 960
cgaggaaatg atggtgcgac tggtgctgct gggccccctg gtcccactgg ccccgctggt 1020
cctcctggtt tccctggtgc tgtgggtgct aagggtgaag gtggtcccca aggaccccga 1080
ggttctgaag gtccccaggg tgtacgtggt gagcctggcc cccctggccc tgctggtgct 1140
gctggccctg ctggcaaccc tggtgctgat ggacagcctg gtgctaaagg agccaatggc 1200
gctcctggta ttgctggtgc tcctggcttc cctggtgccc gaggcccctc tggaccccag 1260
ggccccagcg gcccccctgg ccccaagggt aacagcggtg aacctggtgc tcctggcagc 1320
aaaggagaca ctggcgccaa gggagaaccc ggtcccactg gtattcaagg cccccctggc 1380
cccgctgggg aagaaggaaa gcgaggagcc cgaggtgaac ctggacctgc tggcctgcct 1440
ggaccccctg gcgagcgtgg tggacctgga agccgtggtt tccctggcgc cgacggtgtt 1500
gctggtccca agggtcctgc tggtgaacgc ggtgctcctg gccctgctgg ccccaaaggt 1560
tctcctggtg aagctggtcg ccccggtgaa gctggtctgc ccggtgccaa gggtctgact 1620
ggaagccctg gcagcccggg tcctgatggc aaaactggcc cccctggtcc cgccggtcaa 1680
gatggccgcc ctggacctcc aggccctccc ggtgcccgtg gtcaggctgg cgtgatgggt 1740
ttccctggac ctaaaggtgc tgctggagag cctggaaaag ctggagagcg aggtgttcct 1800
ggaccccctg gcgctgttgg tcctgctggc aaagacggag aagctggagc tcagggaccc 1860
ccaggacctg ctggccccgc tggtgagaga ggcgaacaag gccctgctgg ctcccctgga 1920
ttccagggtc tccccggccc tgctggtcct cctggtgaag caggcaaacc tggtgaacag 1980
ggtgttcctg gagatcttgg tgcccccggc ccctctggag caagaggcga gagaggtttc 2040
cccggcgagc gtggtgtgca agggccgccc ggtcctgcag gtccccgtgg ggccaatggt 2100
gcccctggca acgatggtgc taagggtgat gctggtgccc ctggagcccc cggtagccag 2160
ggtgcccctg gccttcaagg aatgcctggt gaacgaggtg cagctggtct tccaggccct 2220
aagggtgaca gaggggatgc tggtcccaaa ggtgctgatg gtgctcctgg caaagatggc 2280
gtccgtggtc tgactggtcc catcggtcct cctggccccg ctggtgcccc tggtgacaag 2340
ggtgaagctg gtcctagtgg cccagccggt cccactggag ctcgtggtgc ccccggtgac 2400
cgtggtgagc ctggtccccc cggccctgct ggcttcgctg gcccccctgg tgctgatggc 2460
caacctggtg ctaaaggcga acctggtgat gctggtgcta aaggtgacgc tggtcccccc 2520
ggccctgctg ggcccgctgg accccccggc cccattggta acgttggtgc tcccggaccc 2580
aaaggtgctc gtggcagcgc tggtccccct ggtgctactg gtttcccagg tgctgctggc 2640
cgagtcggtc cccccggccc ctctggaaat gctggacccc ctggccctcc tggccctgct 2700
ggcaaagaag gcagcaaagg cccccgcggt gagactggcc ccgctgggcg tcccggtgaa 2760
gtcggtcccc ctggtccccc tggccccgct ggtgagaaag gagcccctgg tgctgacgga 2820
cctgctggag ctcctggcac tcctggacct caaggtattg ctggacagcg tggtgtggtc 2880
ggcctgcctg gtcagagagg agaaagaggc ttccctggtc ttcctggccc ctctggtgaa 2940
cccggcaaac aaggtccttc tggagcaagt ggtgaacgtg gcccccctgg tcccatgggc 3000
ccccctggat tggctggacc ccctggcgag tctggacgtg agggagctcc tggtgctgaa 3060
ggatcccctg gacgagatgg ttctcctggc gccaagggtg accgtggtga gaccggccct 3120
gctggacctc ctggtgctcc tggcgctccc ggtgcccccg gccctgtcgg acctgccggc 3180
aagagcggtg atcgtggtga gaccggtcct gctggtcctg ctggtcccat tggccccgtt 3240
ggtgcccgtg gccccgctgg accccaaggc ccccgtggtg acaagggtga gacaggcgaa 3300
cagggcgaca gaggcattaa gggtcaccgt ggcttctctg gtctccaggg tccccccggc 3360
cctcccggct ctcctggtga gcaaggtcct tccggagcct ctggtcctgc tggtccccgc 3420
ggtccccctg gctctgctgg ttctcccggc aaagatggac tcaatggtct cccaggcccc 3480
atcggtcccc ctgggcctcg aggtcgcact ggtgatgctg gtcctgctgg tcctcccggc 3540
cctcctggac cccctggtcc cccaggtcct cccagcggcg gctacgactt gagcttcctg 3600
ccccagccac ctcaagagaa ggctcacgat ggtggccgct actaccgggc tgatgatgcc 3660
aatgtggtcc gtgaccgtga cctcgaggtg gacaccaccc tcaagagcct gagccagcag 3720
atcgagaaca tccggagccc tgaaggcagc cgcaagaacc ccgcccgcac ctgccgtgac 3780
ctcaagatgt gccactctga ctggaagagc ggagaatact ggattgaccc caaccaaggc 3840
tgcaacctgg atgccattaa ggtcttctgc aacatggaaa ccggtgagac ctgtgtatac 3900
cccactcagc ccagcgtggc ccagaagaac tggtatatca gcaagaaccc caaggaaaag 3960
aggcacgtct ggtacggcga gagcatgacc ggcggattcc agttcgagta tggcggccag 4020
gggtccgatc ctgccgatgt ggccatccag ctgactttcc tgcgcctgat gtccaccgag 4080
gcctcccaga acatcaccta ccactgcaag aacagcgtgg cctacatgga ccagcagact 4140
ggcaacctca agaaggccct gctcctccag ggctccaacg agatcgagat ccgggccgag 4200
ggcaacagcc gcttcaccta cagcgtcacc tacgatggct gcacgagtca caccggagcc 4260
tggggcaaga cagtgatcga atacaaaacc accaagacct cccgcttgcc catcatcgat 4320
gtggccccct tggacgttgg cgccccagac caggaattcg gcttcgacgt tggccctgcc 4380
tgcttcctgt aa                                                     4392


<210> 16
<211> 1463
<212> PRT
<213> Artificial Sequence

<220> 
<223> Btau COL1A1

<400> 16
Met Phe Ser Phe Val Asp Leu Arg Leu Leu Leu Leu Leu Ala Ala Thr
 1               5                  10                  15      
Ala Leu Leu Thr His Gly Gln Glu Glu Gly Gln Glu Glu Gly Gln Glu
            20                  25                  30          
Glu Asp Ile Pro Pro Val Thr Cys Val Gln Asn Gly Leu Arg Tyr His
        35                  40                  45              
Asp Arg Asp Val Trp Lys Pro Val Pro Cys Gln Ile Cys Val Cys Asp
    50                  55                  60                  
Asn Gly Asn Val Leu Cys Asp Asp Val Ile Cys Asp Glu Leu Lys Asp
65                  70                  75                  80  
Cys Pro Asn Ala Lys Val Pro Thr Asp Glu Cys Cys Pro Val Cys Pro
                85                  90                  95      
Glu Gly Gln Glu Ser Pro Thr Asp Gln Glu Thr Thr Gly Val Glu Gly
            100                 105                 110         
Pro Lys Gly Asp Thr Gly Pro Arg Gly Pro Arg Gly Pro Ala Gly Pro
        115                 120                 125             
Pro Gly Arg Asp Gly Ile Pro Gly Gln Pro Gly Leu Pro Gly Pro Pro
    130                 135                 140                 
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala
145                 150                 155                 160 
Pro Gln Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Ile Ser Val
                165                 170                 175     
Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro
            180                 185                 190         
Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Pro Gly Glu Pro Gly
        195                 200                 205             
Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro
    210                 215                 220                 
Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro
225                 230                 235                 240 
Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly Ala Arg Gly Leu Pro Gly
                245                 250                 255     
Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu
            260                 265                 270         
Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro
        275                 280                 285             
Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gln Met Gly Pro Arg Gly
    290                 295                 300                 
Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala
305                 310                 315                 320 
Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr
                325                 330                 335     
Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly
            340                 345                 350         
Glu Gly Gly Pro Gln Gly Pro Arg Gly Ser Glu Gly Pro Gln Gly Val
        355                 360                 365             
Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala
    370                 375                 380                 
Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Ala Asn Gly
385                 390                 395                 400 
Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro
                405                 410                 415     
Ser Gly Pro Gln Gly Pro Ser Gly Pro Pro Gly Pro Lys Gly Asn Ser
            420                 425                 430         
Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly
        435                 440                 445             
Glu Pro Gly Pro Thr Gly Ile Gln Gly Pro Pro Gly Pro Ala Gly Glu
    450                 455                 460                 
Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Ala Gly Leu Pro
465                 470                 475                 480 
Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly
                485                 490                 495     
Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ala
            500                 505                 510         
Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro
        515                 520                 525             
Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly
    530                 535                 540                 
Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gln
545                 550                 555                 560 
Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gln Ala
                565                 570                 575     
Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly
            580                 585                 590         
Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro
        595                 600                 605             
Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala
    610                 615                 620                 
Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly
625                 630                 635                 640 
Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys
                645                 650                 655     
Pro Gly Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser
            660                 665                 670         
Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gln Gly
        675                 680                 685             
Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn
    690                 695                 700                 
Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gln
705                 710                 715                 720 
Gly Ala Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Ala Ala Gly
                725                 730                 735     
Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala
            740                 745                 750         
Asp Gly Ala Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro Ile
        755                 760                 765             
Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ala Gly
    770                 775                 780                 
Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp
785                 790                 795                 800 
Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro
                805                 810                 815     
Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly
            820                 825                 830         
Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro
        835                 840                 845             
Pro Gly Pro Ile Gly Asn Val Gly Ala Pro Gly Pro Lys Gly Ala Arg
    850                 855                 860                 
Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly
865                 870                 875                 880 
Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro
                885                 890                 895     
Pro Gly Pro Ala Gly Lys Glu Gly Ser Lys Gly Pro Arg Gly Glu Thr
            900                 905                 910         
Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly
        915                 920                 925             
Pro Ala Gly Glu Lys Gly Ala Pro Gly Ala Asp Gly Pro Ala Gly Ala
    930                 935                 940                 
Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly Val Val
945                 950                 955                 960 
Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly
                965                 970                 975     
Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser Gly Glu
            980                 985                 990         
Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro
        995                 1000                1005            
Gly Glu Ser Gly Arg Glu Gly Ala Pro Gly Ala Glu Gly Ser Pro Gly
    1010                1015                1020                
Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro
1025                1030                1035               1040 
Ala Gly Pro Pro Gly Ala Pro Gly Ala Pro Gly Ala Pro Gly Pro Val
                1045                1050                1055    
Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly
            1060                1065                1070        
Pro Ala Gly Pro Ile Gly Pro Val Gly Ala Arg Gly Pro Ala Gly Pro
        1075                1080                1085            
Gln Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp Arg
    1090                1095                1100                
Gly Ile Lys Gly His Arg Gly Phe Ser Gly Leu Gln Gly Pro Pro Gly
1105                1110                1115               1120 
Pro Pro Gly Ser Pro Gly Glu Gln Gly Pro Ser Gly Ala Ser Gly Pro
                1125                1130                1135    
Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ser Pro Gly Lys Asp
            1140                1145                1150        
Gly Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly
        1155                1160                1165            
Arg Thr Gly Asp Ala Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro
    1170                1175                1180                
Pro Gly Pro Pro Gly Pro Pro Ser Gly Gly Tyr Asp Leu Ser Phe Leu
1185                1190                1195               1200 
Pro Gln Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg
                1205                1210                1215    
Ala Asp Asp Ala Asn Val Val Arg Asp Arg Asp Leu Glu Val Asp Thr
            1220                1225                1230        
Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn Ile Arg Ser Pro Glu
        1235                1240                1245            
Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu Lys Met Cys
    1250                1255                1260                
His Ser Asp Trp Lys Ser Gly Glu Tyr Trp Ile Asp Pro Asn Gln Gly
1265                1270                1275               1280 
Cys Asn Leu Asp Ala Ile Lys Val Phe Cys Asn Met Glu Thr Gly Glu
                1285                1290                1295    
Thr Cys Val Tyr Pro Thr Gln Pro Ser Val Ala Gln Lys Asn Trp Tyr
            1300                1305                1310        
Ile Ser Lys Asn Pro Lys Glu Lys Arg His Val Trp Tyr Gly Glu Ser
        1315                1320                1325            
Met Thr Gly Gly Phe Gln Phe Glu Tyr Gly Gly Gln Gly Ser Asp Pro
    1330                1335                1340                
Ala Asp Val Ala Ile Gln Leu Thr Phe Leu Arg Leu Met Ser Thr Glu
1345                1350                1355               1360 
Ala Ser Gln Asn Ile Thr Tyr His Cys Lys Asn Ser Val Ala Tyr Met
                1365                1370                1375    
Asp Gln Gln Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu Gln Gly Ser
            1380                1385                1390        
Asn Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Ser
        1395                1400                1405            
Val Thr Tyr Asp Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys Thr
    1410                1415                1420                
Val Ile Glu Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile Ile Asp
1425                1430                1435               1440 
Val Ala Pro Leu Asp Val Gly Ala Pro Asp Gln Glu Phe Gly Phe Asp
                1445                1450                1455    
Val Gly Pro Ala Cys Phe Leu
            1460            


<210> 17
<211> 4095
<212> DNA
<213> Artificial Sequence

<220> 
<223> Btau COL1A1

<400> 17
atgctcagct ttgtggatac gcggactttg ttgctgcttg cagtaacttc gtgcctagca 60
acatgccaat ccttacaaga ggcaactgca agaaagggcc caagtggaga tagaggacca 120
cgcggagaaa ggggtccacc aggcccacca ggcagagatg gtgatgacgg catcccaggc 180
cctcctggcc cccctggccc tcctggcccc cctggtcttg gcgggaactt tgctgctcag 240
tttgatgcaa aaggaggtgg ccctggacca atggggctga tgggacctcg cggccctcct 300
ggggcttctg gagcccctgg ccctcaaggt ttccagggac ctccgggtga gcctggtgaa 360
cctggtcaga ctggtcctgc aggtgctcgt ggcccgcctg gccctcctgg caaggctggt 420
gaggatggtc accctggaaa acctggacga cctggtgaga gaggggttgt tggaccacag 480
ggtgctcgtg gctttcctgg aactcctgga ctccctggct tcaagggcat taggggtcac 540
aatggtctgg atggattgaa gggacagcct ggtgctccag gtgtgaaggg tgaacctggt 600
gcccctggtg aaaatggaac tccaggtcaa acgggagccc gtggtcttcc tggtgagaga 660
ggacgtgttg gtgcccctgg cccagctggt gcccgtggaa gtgatggaag tgtgggtcct 720
gtgggccctg ctggtcccat tgggtctgct ggccctccag gcttcccagg tgctcctggc 780
cccaagggtg aactcggacc tgttggtaac cctggccctg ctggtcccgc gggtccccgt 840
ggtgaagtgg gtctcccagg cctttctggc cctgtcggac ctcctggaaa ccccggagcc 900
aatgggcttc ctggcgctaa gggtgctgct ggccttcccg gtgttgctgg ggctcccggc 960
ctccctggac cccggggtat tcctggccct gttggcgctg ctggtgctac tggcgccaga 1020
ggacttgttg gtgagcccgg cccagctggt tcgaaaggag agagcggcaa caagggcgag 1080
cctggtgctg ttgggcagcc aggtcctcct ggccccagtg gtgaagaagg aaagagaggc 1140
tccactggag aaatcggacc cgctggcccc ccaggacctc ctgggctgag gggaaatcct 1200
ggctcccgtg gtctacctgg agctgacggc agagctggtg tcatgggtcc tgctggtagc 1260
cgtggtgcaa ctggccctgc tggtgtgcga ggtcccaatg gagattctgg tcgccctgga 1320
gagcctggcc tcatgggacc ccgaggtttc ccaggttccc ctggaaatat cggcccagct 1380
ggtaaagaag gtcctgtggg tctccctggt attgacggca gacctgggcc cattggccca 1440
gcgggagcaa gaggagagcc tggcaacatt ggattccctg gacccaaagg ccccagtggt 1500
gatcctggca aagctggtga aaaaggtcat gctggtcttg ctggtgctcg gggcgctcca 1560
ggtcccgatg gcaacaacgg tgctcaggga ccccctggac tacagggtgt ccaaggtgga 1620
aaaggtgaac agggtcctgc tggtcctcca ggcttccagg gtctgcctgg ccctgcaggc 1680
acagctggtg aagctggcaa accaggagaa aggggtatcc ctggtgaatt tggtctccct 1740
ggccctgctg gtgcaagagg ggagcggggg cccccaggtg aaagtggtgc tgctgggcct 1800
actgggccta ttggaagccg aggtccttct ggacccccag ggcctgatgg aaacaagggt 1860
gaaccgggtg tggttggcgc tccaggcact gctggcccat ctggtcctag cggactccca 1920
ggagagaggg gtgcggctgg cattcctgga ggcaagggag aaaagggtga aactggtctc 1980
agaggtgaca ttggtagccc tggtagagat ggtgctcgtg gtgctcctgg tgctattggt 2040
gctcctggcc ctgctggagc caatggggac cggggtgaag ctggtcccgc tggccctgct 2100
ggccctgctg gtcctcgtgg tagccctggt gaacgtggtg aggtcggtcc cgctggcccc 2160
aacggatttg ctggtcctgc tggtgctgct ggtcaacctg gtgctaaagg agagagagga 2220
accaaaggac ccaagggtga aaatggtcct gttggtccca caggccccgt tggagctgcc 2280
ggtccgtctg gtccaaatgg cccacctggt cctgctggaa gtcgtggtga tggagggccc 2340
cctggggcta ctggtttccc tggtgctgct ggacggactg gtccccctgg accctctggt 2400
atctctggcc cccctggccc ccctggtcct gctggtaaag aagggcttcg tgggcctcgt 2460
ggtgaccaag gtccagttgg tcgaagtgga gagacaggtg cctctggccc tcctggcttt 2520
gttggtgaga agggtccctc tggagagcct ggtactgctg ggcctcctgg aaccccaggt 2580
ccacaaggcc ttcttggtgc tcctggtttt ctgggtctcc caggctctag aggtgagcgt 2640
ggtctaccag gtgtcgctgg atctgtgggt gaacctggcc ccctcggcat cgcaggccca 2700
cctggggccc gtggtccccc tggtaatgtc ggtaatcctg gcgtcaatgg tgctcctggt 2760
gaagccggtc gtgacggcaa ccctgggaat gacggtcccc caggccgcga tggtcaaccc 2820
ggacacaagg gggagcgtgg ttaccccggt aacgcaggtc ctgttggtgc tgccggtgct 2880
cctggccctc aaggccctgt gggtcccgtt ggtaaacacg gaaaccgtgg tgaaccgggt 2940
cctgccggtg ctgttggtcc tgctggtgcc gttggcccaa gaggtcccag tggcccacaa 3000
ggtattcgag gtgacaaggg agagcctggt gataagggtc ccagaggtct tcctggctta 3060
aagggacaca atgggttgca aggtctcccg ggtcttgctg gtcatcatgg cgatcaaggt 3120
gctcccggtg ctgtgggtcc cgctggtccc aggggccctg ctggtccttc tggccccgct 3180
ggcaaagacg gtcgcattgg acagcctggt gcagtcggac ctgctggcat tcgtggctct 3240
cagggtagcc aaggtcctgc tggccctcct ggtccccctg gccctcctgg acctcctggc 3300
ccaagtggtg gtggttacga gtttggtttt gatggagact tctacagggc tgaccagcct 3360
cgctcaccaa cttctctcag acccaaggat tatgaagttg atgctactct gaaatctctc 3420
aacaaccaga ttgagaccct tcttactcca gaaggctcta ggaagaaccc agctcgcaca 3480
tgccgagact tgagactcag ccacccagaa tggagcagtg gttactactg gattgaccct 3540
aaccaaggat gtactatgga tgctatcaaa gtatactgtg atttctctac tggcgaaacc 3600
tgcatccggg ctcaacctga agacatccca gtcaagaact ggtacagaaa ttccaaggcc 3660
aagaagcatg tctgggtagg agaaactatc aacggtggta cccagtttga atataatgtt 3720
gaaggagtaa ccaccaagga aatggctacc caacttgcct tcatgcgtct gctggccaac 3780
catgcctctc agaacatcac ctaccattgc aagaacagca ttgcatacat ggatgaggaa 3840
actggcaacc tgaaaaaggc tgtcattctg caaggatcca atgatgtcga acttgttgcc 3900
gagggcaaca gcagattcac ttacactgtt cttgtagatg gctgctctaa aaagacaaat 3960
gaatggcaga agacaatcat tgaatataaa acaaacaagc catctcgcct gcctatcctt 4020
gatattgcac ctttggacat cggtggcgct gaccaagaaa tcagattgaa cattggccca 4080
gtctgtttca aataa                                                  4095


<210> 18
<211> 1364
<212> PRT
<213> Artificial Sequence

<220> 
<223> Btau COL1A1

<400> 18
Met Leu Ser Phe Val Asp Thr Arg Thr Leu Leu Leu Leu Ala Val Thr
 1               5                  10                  15      
Ser Cys Leu Ala Thr Cys Gln Ser Leu Gln Glu Ala Thr Ala Arg Lys
            20                  25                  30          
Gly Pro Ser Gly Asp Arg Gly Pro Arg Gly Glu Arg Gly Pro Pro Gly
        35                  40                  45              
Pro Pro Gly Arg Asp Gly Asp Asp Gly Ile Pro Gly Pro Pro Gly Pro
    50                  55                  60                  
Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala Ala Gln
65                  70                  75                  80  
Phe Asp Ala Lys Gly Gly Gly Pro Gly Pro Met Gly Leu Met Gly Pro
                85                  90                  95      
Arg Gly Pro Pro Gly Ala Ser Gly Ala Pro Gly Pro Gln Gly Phe Gln
            100                 105                 110         
Gly Pro Pro Gly Glu Pro Gly Glu Pro Gly Gln Thr Gly Pro Ala Gly
        115                 120                 125             
Ala Arg Gly Pro Pro Gly Pro Pro Gly Lys Ala Gly Glu Asp Gly His
    130                 135                 140                 
Pro Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Val Val Gly Pro Gln
145                 150                 155                 160 
Gly Ala Arg Gly Phe Pro Gly Thr Pro Gly Leu Pro Gly Phe Lys Gly
                165                 170                 175     
Ile Arg Gly His Asn Gly Leu Asp Gly Leu Lys Gly Gln Pro Gly Ala
            180                 185                 190         
Pro Gly Val Lys Gly Glu Pro Gly Ala Pro Gly Glu Asn Gly Thr Pro
        195                 200                 205             
Gly Gln Thr Gly Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg Val Gly
    210                 215                 220                 
Ala Pro Gly Pro Ala Gly Ala Arg Gly Ser Asp Gly Ser Val Gly Pro
225                 230                 235                 240 
Val Gly Pro Ala Gly Pro Ile Gly Ser Ala Gly Pro Pro Gly Phe Pro
                245                 250                 255     
Gly Ala Pro Gly Pro Lys Gly Glu Leu Gly Pro Val Gly Asn Pro Gly
            260                 265                 270         
Pro Ala Gly Pro Ala Gly Pro Arg Gly Glu Val Gly Leu Pro Gly Leu
        275                 280                 285             
Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ala Asn Gly Leu Pro
    290                 295                 300                 
Gly Ala Lys Gly Ala Ala Gly Leu Pro Gly Val Ala Gly Ala Pro Gly
305                 310                 315                 320 
Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Val Gly Ala Ala Gly Ala
                325                 330                 335     
Thr Gly Ala Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ser Lys
            340                 345                 350         
Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ala Val Gly Gln Pro Gly
        355                 360                 365             
Pro Pro Gly Pro Ser Gly Glu Glu Gly Lys Arg Gly Ser Thr Gly Glu
    370                 375                 380                 
Ile Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Leu Arg Gly Asn Pro
385                 390                 395                 400 
Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met Gly
                405                 410                 415     
Pro Ala Gly Ser Arg Gly Ala Thr Gly Pro Ala Gly Val Arg Gly Pro
            420                 425                 430         
Asn Gly Asp Ser Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro Arg
        435                 440                 445             
Gly Phe Pro Gly Ser Pro Gly Asn Ile Gly Pro Ala Gly Lys Glu Gly
    450                 455                 460                 
Pro Val Gly Leu Pro Gly Ile Asp Gly Arg Pro Gly Pro Ile Gly Pro
465                 470                 475                 480 
Ala Gly Ala Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro Lys
                485                 490                 495     
Gly Pro Ser Gly Asp Pro Gly Lys Ala Gly Glu Lys Gly His Ala Gly
            500                 505                 510         
Leu Ala Gly Ala Arg Gly Ala Pro Gly Pro Asp Gly Asn Asn Gly Ala
        515                 520                 525             
Gln Gly Pro Pro Gly Leu Gln Gly Val Gln Gly Gly Lys Gly Glu Gln
    530                 535                 540                 
Gly Pro Ala Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly
545                 550                 555                 560 
Thr Ala Gly Glu Ala Gly Lys Pro Gly Glu Arg Gly Ile Pro Gly Glu
                565                 570                 575     
Phe Gly Leu Pro Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Pro Pro
            580                 585                 590         
Gly Glu Ser Gly Ala Ala Gly Pro Thr Gly Pro Ile Gly Ser Arg Gly
        595                 600                 605             
Pro Ser Gly Pro Pro Gly Pro Asp Gly Asn Lys Gly Glu Pro Gly Val
    610                 615                 620                 
Val Gly Ala Pro Gly Thr Ala Gly Pro Ser Gly Pro Ser Gly Leu Pro
625                 630                 635                 640 
Gly Glu Arg Gly Ala Ala Gly Ile Pro Gly Gly Lys Gly Glu Lys Gly
                645                 650                 655     
Glu Thr Gly Leu Arg Gly Asp Ile Gly Ser Pro Gly Arg Asp Gly Ala
            660                 665                 670         
Arg Gly Ala Pro Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Ala Asn
        675                 680                 685             
Gly Asp Arg Gly Glu Ala Gly Pro Ala Gly Pro Ala Gly Pro Ala Gly
    690                 695                 700                 
Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Val Gly Pro Ala Gly Pro
705                 710                 715                 720 
Asn Gly Phe Ala Gly Pro Ala Gly Ala Ala Gly Gln Pro Gly Ala Lys
                725                 730                 735     
Gly Glu Arg Gly Thr Lys Gly Pro Lys Gly Glu Asn Gly Pro Val Gly
            740                 745                 750         
Pro Thr Gly Pro Val Gly Ala Ala Gly Pro Ser Gly Pro Asn Gly Pro
        755                 760                 765             
Pro Gly Pro Ala Gly Ser Arg Gly Asp Gly Gly Pro Pro Gly Ala Thr
    770                 775                 780                 
Gly Phe Pro Gly Ala Ala Gly Arg Thr Gly Pro Pro Gly Pro Ser Gly
785                 790                 795                 800 
Ile Ser Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Glu Gly Leu
                805                 810                 815     
Arg Gly Pro Arg Gly Asp Gln Gly Pro Val Gly Arg Ser Gly Glu Thr
            820                 825                 830         
Gly Ala Ser Gly Pro Pro Gly Phe Val Gly Glu Lys Gly Pro Ser Gly
        835                 840                 845             
Glu Pro Gly Thr Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly Leu
    850                 855                 860                 
Leu Gly Ala Pro Gly Phe Leu Gly Leu Pro Gly Ser Arg Gly Glu Arg
865                 870                 875                 880 
Gly Leu Pro Gly Val Ala Gly Ser Val Gly Glu Pro Gly Pro Leu Gly
                885                 890                 895     
Ile Ala Gly Pro Pro Gly Ala Arg Gly Pro Pro Gly Asn Val Gly Asn
            900                 905                 910         
Pro Gly Val Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn Pro
        915                 920                 925             
Gly Asn Asp Gly Pro Pro Gly Arg Asp Gly Gln Pro Gly His Lys Gly
    930                 935                 940                 
Glu Arg Gly Tyr Pro Gly Asn Ala Gly Pro Val Gly Ala Ala Gly Ala
945                 950                 955                 960 
Pro Gly Pro Gln Gly Pro Val Gly Pro Val Gly Lys His Gly Asn Arg
                965                 970                 975     
Gly Glu Pro Gly Pro Ala Gly Ala Val Gly Pro Ala Gly Ala Val Gly
            980                 985                 990         
Pro Arg Gly Pro Ser Gly Pro Gln Gly Ile Arg Gly Asp Lys Gly Glu
        995                 1000                1005            
Pro Gly Asp Lys Gly Pro Arg Gly Leu Pro Gly Leu Lys Gly His Asn
    1010                1015                1020                
Gly Leu Gln Gly Leu Pro Gly Leu Ala Gly His His Gly Asp Gln Gly
1025                1030                1035               1040 
Ala Pro Gly Ala Val Gly Pro Ala Gly Pro Arg Gly Pro Ala Gly Pro
                1045                1050                1055    
Ser Gly Pro Ala Gly Lys Asp Gly Arg Ile Gly Gln Pro Gly Ala Val
            1060                1065                1070        
Gly Pro Ala Gly Ile Arg Gly Ser Gln Gly Ser Gln Gly Pro Ala Gly
        1075                1080                1085            
Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Ser Gly Gly
    1090                1095                1100                
Gly Tyr Glu Phe Gly Phe Asp Gly Asp Phe Tyr Arg Ala Asp Gln Pro
1105                1110                1115               1120 
Arg Ser Pro Thr Ser Leu Arg Pro Lys Asp Tyr Glu Val Asp Ala Thr
                1125                1130                1135    
Leu Lys Ser Leu Asn Asn Gln Ile Glu Thr Leu Leu Thr Pro Glu Gly
            1140                1145                1150        
Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu Arg Leu Ser His
        1155                1160                1165            
Pro Glu Trp Ser Ser Gly Tyr Tyr Trp Ile Asp Pro Asn Gln Gly Cys
    1170                1175                1180                
Thr Met Asp Ala Ile Lys Val Tyr Cys Asp Phe Ser Thr Gly Glu Thr
1185                1190                1195               1200 
Cys Ile Arg Ala Gln Pro Glu Asp Ile Pro Val Lys Asn Trp Tyr Arg
                1205                1210                1215    
Asn Ser Lys Ala Lys Lys His Val Trp Val Gly Glu Thr Ile Asn Gly
            1220                1225                1230        
Gly Thr Gln Phe Glu Tyr Asn Val Glu Gly Val Thr Thr Lys Glu Met
        1235                1240                1245            
Ala Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn His Ala Ser Gln
    1250                1255                1260                
Asn Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu Glu
1265                1270                1275               1280 
Thr Gly Asn Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn Asp Val
                1285                1290                1295    
Glu Leu Val Ala Glu Gly Asn Ser Arg Phe Thr Tyr Thr Val Leu Val
            1300                1305                1310        
Asp Gly Cys Ser Lys Lys Thr Asn Glu Trp Gln Lys Thr Ile Ile Glu
        1315                1320                1325            
Tyr Lys Thr Asn Lys Pro Ser Arg Leu Pro Ile Leu Asp Ile Ala Pro
    1330                1335                1340                
Leu Asp Ile Gly Gly Ala Asp Gln Glu Ile Arg Leu Asn Ile Gly Pro
1345                1350                1355               1360 
Val Cys Phe Lys
                


<210> 19
<211> 4368
<212> DNA
<213> Artificial Sequence

<220> 
<223> Amis COL1A1

<400> 19
atgttcagct ttgtggattc tcggttactg ctgttgatag cagcgactgt actactcacc 60
aaaggtcaag gagaagaaga cattcaaact ggaagctgca tacaggatgg actagcgtac 120
aacaacacag acgtatggaa acccgagccc tgccagatct gcgtatgcga caatggcaac 180
atcctgtgtg acgatgtcat ctgtgatgat acctcggact gtaccaatgc tgagatcccc 240
tttggagaat gctgtcccat ctgtcctgac accgctggct cttctaccta ccccaaatcc 300
actggagtag agggtcctaa gggagacact ggccccagag gacagagggg actcccaggc 360
ccacctggca gagatggcat tcctggacag cctggtctcc ctggactccc aggacctcca 420
ggccctcctg gccttggtgg aaacttcgct cctcaaatgg cttacggtta cggagatgaa 480
accaaatctg ctggcatttc tgtccctgga cccatgggtc cagctggccc ccgtggtctc 540
cccggccccc ctggttctcc tggtcctcaa ggtttccaag gtcctcctgg agagcctgga 600
gagcctggtg cttcaggtcc aatgggtccc cgtggtccag ccggcccccc tggcaagaac 660
ggagatgatg gtgaagctgg aaagcccggc cgtcccggtg agcgcggccc tcctggcccc 720
cagggtgcac gtggtctgcc cggaactgct ggcctgccag gcatgaaggg tcacagaggt 780
ttcagtggtc tggatggtgc taagggtgat gctggtccat ccggccccaa gggtgagcct 840
ggtagccctg gtgagaacgg agctcctgga caaatgggcc ctcgtggtct tcccggtgag 900
agaggccgcc ctggtccatc tggccctgct ggtgctcgtg gtaacgatgg tagtcctggt 960
gctgctggcc ctccaggtcc aactggccca gctggccccc ctggcttccc tggtgctgct 1020
ggtgctaagg gtgaaactgg tcctcaaggt tctcgtggta gtgaaggccc acagggtgct 1080
cgtggtgagc ctggtcctcc tggccctgct ggtgctgctg gtcctgctgg caaccctggt 1140
tctgatggtc aagctggtgc caaaggtgca actggtgctc ctggtattgc tggtgctcct 1200
ggcttccctg gcgctcgtgg cccatctgga ccccagggtc ccagcggtgc tcctggcccc 1260
aagggtaaca gtggtgaacc cggtgctcaa ggcaacaagg gagacactgg tgcaaaagga 1320
gagcctggtc ctgctggtgt ccaaggccca cctggtccag ctggtgaaga aggcaagaga 1380
ggagcccgtg gtgagcccgg ccctggaggt cttcctggcc ctgctggcga acgtggtgct 1440
cctggaagcc gtggtttccc tggcgctgat ggcatttctg gtcccaaggg tccccctggt 1500
gaacgtggtt cccctggccc tgctggtccc aaaggatcta ctggtgaatc tggacgccct 1560
ggtgagcctg gtctccctgg tgccaagggt cttactggaa gcccaggtag cccaggtcct 1620
gatggcaaga ctggtccacc tggccccgct ggtcaagatg gtcgcccagg acccccaggc 1680
ccacctggtg ccagaggtca ggctggtgtg atgggtttcc ctggacctaa aggtgctgct 1740
ggtgagcctg gcaaacctgg tgagagagga gctcctggac cccctggtgc tgttggcgca 1800
gctggtaagg atggtgaagc tggtgcccaa ggttctcctg gcgctgctgg tcctgctgga 1860
gagagaggtg aacaaggtcc tgctggtgct cctggattcc agggtctgcc cggtcctgct 1920
ggcccatctg gtgaatctgg caagcctggt gaacagggtg ttcctggaga tgctggtgct 1980
cctggtccag ctggtgcaag aggcgagaga ggtttccctg gtgagcgtgg tgtccaaggt 2040
caaccaggtc cacagggtcc acgtggtgct aacggtgctc ccggtaacga tggtgctaag 2100
ggtgatgctg gtgctcctgg tgctcctggt ggccaaggtc ctcccggtct gcagggtatg 2160
cctggtgagc gtggtgctgc tggtctgcct ggttccaagg gtgacagagg cgatcctggt 2220
cccaaaggca ctgatggtgc tcctggcaaa gatggcgtca gaggtctaac tggccctatt 2280
ggtcctcctg gcccagctgg tgcccctggt gacaagggtg aagctggtcc ttctggccct 2340
gctggtccca ctggttctcg tggtgcccct ggagatcgtg gtgagcctgg tccacctggc 2400
cctgctggat tcgctggtcc ccctggtgct gatggacaac ctggtgctaa aggtgaatct 2460
ggtgatgctg gtgctaaagg tgatgctggt cctccaggcc ctgctggacc cactggtgct 2520
cctggacctt ctggcgctgt tggtgctcct ggacccaaag gtgctcgtgg tagtgctgga 2580
ccccctggtg ctactggttt ccctggtgct gctggaagag ttggtccacc tggccctgct 2640
ggtaacgtcg gtcttcctgg cccatcaggc cccagtggaa aagaaggctc taaaggaccc 2700
cgtggtgaga ctggccctgc tggacgcccc ggtgaacctg gacctgctgg cccaccagga 2760
ccttctggcg agaagggctc tcctggtggt gatggtcccg ctggtgctcc tggtactcca 2820
ggcccacagg gtattgctgg acagcgtggt gtagttggtc ttcctggaca gagaggcgag 2880
agaggtttcc ctggtctccc cggcccatct ggcgaacctg gcaaacaagg tccatctggc 2940
tcctctggtg aacgcggtcc tcctggtcca atgggaccac ctggcttggc tggacctcct 3000
ggtgaagctg gacgtgaggg tgctcctggt tctgaaggtg ctcctggtcg cgatggcgct 3060
gctggtccca agggtgaccg tggtgagact ggcccctctg gtcctcctgg tgctcccggt 3120
gcccctggag ctcctggccc tattggccct gctggcaaga atggagatcg tggtgagact 3180
ggtccttctg gtcctgctgg ccctgccggt cctgctggtg ctcgtggtcc tgctggtcca 3240
caaggtgccc gtggtgacaa aggtgaaact ggagaacatg gtgacagagg catgaagggt 3300
cacagaggat tccctggtcc ccagggtccc tctggtcctg ctggctctcc tggtgaacaa 3360
ggtccttctg gagcttccgg ccctgctggt ccaagaggtc ctcctggctc tgctggcacc 3420
cctggcaaag atggtctgaa tggtctccct ggccctattg gtccacctgg tccccggggt 3480
cgcactggtg atgttggtcc tgctggtccc cctggacctc ctgggccccc aggtcctcct 3540
ggtgcaccca gcggcggctt tgacttcagc ttcatgcccc agcctcctca ggagaaagcc 3600
catgatcctg gccgctacta cagagctgat gacgccaacg tgatgcgtga ccgtgacctg 3660
gaggtggaca ccaccctcaa gagcctgagc cagcagatcg agaacatccg cagccccgag 3720
ggcaccagga agaaccctgc ccgcacctgc cgtgacctga agatgtgcca caatgactgg 3780
aagagcggcg agtactggat tgaccccaac cagggctgca atctggatgc catcaaggtc 3840
tactgtaaca tggagactgg cgagacttgc gtccacccaa cccaggccac catcgctcag 3900
aagaactggt acatgagcaa gaaccccaag gagaagaaac acatctggtt tggcgagaca 3960
atgagcgatg gcttccagtt cgaatatggt ggggagggct ccaacccagc tgacgttgcc 4020
atccaactga ccttcctgcg cctgatgtcc actgaggcct cccagaacat cacctaccac 4080
tgcaagaaca gcgtggctta catggaccag gagactggca acctgaagaa ggctctgctc 4140
cttcagggct ccaacgagat cgagatcaga gcagaaggca acagccgctt cacctatgga 4200
gtcactgagg atggctgcac aactcacacc ggtgcctggg gcaagacagt cattgaatac 4260
aaaacaacaa aaacctctcg cctgcccgtc attgacgtgg ctcccatgga cgttggagca 4320
caagatcagg aattcggaat tgtcatcgga cctgtctgct tcttgtaa              4368


<210> 20
<211> 1455
<212> PRT
<213> Artificial Sequence

<220> 
<223> Amis COL1A1

<400> 20
Met Phe Ser Phe Val Asp Ser Arg Leu Leu Leu Leu Ile Ala Ala Thr
 1               5                  10                  15      
Val Leu Leu Thr Lys Gly Gln Gly Glu Glu Asp Ile Gln Thr Gly Ser
            20                  25                  30          
Cys Ile Gln Asp Gly Leu Ala Tyr Asn Asn Thr Asp Val Trp Lys Pro
        35                  40                  45              
Glu Pro Cys Gln Ile Cys Val Cys Asp Asn Gly Asn Ile Leu Cys Asp
    50                  55                  60                  
Asp Val Ile Cys Asp Asp Thr Ser Asp Cys Thr Asn Ala Glu Ile Pro
65                  70                  75                  80  
Phe Gly Glu Cys Cys Pro Ile Cys Pro Asp Thr Ala Gly Ser Ser Thr
                85                  90                  95      
Tyr Pro Lys Ser Thr Gly Val Glu Gly Pro Lys Gly Asp Thr Gly Pro
            100                 105                 110         
Arg Gly Gln Arg Gly Leu Pro Gly Pro Pro Gly Arg Asp Gly Ile Pro
        115                 120                 125             
Gly Gln Pro Gly Leu Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly
    130                 135                 140                 
Leu Gly Gly Asn Phe Ala Pro Gln Met Ala Tyr Gly Tyr Gly Asp Glu
145                 150                 155                 160 
Thr Lys Ser Ala Gly Ile Ser Val Pro Gly Pro Met Gly Pro Ala Gly
                165                 170                 175     
Pro Arg Gly Leu Pro Gly Pro Pro Gly Ser Pro Gly Pro Gln Gly Phe
            180                 185                 190         
Gln Gly Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser Gly Pro Met
        195                 200                 205             
Gly Pro Arg Gly Pro Ala Gly Pro Pro Gly Lys Asn Gly Asp Asp Gly
    210                 215                 220                 
Glu Ala Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro Pro Gly Pro
225                 230                 235                 240 
Gln Gly Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro Gly Met Lys
                245                 250                 255     
Gly His Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly Asp Ala Gly
            260                 265                 270         
Pro Ser Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly Ala
        275                 280                 285             
Pro Gly Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg Pro
    290                 295                 300                 
Gly Pro Ser Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly Ser Pro Gly
305                 310                 315                 320 
Ala Ala Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro Gly Phe
                325                 330                 335     
Pro Gly Ala Ala Gly Ala Lys Gly Glu Thr Gly Pro Gln Gly Ser Arg
            340                 345                 350         
Gly Ser Glu Gly Pro Gln Gly Ala Arg Gly Glu Pro Gly Pro Pro Gly
        355                 360                 365             
Pro Ala Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ser Asp Gly Gln
    370                 375                 380                 
Ala Gly Ala Lys Gly Ala Thr Gly Ala Pro Gly Ile Ala Gly Ala Pro
385                 390                 395                 400 
Gly Phe Pro Gly Ala Arg Gly Pro Ser Gly Pro Gln Gly Pro Ser Gly
                405                 410                 415     
Ala Pro Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala Gln Gly Asn
            420                 425                 430         
Lys Gly Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Ala Gly Val Gln
        435                 440                 445             
Gly Pro Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg Gly
    450                 455                 460                 
Glu Pro Gly Pro Gly Gly Leu Pro Gly Pro Ala Gly Glu Arg Gly Ala
465                 470                 475                 480 
Pro Gly Ser Arg Gly Phe Pro Gly Ala Asp Gly Ile Ser Gly Pro Lys
                485                 490                 495     
Gly Pro Pro Gly Glu Arg Gly Ser Pro Gly Pro Ala Gly Pro Lys Gly
            500                 505                 510         
Ser Thr Gly Glu Ser Gly Arg Pro Gly Glu Pro Gly Leu Pro Gly Ala
        515                 520                 525             
Lys Gly Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys Thr
    530                 535                 540                 
Gly Pro Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro Gly
545                 550                 555                 560 
Pro Pro Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro Gly Pro
                565                 570                 575     
Lys Gly Ala Ala Gly Glu Pro Gly Lys Pro Gly Glu Arg Gly Ala Pro
            580                 585                 590         
Gly Pro Pro Gly Ala Val Gly Ala Ala Gly Lys Asp Gly Glu Ala Gly
        595                 600                 605             
Ala Gln Gly Ser Pro Gly Ala Ala Gly Pro Ala Gly Glu Arg Gly Glu
    610                 615                 620                 
Gln Gly Pro Ala Gly Ala Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala
625                 630                 635                 640 
Gly Pro Ser Gly Glu Ser Gly Lys Pro Gly Glu Gln Gly Val Pro Gly
                645                 650                 655     
Asp Ala Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Phe
            660                 665                 670         
Pro Gly Glu Arg Gly Val Gln Gly Gln Pro Gly Pro Gln Gly Pro Arg
        675                 680                 685             
Gly Ala Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala Gly
    690                 695                 700                 
Ala Pro Gly Ala Pro Gly Gly Gln Gly Pro Pro Gly Leu Gln Gly Met
705                 710                 715                 720 
Pro Gly Glu Arg Gly Ala Ala Gly Leu Pro Gly Ser Lys Gly Asp Arg
                725                 730                 735     
Gly Asp Pro Gly Pro Lys Gly Thr Asp Gly Ala Pro Gly Lys Asp Gly
            740                 745                 750         
Val Arg Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala
        755                 760                 765             
Pro Gly Asp Lys Gly Glu Ala Gly Pro Ser Gly Pro Ala Gly Pro Thr
    770                 775                 780                 
Gly Ser Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro Gly
785                 790                 795                 800 
Pro Ala Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly Ala
                805                 810                 815     
Lys Gly Glu Ser Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro Pro
            820                 825                 830         
Gly Pro Ala Gly Pro Thr Gly Ala Pro Gly Pro Ser Gly Ala Val Gly
        835                 840                 845             
Ala Pro Gly Pro Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly Ala
    850                 855                 860                 
Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ala
865                 870                 875                 880 
Gly Asn Val Gly Leu Pro Gly Pro Ser Gly Pro Ser Gly Lys Glu Gly
                885                 890                 895     
Ser Lys Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly Glu
            900                 905                 910         
Pro Gly Pro Ala Gly Pro Pro Gly Pro Ser Gly Glu Lys Gly Ser Pro
        915                 920                 925             
Gly Gly Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly
    930                 935                 940                 
Ile Ala Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu
945                 950                 955                 960 
Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln
                965                 970                 975     
Gly Pro Ser Gly Ser Ser Gly Glu Arg Gly Pro Pro Gly Pro Met Gly
            980                 985                 990         
Pro Pro Gly Leu Ala Gly Pro Pro Gly Glu Ala Gly Arg Glu Gly Ala
        995                 1000                1005            
Pro Gly Ser Glu Gly Ala Pro Gly Arg Asp Gly Ala Ala Gly Pro Lys
    1010                1015                1020                
Gly Asp Arg Gly Glu Thr Gly Pro Ser Gly Pro Pro Gly Ala Pro Gly
1025                1030                1035               1040 
Ala Pro Gly Ala Pro Gly Pro Ile Gly Pro Ala Gly Lys Asn Gly Asp
                1045                1050                1055    
Arg Gly Glu Thr Gly Pro Ser Gly Pro Ala Gly Pro Ala Gly Pro Ala
            1060                1065                1070        
Gly Ala Arg Gly Pro Ala Gly Pro Gln Gly Ala Arg Gly Asp Lys Gly
        1075                1080                1085            
Glu Thr Gly Glu His Gly Asp Arg Gly Met Lys Gly His Arg Gly Phe
    1090                1095                1100                
Pro Gly Pro Gln Gly Pro Ser Gly Pro Ala Gly Ser Pro Gly Glu Gln
1105                1110                1115               1120 
Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly Pro Arg Gly Pro Pro Gly
                1125                1130                1135    
Ser Ala Gly Thr Pro Gly Lys Asp Gly Leu Asn Gly Leu Pro Gly Pro
            1140                1145                1150        
Ile Gly Pro Pro Gly Pro Arg Gly Arg Thr Gly Asp Val Gly Pro Ala
        1155                1160                1165            
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Ser
    1170                1175                1180                
Gly Gly Phe Asp Phe Ser Phe Met Pro Gln Pro Pro Gln Glu Lys Ala
1185                1190                1195               1200 
His Asp Pro Gly Arg Tyr Tyr Arg Ala Asp Asp Ala Asn Val Met Arg
                1205                1210                1215    
Asp Arg Asp Leu Glu Val Asp Thr Thr Leu Lys Ser Leu Ser Gln Gln
            1220                1225                1230        
Ile Glu Asn Ile Arg Ser Pro Glu Gly Thr Arg Lys Asn Pro Ala Arg
        1235                1240                1245            
Thr Cys Arg Asp Leu Lys Met Cys His Asn Asp Trp Lys Ser Gly Glu
    1250                1255                1260                
Tyr Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val
1265                1270                1275               1280 
Tyr Cys Asn Met Glu Thr Gly Glu Thr Cys Val His Pro Thr Gln Ala
                1285                1290                1295    
Thr Ile Ala Gln Lys Asn Trp Tyr Met Ser Lys Asn Pro Lys Glu Lys
            1300                1305                1310        
Lys His Ile Trp Phe Gly Glu Thr Met Ser Asp Gly Phe Gln Phe Glu
        1315                1320                1325            
Tyr Gly Gly Glu Gly Ser Asn Pro Ala Asp Val Ala Ile Gln Leu Thr
    1330                1335                1340                
Phe Leu Arg Leu Met Ser Thr Glu Ala Ser Gln Asn Ile Thr Tyr His
1345                1350                1355               1360 
Cys Lys Asn Ser Val Ala Tyr Met Asp Gln Glu Thr Gly Asn Leu Lys
                1365                1370                1375    
Lys Ala Leu Leu Leu Gln Gly Ser Asn Glu Ile Glu Ile Arg Ala Glu
            1380                1385                1390        
Gly Asn Ser Arg Phe Thr Tyr Gly Val Thr Glu Asp Gly Cys Thr Thr
        1395                1400                1405            
His Thr Gly Ala Trp Gly Lys Thr Val Ile Glu Tyr Lys Thr Thr Lys
    1410                1415                1420                
Thr Ser Arg Leu Pro Val Ile Asp Val Ala Pro Met Asp Val Gly Ala
1425                1430                1435               1440 
Gln Asp Gln Glu Phe Gly Ile Val Ile Gly Pro Val Cys Phe Leu
                1445                1450                1455


<210> 21
<211> 4092
<212> DNA
<213> Artificial Sequence

<220> 
<223> Amis COL1A2

<400> 21
atgctcagct ttgtggatac acggattttg ttgctgctcg cagtaacttc gtacctagca 60
acatgtcaac aagcaaatga ggcaactgca ggacggaagg gcccaagagg agacaaaggg 120
ccacagggag aaaggggtcc accaggtcca ccaggcagag atggtgaaga tggtccacca 180
gggcctccag ggccccctgg tcctccaggt cttggcggaa actttgctgc tcagtatgac 240
ggagcaaaag caggtgacta tggctcagga ccaatgggtt taatgggacc cagaggccca 300
cctggaacaa gtggacctcc tggtcctcct ggcttccaag gacctcatgg tgagcctggt 360
gaacctggtc aaacaggtcc ccagggtccc cgtggtccat ctggtcctcc tggaaaggct 420
ggtgaagatg gccatcctgg aaaatctgga cgatctggtg agaggggcgt ctctggtcct 480
cagggtgctc gtggtttccc tggaactcct ggtctgcctg gctttaaggg aattagagga 540
cacaatggtc tggatggtca gaagggacaa cctggtactc caggcattaa gggtgaatcc 600
ggtgcccctg gtgaaaatgg taccccagga caatctggtg ctcgtggcct tcccggtgaa 660
agaggaagaa ttggtgcacc tggcccagct ggtgcccgtg gcagcgatgg tagcactggt 720
cccactggtc ctgctggccc tatcggttct gctggtgctc caggtttccc aggtgctcct 780
ggagccaagg gtgaaattgg agctgctggt aatgtaggtc cttctggccc tgctggtcca 840
cgaggagagg ctggacttcc tggttcttct ggtcccgttg gccctcctgg aaaccctggt 900
tctaatggtc ttgctggtgc taaaggtgca actggtcttc ctggtgttgc tggtgctcct 960
ggcttgcctg gtccacgtgg tattcctgga ccttctggcc ctgccggagc tgctggcacc 1020
agaggtcttg ttggtgaacc aggccctgct ggtgccaagg gagaaagtgg taacaagggt 1080
gaacccggtg ctgctggtcc atcaggtccc gctggtccaa gtggtgaaga aggcaagaaa 1140
ggtactactg gtgaacctgg ctcttctggc ccccctggtc cagctggtct aagaggcgtt 1200
cctggatctc gtggtctccc tggagctgac ggcagagctg gtgttatggg acctgctggc 1260
agccgtggtg ctactggtcc tgctggtgct aaaggtccta gtggtgataa tggtcgccct 1320
ggtgagcctg gccttatggg tccaagaggt ctccctggtc aacctggaag ctcaggccct 1380
gctggcaagg aaggtcctgt tggtttccct ggtgcagatg gtagagttgg cccaactggt 1440
ccagctggtg caagaggtga gcctggcaac attggattcc ctggacccaa aggccccact 1500
ggtgaccctg gcaaacctgg tgacagaggc catgctggtc ttgctggtgc tcggggtgcg 1560
cctggtcctg agggcaacaa tggggctcaa ggtcctcctg gtgttgctgg caaccctggt 1620
gcaaaaggtg aacaaggtcc agctggtcct cccggtttcc agggtctccc aggcccctca 1680
ggtccagctg gtgaagctgg caaaccaggt gaaaggggta tggctggtga atttggtgcc 1740
cctggccctg cgggttcaag aggtgaacgt ggtcctccag gcgaaagtgg tgctgttggt 1800
cctgtaggtc ccattggaag ccgtggtcca tctggtccac caggcactga tggcaacaag 1860
ggtgaacctg gtaatgttgg taatgctggt actgcaggcc cctctggcgc tggtggagcc 1920
ccaggagaga gaggcattgc tggtattcca ggacccaagg gtgaaaaggg tgctacaggt 1980
ctgagagggg atactggcgc aacaggaaga gatggtgctc gtggtgctcc tggtgctatt 2040
ggagcccctg gccccgctgg tggagctggt gagcggggtg aaggtggtcc tgctggtgct 2100
gctggccctt ctggtgcccg tggtattcct ggtgaacgtg gtgagcctgg tcctgctggc 2160
cctactggat ttgctggacc tgctggtgca gctggccaac ctggtgctaa aggtgaacga 2220
ggtacaaaag gacccaaggg tgaaaatggt ccacaaggtg ctgttggccc agttggttct 2280
tctggaccat caggtcctgt tggtgcctct ggtcctgctg gtcctcgtgg tgatggtggt 2340
cctcctggtg tcactggttt ccctggagct gctggcagaa ctggtcctcc cggcccctct 2400
ggtatcactg gcccccctgg tccccctggc tcagctggca aagatggtat gagaggccca 2460
cgtggtgata ctggtccagt tggccgcact ggagaacaag gcattgttgg cccacctggc 2520
ttcagtggtg agaaaggtcc atctggagag cctggtgctg ctggtccccc tggtacccca 2580
ggtcctcagg gtattcttgg tgctcctggt atccttggtc tgcctggctc tcggggagaa 2640
cgtggtcttc caggcatctc tggagcaaca ggtgaaccag gtcctcttgg tatttccggt 2700
cctcctggtg cacgtggtcc ctctggcccc gtgggttctg ctggtctgaa tggtgcccct 2760
ggtgaagctg gccgtgatgg caatcctggc catgatggtg ctccaggccg tgatggtgct 2820
cctggtttca agggtgagcg tggtgctcct gggaacaatg gacctgctgg tgctgttggt 2880
gctcctggcg cccatggtca agttggtcct gctggaaagc ctggaaatcg tggtgatcct 2940
ggtcctgttg gtccttctgg tcctgctggt gcttttggtg caaggggtcc ttctggccca 3000
caaggtgcac gtggtgagaa gggagaaaca ggtgaaaagg gacacagagg tatgcctgga 3060
tttaaggggc acaatggact tcagggtctg cctggtcttg ctggccaaca tggagatcaa 3120
ggtcctccag gttctactgg ccccgctggc ccaaggggtc cctctggtcc ttctggtcct 3180
gctggaaaag atggtcgcaa tggactccct ggccctattg gacctgctgg tgtgcgtggt 3240
tctcagggta gccaaggtcc ttcgggtcca cctggcccac ctggtctccc tggtccccct 3300
ggtgcaaatg gtggtggata cgaagttggc tatgatcttg aatactaccg ggctgatcag 3360
cctgctctca gacctaagga ctatgaagtt gatgccactc tgaaaacatt gaacaaccaa 3420
attgagaccc tcctgacccc agaaggctcc aggaagaacc cagctcgcac ctgccgtgac 3480
ctgagactca gccacccaga atggaccagt ggtttctact ggattgatcc caaccagggc 3540
tgtactatgg atgccattag agtgtattgt gacttctcca ctggtgagac ttgcatacat 3600
gccaatctag aaaacatccc cactaagaac tggtatgtca gcaagaactc caaggaaaag 3660
aagcacatgt ggtttggtga aactatcaat ggtggtaccc agtttgaata taacgatgaa 3720
ggagtgactt ccaaggacat ggctacccaa cttgccttca tgcgtctgct ggccaaccat 3780
gcctcccaga acatcaccta ccactgcaag aacagtattg catacatgga tgaagaaact 3840
ggcaacctta agaaggctgt aatactgcag ggatccaatg atgttgaact acgagctgaa 3900
ggcaacagca gattcacttt cagtgttctg gaagatggct gctctagaaa gaacaacgca 3960
tggggcaaaa caatcattga atatagaaca aacaaaccat ctcgcttgcc catccttgac 4020
attgcacctt tggacattgg tggagctgat caagaattcg gtttggacat tggcccagtc 4080
tgtttcaaat ga                                                     4092

<210> 22
<211> 1363
<212> PRT
<213> Artificial Sequence

<220> 
<223> Amis COL1A2

<400> 22
Met Leu Ser Phe Val Asp Thr Arg Ile Leu Leu Leu Leu Ala Val Thr
 1               5                  10                  15      
Ser Tyr Leu Ala Thr Cys Gln Gln Ala Asn Glu Ala Thr Ala Gly Arg
            20                  25                  30          
Lys Gly Pro Arg Gly Asp Lys Gly Pro Gln Gly Glu Arg Gly Pro Pro
        35                  40                  45              
Gly Pro Pro Gly Arg Asp Gly Glu Asp Gly Pro Pro Gly Pro Pro Gly
    50                  55                  60                  
Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala Ala Gln Tyr Asp
65                  70                  75                  80  
Gly Ala Lys Ala Gly Asp Tyr Gly Ser Gly Pro Met Gly Leu Met Gly
                85                  90                  95      
Pro Arg Gly Pro Pro Gly Thr Ser Gly Pro Pro Gly Pro Pro Gly Phe
            100                 105                 110         
Gln Gly Pro His Gly Glu Pro Gly Glu Pro Gly Gln Thr Gly Pro Gln
        115                 120                 125             
Gly Pro Arg Gly Pro Ser Gly Pro Pro Gly Lys Ala Gly Glu Asp Gly
    130                 135                 140                 
His Pro Gly Lys Ser Gly Arg Ser Gly Glu Arg Gly Val Ser Gly Pro
145                 150                 155                 160 
Gln Gly Ala Arg Gly Phe Pro Gly Thr Pro Gly Leu Pro Gly Phe Lys
                165                 170                 175     
Gly Ile Arg Gly His Asn Gly Leu Asp Gly Gln Lys Gly Gln Pro Gly
            180                 185                 190         
Thr Pro Gly Ile Lys Gly Glu Ser Gly Ala Pro Gly Glu Asn Gly Thr
        195                 200                 205             
Pro Gly Gln Ser Gly Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg Ile
    210                 215                 220                 
Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Ser Asp Gly Ser Thr Gly
225                 230                 235                 240 
Pro Thr Gly Pro Ala Gly Pro Ile Gly Ser Ala Gly Ala Pro Gly Phe
                245                 250                 255     
Pro Gly Ala Pro Gly Ala Lys Gly Glu Ile Gly Ala Ala Gly Asn Val
            260                 265                 270         
Gly Pro Ser Gly Pro Ala Gly Pro Arg Gly Glu Ala Gly Leu Pro Gly
        275                 280                 285             
Ser Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ser Asn Gly Leu
    290                 295                 300                 
Ala Gly Ala Lys Gly Ala Thr Gly Leu Pro Gly Val Ala Gly Ala Pro
305                 310                 315                 320 
Gly Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Ser Gly Pro Ala Gly
                325                 330                 335     
Ala Ala Gly Thr Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ala
            340                 345                 350         
Lys Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ala Ala Gly Pro Ser
        355                 360                 365             
Gly Pro Ala Gly Pro Ser Gly Glu Glu Gly Lys Lys Gly Thr Thr Gly
    370                 375                 380                 
Glu Pro Gly Ser Ser Gly Pro Pro Gly Pro Ala Gly Leu Arg Gly Val
385                 390                 395                 400 
Pro Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met
                405                 410                 415     
Gly Pro Ala Gly Ser Arg Gly Ala Thr Gly Pro Ala Gly Ala Lys Gly
            420                 425                 430         
Pro Ser Gly Asp Asn Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro
        435                 440                 445             
Arg Gly Leu Pro Gly Gln Pro Gly Ser Ser Gly Pro Ala Gly Lys Glu
    450                 455                 460                 
Gly Pro Val Gly Phe Pro Gly Ala Asp Gly Arg Val Gly Pro Thr Gly
465                 470                 475                 480 
Pro Ala Gly Ala Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro
                485                 490                 495     
Lys Gly Pro Thr Gly Asp Pro Gly Lys Pro Gly Asp Arg Gly His Ala
            500                 505                 510         
Gly Leu Ala Gly Ala Arg Gly Ala Pro Gly Pro Glu Gly Asn Asn Gly
        515                 520                 525             
Ala Gln Gly Pro Pro Gly Val Ala Gly Asn Pro Gly Ala Lys Gly Glu
    530                 535                 540                 
Gln Gly Pro Ala Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ser
545                 550                 555                 560 
Gly Pro Ala Gly Glu Ala Gly Lys Pro Gly Glu Arg Gly Met Ala Gly
                565                 570                 575     
Glu Phe Gly Ala Pro Gly Pro Ala Gly Ser Arg Gly Glu Arg Gly Pro
            580                 585                 590         
Pro Gly Glu Ser Gly Ala Val Gly Pro Val Gly Pro Ile Gly Ser Arg
        595                 600                 605             
Gly Pro Ser Gly Pro Pro Gly Thr Asp Gly Asn Lys Gly Glu Pro Gly
    610                 615                 620                 
Asn Val Gly Asn Ala Gly Thr Ala Gly Pro Ser Gly Ala Gly Gly Ala
625                 630                 635                 640 
Pro Gly Glu Arg Gly Ile Ala Gly Ile Pro Gly Pro Lys Gly Glu Lys
                645                 650                 655     
Gly Ala Thr Gly Leu Arg Gly Asp Thr Gly Ala Thr Gly Arg Asp Gly
            660                 665                 670         
Ala Arg Gly Ala Pro Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Gly
        675                 680                 685             
Ala Gly Glu Arg Gly Glu Gly Gly Pro Ala Gly Ala Ala Gly Pro Ser
    690                 695                 700                 
Gly Ala Arg Gly Ile Pro Gly Glu Arg Gly Glu Pro Gly Pro Ala Gly
705                 710                 715                 720 
Pro Thr Gly Phe Ala Gly Pro Ala Gly Ala Ala Gly Gln Pro Gly Ala
                725                 730                 735     
Lys Gly Glu Arg Gly Thr Lys Gly Pro Lys Gly Glu Asn Gly Pro Gln
            740                 745                 750         
Gly Ala Val Gly Pro Val Gly Ser Ser Gly Pro Ser Gly Pro Val Gly
        755                 760                 765             
Ala Ser Gly Pro Ala Gly Pro Arg Gly Asp Gly Gly Pro Pro Gly Val
    770                 775                 780                 
Thr Gly Phe Pro Gly Ala Ala Gly Arg Thr Gly Pro Pro Gly Pro Ser
785                 790                 795                 800 
Gly Ile Thr Gly Pro Pro Gly Pro Pro Gly Ser Ala Gly Lys Asp Gly
                805                 810                 815     
Met Arg Gly Pro Arg Gly Asp Thr Gly Pro Val Gly Arg Thr Gly Glu
            820                 825                 830         
Gln Gly Ile Val Gly Pro Pro Gly Phe Ser Gly Glu Lys Gly Pro Ser
        835                 840                 845             
Gly Glu Pro Gly Ala Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly
    850                 855                 860                 
Ile Leu Gly Ala Pro Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly Glu
865                 870                 875                 880 
Arg Gly Leu Pro Gly Ile Ser Gly Ala Thr Gly Glu Pro Gly Pro Leu
                885                 890                 895     
Gly Ile Ser Gly Pro Pro Gly Ala Arg Gly Pro Ser Gly Pro Val Gly
            900                 905                 910         
Ser Ala Gly Leu Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn
        915                 920                 925             
Pro Gly His Asp Gly Ala Pro Gly Arg Asp Gly Ala Pro Gly Phe Lys
    930                 935                 940                 
Gly Glu Arg Gly Ala Pro Gly Asn Asn Gly Pro Ala Gly Ala Val Gly
945                 950                 955                 960 
Ala Pro Gly Ala His Gly Gln Val Gly Pro Ala Gly Lys Pro Gly Asn
                965                 970                 975     
Arg Gly Asp Pro Gly Pro Val Gly Pro Ser Gly Pro Ala Gly Ala Phe
            980                 985                 990         
Gly Ala Arg Gly Pro Ser Gly Pro Gln Gly Ala Arg Gly Glu Lys Gly
        995                 1000                1005            
Glu Thr Gly Glu Lys Gly His Arg Gly Met Pro Gly Phe Lys Gly His
    1010                1015                1020                
Asn Gly Leu Gln Gly Leu Pro Gly Leu Ala Gly Gln His Gly Asp Gln
1025                1030                1035               1040 
Gly Pro Pro Gly Ser Thr Gly Pro Ala Gly Pro Arg Gly Pro Ser Gly
                1045                1050                1055    
Pro Ser Gly Pro Ala Gly Lys Asp Gly Arg Asn Gly Leu Pro Gly Pro
            1060                1065                1070        
Ile Gly Pro Ala Gly Val Arg Gly Ser Gln Gly Ser Gln Gly Pro Ser
        1075                1080                1085            
Gly Pro Pro Gly Pro Pro Gly Leu Pro Gly Pro Pro Gly Ala Asn Gly
    1090                1095                1100                
Gly Gly Tyr Glu Val Gly Tyr Asp Leu Glu Tyr Tyr Arg Ala Asp Gln
1105                1110                1115               1120 
Pro Ala Leu Arg Pro Lys Asp Tyr Glu Val Asp Ala Thr Leu Lys Thr
                1125                1130                1135    
Leu Asn Asn Gln Ile Glu Thr Leu Leu Thr Pro Glu Gly Ser Arg Lys
            1140                1145                1150        
Asn Pro Ala Arg Thr Cys Arg Asp Leu Arg Leu Ser His Pro Glu Trp
        1155                1160                1165            
Thr Ser Gly Phe Tyr Trp Ile Asp Pro Asn Gln Gly Cys Thr Met Asp
    1170                1175                1180                
Ala Ile Arg Val Tyr Cys Asp Phe Ser Thr Gly Glu Thr Cys Ile His
1185                1190                1195               1200 
Ala Asn Leu Glu Asn Ile Pro Thr Lys Asn Trp Tyr Val Ser Lys Asn
                1205                1210                1215    
Ser Lys Glu Lys Lys His Met Trp Phe Gly Glu Thr Ile Asn Gly Gly
            1220                1225                1230        
Thr Gln Phe Glu Tyr Asn Asp Glu Gly Val Thr Ser Lys Asp Met Ala
        1235                1240                1245            
Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn His Ala Ser Gln Asn
    1250                1255                1260                
Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu Glu Thr
1265                1270                1275               1280 
Gly Asn Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn Asp Val Glu
                1285                1290                1295    
Leu Arg Ala Glu Gly Asn Ser Arg Phe Thr Phe Ser Val Leu Glu Asp
            1300                1305                1310        
Gly Cys Ser Arg Lys Asn Asn Ala Trp Gly Lys Thr Ile Ile Glu Tyr
        1315                1320                1325            
Arg Thr Asn Lys Pro Ser Arg Leu Pro Ile Leu Asp Ile Ala Pro Leu
    1330                1335                1340                
Asp Ile Gly Gly Ala Asp Gln Glu Phe Gly Leu Asp Ile Gly Pro Val
1345                1350                1355               1360 
Cys Phe Lys
            


<210> 23
<211> 90
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn1a

<400> 23
ggtcctaagg gtccaaaggg ccctaaggga cccaaaggtc cacctggccc tccaggcgat 60
ccaggtgacc ctggcgaccc cggagatcca                                  90

<210> 24
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn1a

<400> 24
Gly Pro Lys Gly Pro Lys Gly Pro Lys Gly Pro Lys Gly Pro Pro Gly
 1               5                  10                  15      
Pro Pro Gly Asp Pro Gly Asp Pro Gly Asp Pro Gly Asp Pro
            20                  25                  30  


<210> 25
<211> 136
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn2

<400> 25
gcatcgtctc atcggtctca ttctggtcct aaaggacccg acggaccaaa gggcccagac 60
ggaccccctg gtccaccagg tgaccccggc aagccaggag atcccggtaa accaatcctg 120
agacctgaga cggcat                                                 136

<210> 26
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn2

<400> 26
Gly Pro Lys Gly Pro Asp Gly Pro Lys Gly Pro Asp Gly Pro Pro Gly
 1               5                  10                  15      
Pro Pro Gly Asp Pro Gly Lys Pro Gly Asp Pro Gly Lys Pro
            20                  25                  30  


<210> 27
<211> 90
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn3

<400> 27
ggaccaaagg gacccaaagg accagacggc ccagatggcc ccccaggacc tcctggcgac 60
ccaggtgacc caggtaagcc tggcaagcct                                  90

<210> 28
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn3

<400> 28
Gly Pro Lys Gly Pro Lys Gly Pro Asp Gly Pro Asp Gly Pro Pro Gly
 1               5                  10                  15      
Pro Pro Gly Asp Pro Gly Asp Pro Gly Lys Pro Gly Lys Pro
            20                  25                  30  


<210> 29
<211> 90
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn4
<400> 29
ggtcctaaag gaccaaaggg tcccaagggc ccaaagggtc ctccaggagc tcctggacca 60
cctggccctc caggtgtccc aggtccacca                                  90

<210> 30
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn4

<400> 30
Gly Pro Lys Gly Pro Lys Gly Pro Lys Gly Pro Lys Gly Pro Pro Gly
 1               5                  10                  15      
Ala Pro Gly Pro Pro Gly Pro Pro Gly Val Pro Gly Pro Pro
            20                  25                  30  


<210> 31
<211> 90
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn5

<400> 31
ggtcctgacg gacctgatgg accagatggt cctgatggtc ctccaggagc tcctggacca 60
cctggccctc caggtgtccc aggtccacca                                  90

<210> 32
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn5

<400> 32
Gly Pro Asp Gly Pro Asp Gly Pro Asp Gly Pro Asp Gly Pro Pro Gly
 1               5                  10                  15      
Ala Pro Gly Pro Pro Gly Pro Pro Gly Val Pro Gly Pro Pro
            20                  25                  30  


<210> 33
<211> 90
<212> DNA
<213> Artificial Sequence

<220> 
<223> COLsyn6

<400> 33
ggtttagctg gtcccccagg tcctgcagga gctcccggtc ctccaggagc tcctggacca 60
cctggccctc caggtgtccc aggtccacca                                  90

<210> 34
<211> 30
<212> PRT
<213> Artificial Sequence

<220> 
<223> COLsyn6

<400> 34
Gly Leu Ala Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Pro Pro Gly
 1               5                  10                  15      
Ala Pro Gly Pro Pro Gly Pro Pro Gly Val Pro Gly Pro Pro
            20                  25                  30  


<210> 35
<211> 877
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn2-ePTS1

<400> 35
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaagcatcg 720
tctcatcggt ctcattctgg tcctaaagga cccgacggac caaagggccc agacggaccc 780
cctggtccac caggtgaccc cggcaagcca ggagatcccg gtaaaccaat cctgagacct 840
gagacggcat ttgggaagag gtagaagatc caaattg                          877

<210> 36
<211> 277
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn2-ePTS1

<400> 36
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Pro
225                 230                 235                 240 
Lys Gly Pro Asp Gly Pro Lys Gly Pro Asp Gly Pro Pro Gly Pro Pro
                245                 250                 255     
Gly Asp Pro Gly Lys Pro Gly Asp Pro Gly Lys Pro Leu Gly Arg Gly
            260                 265                 270         
Arg Arg Ser Lys Leu
        275         


<210> 37
<211> 831
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn3-ePTS1

<400> 37
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaaggacca 720
aagggaccca aaggaccaga cggcccagat ggccccccag gacctcctgg cgacccaggt 780
gacccaggta agcctggcaa gcctttggga agaggtagaa gatccaaatt g          831


<210> 38
<211> 277
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn3-ePTS1

<400> 38
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Pro
225                 230                 235                 240 
Lys Gly Pro Lys Gly Pro Asp Gly Pro Asp Gly Pro Pro Gly Pro Pro
                245                 250                 255     
Gly Asp Pro Gly Asp Pro Gly Lys Pro Gly Lys Pro Leu Gly Arg Gly
            260                 265                 270         
Arg Arg Ser Lys Leu
        275         


<210> 39
<211> 831
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn6-ePTS1

<400> 39
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaaggttta 720
gctggtcccc caggtcctgc aggagctccc ggtcctccag gagctcctgg accacctggc 780
cctccaggtg tcccaggtcc accattggga agaggtagaa gatccaaatt g          831

<210> 40
<211> 277
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-COLsyn6-ePTS1

<400> 40
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Leu
225                 230                 235                 240 
Ala Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Pro Pro Gly Ala Pro
                245                 250                 255     
Gly Pro Pro Gly Pro Pro Gly Val Pro Gly Pro Pro Leu Gly Arg Gly
            260                 265                 270         
Arg Arg Ser Lys Leu
        275         


<210> 41
<211> 1605
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtauP4HA1

<400> 41
atgatctggt atattttagt tgtagggatt ctacttcccc agtctttggc ccatccaggc 60
ttttttactt ctattggtca gatgactgat ttgattcata ctgaaaaaga tctggtgact 120
tccctgaaag actatataaa ggcagaagag gacaaattag aacaaataaa aaaatgggca 180
gagaaattag atcgattaac cagcacagcg acaaaagatc cagaaggatt tgttggacac 240
cctgtaaatg cattcaaatt aatgaaacgt ctgaacactg agtggagtga gttggagaat 300
ctggtcctta aggatatgtc agatggtttt atctctaacc taaccattca gagacagtac 360
ttccctaatg atgaagatca ggttggggca gccaaagctc tgttgcgtct acaggacacc 420
tacaatttgg atacagatac catctcaaag ggtgatcttc caggagtaaa acacaaatct 480
tttctaacag ttgaggactg ttttgagttg ggcaaagtgg cctacacaga agcagattat 540
taccatacag agctgtggat ggaacaagca ctgaggcagc tggatgaagg cgaggtttct 600
accgttgata aagtctctgt tctggattat ttgagctatg cagtatacca gcagggagac 660
ctggataagg cgcttttgct cacaaagaag cttcttgaac tagatcctga acatcagaga 720
gctaacggta acttaaaata ctttgagtat ataatggcta aagaaaaaga tgccaataag 780
tcttcttcag atgaccaatc tgatcagaaa accacactga agaagaaagg tgctgctgtg 840
gattacctgc cagagagaca gaagtacgaa atgctgtgcc gtggggaggg tatcaaaatg 900
actcctcgga gacagaaaaa actcttctgt cgctaccatg atggaaaccg gaatcctaaa 960
tttatcctgg ctccagccaa acaggaggat gagtgggaca agcctcgtat tatccgcttc 1020
catgatatta tttctgatgc agaaattgaa gtcgttaaag atctagcaaa accaaggctg 1080
aggcgagcca ccatttcaaa cccaataaca ggagacttgg agacggtaca ttacagaatt 1140
agcaaaagtg cctggctgtc tggctatgaa aaccctgtgg tgtcacgaat taatatgaga 1200
atccaagatc tgacaggact agatgtctcc acagcagagg aattacaggt agcaaattat 1260
ggagttggag gacagtatga accccatttt gattttgcac ggaaagatga gccagatgct 1320
ttcaaagagc tggggacagg aaatagaatt gctacatggc tgttttatat gagtgatgtg 1380
ttagcaggag gagccactgt ttttcctgaa gtaggagcta gtgtttggcc caaaaaggga 1440
actgctgttt tctggtataa tctgtttgcc agtggagaag gagattatag tacacggcat 1500
gcagcctgtc cagtgctggt tggaaacaaa tgggtatcca ataaatggct ccatgaacgt 1560
ggacaggaat ttcgaagacc atgcaccttg tcagaattgg aatga                 1605

<210> 42
<211> 534
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtauP4HA1

<400> 42
Met Ile Trp Tyr Ile Leu Val Val Gly Ile Leu Leu Pro Gln Ser Leu
 1               5                  10                  15      
Ala His Pro Gly Phe Phe Thr Ser Ile Gly Gln Met Thr Asp Leu Ile
            20                  25                  30          
His Thr Glu Lys Asp Leu Val Thr Ser Leu Lys Asp Tyr Ile Lys Ala
        35                  40                  45              
Glu Glu Asp Lys Leu Glu Gln Ile Lys Lys Trp Ala Glu Lys Leu Asp
    50                  55                  60                  
Arg Leu Thr Ser Thr Ala Thr Lys Asp Pro Glu Gly Phe Val Gly His
65                  70                  75                  80  
Pro Val Asn Ala Phe Lys Leu Met Lys Arg Leu Asn Thr Glu Trp Ser
                85                  90                  95      
Glu Leu Glu Asn Leu Val Leu Lys Asp Met Ser Asp Gly Phe Ile Ser
            100                 105                 110         
Asn Leu Thr Ile Gln Arg Gln Tyr Phe Pro Asn Asp Glu Asp Gln Val
        115                 120                 125             
Gly Ala Ala Lys Ala Leu Leu Arg Leu Gln Asp Thr Tyr Asn Leu Asp
    130                 135                 140                 
Thr Asp Thr Ile Ser Lys Gly Asp Leu Pro Gly Val Lys His Lys Ser
145                 150                 155                 160 
Phe Leu Thr Val Glu Asp Cys Phe Glu Leu Gly Lys Val Ala Tyr Thr
                165                 170                 175     
Glu Ala Asp Tyr Tyr His Thr Glu Leu Trp Met Glu Gln Ala Leu Arg
            180                 185                 190         
Gln Leu Asp Glu Gly Glu Val Ser Thr Val Asp Lys Val Ser Val Leu
        195                 200                 205             
Asp Tyr Leu Ser Tyr Ala Val Tyr Gln Gln Gly Asp Leu Asp Lys Ala
    210                 215                 220                 
Leu Leu Leu Thr Lys Lys Leu Leu Glu Leu Asp Pro Glu His Gln Arg
225                 230                 235                 240 
Ala Asn Gly Asn Leu Lys Tyr Phe Glu Tyr Ile Met Ala Lys Glu Lys
                245                 250                 255     
Asp Ala Asn Lys Ser Ser Ser Asp Asp Gln Ser Asp Gln Lys Thr Thr
            260                 265                 270         
Leu Lys Lys Lys Gly Ala Ala Val Asp Tyr Leu Pro Glu Arg Gln Lys
        275                 280                 285             
Tyr Glu Met Leu Cys Arg Gly Glu Gly Ile Lys Met Thr Pro Arg Arg
    290                 295                 300                 
Gln Lys Lys Leu Phe Cys Arg Tyr His Asp Gly Asn Arg Asn Pro Lys
305                 310                 315                 320 
Phe Ile Leu Ala Pro Ala Lys Gln Glu Asp Glu Trp Asp Lys Pro Arg
                325                 330                 335     
Ile Ile Arg Phe His Asp Ile Ile Ser Asp Ala Glu Ile Glu Val Val
            340                 345                 350         
Lys Asp Leu Ala Lys Pro Arg Leu Arg Arg Ala Thr Ile Ser Asn Pro
        355                 360                 365             
Ile Thr Gly Asp Leu Glu Thr Val His Tyr Arg Ile Ser Lys Ser Ala
    370                 375                 380                 
Trp Leu Ser Gly Tyr Glu Asn Pro Val Val Ser Arg Ile Asn Met Arg
385                 390                 395                 400 
Ile Gln Asp Leu Thr Gly Leu Asp Val Ser Thr Ala Glu Glu Leu Gln
                405                 410                 415     
Val Ala Asn Tyr Gly Val Gly Gly Gln Tyr Glu Pro His Phe Asp Phe
            420                 425                 430         
Ala Arg Lys Asp Glu Pro Asp Ala Phe Lys Glu Leu Gly Thr Gly Asn
        435                 440                 445             
Arg Ile Ala Thr Trp Leu Phe Tyr Met Ser Asp Val Leu Ala Gly Gly
    450                 455                 460                 
Ala Thr Val Phe Pro Glu Val Gly Ala Ser Val Trp Pro Lys Lys Gly
465                 470                 475                 480 
Thr Ala Val Phe Trp Tyr Asn Leu Phe Ala Ser Gly Glu Gly Asp Tyr
                485                 490                 495     
Ser Thr Arg His Ala Ala Cys Pro Val Leu Val Gly Asn Lys Trp Val
            500                 505                 510         
Ser Asn Lys Trp Leu His Glu Arg Gly Gln Glu Phe Arg Arg Pro Cys
        515                 520                 525             
Thr Leu Ser Glu Leu Glu
    530                 


<210> 43
<211> 1533
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtauP4HB

<400> 43
atgctgcgcc gcgctctgct ctgcctggcc ctgaccgcgc tattccgcgc gggtgccggc 60
gcccccgacg aggaggacca cgtcctggtg ctccataagg gcaacttcga cgaggcgctg 120
gcggcccaca agtacctgct ggtggagttc tacgccccat ggtgcggcca ctgcaaggct 180
ctggccccgg agtatgccaa agcagctggg aagctgaagg cagaaggttc tgagatcaga 240
ctggccaagg tggatgccac tgaagagtct gacctggccc agcagtatgg tgtccgaggc 300
taccccacca tcaagttctt caagaatgga gacacagctt cccccaaaga gtacacagct 360
ggccgagaag cggatgatat cgtgaactgg ctgaagaagc gcacgggccc cgctgccagc 420
acgctgtccg acggggctgc tgcagaggcc ttggtggagt ccagtgaggt ggccgtcatt 480
ggcttcttca aggacatgga gtcggactcc gcaaagcagt tcttcttggc agcagaggtc 540
attgatgaca tccccttcgg gatcacatct aacagcgatg tgttctccaa ataccagctg 600
gacaaggatg gggttgtcct ctttaagaag tttgacgaag gccggaacaa ctttgagggg 660
gaggtcacca aagaaaagct tctggacttc atcaagcaca accagttgcc cctggtcatt 720
gagttcaccg agcagacagc cccgaagatc ttcggagggg aaatcaagac tcacatcctg 780
ctgttcctgc cgaaaagcgt gtctgactat gagggcaagc tgagcaactt caaaaaagcg 840
gctgagagct tcaagggcaa gatcctgttt atcttcatcg acagcgacca cactgacaac 900
cagcgcatcc tggaattctt cggcctaaag aaagaggagt gcccggccgt gcgcctcatc 960
acgctggagg aggagatgac caaatataag ccagagtcag atgagctgac ggcagagaag 1020
atcaccgagt tctgccaccg cttcctggag ggcaagatta agccccacct gatgagccag 1080
gagctgcctg acgactggga caagcagcct gtcaaagtgc tggttgggaa gaactttgaa 1140
gaggttgctt ttgatgagaa aaagaacgtc tttgtagagt tctatgcccc gtggtgcggt 1200
cactgcaagc agctggcccc catctgggat aagctgggag agacgtacaa ggaccacgag 1260
aacatagtca tcgccaagat ggactccacg gccaacgagg tggaggcggt gaaagtgcac 1320
agcttcccca cgctcaagtt cttccccgcc agcgccgaca ggacggtcat cgactacaat 1380
ggggagcgga cactggatgg ttttaagaag ttcctggaga gtggtggcca ggatggggcc 1440
ggagatgatg acgatctaga agatcttgaa gaagcagaag agcctgatct ggaggaagat 1500
gatgatcaaa aagctgtgaa agatgaactg taa                              1533

<210> 44
<211> 510
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtauP4HB

<400> 44
Met Leu Arg Arg Ala Leu Leu Cys Leu Ala Leu Thr Ala Leu Phe Arg
 1               5                  10                  15      
Ala Gly Ala Gly Ala Pro Asp Glu Glu Asp His Val Leu Val Leu His
            20                  25                  30          
Lys Gly Asn Phe Asp Glu Ala Leu Ala Ala His Lys Tyr Leu Leu Val
        35                  40                  45              
Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu
    50                  55                  60                  
Tyr Ala Lys Ala Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg
65                  70                  75                  80  
Leu Ala Lys Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr
                85                  90                  95      
Gly Val Arg Gly Tyr Pro Thr Ile Lys Phe Phe Lys Asn Gly Asp Thr
            100                 105                 110         
Ala Ser Pro Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val
        115                 120                 125             
Asn Trp Leu Lys Lys Arg Thr Gly Pro Ala Ala Ser Thr Leu Ser Asp
    130                 135                 140                 
Gly Ala Ala Ala Glu Ala Leu Val Glu Ser Ser Glu Val Ala Val Ile
145                 150                 155                 160 
Gly Phe Phe Lys Asp Met Glu Ser Asp Ser Ala Lys Gln Phe Phe Leu
                165                 170                 175     
Ala Ala Glu Val Ile Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser
            180                 185                 190         
Asp Val Phe Ser Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe
        195                 200                 205             
Lys Lys Phe Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys
    210                 215                 220                 
Glu Lys Leu Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile
225                 230                 235                 240 
Glu Phe Thr Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys
                245                 250                 255     
Thr His Ile Leu Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Glu Gly
            260                 265                 270         
Lys Leu Ser Asn Phe Lys Lys Ala Ala Glu Ser Phe Lys Gly Lys Ile
        275                 280                 285             
Leu Phe Ile Phe Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu
    290                 295                 300                 
Glu Phe Phe Gly Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile
305                 310                 315                 320 
Thr Leu Glu Glu Glu Met Thr Lys Tyr Lys Pro Glu Ser Asp Glu Leu
                325                 330                 335     
Thr Ala Glu Lys Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys
            340                 345                 350         
Ile Lys Pro His Leu Met Ser Gln Glu Leu Pro Asp Asp Trp Asp Lys
        355                 360                 365             
Gln Pro Val Lys Val Leu Val Gly Lys Asn Phe Glu Glu Val Ala Phe
    370                 375                 380                 
Asp Glu Lys Lys Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly
385                 390                 395                 400 
His Cys Lys Gln Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr
                405                 410                 415     
Lys Asp His Glu Asn Ile Val Ile Ala Lys Met Asp Ser Thr Ala Asn
            420                 425                 430         
Glu Val Glu Ala Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe
        435                 440                 445             
Pro Ala Ser Ala Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr
    450                 455                 460                 
Leu Asp Gly Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala
465                 470                 475                 480 
Gly Asp Asp Asp Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp
                485                 490                 495     
Leu Glu Glu Asp Asp Asp Gln Lys Ala Val Lys Asp Glu Leu
            500                 505                 510 


<210> 45
<211> 1470
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtP4HB

<400> 45
gcccccgacg aggaggacca cgtcctggtg ctccataagg gcaacttcga cgaggcgctg 60
gcggcccaca agtacctgct ggtggagttc tacgccccat ggtgcggcca ctgcaaggct 120
ctggccccgg agtatgccaa agcagctggg aagctgaagg cagaaggttc tgagatcaga 180
ctggccaagg tggatgccac tgaagagtct gacctggccc agcagtatgg tgtccgaggc 240
taccccacca tcaagttctt caagaatgga gacacagctt cccccaaaga gtacacagct 300
ggccgagaag cggatgatat cgtgaactgg ctgaagaagc gcacgggccc cgctgccagc 360
acgctgtccg acggggctgc tgcagaggcc ttggtggagt ccagtgaggt ggccgtcatt 420
ggcttcttca aggacatgga gtcggactcc gcaaagcagt tcttcttggc agcagaggtc 480
attgatgaca tccccttcgg gatcacatct aacagcgatg tgttctccaa ataccagctg 540
gacaaggatg gggttgtcct ctttaagaag tttgacgaag gccggaacaa ctttgagggg 600
gaggtcacca aagaaaagct tctggacttc atcaagcaca accagttgcc cctggtcatt 660
gagttcaccg agcagacagc cccgaagatc ttcggagggg aaatcaagac tcacatcctg 720
ctgttcctgc cgaaaagcgt gtctgactat gagggcaagc tgagcaactt caaaaaagcg 780
gctgagagct tcaagggcaa gatcctgttt atcttcatcg acagcgacca cactgacaac 840
cagcgcatcc tggaattctt cggcctaaag aaagaggagt gcccggccgt gcgcctcatc 900
acgctggagg aggagatgac caaatataag ccagagtcag atgagctgac ggcagagaag 960
atcaccgagt tctgccaccg cttcctggag ggcaagatta agccccacct gatgagccag 1020
gagctgcctg acgactggga caagcagcct gtcaaagtgc tggttgggaa gaactttgaa 1080
gaggttgctt ttgatgagaa aaagaacgtc tttgtagagt tctatgcccc gtggtgcggt 1140
cactgcaagc agctggcccc catctgggat aagctgggag agacgtacaa ggaccacgag 1200
aacatagtca tcgccaagat ggactccacg gccaacgagg tggaggcggt gaaagtgcac 1260
agcttcccca cgctcaagtt cttccccgcc agcgccgaca ggacggtcat cgactacaat 1320
ggggagcgga cactggatgg ttttaagaag ttcctggaga gtggtggcca ggatggggcc 1380
ggagatgatg acgatctaga agatcttgaa gaagcagaag agcctgatct ggaggaagat 1440
gatgatcaaa aagctgtgaa agatgaactg                                  1470

<210> 46
<211> 490
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtP4HB

<400> 46
Ala Pro Asp Glu Glu Asp His Val Leu Val Leu His Lys Gly Asn Phe
 1               5                  10                  15      
Asp Glu Ala Leu Ala Ala His Lys Tyr Leu Leu Val Glu Phe Tyr Ala
            20                  25                  30          
Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Ala Lys Ala
        35                  40                  45              
Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg Leu Ala Lys Val
    50                  55                  60                  
Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr Gly Val Arg Gly
65                  70                  75                  80  
Tyr Pro Thr Ile Lys Phe Phe Lys Asn Gly Asp Thr Ala Ser Pro Lys
                85                  90                  95      
Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val Asn Trp Leu Lys
            100                 105                 110         
Lys Arg Thr Gly Pro Ala Ala Ser Thr Leu Ser Asp Gly Ala Ala Ala
        115                 120                 125             
Glu Ala Leu Val Glu Ser Ser Glu Val Ala Val Ile Gly Phe Phe Lys
    130                 135                 140                 
Asp Met Glu Ser Asp Ser Ala Lys Gln Phe Phe Leu Ala Ala Glu Val
145                 150                 155                 160 
Ile Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser Asp Val Phe Ser
                165                 170                 175     
Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe Lys Lys Phe Asp
            180                 185                 190         
Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys Glu Lys Leu Leu
        195                 200                 205             
Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile Glu Phe Thr Glu
    210                 215                 220                 
Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys Thr His Ile Leu
225                 230                 235                 240 
Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Glu Gly Lys Leu Ser Asn
                245                 250                 255     
Phe Lys Lys Ala Ala Glu Ser Phe Lys Gly Lys Ile Leu Phe Ile Phe
            260                 265                 270         
Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu Glu Phe Phe Gly
        275                 280                 285             
Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile Thr Leu Glu Glu
    290                 295                 300                 
Glu Met Thr Lys Tyr Lys Pro Glu Ser Asp Glu Leu Thr Ala Glu Lys
305                 310                 315                 320 
Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys Ile Lys Pro His
                325                 330                 335     
Leu Met Ser Gln Glu Leu Pro Asp Asp Trp Asp Lys Gln Pro Val Lys
            340                 345                 350         
Val Leu Val Gly Lys Asn Phe Glu Glu Val Ala Phe Asp Glu Lys Lys
        355                 360                 365             
Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Gln
    370                 375                 380                 
Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr Lys Asp His Glu
385                 390                 395                 400 
Asn Ile Val Ile Ala Lys Met Asp Ser Thr Ala Asn Glu Val Glu Ala
                405                 410                 415     
Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe Pro Ala Ser Ala
            420                 425                 430         
Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr Leu Asp Gly Phe
        435                 440                 445             
Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala Gly Asp Asp Asp
    450                 455                 460                 
Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp Leu Glu Glu Asp
465                 470                 475                 480 
Asp Asp Gln Lys Ala Val Lys Asp Glu Leu
                485                 490 


<210> 47
<211> 2211
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-BtP4HB-ePTS1

<400> 47
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaagccccc 720
gacgaggagg accacgtcct ggtgctccat aagggcaact tcgacgaggc gctggcggcc 780
cacaagtacc tgctggtgga gttctacgcc ccatggtgcg gccactgcaa ggctctggcc 840
ccggagtatg ccaaagcagc tgggaagctg aaggcagaag gttctgagat cagactggcc 900
aaggtggatg ccactgaaga gtctgacctg gcccagcagt atggtgtccg aggctacccc 960
accatcaagt tcttcaagaa tggagacaca gcttccccca aagagtacac agctggccga 1020
gaagcggatg atatcgtgaa ctggctgaag aagcgcacgg gccccgctgc cagcacgctg 1080
tccgacgggg ctgctgcaga ggccttggtg gagtccagtg aggtggccgt cattggcttc 1140
ttcaaggaca tggagtcgga ctccgcaaag cagttcttct tggcagcaga ggtcattgat 1200
gacatcccct tcgggatcac atctaacagc gatgtgttct ccaaatacca gctggacaag 1260
gatggggttg tcctctttaa gaagtttgac gaaggccgga acaactttga gggggaggtc 1320
accaaagaaa agcttctgga cttcatcaag cacaaccagt tgcccctggt cattgagttc 1380
accgagcaga cagccccgaa gatcttcgga ggggaaatca agactcacat cctgctgttc 1440
ctgccgaaaa gcgtgtctga ctatgagggc aagctgagca acttcaaaaa agcggctgag 1500
agcttcaagg gcaagatcct gtttatcttc atcgacagcg accacactga caaccagcgc 1560
atcctggaat tcttcggcct aaagaaagag gagtgcccgg ccgtgcgcct catcacgctg 1620
gaggaggaga tgaccaaata taagccagag tcagatgagc tgacggcaga gaagatcacc 1680
gagttctgcc accgcttcct ggagggcaag attaagcccc acctgatgag ccaggagctg 1740
cctgacgact gggacaagca gcctgtcaaa gtgctggttg ggaagaactt tgaagaggtt 1800
gcttttgatg agaaaaagaa cgtctttgta gagttctatg ccccgtggtg cggtcactgc 1860
aagcagctgg cccccatctg ggataagctg ggagagacgt acaaggacca cgagaacata 1920
gtcatcgcca agatggactc cacggccaac gaggtggagg cggtgaaagt gcacagcttc 1980
cccacgctca agttcttccc cgccagcgcc gacaggacgg tcatcgacta caatggggag 2040
cggacactgg atggttttaa gaagttcctg gagagtggtg gccaggatgg ggccggagat 2100
gatgacgatc tagaagatct tgaagaagca gaagagcctg atctggagga agatgatgat 2160
caaaaagctg tgaaagatga actgttggga agaggtagaa gatccaaatt g          2211

<210> 48
<211> 737
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-BtP4HB-ePTS1

<400> 48
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Ala Pro
225                 230                 235                 240 
Asp Glu Glu Asp His Val Leu Val Leu His Lys Gly Asn Phe Asp Glu
                245                 250                 255     
Ala Leu Ala Ala His Lys Tyr Leu Leu Val Glu Phe Tyr Ala Pro Trp
            260                 265                 270         
Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Ala Lys Ala Ala Gly
        275                 280                 285             
Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg Leu Ala Lys Val Asp Ala
    290                 295                 300                 
Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr Gly Val Arg Gly Tyr Pro
305                 310                 315                 320 
Thr Ile Lys Phe Phe Lys Asn Gly Asp Thr Ala Ser Pro Lys Glu Tyr
                325                 330                 335     
Thr Ala Gly Arg Glu Ala Asp Asp Ile Val Asn Trp Leu Lys Lys Arg
            340                 345                 350         
Thr Gly Pro Ala Ala Ser Thr Leu Ser Asp Gly Ala Ala Ala Glu Ala
        355                 360                 365             
Leu Val Glu Ser Ser Glu Val Ala Val Ile Gly Phe Phe Lys Asp Met
    370                 375                 380                 
Glu Ser Asp Ser Ala Lys Gln Phe Phe Leu Ala Ala Glu Val Ile Asp
385                 390                 395                 400 
Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser Asp Val Phe Ser Lys Tyr
                405                 410                 415     
Gln Leu Asp Lys Asp Gly Val Val Leu Phe Lys Lys Phe Asp Glu Gly
            420                 425                 430         
Arg Asn Asn Phe Glu Gly Glu Val Thr Lys Glu Lys Leu Leu Asp Phe
        435                 440                 445             
Ile Lys His Asn Gln Leu Pro Leu Val Ile Glu Phe Thr Glu Gln Thr
    450                 455                 460                 
Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys Thr His Ile Leu Leu Phe
465                 470                 475                 480 
Leu Pro Lys Ser Val Ser Asp Tyr Glu Gly Lys Leu Ser Asn Phe Lys
                485                 490                 495     
Lys Ala Ala Glu Ser Phe Lys Gly Lys Ile Leu Phe Ile Phe Ile Asp
            500                 505                 510         
Ser Asp His Thr Asp Asn Gln Arg Ile Leu Glu Phe Phe Gly Leu Lys
        515                 520                 525             
Lys Glu Glu Cys Pro Ala Val Arg Leu Ile Thr Leu Glu Glu Glu Met
    530                 535                 540                 
Thr Lys Tyr Lys Pro Glu Ser Asp Glu Leu Thr Ala Glu Lys Ile Thr
545                 550                 555                 560 
Glu Phe Cys His Arg Phe Leu Glu Gly Lys Ile Lys Pro His Leu Met
                565                 570                 575     
Ser Gln Glu Leu Pro Asp Asp Trp Asp Lys Gln Pro Val Lys Val Leu
            580                 585                 590         
Val Gly Lys Asn Phe Glu Glu Val Ala Phe Asp Glu Lys Lys Asn Val
        595                 600                 605             
Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Gln Leu Ala
    610                 615                 620                 
Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr Lys Asp His Glu Asn Ile
625                 630                 635                 640 
Val Ile Ala Lys Met Asp Ser Thr Ala Asn Glu Val Glu Ala Val Lys
                645                 650                 655     
Val His Ser Phe Pro Thr Leu Lys Phe Phe Pro Ala Ser Ala Asp Arg
            660                 665                 670         
Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr Leu Asp Gly Phe Lys Lys
        675                 680                 685             
Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala Gly Asp Asp Asp Asp Leu
    690                 695                 700                 
Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp Leu Glu Glu Asp Asp Asp
705                 710                 715                 720 
Gln Lys Ala Val Lys Asp Glu Leu Leu Gly Arg Gly Arg Arg Ser Lys
                725                 730                 735     
Leu
    


<210> 49
<211> 708
<212> DNA
<213> Artificial Sequence

<220> 
<223> TEV protease

<400> 49
ggagagtccc tgtttaaagg acccagagac tataacccga ttagtagcac tatttgtcat 60
cttacaaacg aaagtgatgg tcacacgact agtctttacg gaatcggatt cggcccattt 120
attatcacaa acaagcatct gttcagaaga aataacggga cgttgttggt ccaatctctt 180
catggagtat ttaaggtaaa gaacactaca actcttcagc agcatctgat cgacggtagg 240
gatatgatca tcatccgtat gccgaaagac tttccacctt ttcctcagaa gttgaagttt 300
agagaacccc agcgtgagga gcgtatctgt ttagtaacaa caaatttcca aacgaaatct 360
atgtcatcaa tggttagcga taccagttgt actttcccca gttcagatgg gattttctgg 420
aagcactgga ttcagacaaa ggacggtcag tgtggtagtc cgcttgtttc tacaagggac 480
ggatttattg tcgggataca cagtgcttct aactttacga atacaaacaa ctacttcacg 540
tctgtcccta aaaattttat ggagctgttg actaatcagg aagcccaaca gtgggtatct 600
ggctggcgtt tgaacgcgga ttccgtactg tggggtggcc acaaggtttt tatggttaag 660
cctgaagagc cgttccaacc tgtgaaggag gcaacacagc taatgaat              708

<210> 50
<211> 236
<212> PRT
<213> Artificial Sequence

<220> 
<223> TEV protease

<400> 50
Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser
 1               5                  10                  15      
Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu
            20                  25                  30          
Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe
        35                  40                  45              
Arg Arg Asn Asn Gly Thr Leu Leu Val Gln Ser Leu His Gly Val Phe
    50                  55                  60                  
Lys Val Lys Asn Thr Thr Thr Leu Gln Gln His Leu Ile Asp Gly Arg
65                  70                  75                  80  
Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gln
                85                  90                  95      
Lys Leu Lys Phe Arg Glu Pro Gln Arg Glu Glu Arg Ile Cys Leu Val
            100                 105                 110         
Thr Thr Asn Phe Gln Thr Lys Ser Met Ser Ser Met Val Ser Asp Thr
        115                 120                 125             
Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His Trp Ile
    130                 135                 140                 
Gln Thr Lys Asp Gly Gln Cys Gly Ser Pro Leu Val Ser Thr Arg Asp
145                 150                 155                 160 
Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr Asn
                165                 170                 175     
Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr Asn
            180                 185                 190         
Gln Glu Ala Gln Gln Trp Val Ser Gly Trp Arg Leu Asn Ala Asp Ser
        195                 200                 205             
Val Leu Trp Gly Gly His Lys Val Phe Met Val Lys Pro Glu Glu Pro
    210                 215                 220                 
Phe Gln Pro Val Lys Glu Ala Thr Gln Leu Met Asn
225                 230                 235     


<210> 51
<211> 1182
<212> DNA
<213> Artificial Sequence

<220> 
<223> mRuby-BantP4H-ePTS1

<400> 51
atggtgtcca aaggagagga gttaatcaag gaaaacatga gaatgaaagt tgtcatggag 60
ggctccgtta atggtcacca attcaagtgt acaggggaag gtgaaggtaa tccttacatg 120
ggtacacaaa ctatgagaat taaagtaatt gaaggcggac cactaccatt tgcatttgac 180
attctggcaa cgtcattcat gtacggatca cgaactttca tcaagtaccc taaaggtata 240
ccagactttt tcaagcaatc ttttccagag ggttttacat gggaaagggt tacaagatac 300
gaagatgggg gtgtcgtcac agttatgcaa gatacttcat tagaagatgg ctgccttgtc 360
tatcatgtgc aagtaagagg ggtgaatttt ccttctaacg gacctgtgat gcagaaaaag 420
accaaaggtt gggaaccaaa tactgaaatg atgtacccag ctgatggagg tttgagaggc 480
tacacacaca tggcgcttaa agttgatggt ggaggtcatt tgtcttgtag ttttgttacc 540
acttatcgtt ctaaaaagac tgttggcaat atcaaaatgc caggaataca tgctgtagac 600
cacagactag aaagactcga agagagcgat aacgaaatgt tcgttgtaca gagagagcat 660
gccgtagcca aatttgctgg cttaggcggt ggtatggatg aattgtataa gggttctaaa 720
actttaggct atatccttat ggagaatggc gagaagattg acttggagtt ctttcccgag 780
gaagcaccta aaacagtaga aaacttcaaa aagctggctg agcaaggttt ctacgatggt 840
gttacattcc acagggttat cccaggattc gtgtcacaag gtggagatcc aaccggtacg 900
ggagctggcg gtccaggtta ctcaatccca tgtgaaacgg acggcaatcc ccataggcac 960
cttgttggca gtctaagtat ggcacatgct ggtcgtaaca ccggcggttc acagttcttc 1020
atagtacacg aaccccaacc ccaccttgac ggagtgcaca cagtctttgg taaggctacc 1080
tcaggtatcg aaacggtact aaatatgagg caaggtgacg taatgaaaga ggtcaaggtc 1140
tgggaggaag gatccttggg aagaggtaga agatccaaat tg                    1182

<210> 52
<211> 394
<212> PRT
<213> Artificial Sequence

<220> 
<223> mRuby-BantP4H-ePTS1

<400> 52
Met Val Ser Lys Gly Glu Glu Leu Ile Lys Glu Asn Met Arg Met Lys
 1               5                  10                  15      
Val Val Met Glu Gly Ser Val Asn Gly His Gln Phe Lys Cys Thr Gly
            20                  25                  30          
Glu Gly Glu Gly Asn Pro Tyr Met Gly Thr Gln Thr Met Arg Ile Lys
        35                  40                  45              
Val Ile Glu Gly Gly Pro Leu Pro Phe Ala Phe Asp Ile Leu Ala Thr
    50                  55                  60                  
Ser Phe Met Tyr Gly Ser Arg Thr Phe Ile Lys Tyr Pro Lys Gly Ile
65                  70                  75                  80  
Pro Asp Phe Phe Lys Gln Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg
                85                  90                  95      
Val Thr Arg Tyr Glu Asp Gly Gly Val Val Thr Val Met Gln Asp Thr
            100                 105                 110         
Ser Leu Glu Asp Gly Cys Leu Val Tyr His Val Gln Val Arg Gly Val
        115                 120                 125             
Asn Phe Pro Ser Asn Gly Pro Val Met Gln Lys Lys Thr Lys Gly Trp
    130                 135                 140                 
Glu Pro Asn Thr Glu Met Met Tyr Pro Ala Asp Gly Gly Leu Arg Gly
145                 150                 155                 160 
Tyr Thr His Met Ala Leu Lys Val Asp Gly Gly Gly His Leu Ser Cys
                165                 170                 175     
Ser Phe Val Thr Thr Tyr Arg Ser Lys Lys Thr Val Gly Asn Ile Lys
            180                 185                 190         
Met Pro Gly Ile His Ala Val Asp His Arg Leu Glu Arg Leu Glu Glu
        195                 200                 205             
Ser Asp Asn Glu Met Phe Val Val Gln Arg Glu His Ala Val Ala Lys
    210                 215                 220                 
Phe Ala Gly Leu Gly Gly Gly Met Asp Glu Leu Tyr Lys Gly Ser Lys
225                 230                 235                 240 
Thr Leu Gly Tyr Ile Leu Met Glu Asn Gly Glu Lys Ile Asp Leu Glu
                245                 250                 255     
Phe Phe Pro Glu Glu Ala Pro Lys Thr Val Glu Asn Phe Lys Lys Leu
            260                 265                 270         
Ala Glu Gln Gly Phe Tyr Asp Gly Val Thr Phe His Arg Val Ile Pro
        275                 280                 285             
Gly Phe Val Ser Gln Gly Gly Asp Pro Thr Gly Thr Gly Ala Gly Gly
    290                 295                 300                 
Pro Gly Tyr Ser Ile Pro Cys Glu Thr Asp Gly Asn Pro His Arg His
305                 310                 315                 320 
Leu Val Gly Ser Leu Ser Met Ala His Ala Gly Arg Asn Thr Gly Gly
                325                 330                 335     
Ser Gln Phe Phe Ile Val His Glu Pro Gln Pro His Leu Asp Gly Val
            340                 345                 350         
His Thr Val Phe Gly Lys Ala Thr Ser Gly Ile Glu Thr Val Leu Asn
        355                 360                 365             
Met Arg Gln Gly Asp Val Met Lys Glu Val Lys Val Trp Glu Glu Gly
    370                 375                 380                 
Ser Leu Gly Arg Gly Arg Arg Ser Lys Leu
385                 390                 


<210> 53
<211> 1053
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-BTCol1A1 403-0P-ePTS1

<400> 53
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaaggttct 720
ggatttgctg gccctaaagg agccgcagga gaggctggta aagcaggcga acgtggcgta 780
gctggcccag ctggtgctgt gggtccagtt ggtaaagatg gagaggccgg tgcccaaggt 840
ccagccggtc ctgtaggccc agcaggtgaa agaggagaac aaggtcccgc aggctcagct 900
ggattccaag gattagcagg acctgccggc ccagttggcg aagccggtaa agccggagag 960
caaggcgtgg ctggtgacct aggtgcagtt ggccctagtg gcgccagggg agaaagagga 1020
tccttgggaa gaggtagaag atccaaattg taa                              1053

<210> 54
<211> 350
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-BTCol1A1 403-0P-ePTS1

<400> 54
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Ser
225                 230                 235                 240 
Gly Phe Ala Gly Pro Lys Gly Ala Ala Gly Glu Ala Gly Lys Ala Gly
                245                 250                 255     
Glu Arg Gly Val Ala Gly Pro Ala Gly Ala Val Gly Pro Val Gly Lys
            260                 265                 270         
Asp Gly Glu Ala Gly Ala Gln Gly Pro Ala Gly Pro Val Gly Pro Ala
        275                 280                 285             
Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Ala Gly Phe Gln Gly
    290                 295                 300                 
Leu Ala Gly Pro Ala Gly Pro Val Gly Glu Ala Gly Lys Ala Gly Glu
305                 310                 315                 320 
Gln Gly Val Ala Gly Asp Leu Gly Ala Val Gly Pro Ser Gly Ala Arg
                325                 330                 335     
Gly Glu Arg Gly Ser Leu Gly Arg Gly Arg Arg Ser Lys Leu
            340                 345                 350 


<210> 55
<211> 1053
<212> DNA
<213> Artificial Sequence

<220> 
<223> GFP-BTCol1A1 403-11P-ePTS1

<400> 55
atgcgtaaag gcgaagagct gttcactggt gtcgtcccta ttctggtgga actggatggt 60
gatgtcaacg gtcataagtt ttccgtgcgt ggcgagggtg aaggtgacgc aactaatggt 120
aaactgacgc tgaagttcat ctgtactact ggtaaactgc cggttccttg gccgactctg 180
gtaacgacgc tgacttatgg tgttcagtgc tttgctcgtt atccggacca tatgaagcag 240
catgacttct tcaagtccgc catgccggaa ggctatgtgc aggaacgcac gatttccttt 300
aaggatgacg gcacgtacaa aacgcgtgcg gaagtgaaat ttgaaggcga taccctggta 360
aaccgcattg agctgaaagg cattgacttt aaagaggacg gcaatatcct gggccataag 420
ctggaataca attttaacag ccacaatgtt tacatcaccg ccgataaaca aaaaaatggc 480
attaaagcga attttaaaat tcgccacaac gtggaggatg gcagcgtgca gctggctgat 540
cactaccagc aaaacactcc aatcggtgat ggtcctgttc tgctgccaga caatcactat 600
ctgagcacgc aaagcgttct gtctaaagat ccgaacgaga aacgcgatca tatggttctg 660
ctggagttcg taaccgcagc gggcatcacg catggtatgg atgaactgta caaaggttct 720
ggatttcctg gccctaaggg agccgcagga gagcccggta aagcaggcga aagaggcgta 780
cctggtccac ccggtgctgt gggtccagct ggtaaagatg gagaggccgg tgcccaaggt 840
cctcctggtc ctgctggccc agcaggtgaa agaggagaac aaggtcccgc aggctcacct 900
ggattccaag gattaccagg tccagccgga ccacctggcg aagccggtaa acccggagag 960
caaggcgtgc ctggtgacct aggtgcacca ggacctagtg gcgccagggg agaaagagga 1020
tccttgggaa gaggtagaag atccaaattg taa                              1053

<210> 56
<211> 350
<212> PRT
<213> Artificial Sequence

<220> 
<223> GFP-BTCol1A1 403-11P-ePTS1

<400> 56
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Ser
225                 230                 235                 240 
Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala Gly
                245                 250                 255     
Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly Lys
            260                 265                 270         
Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala
        275                 280                 285             
Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly
    290                 295                 300                 
Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu
305                 310                 315                 320 
Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg
                325                 330                 335     
Gly Glu Arg Gly Ser Leu Gly Arg Gly Arg Arg Ser Lys Leu
            340                 345                 350 


<210> 57
<211> 5094
<212> DNA
<213> Artificial Sequence

<220> 
<223> AmisCOL1A1-TEV-GFP-HIS-ePTS1

<400> 57
atgcaaggag aagaagacau ucaaacugga agcugcauac aggauggacu agcguacaac 60
aacacagacg uauggaaacc cgagcccugc cagaucugcg uaugcgacaa uggcaacauc 120
cugugugacg augucaucug ugaugauacc ucggacugua ccaaugcuga gauccccuuu 180
ggagaaugcu gucccaucug uccugacacc gcuggcucuu cuaccuaccc caaauccacu 240
ggaguagagg guccuaaggg agacacuggc cccagaggac agaggggacu cccaggccca 300
ccuggcagag auggcauucc uggacagccu ggucucccug gacucccagg accuccaggc 360
ccuccuggcc uugguggaaa cuucgcuccu caaauggcuu acgguuacgg agaugaaacc 420
aaaucugcug gcauuucugu cccuggaccc auggguccag cuggcccccg uggucucccc 480
ggccccccug guucuccugg uccucaaggu uuccaagguc cuccuggaga gccuggagag 540
ccuggugcuu cagguccaau ggguccccgu gguccagccg gccccccugg caagaacgga 600
gaugauggug aagcuggaaa gcccggccgu cccggugagc gcggcccucc uggcccccag 660
ggugcacgug gucugcccgg aacugcuggc cugccaggca ugaaggguca cagagguuuc 720
aguggucugg auggugcuaa gggugaugcu gguccauccg gccccaaggg ugagccuggu 780
agcccuggug agaacggagc uccuggacaa augggcccuc guggucuucc cggugagaga 840
ggccgcccug guccaucugg cccugcuggu gcucguggua acgaugguag uccuggugcu 900
gcuggcccuc cagguccaac uggcccagcu ggccccccug gcuucccugg ugcugcuggu 960
gcuaagggug aaacuggucc ucaagguucu cgugguagug aaggcccaca gggugcucgu 1020
ggugagccug guccuccugg cccugcuggu gcugcugguc cugcuggcaa cccugguucu 1080
gauggucaag cuggugccaa aggugcaacu ggugcuccug guauugcugg ugcuccuggc 1140
uucccuggcg cucguggccc aucuggaccc caggguccca gcggugcucc uggccccaag 1200
gguaacagug gugaacccgg ugcucaaggc aacaagggag acacuggugc aaaaggagag 1260
ccugguccug cuggugucca aggcccaccu gguccagcug gugaagaagg caagagagga 1320
gcccguggug agcccggccc uggaggucuu ccuggcccug cuggcgaacg uggugcuccu 1380
ggaagccgug guuucccugg cgcugauggc auuucugguc ccaagggucc cccuggugaa 1440
cgugguuccc cuggcccugc uggucccaaa ggaucuacug gugaaucugg acgcccuggu 1500
gagccugguc ucccuggugc caagggucuu acuggaagcc cagguagccc agguccugau 1560
ggcaagacug guccaccugg ccccgcuggu caagaugguc gcccaggacc cccaggccca 1620
ccuggugcca gaggucaggc uggugugaug gguuucccug gaccuaaagg ugcugcuggu 1680
gagccuggca aaccugguga gagaggagcu ccuggacccc cuggugcugu uggcgcagcu 1740
gguaaggaug gugaagcugg ugcccaaggu ucuccuggcg cugcuggucc ugcuggagag 1800
agaggugaac aagguccugc uggugcuccu ggauuccagg gucugcccgg uccugcuggc 1860
ccaucuggug aaucuggcaa gccuggugaa caggguguuc cuggagaugc uggugcuccu 1920
gguccagcug gugcaagagg cgagagaggu uucccuggug agcguggugu ccaaggucaa 1980
ccagguccac aggguccacg uggugcuaac ggugcucccg guaacgaugg ugcuaagggu 2040
gaugcuggug cuccuggugc uccugguggc caagguccuc ccggucugca ggguaugccu 2100
ggugagcgug gugcugcugg ucugccuggu uccaagggug acagaggcga uccugguccc 2160
aaaggcacug auggugcucc uggcaaagau ggcgucagag gucuaacugg cccuauuggu 2220
ccuccuggcc cagcuggugc cccuggugac aagggugaag cugguccuuc uggcccugcu 2280
ggucccacug guucucgugg ugccccugga gaucguggug agccuggucc accuggcccu 2340
gcuggauucg cugguccccc uggugcugau ggacaaccug gugcuaaagg ugaaucuggu 2400
gaugcuggug cuaaagguga ugcugguccu ccaggcccug cuggacccac uggugcuccu 2460
ggaccuucug gcgcuguugg ugcuccugga cccaaaggug cucgugguag ugcuggaccc 2520
ccuggugcua cugguuuccc uggugcugcu ggaagaguug guccaccugg cccugcuggu 2580
aacgucgguc uuccuggccc aucaggcccc aguggaaaag aaggcucuaa aggaccccgu 2640
ggugagacug gcccugcugg acgccccggu gaaccuggac cugcuggccc accaggaccu 2700
ucuggcgaga agggcucucc ugguggugau ggucccgcug gugcuccugg uacuccaggc 2760
ccacagggua uugcuggaca gcguggugua guuggucuuc cuggacagag aggcgagaga 2820
gguuucccug gucuccccgg cccaucuggc gaaccuggca aacaaggucc aucuggcucc 2880
ucuggugaac gcgguccucc ugguccaaug ggaccaccug gcuuggcugg accuccuggu 2940
gaagcuggac gugagggugc uccugguucu gaaggugcuc cuggucgcga uggcgcugcu 3000
ggucccaagg gugaccgugg ugagacuggc cccucugguc cuccuggugc ucccggugcc 3060
ccuggagcuc cuggcccuau uggcccugcu ggcaagaaug gagaucgugg ugagacuggu 3120
ccuucugguc cugcuggccc ugccgguccu gcuggugcuc gugguccugc ugguccacaa 3180
ggugcccgug gugacaaagg ugaaacugga gaacauggug acagaggcau gaagggucac 3240
agaggauucc cuggucccca gggucccucu gguccugcug gcucuccugg ugaacaaggu 3300
ccuucuggag cuuccggccc ugcuggucca agagguccuc cuggcucugc uggcaccccu 3360
ggcaaagaug gucugaaugg ucucccuggc ccuauugguc caccuggucc ccggggucgc 3420
acuggugaug uugguccugc uggucccccu ggaccuccug ggcccccagg uccuccuggu 3480
gcacccagcg gcggcuuuga cuucagcuuc augccccagc cuccucagga gaaagcccau 3540
gauccuggcc gcuacuacag agcugaugac gccaacguga ugcgugaccg ugaccuggag 3600
guggacacca cccucaagag ccugagccag cagaucgaga acauccgcag ccccgagggc 3660
accaggaaga acccugcccg caccugccgu gaccugaaga ugugccacaa ugacuggaag 3720
agcggcgagu acuggauuga ccccaaccag ggcugcaauc uggaugccau caaggucuac 3780
uguaacaugg agacuggcga gacuugcguc cacccaaccc aggccaccau cgcucagaag 3840
aacugguaca ugagcaagaa ccccaaggag aagaaacaca ucugguuugg cgagacaaug 3900
agcgauggcu uccaguucga auaugguggg gagggcucca acccagcuga cguugccauc 3960
caacugaccu uccugcgccu gauguccacu gaggccuccc agaacaucac cuaccacugc 4020
aagaacagcg uggcuuacau ggaccaggag acuggcaacc ugaagaaggc ucugcuccuu 4080
cagggcucca acgagaucga gaucagagca gaaggcaaca gccgcuucac cuauggaguc 4140
acugaggaug gcugcacaac ucacaccggu gccuggggca agacagucau ugaauacaaa 4200
acaacaaaaa ccucucgccu gcccgucauu gacguggcuc ccauggacgu uggagcacaa 4260
gaucaggaau ucggaauugu caucggaccu gucugcuucu ugggttctga gaatctttat 4320
tttcagggcc gtaaaggcga agagctgttc actggtgtcg tccctattct ggtggaactg 4380
gatggtgatg tcaacggtca taagttttcc gtgcgtggcg agggtgaagg tgacgcaact 4440
aatggtaaac tgacgctgaa gttcatctgt actactggta aactgccggt tccttggccg 4500
actctggtaa cgacgctgac ttatggtgtt cagtgctttg ctcgttatcc ggaccatatg 4560
aagcagcatg acttcttcaa gtccgccatg ccggaaggct atgtgcagga acgcacgatt 4620
tcctttaagg atgacggcac gtacaaaacg cgtgcggaag tgaaatttga aggcgatacc 4680
ctggtaaacc gcattgagct gaaaggcatt gactttaaag aggacggcaa tatcctgggc 4740
cataagctgg aatacaattt taacagccac aatgtttaca tcaccgccga taaacaaaaa 4800
aatggcatta aagcgaattt taaaattcgc cacaacgtgg aggatggcag cgtgcagctg 4860
gctgatcact accagcaaaa cactccaatc ggtgatggtc ctgttctgct gccagacaat 4920
cactatctga gcacgcaaag cgttctgtct aaagatccga acgagaaacg cgatcatatg 4980
gttctgctgg agttcgtaac cgcagcgggc atcacgcatg gtatggatga actgtacaaa 5040
ggttctcatc atcatcatca tcacttggga agaggtagaa gatccaaatt gtaa       5094

<210> 58
<211> 1697
<212> PRT
<213> Artificial Sequence

<220> 
<223> AmisCOL1A1-TEV-GFP-HIS-ePTS1

<400> 58
Met Gln Gly Glu Glu Asp Ile Gln Thr Gly Ser Cys Ile Gln Asp Gly
 1               5                  10                  15      
Leu Ala Tyr Asn Asn Thr Asp Val Trp Lys Pro Glu Pro Cys Gln Ile
            20                  25                  30          
Cys Val Cys Asp Asn Gly Asn Ile Leu Cys Asp Asp Val Ile Cys Asp
        35                  40                  45              
Asp Thr Ser Asp Cys Thr Asn Ala Glu Ile Pro Phe Gly Glu Cys Cys
    50                  55                  60                  
Pro Ile Cys Pro Asp Thr Ala Gly Ser Ser Thr Tyr Pro Lys Ser Thr
65                  70                  75                  80  
Gly Val Glu Gly Pro Lys Gly Asp Thr Gly Pro Arg Gly Gln Arg Gly
                85                  90                  95      
Leu Pro Gly Pro Pro Gly Arg Asp Gly Ile Pro Gly Gln Pro Gly Leu
            100                 105                 110         
Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe
        115                 120                 125             
Ala Pro Gln Met Ala Tyr Gly Tyr Gly Asp Glu Thr Lys Ser Ala Gly
    130                 135                 140                 
Ile Ser Val Pro Gly Pro Met Gly Pro Ala Gly Pro Arg Gly Leu Pro
145                 150                 155                 160 
Gly Pro Pro Gly Ser Pro Gly Pro Gln Gly Phe Gln Gly Pro Pro Gly
                165                 170                 175     
Glu Pro Gly Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro
            180                 185                 190         
Ala Gly Pro Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro
        195                 200                 205             
Gly Arg Pro Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly Ala Arg Gly
    210                 215                 220                 
Leu Pro Gly Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe
225                 230                 235                 240 
Ser Gly Leu Asp Gly Ala Lys Gly Asp Ala Gly Pro Ser Gly Pro Lys
                245                 250                 255     
Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gln Met Gly
            260                 265                 270         
Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Pro Ser Gly Pro
        275                 280                 285             
Ala Gly Ala Arg Gly Asn Asp Gly Ser Pro Gly Ala Ala Gly Pro Pro
    290                 295                 300                 
Gly Pro Thr Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Ala Gly
305                 310                 315                 320 
Ala Lys Gly Glu Thr Gly Pro Gln Gly Ser Arg Gly Ser Glu Gly Pro
                325                 330                 335     
Gln Gly Ala Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala
            340                 345                 350         
Gly Pro Ala Gly Asn Pro Gly Ser Asp Gly Gln Ala Gly Ala Lys Gly
        355                 360                 365             
Ala Thr Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe Pro Gly Ala
    370                 375                 380                 
Arg Gly Pro Ser Gly Pro Gln Gly Pro Ser Gly Ala Pro Gly Pro Lys
385                 390                 395                 400 
Gly Asn Ser Gly Glu Pro Gly Ala Gln Gly Asn Lys Gly Asp Thr Gly
                405                 410                 415     
Ala Lys Gly Glu Pro Gly Pro Ala Gly Val Gln Gly Pro Pro Gly Pro
            420                 425                 430         
Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Gly
        435                 440                 445             
Gly Leu Pro Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly Ser Arg Gly
    450                 455                 460                 
Phe Pro Gly Ala Asp Gly Ile Ser Gly Pro Lys Gly Pro Pro Gly Glu
465                 470                 475                 480 
Arg Gly Ser Pro Gly Pro Ala Gly Pro Lys Gly Ser Thr Gly Glu Ser
                485                 490                 495     
Gly Arg Pro Gly Glu Pro Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly
            500                 505                 510         
Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro
        515                 520                 525             
Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg
    530                 535                 540                 
Gly Gln Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly
545                 550                 555                 560 
Glu Pro Gly Lys Pro Gly Glu Arg Gly Ala Pro Gly Pro Pro Gly Ala
                565                 570                 575     
Val Gly Ala Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln Gly Ser Pro
            580                 585                 590         
Gly Ala Ala Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly
        595                 600                 605             
Ala Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro Ser Gly Glu
    610                 615                 620                 
Ser Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Ala Gly Ala Pro
625                 630                 635                 640 
Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly
                645                 650                 655     
Val Gln Gly Gln Pro Gly Pro Gln Gly Pro Arg Gly Ala Asn Gly Ala
            660                 665                 670         
Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro
        675                 680                 685             
Gly Gly Gln Gly Pro Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly
    690                 695                 700                 
Ala Ala Gly Leu Pro Gly Ser Lys Gly Asp Arg Gly Asp Pro Gly Pro
705                 710                 715                 720 
Lys Gly Thr Asp Gly Ala Pro Gly Lys Asp Gly Val Arg Gly Leu Thr
                725                 730                 735     
Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly
            740                 745                 750         
Glu Ala Gly Pro Ser Gly Pro Ala Gly Pro Thr Gly Ser Arg Gly Ala
        755                 760                 765             
Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala
    770                 775                 780                 
Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Glu Ser Gly
785                 790                 795                 800 
Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro
                805                 810                 815     
Thr Gly Ala Pro Gly Pro Ser Gly Ala Val Gly Ala Pro Gly Pro Lys
            820                 825                 830         
Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly
        835                 840                 845             
Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ala Gly Asn Val Gly Leu
    850                 855                 860                 
Pro Gly Pro Ser Gly Pro Ser Gly Lys Glu Gly Ser Lys Gly Pro Arg
865                 870                 875                 880 
Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly Glu Pro Gly Pro Ala Gly
                885                 890                 895     
Pro Pro Gly Pro Ser Gly Glu Lys Gly Ser Pro Gly Gly Asp Gly Pro
            900                 905                 910         
Ala Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg
        915                 920                 925             
Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly
    930                 935                 940                 
Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ser
945                 950                 955                 960 
Ser Gly Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala
                965                 970                 975     
Gly Pro Pro Gly Glu Ala Gly Arg Glu Gly Ala Pro Gly Ser Glu Gly
            980                 985                 990         
Ala Pro Gly Arg Asp Gly Ala Ala Gly Pro Lys Gly Asp Arg Gly Glu
        995                 1000                1005            
Thr Gly Pro Ser Gly Pro Pro Gly Ala Pro Gly Ala Pro Gly Ala Pro
    1010                1015                1020                
Gly Pro Ile Gly Pro Ala Gly Lys Asn Gly Asp Arg Gly Glu Thr Gly
1025                1030                1035               1040 
Pro Ser Gly Pro Ala Gly Pro Ala Gly Pro Ala Gly Ala Arg Gly Pro
                1045                1050                1055    
Ala Gly Pro Gln Gly Ala Arg Gly Asp Lys Gly Glu Thr Gly Glu His
            1060                1065                1070        
Gly Asp Arg Gly Met Lys Gly His Arg Gly Phe Pro Gly Pro Gln Gly
        1075                1080                1085            
Pro Ser Gly Pro Ala Gly Ser Pro Gly Glu Gln Gly Pro Ser Gly Ala
    1090                1095                1100                
Ser Gly Pro Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Thr Pro
1105                1110                1115               1120 
Gly Lys Asp Gly Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro Pro Gly
                1125                1130                1135    
Pro Arg Gly Arg Thr Gly Asp Val Gly Pro Ala Gly Pro Pro Gly Pro
            1140                1145                1150        
Pro Gly Pro Pro Gly Pro Pro Gly Ala Pro Ser Gly Gly Phe Asp Phe
        1155                1160                1165            
Ser Phe Met Pro Gln Pro Pro Gln Glu Lys Ala His Asp Pro Gly Arg
    1170                1175                1180                
Tyr Tyr Arg Ala Asp Asp Ala Asn Val Met Arg Asp Arg Asp Leu Glu
1185                1190                1195               1200 
Val Asp Thr Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn Ile Arg
                1205                1210                1215    
Ser Pro Glu Gly Thr Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu
            1220                1225                1230        
Lys Met Cys His Asn Asp Trp Lys Ser Gly Glu Tyr Trp Ile Asp Pro
        1235                1240                1245            
Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val Tyr Cys Asn Met Glu
    1250                1255                1260                
Thr Gly Glu Thr Cys Val His Pro Thr Gln Ala Thr Ile Ala Gln Lys
1265                1270                1275               1280 
Asn Trp Tyr Met Ser Lys Asn Pro Lys Glu Lys Lys His Ile Trp Phe
                1285                1290                1295    
Gly Glu Thr Met Ser Asp Gly Phe Gln Phe Glu Tyr Gly Gly Glu Gly
            1300                1305                1310        
Ser Asn Pro Ala Asp Val Ala Ile Gln Leu Thr Phe Leu Arg Leu Met
        1315                1320                1325            
Ser Thr Glu Ala Ser Gln Asn Ile Thr Tyr His Cys Lys Asn Ser Val
    1330                1335                1340                
Ala Tyr Met Asp Gln Glu Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu
1345                1350                1355               1360 
Gln Gly Ser Asn Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser Arg Phe
                1365                1370                1375    
Thr Tyr Gly Val Thr Glu Asp Gly Cys Thr Thr His Thr Gly Ala Trp
            1380                1385                1390        
Gly Lys Thr Val Ile Glu Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro
        1395                1400                1405            
Val Ile Asp Val Ala Pro Met Asp Val Gly Ala Gln Asp Gln Glu Phe
    1410                1415                1420                
Gly Ile Val Ile Gly Pro Val Cys Phe Leu Gly Ser Glu Asn Leu Tyr
1425                1430                1435               1440 
Phe Gln Gly Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile
                1445                1450                1455    
Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg
            1460                1465                1470        
Gly Glu Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe
        1475                1480                1485            
Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr
    1490                1495                1500                
Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met
1505                1510                1515               1520 
Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln
                1525                1530                1535    
Glu Arg Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala
            1540                1545                1550        
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys
        1555                1560                1565            
Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu
    1570                1575                1580                
Tyr Asn Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys
1585                1590                1595               1600 
Asn Gly Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly
                1605                1610                1615    
Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp
            1620                1625                1630        
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val
        1635                1640                1645            
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu
    1650                1655                1660                
Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
1665                1670                1675               1680 
Gly Ser His His His His His His Leu Gly Arg Gly Arg Arg Ser Lys
                1685                1690                1695    
Leu
    


<210> 59
<211> 4818
<212> DNA
<213> Artificial Sequence

<220> 
<223> AmisCOL1A2-TEV-GFP-HIS-ePTS1

<400> 59
atgcaacaag caaatgaggc aactgcagga cggaagggcc caagaggaga caaagggcca 60
cagggagaaa ggggtccacc aggtccacca ggcagagatg gtgaagatgg tccaccaggg 120
cctccagggc cccctggtcc tccaggtctt ggcggaaact ttgctgctca gtatgacgga 180
gcaaaagcag gtgactatgg ctcaggacca atgggtttaa tgggacccag aggcccacct 240
ggaacaagtg gacctcctgg tcctcctggc ttccaaggac ctcatggtga gcctggtgaa 300
cctggtcaaa caggtcccca gggtccccgt ggtccatctg gtcctcctgg aaaggctggt 360
gaagatggcc atcctggaaa atctggacga tctggtgaga ggggcgtctc tggtcctcag 420
ggtgctcgtg gtttccctgg aactcctggt ctgcctggct ttaagggaat tagaggacac 480
aatggtctgg atggtcagaa gggacaacct ggtactccag gcattaaggg tgaatccggt 540
gcccctggtg aaaatggtac cccaggacaa tctggtgctc gtggccttcc cggtgaaaga 600
ggaagaattg gtgcacctgg cccagctggt gcccgtggca gcgatggtag cactggtccc 660
actggtcctg ctggccctat cggttctgct ggtgctccag gtttcccagg tgctcctgga 720
gccaagggtg aaattggagc tgctggtaat gtaggtcctt ctggccctgc tggtccacga 780
ggagaggctg gacttcctgg ttcttctggt cccgttggcc ctcctggaaa ccctggttct 840
aatggtcttg ctggtgctaa aggtgcaact ggtcttcctg gtgttgctgg tgctcctggc 900
ttgcctggtc cacgtggtat tcctggacct tctggccctg ccggagctgc tggcaccaga 960
ggtcttgttg gtgaaccagg ccctgctggt gccaagggag aaagtggtaa caagggtgaa 1020
cccggtgctg ctggtccatc aggtcccgct ggtccaagtg gtgaagaagg caagaaaggt 1080
actactggtg aacctggctc ttctggcccc cctggtccag ctggtctaag aggcgttcct 1140
ggatctcgtg gtctccctgg agctgacggc agagctggtg ttatgggacc tgctggcagc 1200
cgtggtgcta ctggtcctgc tggtgctaaa ggtcctagtg gtgataatgg tcgccctggt 1260
gagcctggcc ttatgggtcc aagaggtctc cctggtcaac ctggaagctc aggccctgct 1320
ggcaaggaag gtcctgttgg tttccctggt gcagatggta gagttggccc aactggtcca 1380
gctggtgcaa gaggtgagcc tggcaacatt ggattccctg gacccaaagg ccccactggt 1440
gaccctggca aacctggtga cagaggccat gctggtcttg ctggtgctcg gggtgcgcct 1500
ggtcctgagg gcaacaatgg ggctcaaggt cctcctggtg ttgctggcaa ccctggtgca 1560
aaaggtgaac aaggtccagc tggtcctccc ggtttccagg gtctcccagg cccctcaggt 1620
ccagctggtg aagctggcaa accaggtgaa aggggtatgg ctggtgaatt tggtgcccct 1680
ggccctgcgg gttcaagagg tgaacgtggt cctccaggcg aaagtggtgc tgttggtcct 1740
gtaggtccca ttggaagccg tggtccatct ggtccaccag gcactgatgg caacaagggt 1800
gaacctggta atgttggtaa tgctggtact gcaggcccct ctggcgctgg tggagcccca 1860
ggagagagag gcattgctgg tattccagga cccaagggtg aaaagggtgc tacaggtctg 1920
agaggggata ctggcgcaac aggaagagat ggtgctcgtg gtgctcctgg tgctattgga 1980
gcccctggcc ccgctggtgg agctggtgag cggggtgaag gtggtcctgc tggtgctgct 2040
ggcccttctg gtgcccgtgg tattcctggt gaacgtggtg agcctggtcc tgctggccct 2100
actggatttg ctggacctgc tggtgcagct ggccaacctg gtgctaaagg tgaacgaggt 2160
acaaaaggac ccaagggtga aaatggtcca caaggtgctg ttggcccagt tggttcttct 2220
ggaccatcag gtcctgttgg tgcctctggt cctgctggtc ctcgtggtga tggtggtcct 2280
cctggtgtca ctggtttccc tggagctgct ggcagaactg gtcctcccgg cccctctggt 2340
atcactggcc cccctggtcc ccctggctca gctggcaaag atggtatgag aggcccacgt 2400
ggtgatactg gtccagttgg ccgcactgga gaacaaggca ttgttggccc acctggcttc 2460
agtggtgaga aaggtccatc tggagagcct ggtgctgctg gtccccctgg taccccaggt 2520
cctcagggta ttcttggtgc tcctggtatc cttggtctgc ctggctctcg gggagaacgt 2580
ggtcttccag gcatctctgg agcaacaggt gaaccaggtc ctcttggtat ttccggtcct 2640
cctggtgcac gtggtccctc tggccccgtg ggttctgctg gtctgaatgg tgcccctggt 2700
gaagctggcc gtgatggcaa tcctggccat gatggtgctc caggccgtga tggtgctcct 2760
ggtttcaagg gtgagcgtgg tgctcctggg aacaatggac ctgctggtgc tgttggtgct 2820
cctggcgccc atggtcaagt tggtcctgct ggaaagcctg gaaatcgtgg tgatcctggt 2880
cctgttggtc cttctggtcc tgctggtgct tttggtgcaa ggggtccttc tggcccacaa 2940
ggtgcacgtg gtgagaaggg agaaacaggt gaaaagggac acagaggtat gcctggattt 3000
aaggggcaca atggacttca gggtctgcct ggtcttgctg gccaacatgg agatcaaggt 3060
cctccaggtt ctactggccc cgctggccca aggggtccct ctggtccttc tggtcctgct 3120
ggaaaagatg gtcgcaatgg actccctggc cctattggac ctgctggtgt gcgtggttct 3180
cagggtagcc aaggtccttc gggtccacct ggcccacctg gtctccctgg tccccctggt 3240
gcaaatggtg gtggatacga agttggctat gatcttgaat actaccgggc tgatcagcct 3300
gctctcagac ctaaggacta tgaagttgat gccactctga aaacattgaa caaccaaatt 3360
gagaccctcc tgaccccaga aggctccagg aagaacccag ctcgcacctg ccgtgacctg 3420
agactcagcc acccagaatg gaccagtggt ttctactgga ttgatcccaa ccagggctgt 3480
actatggatg ccattagagt gtattgtgac ttctccactg gtgagacttg catacatgcc 3540
aatctagaaa acatccccac taagaactgg tatgtcagca agaactccaa ggaaaagaag 3600
cacatgtggt ttggtgaaac tatcaatggt ggtacccagt ttgaatataa cgatgaagga 3660
gtgacttcca aggacatggc tacccaactt gccttcatgc gtctgctggc caaccatgcc 3720
tcccagaaca tcacctacca ctgcaagaac agtattgcat acatggatga agaaactggc 3780
aaccttaaga aggctgtaat actgcaggga tccaatgatg ttgaactacg agctgaaggc 3840
aacagcagat tcactttcag tgttctggaa gatggctgct ctagaaagaa caacgcatgg 3900
ggcaaaacaa tcattgaata tagaacaaac aaaccatctc gcttgcccat ccttgacatt 3960
gcacctttgg acattggtgg agctgatcaa gaattcggtt tggacattgg cccagtctgt 4020
ttcaaaggtt ctgagaatct ttattttcag ggccgtaaag gcgaagagct gttcactggt 4080
gtcgtcccta ttctggtgga actggatggt gatgtcaacg gtcataagtt ttccgtgcgt 4140
ggcgagggtg aaggtgacgc aactaatggt aaactgacgc tgaagttcat ctgtactact 4200
ggtaaactgc cggttccttg gccgactctg gtaacgacgc tgacttatgg tgttcagtgc 4260
tttgctcgtt atccggacca tatgaagcag catgacttct tcaagtccgc catgccggaa 4320
ggctatgtgc aggaacgcac gatttccttt aaggatgacg gcacgtacaa aacgcgtgcg 4380
gaagtgaaat ttgaaggcga taccctggta aaccgcattg agctgaaagg cattgacttt 4440
aaagaggacg gcaatatcct gggccataag ctggaataca attttaacag ccacaatgtt 4500
tacatcaccg ccgataaaca aaaaaatggc attaaagcga attttaaaat tcgccacaac 4560
gtggaggatg gcagcgtgca gctggctgat cactaccagc aaaacactcc aatcggtgat 4620
ggtcctgttc tgctgccaga caatcactat ctgagcacgc aaagcgttct gtctaaagat 4680
ccgaacgaga aacgcgatca tatggttctg ctggagttcg taaccgcagc gggcatcacg 4740
catggtatgg atgaactgta caaaggttct catcatcatc atcatcactt gggaagaggt 4800
agaagatcca aattgtaa                                               4818

<210> 60
<211> 1605
<212> PRT
<213> Artificial Sequence

<220> 
<223> AmisCOL1A2-TEV-GFP-HIS-ePTS1

<400> 60
Met Gln Gln Ala Asn Glu Ala Thr Ala Gly Arg Lys Gly Pro Arg Gly
 1               5                  10                  15      
Asp Lys Gly Pro Gln Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Arg
            20                  25                  30          
Asp Gly Glu Asp Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro
        35                  40                  45              
Gly Leu Gly Gly Asn Phe Ala Ala Gln Tyr Asp Gly Ala Lys Ala Gly
    50                  55                  60                  
Asp Tyr Gly Ser Gly Pro Met Gly Leu Met Gly Pro Arg Gly Pro Pro
65                  70                  75                  80  
Gly Thr Ser Gly Pro Pro Gly Pro Pro Gly Phe Gln Gly Pro His Gly
                85                  90                  95      
Glu Pro Gly Glu Pro Gly Gln Thr Gly Pro Gln Gly Pro Arg Gly Pro
            100                 105                 110         
Ser Gly Pro Pro Gly Lys Ala Gly Glu Asp Gly His Pro Gly Lys Ser
        115                 120                 125             
Gly Arg Ser Gly Glu Arg Gly Val Ser Gly Pro Gln Gly Ala Arg Gly
    130                 135                 140                 
Phe Pro Gly Thr Pro Gly Leu Pro Gly Phe Lys Gly Ile Arg Gly His
145                 150                 155                 160 
Asn Gly Leu Asp Gly Gln Lys Gly Gln Pro Gly Thr Pro Gly Ile Lys
                165                 170                 175     
Gly Glu Ser Gly Ala Pro Gly Glu Asn Gly Thr Pro Gly Gln Ser Gly
            180                 185                 190         
Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg Ile Gly Ala Pro Gly Pro
        195                 200                 205             
Ala Gly Ala Arg Gly Ser Asp Gly Ser Thr Gly Pro Thr Gly Pro Ala
    210                 215                 220                 
Gly Pro Ile Gly Ser Ala Gly Ala Pro Gly Phe Pro Gly Ala Pro Gly
225                 230                 235                 240 
Ala Lys Gly Glu Ile Gly Ala Ala Gly Asn Val Gly Pro Ser Gly Pro
                245                 250                 255     
Ala Gly Pro Arg Gly Glu Ala Gly Leu Pro Gly Ser Ser Gly Pro Val
            260                 265                 270         
Gly Pro Pro Gly Asn Pro Gly Ser Asn Gly Leu Ala Gly Ala Lys Gly
        275                 280                 285             
Ala Thr Gly Leu Pro Gly Val Ala Gly Ala Pro Gly Leu Pro Gly Pro
    290                 295                 300                 
Arg Gly Ile Pro Gly Pro Ser Gly Pro Ala Gly Ala Ala Gly Thr Arg
305                 310                 315                 320 
Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ala Lys Gly Glu Ser Gly
                325                 330                 335     
Asn Lys Gly Glu Pro Gly Ala Ala Gly Pro Ser Gly Pro Ala Gly Pro
            340                 345                 350         
Ser Gly Glu Glu Gly Lys Lys Gly Thr Thr Gly Glu Pro Gly Ser Ser
        355                 360                 365             
Gly Pro Pro Gly Pro Ala Gly Leu Arg Gly Val Pro Gly Ser Arg Gly
    370                 375                 380                 
Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met Gly Pro Ala Gly Ser
385                 390                 395                 400 
Arg Gly Ala Thr Gly Pro Ala Gly Ala Lys Gly Pro Ser Gly Asp Asn
                405                 410                 415     
Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro Arg Gly Leu Pro Gly
            420                 425                 430         
Gln Pro Gly Ser Ser Gly Pro Ala Gly Lys Glu Gly Pro Val Gly Phe
        435                 440                 445             
Pro Gly Ala Asp Gly Arg Val Gly Pro Thr Gly Pro Ala Gly Ala Arg
    450                 455                 460                 
Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro Lys Gly Pro Thr Gly
465                 470                 475                 480 
Asp Pro Gly Lys Pro Gly Asp Arg Gly His Ala Gly Leu Ala Gly Ala
                485                 490                 495     
Arg Gly Ala Pro Gly Pro Glu Gly Asn Asn Gly Ala Gln Gly Pro Pro
            500                 505                 510         
Gly Val Ala Gly Asn Pro Gly Ala Lys Gly Glu Gln Gly Pro Ala Gly
        515                 520                 525             
Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ser Gly Pro Ala Gly Glu
    530                 535                 540                 
Ala Gly Lys Pro Gly Glu Arg Gly Met Ala Gly Glu Phe Gly Ala Pro
545                 550                 555                 560 
Gly Pro Ala Gly Ser Arg Gly Glu Arg Gly Pro Pro Gly Glu Ser Gly
                565                 570                 575     
Ala Val Gly Pro Val Gly Pro Ile Gly Ser Arg Gly Pro Ser Gly Pro
            580                 585                 590         
Pro Gly Thr Asp Gly Asn Lys Gly Glu Pro Gly Asn Val Gly Asn Ala
        595                 600                 605             
Gly Thr Ala Gly Pro Ser Gly Ala Gly Gly Ala Pro Gly Glu Arg Gly
    610                 615                 620                 
Ile Ala Gly Ile Pro Gly Pro Lys Gly Glu Lys Gly Ala Thr Gly Leu
625                 630                 635                 640 
Arg Gly Asp Thr Gly Ala Thr Gly Arg Asp Gly Ala Arg Gly Ala Pro
                645                 650                 655     
Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Gly Ala Gly Glu Arg Gly
            660                 665                 670         
Glu Gly Gly Pro Ala Gly Ala Ala Gly Pro Ser Gly Ala Arg Gly Ile
        675                 680                 685             
Pro Gly Glu Arg Gly Glu Pro Gly Pro Ala Gly Pro Thr Gly Phe Ala
    690                 695                 700                 
Gly Pro Ala Gly Ala Ala Gly Gln Pro Gly Ala Lys Gly Glu Arg Gly
705                 710                 715                 720 
Thr Lys Gly Pro Lys Gly Glu Asn Gly Pro Gln Gly Ala Val Gly Pro
                725                 730                 735     
Val Gly Ser Ser Gly Pro Ser Gly Pro Val Gly Ala Ser Gly Pro Ala
            740                 745                 750         
Gly Pro Arg Gly Asp Gly Gly Pro Pro Gly Val Thr Gly Phe Pro Gly
        755                 760                 765             
Ala Ala Gly Arg Thr Gly Pro Pro Gly Pro Ser Gly Ile Thr Gly Pro
    770                 775                 780                 
Pro Gly Pro Pro Gly Ser Ala Gly Lys Asp Gly Met Arg Gly Pro Arg
785                 790                 795                 800 
Gly Asp Thr Gly Pro Val Gly Arg Thr Gly Glu Gln Gly Ile Val Gly
                805                 810                 815     
Pro Pro Gly Phe Ser Gly Glu Lys Gly Pro Ser Gly Glu Pro Gly Ala
            820                 825                 830         
Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly Ile Leu Gly Ala Pro
        835                 840                 845             
Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly Glu Arg Gly Leu Pro Gly
    850                 855                 860                 
Ile Ser Gly Ala Thr Gly Glu Pro Gly Pro Leu Gly Ile Ser Gly Pro
865                 870                 875                 880 
Pro Gly Ala Arg Gly Pro Ser Gly Pro Val Gly Ser Ala Gly Leu Asn
                885                 890                 895     
Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn Pro Gly His Asp Gly
            900                 905                 910         
Ala Pro Gly Arg Asp Gly Ala Pro Gly Phe Lys Gly Glu Arg Gly Ala
        915                 920                 925             
Pro Gly Asn Asn Gly Pro Ala Gly Ala Val Gly Ala Pro Gly Ala His
    930                 935                 940                 
Gly Gln Val Gly Pro Ala Gly Lys Pro Gly Asn Arg Gly Asp Pro Gly
945                 950                 955                 960 
Pro Val Gly Pro Ser Gly Pro Ala Gly Ala Phe Gly Ala Arg Gly Pro
                965                 970                 975     
Ser Gly Pro Gln Gly Ala Arg Gly Glu Lys Gly Glu Thr Gly Glu Lys
            980                 985                 990         
Gly His Arg Gly Met Pro Gly Phe Lys Gly His Asn Gly Leu Gln Gly
        995                 1000                1005            
Leu Pro Gly Leu Ala Gly Gln His Gly Asp Gln Gly Pro Pro Gly Ser
    1010                1015                1020                
Thr Gly Pro Ala Gly Pro Arg Gly Pro Ser Gly Pro Ser Gly Pro Ala
1025                1030                1035               1040 
Gly Lys Asp Gly Arg Asn Gly Leu Pro Gly Pro Ile Gly Pro Ala Gly
                1045                1050                1055    
Val Arg Gly Ser Gln Gly Ser Gln Gly Pro Ser Gly Pro Pro Gly Pro
            1060                1065                1070        
Pro Gly Leu Pro Gly Pro Pro Gly Ala Asn Gly Gly Gly Tyr Glu Val
        1075                1080                1085            
Gly Tyr Asp Leu Glu Tyr Tyr Arg Ala Asp Gln Pro Ala Leu Arg Pro
    1090                1095                1100                
Lys Asp Tyr Glu Val Asp Ala Thr Leu Lys Thr Leu Asn Asn Gln Ile
1105                1110                1115               1120 
Glu Thr Leu Leu Thr Pro Glu Gly Ser Arg Lys Asn Pro Ala Arg Thr
                1125                1130                1135    
Cys Arg Asp Leu Arg Leu Ser His Pro Glu Trp Thr Ser Gly Phe Tyr
            1140                1145                1150        
Trp Ile Asp Pro Asn Gln Gly Cys Thr Met Asp Ala Ile Arg Val Tyr
        1155                1160                1165            
Cys Asp Phe Ser Thr Gly Glu Thr Cys Ile His Ala Asn Leu Glu Asn
    1170                1175                1180                
Ile Pro Thr Lys Asn Trp Tyr Val Ser Lys Asn Ser Lys Glu Lys Lys
1185                1190                1195               1200 
His Met Trp Phe Gly Glu Thr Ile Asn Gly Gly Thr Gln Phe Glu Tyr
                1205                1210                1215    
Asn Asp Glu Gly Val Thr Ser Lys Asp Met Ala Thr Gln Leu Ala Phe
            1220                1225                1230        
Met Arg Leu Leu Ala Asn His Ala Ser Gln Asn Ile Thr Tyr His Cys
        1235                1240                1245            
Lys Asn Ser Ile Ala Tyr Met Asp Glu Glu Thr Gly Asn Leu Lys Lys
    1250                1255                1260                
Ala Val Ile Leu Gln Gly Ser Asn Asp Val Glu Leu Arg Ala Glu Gly
1265                1270                1275               1280 
Asn Ser Arg Phe Thr Phe Ser Val Leu Glu Asp Gly Cys Ser Arg Lys
                1285                1290                1295    
Asn Asn Ala Trp Gly Lys Thr Ile Ile Glu Tyr Arg Thr Asn Lys Pro
            1300                1305                1310        
Ser Arg Leu Pro Ile Leu Asp Ile Ala Pro Leu Asp Ile Gly Gly Ala
        1315                1320                1325            
Asp Gln Glu Phe Gly Leu Asp Ile Gly Pro Val Cys Phe Lys Gly Ser
    1330                1335                1340                
Glu Asn Leu Tyr Phe Gln Gly Arg Lys Gly Glu Glu Leu Phe Thr Gly
1345                1350                1355               1360 
Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys
                1365                1370                1375    
Phe Ser Val Arg Gly Glu Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu
            1380                1385                1390        
Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro
        1395                1400                1405            
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr
    1410                1415                1420                
Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu
1425                1430                1435               1440 
Gly Tyr Val Gln Glu Arg Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr
                1445                1450                1455    
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg
            1460                1465                1470        
Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly
        1475                1480                1485            
His Lys Leu Glu Tyr Asn Phe Asn Ser His Asn Val Tyr Ile Thr Ala
    1490                1495                1500                
Asp Lys Gln Lys Asn Gly Ile Lys Ala Asn Phe Lys Ile Arg His Asn
1505                1510                1515               1520 
Val Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
                1525                1530                1535    
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
            1540                1545                1550        
Thr Gln Ser Val Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
        1555                1560                1565            
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
    1570                1575                1580                
Glu Leu Tyr Lys Gly Ser His His His His His His Leu Gly Arg Gly
1585                1590                1595               1600 
Arg Arg Ser Lys Leu
                1605


<210> 61
<211> 5118
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtCOL1A1-TEV-GFP-HIS-ePTS1

<400> 61
atgcaagagg agggccagga agaaggccaa gaagaagaca tcccaccagt cacctgcgta 60
cagaacggcc tcaggtacca tgaccgagac gtgtggaaac ccgtgccctg ccagatctgt 120
gtctgcgaca acggcaacgt gctgtgcgat gacgtgatct gcgacgaact taaggactgt 180
cctaacgcca aagtccccac ggacgaatgc tgccccgtct gccccgaagg ccaggaatca 240
cccacggacc aagaaaccac cggagtcgag ggaccgaaag gagacactgg cccccgaggc 300
ccaaggggac ccgccggccc ccccggccga gatggcatcc ctggacaacc tggacttccc 360
ggaccccctg gaccccccgg acctcccgga ccccctggcc tcggaggaaa ctttgctccc 420
cagttgtctt acggctatga tgagaaatca acaggaattt ccgtgcctgg tcccatgggt 480
ccttctggtc ctcgtggtct ccctggcccc cctggcgcac ctggtcccca aggtttccaa 540
ggcccccctg gtgagcctgg cgagccagga gcctcaggtc ccatgggtcc ccgtggtccc 600
cctggccccc ctggcaagaa cggagatgat ggcgaagctg gaaagcctgg tcgtcctggt 660
gagcgcgggc ctcccggacc tcagggtgct cggggattgc ctggaacagc tggcctccct 720
ggaatgaagg gacacagagg tttcagtggt ttggatggtg ccaagggaga tgctggtcct 780
gctggcccca agggcgagcc tggtagcccc ggtgaaaatg gagctcctgg tcagatgggc 840
ccccgtggtc tgcctggtga gagaggtcgc cctggagccc ctggccctgc tggtgctcga 900
ggaaatgatg gtgcgactgg tgctgctggg ccccctggtc ccactggccc cgctggtcct 960
cctggtttcc ctggtgctgt gggtgctaag ggtgaaggtg gtccccaagg accccgaggt 1020
tctgaaggtc cccagggtgt acgtggtgag cctggccccc ctggccctgc tggtgctgct 1080
ggccctgctg gcaaccctgg tgctgatgga cagcctggtg ctaaaggagc caatggcgct 1140
cctggtattg ctggtgctcc tggcttccct ggtgcccgag gcccctctgg accccagggc 1200
cccagcggcc cccctggccc caagggtaac agcggtgaac ctggtgctcc tggcagcaaa 1260
ggagacactg gcgccaaggg agaacccggt cccactggta ttcaaggccc ccctggcccc 1320
gctggggaag aaggaaagcg aggagcccga ggtgaacctg gacctgctgg cctgcctgga 1380
ccccctggcg agcgtggtgg acctggaagc cgtggtttcc ctggcgccga cggtgttgct 1440
ggtcccaagg gtcctgctgg tgaacgcggt gctcctggcc ctgctggccc caaaggttct 1500
cctggtgaag ctggtcgccc cggtgaagct ggtctgcccg gtgccaaggg tctgactgga 1560
agccctggca gcccgggtcc tgatggcaaa actggccccc ctggtcccgc cggtcaagat 1620
ggccgccctg gacctccagg ccctcccggt gcccgtggtc aggctggcgt gatgggtttc 1680
cctggaccta aaggtgctgc tggagagcct ggaaaagctg gagagcgagg tgttcctgga 1740
ccccctggcg ctgttggtcc tgctggcaaa gacggagaag ctggagctca gggaccccca 1800
ggacctgctg gccccgctgg tgagagaggc gaacaaggcc ctgctggctc ccctggattc 1860
cagggtctcc ccggccctgc tggtcctcct ggtgaagcag gcaaacctgg tgaacagggt 1920
gttcctggag atcttggtgc ccccggcccc tctggagcaa gaggcgagag aggtttcccc 1980
ggcgagcgtg gtgtgcaagg gccgcccggt cctgcaggtc cccgtggggc caatggtgcc 2040
cctggcaacg atggtgctaa gggtgatgct ggtgcccctg gagcccccgg tagccagggt 2100
gcccctggcc ttcaaggaat gcctggtgaa cgaggtgcag ctggtcttcc aggccctaag 2160
ggtgacagag gggatgctgg tcccaaaggt gctgatggtg ctcctggcaa agatggcgtc 2220
cgtggtctga ctggtcccat cggtcctcct ggccccgctg gtgcccctgg tgacaagggt 2280
gaagctggtc ctagtggccc agccggtccc actggagctc gtggtgcccc cggtgaccgt 2340
ggtgagcctg gtccccccgg ccctgctggc ttcgctggcc cccctggtgc tgatggccaa 2400
cctggtgcta aaggcgaacc tggtgatgct ggtgctaaag gtgacgctgg tccccccggc 2460
cctgctgggc ccgctggacc ccccggcccc attggtaacg ttggtgctcc cggacccaaa 2520
ggtgctcgtg gcagcgctgg tccccctggt gctactggtt tcccaggtgc tgctggccga 2580
gtcggtcccc ccggcccctc tggaaatgct ggaccccctg gccctcctgg ccctgctggc 2640
aaagaaggca gcaaaggccc ccgcggtgag actggccccg ctgggcgtcc cggtgaagtc 2700
ggtccccctg gtccccctgg ccccgctggt gagaaaggag cccctggtgc tgacggacct 2760
gctggagctc ctggcactcc tggacctcaa ggtattgctg gacagcgtgg tgtggtcggc 2820
ctgcctggtc agagaggaga aagaggcttc cctggtcttc ctggcccctc tggtgaaccc 2880
ggcaaacaag gtccttctgg agcaagtggt gaacgtggcc cccctggtcc catgggcccc 2940
cctggattgg ctggaccccc tggcgagtct ggacgtgagg gagctcctgg tgctgaagga 3000
tcccctggac gagatggttc tcctggcgcc aagggtgacc gtggtgagac cggccctgct 3060
ggacctcctg gtgctcctgg cgctcccggt gcccccggcc ctgtcggacc tgccggcaag 3120
agcggtgatc gtggtgagac cggtcctgct ggtcctgctg gtcccattgg ccccgttggt 3180
gcccgtggcc ccgctggacc ccaaggcccc cgtggtgaca agggtgagac aggcgaacag 3240
ggcgacagag gcattaaggg tcaccgtggc ttctctggtc tccagggtcc ccccggccct 3300
cccggctctc ctggtgagca aggtccttcc ggagcctctg gtcctgctgg tccccgcggt 3360
ccccctggct ctgctggttc tcccggcaaa gatggactca atggtctccc aggccccatc 3420
ggtccccctg ggcctcgagg tcgcactggt gatgctggtc ctgctggtcc tcccggccct 3480
cctggacccc ctggtccccc aggtcctccc agcggcggct acgacttgag cttcctgccc 3540
cagccacctc aagagaaggc tcacgatggt ggccgctact accgggctga tgatgccaat 3600
gtggtccgtg accgtgacct cgaggtggac accaccctca agagcctgag ccagcagatc 3660
gagaacatcc ggagccctga aggcagccgc aagaaccccg cccgcacctg ccgtgacctc 3720
aagatgtgcc actctgactg gaagagcgga gaatactgga ttgaccccaa ccaaggctgc 3780
aacctggatg ccattaaggt cttctgcaac atggaaaccg gtgagacctg tgtatacccc 3840
actcagccca gcgtggccca gaagaactgg tatatcagca agaaccccaa ggaaaagagg 3900
cacgtctggt acggcgagag catgaccggc ggattccagt tcgagtatgg cggccagggg 3960
tccgatcctg ccgatgtggc catccagctg actttcctgc gcctgatgtc caccgaggcc 4020
tcccagaaca tcacctacca ctgcaagaac agcgtggcct acatggacca gcagactggc 4080
aacctcaaga aggccctgct cctccagggc tccaacgaga tcgagatccg ggccgagggc 4140
aacagccgct tcacctacag cgtcacctac gatggctgca cgagtcacac cggagcctgg 4200
ggcaagacag tgatcgaata caaaaccacc aagacctccc gcttgcccat catcgatgtg 4260
gcccccttgg acgttggcgc cccagaccag gaattcggct tcgacgttgg ccctgcctgc 4320
ttcctgggtt ctgagaatct ttattttcag ggccgtaaag gcgaagagct gttcactggt 4380
gtcgtcccta ttctggtgga actggatggt gatgtcaacg gtcataagtt ttccgtgcgt 4440
ggcgagggtg aaggtgacgc aactaatggt aaactgacgc tgaagttcat ctgtactact 4500
ggtaaactgc cggttccttg gccgactctg gtaacgacgc tgacttatgg tgttcagtgc 4560
tttgctcgtt atccggacca tatgaagcag catgacttct tcaagtccgc catgccggaa 4620
ggctatgtgc aggaacgcac gatttccttt aaggatgacg gcacgtacaa aacgcgtgcg 4680
gaagtgaaat ttgaaggcga taccctggta aaccgcattg agctgaaagg cattgacttt 4740
aaagaggacg gcaatatcct gggccataag ctggaataca attttaacag ccacaatgtt 4800
tacatcaccg ccgataaaca aaaaaatggc attaaagcga attttaaaat tcgccacaac 4860
gtggaggatg gcagcgtgca gctggctgat cactaccagc aaaacactcc aatcggtgat 4920
ggtcctgttc tgctgccaga caatcactat ctgagcacgc aaagcgttct gtctaaagat 4980
ccgaacgaga aacgcgatca tatggttctg ctggagttcg taaccgcagc gggcatcacg 5040
catggtatgg atgaactgta caaaggttct catcatcatc atcatcactt gggaagaggt 5100
agaagatcca aattgtaa                                               5118

<210> 62
<211> 1705
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtCOL1A1-TEV-GFP-HIS-ePTS1

<400> 62
Met Gln Glu Glu Gly Gln Glu Glu Gly Gln Glu Glu Asp Ile Pro Pro
 1               5                  10                  15      
Val Thr Cys Val Gln Asn Gly Leu Arg Tyr His Asp Arg Asp Val Trp
            20                  25                  30          
Lys Pro Val Pro Cys Gln Ile Cys Val Cys Asp Asn Gly Asn Val Leu
        35                  40                  45              
Cys Asp Asp Val Ile Cys Asp Glu Leu Lys Asp Cys Pro Asn Ala Lys
    50                  55                  60                  
Val Pro Thr Asp Glu Cys Cys Pro Val Cys Pro Glu Gly Gln Glu Ser
65                  70                  75                  80  
Pro Thr Asp Gln Glu Thr Thr Gly Val Glu Gly Pro Lys Gly Asp Thr
                85                  90                  95      
Gly Pro Arg Gly Pro Arg Gly Pro Ala Gly Pro Pro Gly Arg Asp Gly
            100                 105                 110         
Ile Pro Gly Gln Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Pro
        115                 120                 125             
Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala Pro Gln Leu Ser Tyr
    130                 135                 140                 
Gly Tyr Asp Glu Lys Ser Thr Gly Ile Ser Val Pro Gly Pro Met Gly
145                 150                 155                 160 
Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro Gly Ala Pro Gly Pro
                165                 170                 175     
Gln Gly Phe Gln Gly Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser
            180                 185                 190         
Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro Pro Gly Lys Asn Gly
        195                 200                 205             
Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro
    210                 215                 220                 
Pro Gly Pro Gln Gly Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro
225                 230                 235                 240 
Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly
                245                 250                 255     
Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu
            260                 265                 270         
Asn Gly Ala Pro Gly Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg
        275                 280                 285             
Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly
    290                 295                 300                 
Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro
305                 310                 315                 320 
Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly Glu Gly Gly Pro Gln
                325                 330                 335     
Gly Pro Arg Gly Ser Glu Gly Pro Gln Gly Val Arg Gly Glu Pro Gly
            340                 345                 350         
Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala
        355                 360                 365             
Asp Gly Gln Pro Gly Ala Lys Gly Ala Asn Gly Ala Pro Gly Ile Ala
    370                 375                 380                 
Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro Ser Gly Pro Gln Gly
385                 390                 395                 400 
Pro Ser Gly Pro Pro Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala
                405                 410                 415     
Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Thr
            420                 425                 430         
Gly Ile Gln Gly Pro Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly
        435                 440                 445             
Ala Arg Gly Glu Pro Gly Pro Ala Gly Leu Pro Gly Pro Pro Gly Glu
    450                 455                 460                 
Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly Ala Asp Gly Val Ala
465                 470                 475                 480 
Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly Pro Ala Gly
                485                 490                 495     
Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro Gly Glu Ala Gly Leu
            500                 505                 510         
Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp
        515                 520                 525             
Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly
    530                 535                 540                 
Pro Pro Gly Pro Pro Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe
545                 550                 555                 560 
Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala Gly Glu Arg
                565                 570                 575     
Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly Lys Asp Gly
            580                 585                 590         
Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Glu
        595                 600                 605             
Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly Leu Pro
    610                 615                 620                 
Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly
625                 630                 635                 640 
Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg Gly Glu
                645                 650                 655     
Arg Gly Phe Pro Gly Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Ala
            660                 665                 670         
Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly
        675                 680                 685             
Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gln Gly Ala Pro Gly Leu
    690                 695                 700                 
Gln Gly Met Pro Gly Glu Arg Gly Ala Ala Gly Leu Pro Gly Pro Lys
705                 710                 715                 720 
Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala Asp Gly Ala Pro Gly
                725                 730                 735     
Lys Asp Gly Val Arg Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro
            740                 745                 750         
Ala Gly Ala Pro Gly Asp Lys Gly Glu Ala Gly Pro Ser Gly Pro Ala
        755                 760                 765             
Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly
    770                 775                 780                 
Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln
785                 790                 795                 800 
Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly Ala Lys Gly Asp Ala
                805                 810                 815     
Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly Pro Ile Gly
            820                 825                 830         
Asn Val Gly Ala Pro Gly Pro Lys Gly Ala Arg Gly Ser Ala Gly Pro
        835                 840                 845             
Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro
    850                 855                 860                 
Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly
865                 870                 875                 880 
Lys Glu Gly Ser Lys Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg
                885                 890                 895     
Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Glu Lys
            900                 905                 910         
Gly Ala Pro Gly Ala Asp Gly Pro Ala Gly Ala Pro Gly Thr Pro Gly
        915                 920                 925             
Pro Gln Gly Ile Ala Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln
    930                 935                 940                 
Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro
945                 950                 955                 960 
Gly Lys Gln Gly Pro Ser Gly Ala Ser Gly Glu Arg Gly Pro Pro Gly
                965                 970                 975     
Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro Gly Glu Ser Gly Arg
            980                 985                 990         
Glu Gly Ala Pro Gly Ala Glu Gly Ser Pro Gly Arg Asp Gly Ser Pro
        995                 1000                1005            
Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Pro Gly
    1010                1015                1020                
Ala Pro Gly Ala Pro Gly Ala Pro Gly Pro Val Gly Pro Ala Gly Lys
1025                1030                1035               1040 
Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Ala Gly Pro Ile
                1045                1050                1055    
Gly Pro Val Gly Ala Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly
            1060                1065                1070        
Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp Arg Gly Ile Lys Gly His
        1075                1080                1085            
Arg Gly Phe Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly Ser Pro
    1090                1095                1100                
Gly Glu Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly Pro Arg Gly
1105                1110                1115               1120 
Pro Pro Gly Ser Ala Gly Ser Pro Gly Lys Asp Gly Leu Asn Gly Leu
                1125                1130                1135    
Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly Arg Thr Gly Asp Ala
            1140                1145                1150        
Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly
        1155                1160                1165            
Pro Pro Ser Gly Gly Tyr Asp Leu Ser Phe Leu Pro Gln Pro Pro Gln
    1170                1175                1180                
Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg Ala Asp Asp Ala Asn
1185                1190                1195               1200 
Val Val Arg Asp Arg Asp Leu Glu Val Asp Thr Thr Leu Lys Ser Leu
                1205                1210                1215    
Ser Gln Gln Ile Glu Asn Ile Arg Ser Pro Glu Gly Ser Arg Lys Asn
            1220                1225                1230        
Pro Ala Arg Thr Cys Arg Asp Leu Lys Met Cys His Ser Asp Trp Lys
        1235                1240                1245            
Ser Gly Glu Tyr Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu Asp Ala
    1250                1255                1260                
Ile Lys Val Phe Cys Asn Met Glu Thr Gly Glu Thr Cys Val Tyr Pro
1265                1270                1275               1280 
Thr Gln Pro Ser Val Ala Gln Lys Asn Trp Tyr Ile Ser Lys Asn Pro
                1285                1290                1295    
Lys Glu Lys Arg His Val Trp Tyr Gly Glu Ser Met Thr Gly Gly Phe
            1300                1305                1310        
Gln Phe Glu Tyr Gly Gly Gln Gly Ser Asp Pro Ala Asp Val Ala Ile
        1315                1320                1325            
Gln Leu Thr Phe Leu Arg Leu Met Ser Thr Glu Ala Ser Gln Asn Ile
    1330                1335                1340                
Thr Tyr His Cys Lys Asn Ser Val Ala Tyr Met Asp Gln Gln Thr Gly
1345                1350                1355               1360 
Asn Leu Lys Lys Ala Leu Leu Leu Gln Gly Ser Asn Glu Ile Glu Ile
                1365                1370                1375    
Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Ser Val Thr Tyr Asp Gly
            1380                1385                1390        
Cys Thr Ser His Thr Gly Ala Trp Gly Lys Thr Val Ile Glu Tyr Lys
        1395                1400                1405            
Thr Thr Lys Thr Ser Arg Leu Pro Ile Ile Asp Val Ala Pro Leu Asp
    1410                1415                1420                
Val Gly Ala Pro Asp Gln Glu Phe Gly Phe Asp Val Gly Pro Ala Cys
1425                1430                1435               1440 
Phe Leu Gly Ser Glu Asn Leu Tyr Phe Gln Gly Arg Lys Gly Glu Glu
                1445                1450                1455    
Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val
            1460                1465                1470        
Asn Gly His Lys Phe Ser Val Arg Gly Glu Gly Glu Gly Asp Ala Thr
        1475                1480                1485            
Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro
    1490                1495                1500                
Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys
1505                1510                1515               1520 
Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser
                1525                1530                1535    
Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Ser Phe Lys Asp
            1540                1545                1550        
Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr
        1555                1560                1565            
Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly
    1570                1575                1580                
Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His Asn Val
1585                1590                1595               1600 
Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala Asn Phe Lys
                1605                1610                1615    
Ile Arg His Asn Val Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr
            1620                1625                1630        
Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn
        1635                1640                1645            
His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp Pro Asn Glu Lys
    1650                1655                1660                
Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr
1665                1670                1675               1680 
His Gly Met Asp Glu Leu Tyr Lys Gly Ser His His His His His His
                1685                1690                1695    
Leu Gly Arg Gly Arg Arg Ser Lys Leu
            1700                1705


<210> 63
<211> 4080
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtCOL1A2-TEV-GFP-HIS-ePTS1

<400> 63
atgcaatcct tacaagaggc aactgcaaga aagggcccaa gtggagatag aggaccacgc 60
ggagaaaggg gtccaccagg cccaccaggc agagatggtg atgacggcat cccaggccct 120
cctggccccc ctggccctcc tggcccccct ggtcttggcg ggaactttgc tgctcagttt 180
gatgcaaaag gaggtggccc tggaccaatg gggctgatgg gacctcgcgg ccctcctggg 240
gcttctggag cccctggccc tcaaggtttc cagggacctc cgggtgagcc tggtgaacct 300
ggtcagactg gtcctgcagg tgctcgtggc ccgcctggcc ctcctggcaa ggctggtgag 360
gatggtcacc ctggaaaacc tggacgacct ggtgagagag gggttgttgg accacagggt 420
gctcgtggct ttcctggaac tcctggactc cctggcttca agggcattag gggtcacaat 480
ggtctggatg gattgaaggg acagcctggt gctccaggtg tgaagggtga acctggtgcc 540
cctggtgaaa atggaactcc aggtcaaacg ggagcccgtg gtcttcctgg tgagagagga 600
cgtgttggtg cccctggccc agctggtgcc cgtggaagtg atggaagtgt gggtcctgtg 660
ggccctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc tcctggcccc 720
aagggtgaac tcggacctgt tggtaaccct ggccctgctg gtcccgcggg tccccgtggt 780
gaagtgggtc tcccaggcct ttctggccct gtcggacctc ctggaaaccc cggagccaat 840
gggcttcctg gcgctaaggg tgctgctggc cttcccggtg ttgctggggc tcccggcctc 900
cctggacccc ggggtattcc tggccctgtt ggcgctgctg gtgctactgg cgccagagga 960
cttgttggtg agcccggccc agctggttcg aaaggagaga gcggcaacaa gggcgagcct 1020
ggtgctgttg ggcagccagg tcctcctggc cccagtggtg aagaaggaaa gagaggctcc 1080
actggagaaa tcggacccgc tggcccccca ggacctcctg ggctgagggg aaatcctggc 1140
tcccgtggtc tacctggagc tgacggcaga gctggtgtca tgggtcctgc tggtagccgt 1200
ggtgcaactg gccctgctgg tgtgcgaggt cccaatggag attctggtcg ccctggagag 1260
cctggcctca tgggaccccg aggtttccca ggttcccctg gaaatatcgg cccagctggt 1320
aaagaaggtc ctgtgggtct ccctggtatt gacggcagac ctgggcccat tggcccagcg 1380
ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc cagtggtgat 1440
cctggcaaag ctggtgaaaa aggtcatgct ggtcttgctg gtgctcgggg cgctccaggt 1500
cccgatggca acaacggtgc tcagggaccc cctggactac agggtgtcca aggtggaaaa 1560
ggtgaacagg gtcctgctgg tcctccaggc ttccagggtc tgcctggccc tgcaggcaca 1620
gctggtgaag ctggcaaacc aggagaaagg ggtatccctg gtgaatttgg tctccctggc 1680
cctgctggtg caagagggga gcgggggccc ccaggtgaaa gtggtgctgc tgggcctact 1740
gggcctattg gaagccgagg tccttctgga cccccagggc ctgatggaaa caagggtgaa 1800
ccgggtgtgg ttggcgctcc aggcactgct ggcccatctg gtcctagcgg actcccagga 1860
gagaggggtg cggctggcat tcctggaggc aagggagaaa agggtgaaac tggtctcaga 1920
ggtgacattg gtagccctgg tagagatggt gctcgtggtg ctcctggtgc tattggtgct 1980
cctggccctg ctggagccaa tggggaccgg ggtgaagctg gtcccgctgg ccctgctggc 2040
cctgctggtc ctcgtggtag ccctggtgaa cgtggtgagg tcggtcccgc tggccccaac 2100
ggatttgctg gtcctgctgg tgctgctggt caacctggtg ctaaaggaga gagaggaacc 2160
aaaggaccca agggtgaaaa tggtcctgtt ggtcccacag gccccgttgg agctgccggt 2220
ccgtctggtc caaatggccc acctggtcct gctggaagtc gtggtgatgg agggccccct 2280
ggggctactg gtttccctgg tgctgctgga cggactggtc cccctggacc ctctggtatc 2340
tctggccccc ctggcccccc tggtcctgct ggtaaagaag ggcttcgtgg gcctcgtggt 2400
gaccaaggtc cagttggtcg aagtggagag acaggtgcct ctggccctcc tggctttgtt 2460
ggtgagaagg gtccctctgg agagcctggt actgctgggc ctcctggaac cccaggtcca 2520
caaggccttc ttggtgctcc tggttttctg ggtctcccag gctctagagg tgagcgtggt 2580
ctaccaggtg tcgctggatc tgtgggtgaa cctggccccc tcggcatcgc aggcccacct 2640
ggggcccgtg gtccccctgg taatgtcggt aatcctggcg tcaatggtgc tcctggtgaa 2700
gccggtcgtg acggcaaccc tgggaatgac ggtcccccag gccgcgatgg tcaacccgga 2760
cacaaggggg agcgtggtta ccccggtaac gcaggtcctg ttggtgctgc cggtgctcct 2820
ggccctcaag gccctgtggg tcccgttggt aaacacggaa accgtggtga accgggtcct 2880
gccggtgctg ttggtcctgc tggtgccgtt ggcccaagag gtcccagtgg cccacaaggt 2940
attcgaggtg acaagggaga gcctggtgat aagggtccca gaggtcttcc tggcttaaag 3000
ggacacaatg ggttgcaagg tctcccgggt cttgctggtc atcatggcga tcaaggtgct 3060
cccggtgctg tgggtcccgc tggtcccagg ggccctgctg gtccttctgg ccccgctggc 3120
aaagacggtc gcattggaca gcctggtgca gtcggacctg ctggcattcg tggctctcag 3180
ggtagccaag gtcctgctgg ccctcctggt ccccctggcc ctcctggacc tcctggccca 3240
agtggtggtg gttacgagtt tggttttgat ggagacttct acagggctgg ttctgagaat 3300
ctttattttc agggccgtaa aggcgaagag ctgttcactg gtgtcgtccc tattctggtg 3360
gaactggatg gtgatgtcaa cggtcataag ttttccgtgc gtggcgaggg tgaaggtgac 3420
gcaactaatg gtaaactgac gctgaagttc atctgtacta ctggtaaact gccggttcct 3480
tggccgactc tggtaacgac gctgacttat ggtgttcagt gctttgctcg ttatccggac 3540
catatgaagc agcatgactt cttcaagtcc gccatgccgg aaggctatgt gcaggaacgc 3600
acgatttcct ttaaggatga cggcacgtac aaaacgcgtg cggaagtgaa atttgaaggc 3660
gataccctgg taaaccgcat tgagctgaaa ggcattgact ttaaagagga cggcaatatc 3720
ctgggccata agctggaata caattttaac agccacaatg tttacatcac cgccgataaa 3780
caaaaaaatg gcattaaagc gaattttaaa attcgccaca acgtggagga tggcagcgtg 3840
cagctggctg atcactacca gcaaaacact ccaatcggtg atggtcctgt tctgctgcca 3900
gacaatcact atctgagcac gcaaagcgtt ctgtctaaag atccgaacga gaaacgcgat 3960
catatggttc tgctggagtt cgtaaccgca gcgggcatca cgcatggtat ggatgaactg 4020
tacaaaggtt ctcatcatca tcatcatcac ttgggaagag gtagaagatc caaattgtaa 4080


<210> 64
<211> 1359
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtCOL1A2-TEV-GFP-HIS-ePTS1

<400> 64
Met Gln Ser Leu Gln Glu Ala Thr Ala Arg Lys Gly Pro Ser Gly Asp
 1               5                  10                  15      
Arg Gly Pro Arg Gly Glu Arg Gly Pro Pro Gly Pro Pro Gly Arg Asp
            20                  25                  30          
Gly Asp Asp Gly Ile Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly
        35                  40                  45              
Pro Pro Gly Leu Gly Gly Asn Phe Ala Ala Gln Phe Asp Ala Lys Gly
    50                  55                  60                  
Gly Gly Pro Gly Pro Met Gly Leu Met Gly Pro Arg Gly Pro Pro Gly
65                  70                  75                  80  
Ala Ser Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Pro Gly Glu
                85                  90                  95      
Pro Gly Glu Pro Gly Gln Thr Gly Pro Ala Gly Ala Arg Gly Pro Pro
            100                 105                 110         
Gly Pro Pro Gly Lys Ala Gly Glu Asp Gly His Pro Gly Lys Pro Gly
        115                 120                 125             
Arg Pro Gly Glu Arg Gly Val Val Gly Pro Gln Gly Ala Arg Gly Phe
    130                 135                 140                 
Pro Gly Thr Pro Gly Leu Pro Gly Phe Lys Gly Ile Arg Gly His Asn
145                 150                 155                 160 
Gly Leu Asp Gly Leu Lys Gly Gln Pro Gly Ala Pro Gly Val Lys Gly
                165                 170                 175     
Glu Pro Gly Ala Pro Gly Glu Asn Gly Thr Pro Gly Gln Thr Gly Ala
            180                 185                 190         
Arg Gly Leu Pro Gly Glu Arg Gly Arg Val Gly Ala Pro Gly Pro Ala
        195                 200                 205             
Gly Ala Arg Gly Ser Asp Gly Ser Val Gly Pro Val Gly Pro Ala Gly
    210                 215                 220                 
Pro Ile Gly Ser Ala Gly Pro Pro Gly Phe Pro Gly Ala Pro Gly Pro
225                 230                 235                 240 
Lys Gly Glu Leu Gly Pro Val Gly Asn Pro Gly Pro Ala Gly Pro Ala
                245                 250                 255     
Gly Pro Arg Gly Glu Val Gly Leu Pro Gly Leu Ser Gly Pro Val Gly
            260                 265                 270         
Pro Pro Gly Asn Pro Gly Ala Asn Gly Leu Pro Gly Ala Lys Gly Ala
        275                 280                 285             
Ala Gly Leu Pro Gly Val Ala Gly Ala Pro Gly Leu Pro Gly Pro Arg
    290                 295                 300                 
Gly Ile Pro Gly Pro Val Gly Ala Ala Gly Ala Thr Gly Ala Arg Gly
305                 310                 315                 320 
Leu Val Gly Glu Pro Gly Pro Ala Gly Ser Lys Gly Glu Ser Gly Asn
                325                 330                 335     
Lys Gly Glu Pro Gly Ala Val Gly Gln Pro Gly Pro Pro Gly Pro Ser
            340                 345                 350         
Gly Glu Glu Gly Lys Arg Gly Ser Thr Gly Glu Ile Gly Pro Ala Gly
        355                 360                 365             
Pro Pro Gly Pro Pro Gly Leu Arg Gly Asn Pro Gly Ser Arg Gly Leu
    370                 375                 380                 
Pro Gly Ala Asp Gly Arg Ala Gly Val Met Gly Pro Ala Gly Ser Arg
385                 390                 395                 400 
Gly Ala Thr Gly Pro Ala Gly Val Arg Gly Pro Asn Gly Asp Ser Gly
                405                 410                 415     
Arg Pro Gly Glu Pro Gly Leu Met Gly Pro Arg Gly Phe Pro Gly Ser
            420                 425                 430         
Pro Gly Asn Ile Gly Pro Ala Gly Lys Glu Gly Pro Val Gly Leu Pro
        435                 440                 445             
Gly Ile Asp Gly Arg Pro Gly Pro Ile Gly Pro Ala Gly Ala Arg Gly
    450                 455                 460                 
Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro Lys Gly Pro Ser Gly Asp
465                 470                 475                 480 
Pro Gly Lys Ala Gly Glu Lys Gly His Ala Gly Leu Ala Gly Ala Arg
                485                 490                 495     
Gly Ala Pro Gly Pro Asp Gly Asn Asn Gly Ala Gln Gly Pro Pro Gly
            500                 505                 510         
Leu Gln Gly Val Gln Gly Gly Lys Gly Glu Gln Gly Pro Ala Gly Pro
        515                 520                 525             
Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Thr Ala Gly Glu Ala
    530                 535                 540                 
Gly Lys Pro Gly Glu Arg Gly Ile Pro Gly Glu Phe Gly Leu Pro Gly
545                 550                 555                 560 
Pro Ala Gly Ala Arg Gly Glu Arg Gly Pro Pro Gly Glu Ser Gly Ala
                565                 570                 575     
Ala Gly Pro Thr Gly Pro Ile Gly Ser Arg Gly Pro Ser Gly Pro Pro
            580                 585                 590         
Gly Pro Asp Gly Asn Lys Gly Glu Pro Gly Val Val Gly Ala Pro Gly
        595                 600                 605             
Thr Ala Gly Pro Ser Gly Pro Ser Gly Leu Pro Gly Glu Arg Gly Ala
    610                 615                 620                 
Ala Gly Ile Pro Gly Gly Lys Gly Glu Lys Gly Glu Thr Gly Leu Arg
625                 630                 635                 640 
Gly Asp Ile Gly Ser Pro Gly Arg Asp Gly Ala Arg Gly Ala Pro Gly
                645                 650                 655     
Ala Ile Gly Ala Pro Gly Pro Ala Gly Ala Asn Gly Asp Arg Gly Glu
            660                 665                 670         
Ala Gly Pro Ala Gly Pro Ala Gly Pro Ala Gly Pro Arg Gly Ser Pro
        675                 680                 685             
Gly Glu Arg Gly Glu Val Gly Pro Ala Gly Pro Asn Gly Phe Ala Gly
    690                 695                 700                 
Pro Ala Gly Ala Ala Gly Gln Pro Gly Ala Lys Gly Glu Arg Gly Thr
705                 710                 715                 720 
Lys Gly Pro Lys Gly Glu Asn Gly Pro Val Gly Pro Thr Gly Pro Val
                725                 730                 735     
Gly Ala Ala Gly Pro Ser Gly Pro Asn Gly Pro Pro Gly Pro Ala Gly
            740                 745                 750         
Ser Arg Gly Asp Gly Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala
        755                 760                 765             
Ala Gly Arg Thr Gly Pro Pro Gly Pro Ser Gly Ile Ser Gly Pro Pro
    770                 775                 780                 
Gly Pro Pro Gly Pro Ala Gly Lys Glu Gly Leu Arg Gly Pro Arg Gly
785                 790                 795                 800 
Asp Gln Gly Pro Val Gly Arg Ser Gly Glu Thr Gly Ala Ser Gly Pro
                805                 810                 815     
Pro Gly Phe Val Gly Glu Lys Gly Pro Ser Gly Glu Pro Gly Thr Ala
            820                 825                 830         
Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly Leu Leu Gly Ala Pro Gly
        835                 840                 845             
Phe Leu Gly Leu Pro Gly Ser Arg Gly Glu Arg Gly Leu Pro Gly Val
    850                 855                 860                 
Ala Gly Ser Val Gly Glu Pro Gly Pro Leu Gly Ile Ala Gly Pro Pro
865                 870                 875                 880 
Gly Ala Arg Gly Pro Pro Gly Asn Val Gly Asn Pro Gly Val Asn Gly
                885                 890                 895     
Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn Pro Gly Asn Asp Gly Pro
            900                 905                 910         
Pro Gly Arg Asp Gly Gln Pro Gly His Lys Gly Glu Arg Gly Tyr Pro
        915                 920                 925             
Gly Asn Ala Gly Pro Val Gly Ala Ala Gly Ala Pro Gly Pro Gln Gly
    930                 935                 940                 
Pro Val Gly Pro Val Gly Lys His Gly Asn Arg Gly Glu Pro Gly Pro
945                 950                 955                 960 
Ala Gly Ala Val Gly Pro Ala Gly Ala Val Gly Pro Arg Gly Pro Ser
                965                 970                 975     
Gly Pro Gln Gly Ile Arg Gly Asp Lys Gly Glu Pro Gly Asp Lys Gly
            980                 985                 990         
Pro Arg Gly Leu Pro Gly Leu Lys Gly His Asn Gly Leu Gln Gly Leu
        995                 1000                1005            
Pro Gly Leu Ala Gly His His Gly Asp Gln Gly Ala Pro Gly Ala Val
    1010                1015                1020                
Gly Pro Ala Gly Pro Arg Gly Pro Ala Gly Pro Ser Gly Pro Ala Gly
1025                1030                1035               1040 
Lys Asp Gly Arg Ile Gly Gln Pro Gly Ala Val Gly Pro Ala Gly Ile
                1045                1050                1055    
Arg Gly Ser Gln Gly Ser Gln Gly Pro Ala Gly Pro Pro Gly Pro Pro
            1060                1065                1070        
Gly Pro Pro Gly Pro Pro Gly Pro Ser Gly Gly Gly Tyr Glu Phe Gly
        1075                1080                1085            
Phe Asp Gly Asp Phe Tyr Arg Ala Gly Ser Glu Asn Leu Tyr Phe Gln
    1090                1095                1100                
Gly Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
1105                1110                1115               1120 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
                1125                1130                1135    
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
            1140                1145                1150        
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
        1155                1160                1165            
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
    1170                1175                1180                
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
1185                1190                1195               1200 
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
                1205                1210                1215    
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
            1220                1225                1230        
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
        1235                1240                1245            
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
    1250                1255                1260                
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
1265                1270                1275               1280 
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
                1285                1290                1295    
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
            1300                1305                1310        
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
        1315                1320                1325            
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Ser
    1330                1335                1340                
His His His His His His Leu Gly Arg Gly Arg Arg Ser Lys Leu
1345                1350                1355                


<210> 65
<211> 1092
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtCOL1A1 403-0P-TEV-GFP-HIS-ePTS1

<400> 65
atgggatttg ctggccctaa aggagccgca ggagaggctg gtaaagcagg cgaacgtggc 60
gtagctggcc cagctggtgc tgtgggtcca gttggtaaag atggagaggc cggtgcccaa 120
ggtccagccg gtcctgtagg cccagcaggt gaaagaggag aacaaggtcc cgcaggctca 180
gctggattcc aaggattagc aggacctgcc ggcccagttg gcgaagccgg taaagccgga 240
gagcaaggcg tggctggtga cctaggtgca gttggcccta gtggcgccag gggagaaaga 300
ggttctgaga atctttattt tcagggccgt aaaggcgaag agctgttcac tggtgtcgtc 360
cctattctgg tggaactgga tggtgatgtc aacggtcata agttttccgt gcgtggcgag 420
ggtgaaggtg acgcaactaa tggtaaactg acgctgaagt tcatctgtac tactggtaaa 480
ctgccggttc cttggccgac tctggtaacg acgctgactt atggtgttca gtgctttgct 540
cgttatccgg accatatgaa gcagcatgac ttcttcaagt ccgccatgcc ggaaggctat 600
gtgcaggaac gcacgatttc ctttaaggat gacggcacgt acaaaacgcg tgcggaagtg 660
aaatttgaag gcgataccct ggtaaaccgc attgagctga aaggcattga ctttaaagag 720
gacggcaata tcctgggcca taagctggaa tacaatttta acagccacaa tgtttacatc 780
accgccgata aacaaaaaaa tggcattaaa gcgaatttta aaattcgcca caacgtggag 840
gatggcagcg tgcagctggc tgatcactac cagcaaaaca ctccaatcgg tgatggtcct 900
gttctgctgc cagacaatca ctatctgagc acgcaaagcg ttctgtctaa agatccgaac 960
gagaaacgcg atcatatggt tctgctggag ttcgtaaccg cagcgggcat cacgcatggt 1020
atggatgaac tgtacaaagg ttctcatcat catcatcatc acttgggaag aggtagaaga 1080
tccaaattgt aa                                                     1092

<210> 66
<211> 363
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtCOL1A1 403-0P-TEV-GFP-HIS-ePTS1

<400> 66
Met Gly Phe Ala Gly Pro Lys Gly Ala Ala Gly Glu Ala Gly Lys Ala
 1               5                  10                  15      
Gly Glu Arg Gly Val Ala Gly Pro Ala Gly Ala Val Gly Pro Val Gly
            20                  25                  30          
Lys Asp Gly Glu Ala Gly Ala Gln Gly Pro Ala Gly Pro Val Gly Pro
        35                  40                  45              
Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Ala Gly Phe Gln
    50                  55                  60                  
Gly Leu Ala Gly Pro Ala Gly Pro Val Gly Glu Ala Gly Lys Ala Gly
65                  70                  75                  80  
Glu Gln Gly Val Ala Gly Asp Leu Gly Ala Val Gly Pro Ser Gly Ala
                85                  90                  95      
Arg Gly Glu Arg Gly Ser Glu Asn Leu Tyr Phe Gln Gly Arg Lys Gly
            100                 105                 110         
Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly
        115                 120                 125             
Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu Gly Glu Gly Asp
    130                 135                 140                 
Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys
145                 150                 155                 160 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val
                165                 170                 175     
Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe
            180                 185                 190         
Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Ser Phe
        195                 200                 205             
Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly
    210                 215                 220                 
Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu
225                 230                 235                 240 
Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His
                245                 250                 255     
Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala Asn
            260                 265                 270         
Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val Gln Leu Ala Asp
        275                 280                 285             
His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro
    290                 295                 300                 
Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp Pro Asn
305                 310                 315                 320 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly
                325                 330                 335     
Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Ser His His His His
            340                 345                 350         
His His Leu Gly Arg Gly Arg Arg Ser Lys Leu
        355                 360             


<210> 67
<211> 1092
<212> DNA
<213> Artificial Sequence

<220> 
<223> BtCOL1A1 403-11P-TEV-GFP-HIS-ePTS1

<400> 67
atgggatttc ctggccctaa gggagccgca ggagagcccg gtaaagcagg cgaaagaggc 60
gtacctggtc cacccggtgc tgtgggtcca gctggtaaag atggagaggc cggtgcccaa 120
ggtcctcctg gtcctgctgg cccagcaggt gaaagaggag aacaaggtcc cgcaggctca 180
cctggattcc aaggattacc aggtccagcc ggaccacctg gcgaagccgg taaacccgga 240
gagcaaggcg tgcctggtga cctaggtgca ccaggaccta gtggcgccag gggagaaaga 300
ggttctgaga atctttattt tcagggccgt aaaggcgaag agctgttcac tggtgtcgtc 360
cctattctgg tggaactgga tggtgatgtc aacggtcata agttttccgt gcgtggcgag 420
ggtgaaggtg acgcaactaa tggtaaactg acgctgaagt tcatctgtac tactggtaaa 480
ctgccggttc cttggccgac tctggtaacg acgctgactt atggtgttca gtgctttgct 540
cgttatccgg accatatgaa gcagcatgac ttcttcaagt ccgccatgcc ggaaggctat 600
gtgcaggaac gcacgatttc ctttaaggat gacggcacgt acaaaacgcg tgcggaagtg 660
aaatttgaag gcgataccct ggtaaaccgc attgagctga aaggcattga ctttaaagag 720
gacggcaata tcctgggcca taagctggaa tacaatttta acagccacaa tgtttacatc 780
accgccgata aacaaaaaaa tggcattaaa gcgaatttta aaattcgcca caacgtggag 840
gatggcagcg tgcagctggc tgatcactac cagcaaaaca ctccaatcgg tgatggtcct 900
gttctgctgc cagacaatca ctatctgagc acgcaaagcg ttctgtctaa agatccgaac 960
gagaaacgcg atcatatggt tctgctggag ttcgtaaccg cagcgggcat cacgcatggt 1020
atggatgaac tgtacaaagg ttctcatcat catcatcatc acttgggaag aggtagaaga 1080
tccaaattgt aa                                                     1092

<210> 68
<211> 363
<212> PRT
<213> Artificial Sequence

<220> 
<223> BtCOL1A1 403-11P-TEV-GFP-HIS-ePTS1

<400> 68
Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala
 1               5                  10                  15      
Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly
            20                  25                  30          
Lys Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro
        35                  40                  45              
Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln
    50                  55                  60                  
Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly
65                  70                  75                  80  
Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala
                85                  90                  95      
Arg Gly Glu Arg Gly Ser Glu Asn Leu Tyr Phe Gln Gly Arg Lys Gly
            100                 105                 110         
Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly
        115                 120                 125             
Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu Gly Glu Gly Asp
    130                 135                 140                 
Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys
145                 150                 155                 160 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val
                165                 170                 175     
Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe
            180                 185                 190         
Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Ser Phe
        195                 200                 205             
Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly
    210                 215                 220                 
Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu
225                 230                 235                 240 
Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His
                245                 250                 255     
Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala Asn
            260                 265                 270         
Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val Gln Leu Ala Asp
        275                 280                 285             
His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro
    290                 295                 300                 
Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser Lys Asp Pro Asn
305                 310                 315                 320 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly
                325                 330                 335     
Ile Thr His Gly Met Asp Glu Leu Tyr Lys Gly Ser His His His His
            340                 345                 350         
His His Leu Gly Arg Gly Arg Arg Ser Lys Leu
        355                 360             


<210> 69
<211> 81
<212> DNA
<213> Artificial Sequence

<220> 
<223> T4 fibritin foldon domain

<400> 69
ggttacatcc ccgaagctcc tcgtgacggc caggcttacg tcaggaaaga tggcgagtgg 60
gttcttttgt ccacttttct g                                           81

<210> 70
<211> 27
<212> PRT
<213> Artificial Sequence

<220> 
<223> T4 fibritin foldon domain

<400> 70
Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys
 1               5                  10                  15      
Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu
            20                  25          

<210> 71
<211> 99
<212> PRT
<213> Artificial Sequence

<220> 
<223> Collagen substrate 1

<400> 71
Gly Phe Ala Gly Pro Lys Gly Ala Ala Gly Glu Ala Gly Lys Ala Gly
 1               5                  10                  15
Glu Arg Gly Val Ala Gly Pro Ala Gly Ala Val Gly Pro Val Gly Lys
            20                  25                  30
Asp Gly Glu Ala Gly Ala Gln Gly Pro Ala Gly Pro Val Gly Pro Ala
        35                  40                  45
Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Ala Gly Phe Gln Gly
    50                  55                  60
Leu Ala Gly Pro Ala Gly Pro Val Gly Glu Ala Gly Lys Ala Gly Glu
65                  70                  75                  80
Gln Gly Val Ala Gly Asp Leu Gly Ala Val Gly Pro Ser Gly Ala Arg
                85                  90                  95
Gly Glu Arg


<210> 72
<211> 99
<212> PRT
<213> Artificial Sequence

<220> 
<223> Collagen substrate 2

<400> 72
Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly Lys Ala Gly
 1               5                  10                  15
Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro Ala Gly Lys
            20                  25                  30
Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro Ala Gly Pro Ala
        35                  40                  45
Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro Gly Phe Gln Gly
    50                  55                  60
Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys Pro Gly Glu
65                  70                  75                  80
Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser Gly Ala Arg
                85                  90                  95
Gly Glu Arg


<210> 73
<211> 6481
<212> DNA
<213> Artificial Sequence

<220> 
<223> OgPDI Construct

<400> 73
ttgcttgtgt gcagttgcgg tgtttttcgg gggaggtttc gtttattcgc ttttcaaata 60
taaccagggc cgtcaatcag tttctcactg gaaacgcaat tccttcacag ctggagtgaa 120
aagccttgct caggtaagtc aacgtgaaga gatttacaaa taacgagata ctaatatcga 180
ctcaatcggc tatagaggga atttatgcac aatcgatacc gtgataatgt gccagaattc 240
aatagctttt tcagctcatt ttcaggagat gaaaacgagt tgattgatgc actggacgcg 300
caacagaatg aaaactaata gaccatttag cagtctatat caaaggtgaa caattaagtt 360
acatcttatt acccggcatt aacgttccgg gtaaccaaac ttataaggcc gaaaattttt 420
gcgttgaggc gaactgaata aggtgggaat tgacttgaaa aattttcttc agcaggtttt 480
tcaaagactg aatactataa atatgatatg caatggtcaa tggaccagtg tcggctcagc 540
cggcttgtac tacaccatca aggctgatag tatgtgcgta gacatacact atacggacgg 600
cttcattcaa ccttcctgtc agggcctaca ggttattgga ccatgcaacc gttatcagaa 660
cggccctagg gattttgtag cctgccagac atctggaggt agtggccacc ccatctgtat 720
tcagtccacc aacggcaaca ttgagctgtg cgctaactgt tactgtcctc aaggttctga 780
gaatctttat tttcagggcc gtaaaggcga agagctgttc actggtgtcg tccctattct 840
ggtggaactg gatggtgatg tcaacggtca taagttttcc gtgcgtggcg agggtgaagg 900
tgacgcaact aatggtaaac tgacgctgaa gttcatctgt actactggta aactgccggt 960
tccttggccg actctggtaa cgacgctgac ttatggtgtt cagtgctttg ctcgttatcc 1020
ggaccatatg aagcagcatg acttcttcaa gtccgccatg ccggaaggct atgtgcagga 1080
acgcacgatt tcctttaagg atgacggcac gtacaaaacg cgtgcggaag tgaaatttga 1140
aggcgatacc ctggtaaacc gcattgagct gaaaggcatt gactttaaag aggacggcaa 1200
tatcctgggc cataagctgg aatacaattt taacagccac aatgtttaca tcaccgccga 1260
taaacaaaaa aatggcatta aagcgaattt taaaattcgc cacaacgtgg aggatggcag 1320
cgtgcagctg gctgatcact accagcaaaa cactccaatc ggtgatggtc ctgttctgct 1380
gccagacaat cactatctga gcacgcaaag cgttctgtct aaagatccga acgagaaacg 1440
cgatcatatg gttctgctgg agttcgtaac cgcagcgggc atcacgcatg gtatggatga 1500
actgtacaaa ggttctcatc atcatcatca tcacttggga agaggtagaa gatccaaatt 1560
gtaactcgag agcttttgat taagccttct agtccaaaaa acacgttttt ttgtcattta 1620
tttcattttc ttagaatagt ttagtttatt cattttatag tcacgaatgt tttatgattc 1680
tatatagggt tgcaaacaag catttttcat tttatgttaa aacaatttca ggtttacctt 1740
ttattctgct tgtggtgacg cgtgtatccg cccgctcttt tggtcaccca tgtatgctga 1800
cggggtcatc acggctcatc atgcgccaaa caaatgtgtg caatacacgc tcggatgact 1860
gcatgatgac cgcactgact ggggacagca gatccaccta agcctgtgag agaagcagac 1920
acccgacaga tcaaggcagt taaacgcctt gccaacaggg agttcttcag agacatggag 1980
gctcaaaacg aaattattga cagcctagac atcaatagtc atacaacaga aagcgaccac 2040
ccaactttgg ctgataatag cgtataaaca atgcatactt tgtacgttca aaatacaatg 2100
cagtagatat atttatgcat attacatata atacatatca cataggaagc aacaggcgcg 2160
ttggactttt aattttcgag gaccgcgaat ccttacatca cacccaatcc cccacaagtg 2220
atcccccaca caccatagct tcaaaatgtt tctactcctt ttttactctt ccagattttc 2280
tcggactccg cgcatcgccg taccacttca aaacacccaa gcacagcata ctaaatttcc 2340
cctctttctt cctctagggt gtcgttaatt acccgtacta aaggtttgga aaagaaaaaa 2400
gacaccgcct cgtttctttt tcttcgtcga aaaaggcaat aaaaattttt atcacgtttc 2460
tttttcttga aaattttttt ttttgatttt tttctctttc gatgacctcc cattgatatt 2520
taagttaata aacggtcatc aatttctcaa gtttcagttt catttttctt gttctattac 2580
aacttttttt acttcttgct cattagaaag aaagcatagc aatctaatct aagttttaat 2640
tacaaaagat ctatggtgtc caaaggagag gagttaatca aggaaaacat gagaatgaaa 2700
gttgtcatgg agggctccgt taatggtcac caattcaagt gtacagggga aggtgaaggt 2760
aatccttaca tgggtacaca aactatgaga attaaagtaa ttgaaggcgg accactacca 2820
tttgcatttg acattctggc aacgtcattc atgtacggat cacgaacttt catcaagtac 2880
cctaaaggta taccagactt tttcaagcaa tcttttccag agggttttac atgggaaagg 2940
gttacaagat acgaagatgg gggtgtcgtc acagttatgc aagatacttc attagaagat 3000
ggctgccttg tctatcatgt gcaagtaaga ggggtgaatt ttccttctaa cggacctgtg 3060
atgcagaaaa agaccaaagg ttgggaacca aatactgaaa tgatgtaccc agctgatgga 3120
ggtttgagag gctacacaca catggcgctt aaagttgatg gtggaggtca tttgtcttgt 3180
agttttgtta ccacttatcg ttctaaaaag actgttggca atatcaaaat gccaggaata 3240
catgctgtag accacagact agaaagactc gaagagagcg ataacgaaat gttcgttgta 3300
cagagagagc atgccgtagc caaatttgct ggcttaggcg gtggtatgga tgaattgtat 3360
aagggttctg ctgtggccaa aggtgacgcc gacgaagccg ccattgcgtc gccggattcc 3420
gctgttgtga agttgacagc agaatccttc gagtcgttta tcaaggagaa cccgctcgtt 3480
ttggctgagt tctttgcgcc atggtgtggc cactgcaagc gtcttggacc agagtttagc 3540
gctgctgccg acaaacttgt cgagaaagac atcaagttgg cccagattga ttgcacccaa 3600
gagagagatc tatgtgcgga ctatggtatc cgtggttatc catctctcaa ggtcttcaga 3660
ggcaacaaca cgccatccga gtaccagggc caaagagaac aagatgcaat tgtcagctac 3720
atgatcaagc aagccctacc tccagtgtcg ttgcttgagg atacggctga tctgctggac 3780
gctctggccg atctgagcga accaatgatc ttgcaagttc tgccacctga ctcgaagtct 3840
tccggcaacg aaacgttcca ttcgttggcc aaccgtctta gaaacgactt caggttcgtg 3900
tctacctcca accctgagta tgttgagaaa tacgtcaagg aaaagtccac tccaacctac 3960
gttgttttca gaccgggtga aaagattgag gacgcatctg ttctcaccaa caagactata 4020
gacgaagagg gattgcagag attcattagt gttgagacta agcctctttt cggcgaggtc 4080
accggtgcca cgttccaggc ctacatggac tccaaacttc ctttagcata ctttttctat 4140
gaagaggagt ctcagaaggc tgctgtcgca gacgaaatca ctaagttggc caagaaatat 4200
agaggcgaga tcaatttcgc cggactggag gccaagaaat acggaatgca cgctaaaaac 4260
ctcaacatgc aagaaaagtt cccactgttt gccattcacg atctgcaagg cgacctaaag 4320
tacggcatcc cacaagataa ggatctggac ttctctgaga ttcctaaatt tgtcgaaaac 4380
ttcaagaagg gcaagctgaa gccaattgtc aagagcgagc ctattccaga gactcaagag 4440
gaggctgtct accacttggt cggctacgag cacgacaaga tcgtcaacca aaagaaggac 4500
gttctggtcg agtactatgc tccatggtgt ggtcactgca agagacttgc ccctacatat 4560
gaggagctgg ctgctatcta caagaacgac accgctgcta gtgccaaggt cgtgatcgcc 4620
aagattgacc acaccgctaa cgatgttgcg ggcgtcgaga tcaccggata ccctaccatt 4680
ttcctttatc cagctgacgg ttctggtccg gtcaattacg agggacaaag aactttggag 4740
tccctagctt ctttcattca agagaagggt acctttggtg ttgacggttt ggccatcaga 4800
ggcgctaaga gcggcggagc tgataaaccg gagtccgaca ctaaggacag tactggatcc 4860
ttgggaagag gtagaagatc caaattgtaa ctcgagtggc gcgaatttct tatgatttat 4920
gatttttatt attaaataag ttataaaaaa aataagtgta tacaaatttt aaagtgactc 4980
ttaggtttta aaacgaaaat tcttattctt gagtaactct ttcctgtagg tcaggttgct 5040
ttctcaggta tagcatgagg tcgctcttat tgaccacacc tctaccggca tgccgagcaa 5100
atgcctgcaa atcgctcccc atttcgctgg aaatctgctc gtcagtggtg ctcacactga 5160
cgaatcatgt acagatcata ccgatgactg cctggcgact cacaactaag caagacagcc 5220
ggaaccagcg ccggcgaaca ccactgcata tatggcatat cacaacagtc cacgtctcaa 5280
gcagttacag agatgttacg aaccactagt gcactgcagt acagtttagc ttgcctcgtc 5340
cccgccgggt cacccggcca gcgacatgga ggcccagaat accctccttg acagtcttga 5400
cgtgcgcagc tcaggggcat gatgtgactg tcgcccgtac atttagccca tacatcccca 5460
tgtataatca tttgcatcca tacattttga tggccgcacg gcgcgaagca aaaattacgg 5520
ctcctcgctc cagacctgcg agcagggaaa cgctcccctc acagacgcgt tgaattgtcc 5580
ccacgccgcg cccctgtaga gaaatataaa aggttaggat ttgccactga ggttcttctt 5640
tcatatactt ccttttaaaa tcttgctagg atacagttct cacatcacat ccgaacataa 5700
acaaaaatgg gtactacctt agatgataca gcctacagat acagaacatc agtccctggt 5760
gatgctgaag caattgaggc tttagacggt tcattcacca ccgacaccgt ctttagagta 5820
accgccaccg gtgatggatt taccttaaga gaagtcccag tcgaccctcc attaactaaa 5880
gtctttccag atgatgaatc tgatgacgaa agcgacgacg gagaagatgg tgacccagat 5940
tcaagaactt tcgtagcata cggtgatgac ggtgatttgg ctggttttgt agtcgtttct 6000
tattcaggtt ggaatagaag gttgaccgtt gaagatatag aagtcgcccc agagcataga 6060
ggtcatggtg taggaagagc tttgatgggt ttggctacag aatttgcaag agagagagga 6120
gccggtcatt tatggttaga agttactaat gttaacgccc ctgctatcca tgcttataga 6180
agaatgggtt tcacattatg tggtttagat actgctttat atgatggaac agcatctgac 6240
ggtgaacagg ccttgtatat gtctatgcct tgcccttaaa gtaactgaca ataaaaagat 6300
tcttgttttc aagaacttgt catttgtata gtttttttat attgtagttg ttctatttta 6360
atcaaatgtt agcgtgattt atattttttt tcgcctcgac atcatctgcc cagatgcgaa 6420
gttaagtgcg cagaaagtaa tatcatgcgt caatcgtatg tgaatgctgg tcgctatact 6480
g                                                                 6481

<210> 74
<211> 54
<212> PRT
<213> Artificial Sequence

<220> 
<223> Human insulin precursor

<400> 74
Met Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu
 1               5                  10                  15
Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Ala Ala
            20                  25                  30
Lys Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln
        35                  40                  45
Phe Glu Asn Tyr Cys Asn
    50

<210> 75
<211> 166
<212> PRT
<213> Artificial Sequence

<220> 
<223> Alpha interferon

<400> 75
Met Lys Tyr Thr Ser Tyr Ile Leu Ala Phe Gln Leu Cys Ile Val Leu
 1               5                  10                  15
Gly Ser Leu Gly Cys Tyr Cys Gln Asp Pro Tyr Val Lys Glu Ala Glu
            20                  25                  30
Asn Leu Lys Lys Tyr Phe Asn Ala Gly His Ser Asp Val Ala Asp Asn
        35                  40                  45
Gly Thr Leu Phe Leu Gly Ile Leu Lys Asn Trp Lys Glu Glu Ser Asp
    50                  55                  60
Arg Lys Ile Met Gln Ser Gln Ile Val Ser Phe Tyr Phe Lys Leu Phe
65                  70                  75                  80
Lys Asn Phe Lys Asp Asp Gln Ser Ile Gln Lys Ser Val Glu Thr Ile
                85                  90                  95
Lys Glu Asp Met Asn Val Lys Phe Phe Asn Ser Asn Lys Lys Lys Arg
            100                 105                 110
Asp Asp Phe Glu Lys Leu Thr Asn Tyr Ser Val Thr Asp Leu Asn Val
        115                 120                 125
Gln Arg Lys Ala Ile His Glu Leu Ile Gln Val Met Ala Glu Leu Ser
    130                 135                 140
Pro Ala Ala Lys Thr Gly Lys Arg Lys Arg Ser Gln Met Leu Phe Arg
145                 150                 155                 160
Gly Arg Arg Ala Ser Gln
                165                                    

<210> 76
<211> 89
<212> PRT
<213> Artificial Sequence

<220> 
<223> Mapacalcine

<400> 76
Ile Cys Asn Gly Gln Trp Thr Ser Val Gly Ser Ala Gly Leu Tyr Tyr
 1               5                  10                  15
Thr Ile Lys Ala Asp Ser Met Cys Val Asp Ile His Tyr Thr Asp Gly
            20                  25                  30
Phe Ile Gln Pro Ser Cys Gln Gly Leu Gln Val Ile Gly Pro Cys Asn
        35                  40                  45
Arg Tyr Gln Asn Gly Pro Arg Asp Phe Val Ala Cys Gln Thr Ser Gly
    50                  55                  60
Gly Ser Gly His Pro Ile Cys Ile Gln Ser Thr Asn Gly Asn Ile Glu
65                  70                  75                  80
Leu Cys Ala Asn Cys Tyr Cys Pro Gln                                       
                85

<210> 77
<211> 496
<212> PRT
<213> Artificial Sequence

<220> 
<223> OgPDI

<400> 77
Met Ala Val Ala Lys Gly Asp Ala Asp Glu Ala Ala Ile Ala Ser Pro
 1               5                  10                  15
Asp Ser Ala Val Val Lys Leu Thr Ala Glu Ser Phe Glu Ser Phe Ile
            20                  25                  30
Lys Glu Asn Pro Leu Val Leu Ala Glu Phe Phe Ala Pro Trp Cys Gly
        35                  40                  45
His Cys Lys Arg Leu Gly Pro Glu Phe Ser Ala Ala Ala Asp Lys Leu
    50                  55                  60
Val Glu Lys Asp Ile Lys Leu Ala Gln Ile Asp Cys Thr Gln Glu Arg
65                  70                  75                  80
Asp Leu Cys Ala Asp Tyr Gly Ile Arg Gly Tyr Pro Ser Leu Lys Val
                85                  90                  95
Phe Arg Gly Asn Asn Thr Pro Ser Glu Tyr Gln Gly Gln Arg Glu Gln
            100                 105                 110
Asp Ala Ile Val Ser Tyr Met Ile Lys Gln Ala Leu Pro Pro Val Ser
        115                 120                 125
Leu Leu Glu Asp Thr Ala Asp Leu Leu Asp Ala Leu Ala Asp Leu Ser
    130                 135                 140
Glu Pro Met Ile Leu Gln Val Leu Pro Pro Asp Ser Lys Ser Ser Gly
145                 150                 155                 160
Asn Glu Thr Phe His Ser Leu Ala Asn Arg Leu Arg Asn Asp Phe Arg
                165                 170                 175
Phe Val Ser Thr Ser Asn Pro Glu Tyr Val Glu Lys Tyr Val Lys Glu
            180                 185                 190
Lys Ser Thr Pro Thr Tyr Val Val Phe Arg Pro Gly Glu Lys Ile Glu
        195                 200                 205
Asp Ala Ser Val Leu Thr Asn Lys Thr Ile Asp Glu Glu Gly Leu Gln
    210                 215                 220 
Arg Phe Ile Ser Val Glu Thr Lys Pro Leu Phe Gly Glu Val Thr Gly
225                 230                 235                 240
Ala Thr Phe Gln Ala Tyr Met Asp Ser Lys Leu Pro Leu Ala Tyr Phe
                245                 250                 255
Phe Tyr Glu Glu Glu Ser Gln Lys Ala Ala Val Ala Asp Glu Ile Thr
            260                 265                 270
Lys Leu Ala Lys Lys Tyr Arg Gly Glu Ile Asn Phe Ala Gly Leu Glu
        275                 280                 285
Ala Lys Lys Tyr Gly Met His Ala Lys Asn Leu Asn Met Gln Glu Lys
    290                 295                 300
Phe Pro Leu Phe Ala Ile His Asp Leu Gln Gly Asp Leu Lys Tyr Gly
305                 310                 315                 320
Ile Pro Gln Asp Lys Asp Leu Asp Phe Ser Glu Ile Pro Lys Phe Val
                325                 330                 335
Glu Asn Phe Lys Lys Gly Lys Leu Lys Pro Ile Val Lys Ser Glu Pro
            340                 345                 350
Ile Pro Glu Thr Gln Glu Glu Ala Val Tyr His Leu Val Gly Tyr Glu
        355                 360                 365
His Asp Lys Ile Val Asn Gln Lys Lys Asp Val Leu Val Glu Tyr Tyr
    370                 375                 380
Ala Pro Trp Cys Gly His Cys Lys Arg Leu Ala Pro Thr Tyr Glu Glu
385                 390                 395                 400
Leu Ala Ala Ile Tyr Lys Asn Asp Thr Ala Ala Ser Ala Lys Val Val
                405                 410                 415
Ile Ala Lys Ile Asp His Thr Ala Asn Asp Val Ala Gly Val Glu Ile
            420                 425                 430
Thr Gly Tyr Pro Thr Ile Phe Leu Tyr Pro Ala Asp Gly Ser Gly Pro
        435                 440                 445
Val Asn Tyr Glu Gly Gln Arg Thr Leu Glu Ser Leu Ala Ser Phe Ile
    450                 455                 460
Gln Glu Lys Gly Thr Phe Gly Val Asp Gly Leu Ala Ile Arg Asp Ala
465                 470                 475                 480
Lys Ser Gly Gly Ala Asp Lys Pro Glu Ser Asp Thr Lys Asp Ser Thr
                485                 490                 495

<210> 78
<211> 675
<212> DNA
<213> Artificial Sequence

<220> 
<223> Human beta-Casein II

<400> 78
atgaaagtcc ttattttagc ttgccttgtc gcattggctc tggcaagaga gacgattgaa 60
tcactaagta gttccgaaga aagtatcacc gaatataaaa aggtcgagaa ggtgaagcat 120
gaagaccagc aacagggcga agacgagcat caagacaaga tttaccctag tttccaacca 180
cagcctttaa tttatccctt cgtggaacca ataccatatg gcttcctgcc acaaaatatc 240
ctgcccttag cccaacccgc cgtcgttctg ccagtgcctc aacctgagat catggaagtt 300
ccaaaagcca aggatactgt ttatactaaa ggacgtgtga tgcctgtttt aaaatctccc 360
accattcctt tctttgatcc ccaaatccca aaacttactg accttgagaa cctacatcta 420
cccctaccac ttttacagcc actaatgcaa caggtgcctc agcctattcc tcagacccta 480
gctctaccac cacagcccct ttggtctgtc ccccaaccca aggttcttcc catacctcaa 540
caagtagttc catacccaca acgtgctgtc cctgtgcagg ctctgctact gaaccaggaa 600
ttgttactga atcctaccca ccaaatctac ccagtgactc agcccttagc cccagtacat 660
aatcccatca gtgtt                                                  675

<210> 79
<211> 645
<212> DNA
<213> Artificial Sequence

<220> 
<223> Casein Kinase II

<400> 79
atgtcatcct ccgaggaagt cagttggatc tcatggttct gcggtctgag gggcaacgag 60
tttttctgtg aggtagatga agactatatt caagacaagt tcaatctgac gggacttaat 120
gaacaggttc ctcactacag acaagcacta gacatgatat tagacctgga gcctgacgaa 180
gaactagaag ataaccccaa tcagtcagat ctaatcgaac aagccgcaga gatgttgtat 240
ggcttgatac acgccagata catattaact aaccgtggta ttgcacagat gttggaaaag 300
tatcagcaag gtgattttgg atattgcccc agagtatatt gcgagaacca acctatgtta 360
cccataggac tttctgatat tcctggagag gctatggtga aattgtactg ccctaaatgt 420
atggatgttt acactcctaa atcttcccgt catcatcata cggatggcgc ttattttgga 480
actggttttc cccacatgtt gttcatggtc caccctgagt ataggccaaa aagacctgca 540
aatcaatttg ttcctagact ttatggattt aagatacatc caatggctta ccaactgcag 600
ttacaagctg ctagtaattt taaatctcca gtcaaaacca ttaga                 645

<210> 80
<211> 1155
<212> DNA
<213> Artificial Sequence

<220> 
<223> Ovalbumin

<400> 80
ggttctatcg gagcagctag tatggaattc tgtttcgacg tgttcaagga attaaaagtc 60
catcatgcta atgaaaacat attctactgc ccaattgcca ttatgtcagc cctggccatg 120
gtgtacctag gtgccaaaga ttccacgaga actcaaataa ataaggttgt tagattcgac 180
aagttgccag gtttcggtga tagtattgaa gcccagtgcg gaacgtctgt taacgttcac 240
agttccctaa gagacatttt aaatcaaatc acaaagccca acgacgtgta ttcattttcc 300
ttagcctcca ggctgtacgc cgaggaaaga tatccaattt tgcccgaata cctgcagtgc 360
gtaaaagagc tgtatagagg cggacttgaa ccaataaatt tccagaccgc tgctgaccaa 420
gcccgtgagt tgataaactc ctgggtcgag tcccaaacta atggaatcat acgtaacgtg 480
ctgcaaccaa gtagtgttga ctcacagacc gcaatggttt tggttaacgc tatcgtattt 540
aaaggtttgt gggagaaagc attcaaagac gaagacacac aagccatgcc tttcagagtg 600
acggagcagg agagtaagcc agtacagatg atgtaccaga ttggcctttt cagagttgct 660
tccatggcct cagagaagat gaaaatcctg gagcttccat ttgcttctgg tactatgtcc 720
atgctagtgc ttctgcccga cgaagtgtct ggattggagc aattagaatc tatcataaac 780
tttgaaaagt taacagagtg gacttcttca aacgtaatgg aggaacgtaa gattaaagtt 840
tacctaccac gtatgaaaat ggaagaaaag tataatttga cctctgttct gatggcaatg 900
ggcataaccg acgtattcag ttcttcagca aatttatccg gcatatcctc tgcagaatct 960
ctaaagatat ctcaggccgt acacgccgct cacgcagaaa ttaacgaggc cggacgtgag 1020
gtagtaggat ccgctgaggc cggtgtggac gctgcatctg tgtctgagga gtttagagcc 1080
gaccaccctt ttctattctg tattaaacat atagccacta atgccgttct tttcttcggt 1140
aggtgcgtct cacca                                                  1155


<210> 81
<211> 540
<212> DNA
<213> Artificial Sequence

<220> 
<223> NatB complex Naa20

<400> 81
atgaccgata caagaaaatt taaggcaaca gatttatttt cctttaacaa cataaatctg 60
gacccactta ccgaaacatt taacatatca ttttaccttt catatcttaa caagtggccc 120
tccttatgcg tagtacaaga gtccgatctt tcagacccca cgttgatggg ttatattatg 180
ggcaagtccg aaggaacagg caaagagtgg cacacgcatg ttaccgccat cacagtggcc 240
ccaaattcac gtagattggg ccttgccagg acaatgatgg attacttgga gacggtcggt 300
aactctgaga acgccttctt cgtggacctg ttcgtcaggg catccaacgc cctagcaatt 360
gacttttaca aaggattggg ctactctgtc taccgtagag tgattggtta ttacagtaac 420
cctcatggca aagatgaaga ctctttcgac atgagaaaac cattgtctag ggatgtaaac 480
agggagtcaa tcagagaaaa cggtgagaac tttaagtgct cacccgcaga tgtgagtttt 540

<210> 82
<211> 2433
<212> DNA
<213> Artificial Sequence

<220> 
<223> NatB complex Naa25

<400> 82
atgaggagga gtggctccaa ggagagtact attgtatact cagctttgag tcttgctcag 60
gccggaagag gacccgaagc actggccttg cttgagcccc tgaagtcaac tccaatcaat 120
tctcttgagt tattggatat catacaggct gtatacgatg atcaaaagaa aggagaagaa 180
tccttcgttt tctgggagaa gttccttcag acttatggta agcaggagaa aaatttactg 240
gcttacttca aggcttctat tagaattaaa tctctgagtc accaacgtaa ggctgcagtg 300
gagctgcaaa agaactttcc aagtaggaaa cacacgttat gggttattag tagtctttat 360
ttactgtcca aaaaatccga gaacgaagtt gagcagaggc tgctgaaggc tctagctgaa 420
aaaaccgcta aattaatttt tgaaaaacca accggatata ttgattcttg tgaggaattc 480
catttgtatc tggacgtact acttctagta ggtgataaag atagggctct tgacgcttta 540
attcaccaag acgcagatag attcgtcgac gctgatgctg acctactgct tcgtaaatta 600
gaactattag caagttgtgc aagatgggat tcactgttta ctttctccct gagtttgttc 660
cagactggta acacggactg gaaagtctgt aaagcactac tagattctgc ctccaacgat 720
gatagtaagt tagtaccatt gaaggattgc atacttaaag cattatctac gtcttctact 780
aaaagaaatt tacacctgct atggattgaa gcatccgcac gtttctttcc cgaggaacac 840
gagtcagcac tattaggcta cataaagaaa ctttacatga agcccattgt cttcgaggat 900
cttaggccat atctgctgaa actaaacgtc gatgcacagc accgtttgtt ggacgctttc 960
aagctagctg atttgggcga gtcaaatgag tcacagaagg tcgataaatt atacgctgag 1020
gttcttctgt tgaaaatcca cttcctgcta ttcgagagtt tcacagccga gagtgtggta 1080
gactacgttc gtcgttgttt cgttgccttt gagaaaggac tttcactgtc taaaggactt 1140
ttgccaacgg acttcactca tggatatgag gctcttctgt tagcagtaca ttcccttatt 1200
tatatgtggg aaggtaacaa ggatttaaaa ccagcagaaa agcaggcatt aattttcgac 1260
gctatttgcc tgttagaaaa gggtataaca tacagtcaac ataatttcca cctgaaactt 1320
cccctgataa ggttatacct actgcttgac ggaggattcc ccgcagcagc aaaagtttac 1380
gatactatga gtataaagca aattcagaac gatacattag atcactattt actgaccagg 1440
gctaccacat attacccctc ttccgtcacg tcacattata taaattcatc cttaaagata 1500
tatggctcta acgagttcga aaccccagaa atgatttcta tggcatatga ggacggcgca 1560
tacagtcaga tcgaggacat gcgtaatttt agatctaggc ttgaccattc cacctggaag 1620
agtatatccc tagtcgaaag ggccaggata cactatctta ccgcatttaa gcctcctaaa 1680
cagtacctac ccaagtgttc cagtcctaaa gataaccgtg acctaaaggt gttcgctgat 1740
tacggatcag acaagcttcc taccgtggag gaaagtctaa ggaactcccc caagcccgat 1800
acgttgtgga tccacctaac tgtaatcggt cattccttag ttcaggatag tattgtgaat 1860
ggcgattttg agaaggccgt tctgtcagcc aaagaaatgg aagtcttgtg tgaaaataac 1920
gatctgtcta agcaactaac atcagaagag atcgtgcaca tgaagctact aatccaatta 1980
ggacttttaa gtgtgaaggt taagaatgga gattatgaaa actcctcttt tgagactatc 2040
gagaacctta tagaaagttt cgattatgaa aacagtactc ccctaagtca gttgacaaag 2100
tataccgaga tcatcaatga tttaatcacc tgcttgaact catttttgta tcatgtaagt 2160
gctactaaga aaaaggaatt cacacgtcaa taccagttgc tgaaaaatat aagttccaac 2220
aaacttggat caatctctgg tatcaccaaa cataagaaga aagctgccag aaagtacgtt 2280
tccgagctgt tgagtaattc ttggctaagt aacctatctg agacccaggt cccttacgat 2340
ccaaagtttg caaagcaagt gggtgagggt atgatcgact cttatataca gacaacggac 2400
gcagtgtcaa aattaccaaa gttcgtgaag ttt                              2433