﻿               SEQUENCE LISTING

<110> Universität Hamburg

<120> Flavonoide-type compounds bearing an O-rhamnosyl residue

<130> Y2387 PCT S3

<150> EP 16 15 1613.3  
<151> 2016-01-15

<160> 79

<170> BiSSAP 1.3

<210> 1
<211> 376
<212> PRT
<213> Artificial Sequence

<220> 
<223> variable sequence of glycosyl transferase

<220> 
<221> UNSURE
<222> 1..20
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 21
<223> Lys = Arg

<220> 
<221> UNSURE
<222> 26..27
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 29
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 34
<223> Asn = Ser

<220> 
<221> UNSURE
<222> 38
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 39
<223> Leu = Ile

<220> 
<221> UNSURE
<222> 41..46
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 48
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 53
<223> Tyr = Phe

<220> 
<221> UNSURE
<222> 54..84
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 85
<223> Phe = Tyr or Leu

<220> 
<221> VARIANT
<222> 87
<223> Glu = Asp

<220> 
<221> UNSURE
<222> 89..99
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 102
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 103..104
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 105
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 107..108
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 110..111
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 113
<223> Tyr = Phe

<220> 
<221> UNSURE
<222> 114
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 115
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 117..123
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 124
<223> Phe = Trp

<220> 
<221> UNSURE
<222> 127
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 128..130
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 131
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 132
<223> Asp = Glu

<220> 
<221> UNSURE
<222> 133..134
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 136..139
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 141..155
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 158
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 160
<223> Xaa = Asn, Cys, Gln, Gly, Ser, Thr or Tyr

<220> 
<221> UNSURE
<222> 161..163
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 165
<223> Pro = Ala

<220> 
<221> UNSURE
<222> 167
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 169
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 171..172
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 174..178
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 180
<223> Lys = Arg

<220> 
<221> UNSURE
<222> 181..229
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 232
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 233
<223> Gly = Cys

<220> 
<221> UNSURE
<222> 234
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 235
<223> Pro = Lys

<220> 
<221> VARIANT
<222> 238
<223> Glu = Asp

<220> 
<221> UNSURE
<222> 240
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 242..280
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 285
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> VARIANT
<222> 287
<223> Lys = Arg

<220> 
<221> UNSURE
<222> 288..290
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 292..294
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> VARIANT
<222> 301
<223> Arg = Lys

<220> 
<221> UNSURE
<222> 302..305
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 308..309
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 314..329
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 331
<223> Glu = Asp

<220> 
<221> UNSURE
<222> 337..338
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 339
<223> Val = Ile

<220> 
<221> UNSURE
<222> 342..343
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 346
<223> Tyr = Phe

<220> 
<221> VARIANT
<222> 347
<223> Ile = Val

<220> 
<221> VARIANT
<222> 348
<223> Thr = Ser

<220> 
<221> VARIANT
<222> 352
<223> Tyr = Phe

<220> 
<221> VARIANT
<222> 356
<223> Met = Leu

<220> 
<221> UNSURE
<222> 358
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 360
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 361
<223> Asn = His

<220> 
<221> UNSURE
<222> 362
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 365
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 367
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 370
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<400> 1
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15      
Xaa Xaa Xaa Xaa Lys Ile Leu Phe Ala Xaa Xaa Pro Xaa Asp Gly His 
            20                  25                  30          
Phe Asn Pro Leu Thr Xaa Leu Ala Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa 
        35                  40                  45              
Asp Val Arg Trp Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
    50                  55                  60                  
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65                  70                  75                  80  
Xaa Xaa Xaa Xaa Phe Pro Glu Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
                85                  90                  95      
Xaa Xaa Xaa Phe Asp Xaa Xaa Xaa Xaa Phe Xaa Xaa Arg Xaa Xaa Glu 
            100                 105                 110         
Tyr Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Pro Phe Xaa Xaa 
        115                 120                 125             
Xaa Xaa Xaa Asp Xaa Xaa Phe Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa 
    130                 135                 140                 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Leu Xaa Glu Xaa 
145                 150                 155                 160 
Xaa Xaa Xaa Leu Pro Pro Xaa Gly Xaa Gly Xaa Xaa Pro Xaa Xaa Xaa 
                165                 170                 175     
Xaa Xaa Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
            180                 185                 190         
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
        195                 200                 205             
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
    210                 215                 220                 
Xaa Xaa Xaa Xaa Xaa Leu Gln Xaa Gly Xaa Pro Gly Phe Glu Tyr Xaa 
225                 230                 235                 240 
Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
                245                 250                 255     
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
            260                 265                 270         
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Gln Gly Thr Xaa Glu Lys Xaa 
        275                 280                 285             
Xaa Xaa Lys Xaa Xaa Xaa Pro Thr Leu Glu Ala Phe Arg Xaa Xaa Xaa 
    290                 295                 300                 
Xaa Leu Val Xaa Xaa Thr Thr Gly Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305                 310                 315                 320 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ile Glu Asp Phe Ile Pro Phe 
                325                 330                 335     
Xaa Xaa Val Met Pro Xaa Xaa Asp Val Tyr Ile Thr Asn Gly Gly Tyr 
            340                 345                 350         
Gly Gly Val Met Leu Xaa Ile Xaa Asn Xaa Leu Pro Xaa Val Xaa Ala 
        355                 360                 365             
Gly Xaa His Glu Gly Lys Asn Glu 
    370                 375     

<210> 2
<211> 1380
<212> DNA
<213> Artificial Sequence

<220> 
<223> variable sequence of glycosyl transferase

<220> 
<221> unsure
<222> 1..60
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 61..63
<223> /replace="mgr"

<220> 
<221> unsure
<222> 75..81
<223> /replace="a, t, g, or c"

<220> 
<221> unsure
<222> 84..87
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 93
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 100-102
<223> /note="tcn - n can be any of t, c, g or a"
      /replace="agy"
      /replace="tcn"

<220> 
<221> unsure
<222> 105
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 111-114
<223> /replace="t, g, a or c"

<220> 
<221> variation
<222> 115-117
<223> /replace="ath"

<220> 
<221> unsure
<222> 120-138
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 141
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 150
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 157-159
<223> /replace="tty"

<220> 
<221> unsure
<222> 160-207
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 208-210
<223> /replace="tay"
      /replace="ytr"

<220> 
<221> unsure
<222> 213
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 214-216
<223> /replace="gay"

<220> 
<221> unsure
<222> 220-297
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 304-315
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 319-324
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 329-333
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 337-339
<223> /replace="tty"

<220> 
<221> unsure
<222> 340-345
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 349-369
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 370-372
<223> /replace="tgg"

<220> 
<221> unsure
<222> 375
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 379-393
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 394-396
<223> /replace="gar"

<220> 
<221> unsure
<222> 397-402
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 406-417
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 421-465
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 468
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 472-474
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 481-489
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 493-495
<223> /note="n at position 495 = a, t, g or c"
      /replace="gcn"

<220> 
<221> unsure
<222> 498-501
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 504-507
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 510-516
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 519-534
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 537
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 538-540
<223> /replace="mgr"

<220> 
<221> unsure
<222> 541-687
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 694-696
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 697-699
<223> /note="n at position 699 = a, t, g or c"
      /replace="tgy"

<220> 
<221> unsure
<222> 700-702
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 703-705
<223> /note="n at position 705 = a, t, g or c"
      /replace="aar"

<220> 
<221> unsure
<222> 708
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 712-714
<223> /replace="gay"

<220> 
<221> unsure
<222> 718-720
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 724-840
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 843
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 849
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 852-855
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 862-870
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 871-873
<223> /replace="mgr"

<220> 
<221> unsure
<222> 874-882
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 885
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 888
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 897
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 901-903
<223> /replace="aar"

<220> 
<221> unsure
<222> 904-915
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 921-927
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 930
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 933
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 936
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 939-987
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 991-993
<223> /replace="gay"

<220> 
<221> unsure
<222> 999
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1005
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1008-1014
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 1015..1017
<223> /note="n at position 1017 = a,t, g or c"
      /replace="ath"

<220> 
<221> unsure
<222> 1023-1029
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1035
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 1036-1038
<223> /replace="tty"

<220> 
<221> variation
<222> 1039-1041
<223> /note="n on position 1041 = a, t, g or c"
      /replace="gtn"

<220> 
<221> variation
<222> 1042-1044
<223> /note="n on position 1043 = a, t, g or c"
      /replace="tcn"
      /replace="agy"

<220> 
<221> unsure
<222> 1050
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1053
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 1054-1056
<223> /replace="tty"

<220> 
<221> unsure
<222> 1059
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1062
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1065
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 1066-1067
<223> /replace="ytr"

<220> 
<221> unsure
<222> 1072-1074
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1078-1080
<223> /replace="a, t, g or c"

<220> 
<221> variation
<222> 1081-1083
<223> /replace="cay"

<220> 
<221> unsure
<222> 1084-1086
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1092-1095
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1098-1101
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1104
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1107-1110
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1119
<223> /replace="a, t, g or c"

<220> 
<221> unsure
<222> 1129-1380
<223> /replace="a, t, g or c"

<400> 2
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn      60

aarathytrt tygcnnnnnn nccnnnngay ggncayttya ayccnytrac nnnnytrgcn     120

nnnnnnnnnn nnnnnnnngg ntgtgaygtn mgrtggtayn nnnnnnnnnn nnnnnnnnnn     180

nnnnnnnnnn nnnnnnnnnn nnnnnnntty ccngarmgrn nnnnnnnnnn nnnnnnnnnn     240

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnntty     300

gaynnnnnnn nnnnnttynn nnnnmgrnnn nnngartayn nnnnngaynn nnnnnnnnnn     360

nnnnnnnnnt tyccnttynn nnnnnnnnnn nnngaynnnn nnttynnnnn nnnnnnntty     420

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccnyt rnnngaragy     480

nnnnnnnnny trccnccnnn nggnnnnggn nnnnnnccnn nnnnnnnnnn nnnnggnaar     540

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     600

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     660

nnnnnnnnnn nnnnnnnnnn nnnnnnnytr carnnnggnn nnccnggntt ygartaynnn     720

mgrnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     780

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     840

acncarggna cnnnngaraa rnnnnnnnnn aarnnnnnnn nnccnacnyt rgargcntty     900

mgrnnnnnnn nnnnnytrgt nnnnnnnacn acnggnggnn nnnnnnnnnn nnnnnnnnnn     960

nnnnnnnnnn nnnnnnnnnn nnnnnnnath gargayttna thccnttnnn nnnngtnatg    1020

ccnnnnnnng aygtntayat hacnaayggn ggntayggng gngtnatgyt rnnnathnnn    1080

aaynnnytrc cnnnngtnnn ngcnggnnnn caygarggna araaygarnn nnnnnnnnnn    1140

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn    1200

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn    1260

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn    1320

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn    1380


<210> 3
<211> 459
<212> PRT
<213> Artificial Sequence

<220> 
<223> GTC

<400> 3
Met Ser Asn Leu Phe Ser Ser Gln Thr Asn Leu Ala Ser Val Lys Pro 
1               5                   10                  15      
Leu Lys Gly Arg Lys Ile Leu Phe Ala Asn Phe Pro Ala Asp Gly His 
            20                  25                  30          
Phe Asn Pro Leu Thr Gly Leu Ala Val His Leu Gln Trp Leu Gly Cys 
        35                  40                  45              
Asp Val Arg Trp Tyr Thr Ser Asn Lys Tyr Ala Asp Lys Leu Arg Arg 
    50                  55                  60                  
Leu Asn Ile Pro His Phe Pro Phe Arg Lys Ala Met Asp Ile Ala Asp 
65                  70                  75                  80  
Leu Glu Asn Met Phe Pro Glu Arg Asp Ala Ile Lys Gly Gln Val Ala 
                85                  90                  95      
Lys Leu Lys Phe Asp Ile Ile Asn Ala Phe Ile Leu Arg Gly Pro Glu 
            100                 105                 110         
Tyr Tyr Val Asp Leu Gln Glu Ile His Lys Ser Phe Pro Phe Asp Val 
        115                 120                 125             
Met Val Ala Asp Cys Ala Phe Thr Gly Ile Pro Phe Val Thr Asp Lys 
    130                 135                 140                 
Met Asp Ile Pro Val Val Ser Val Gly Val Phe Pro Leu Thr Glu Thr 
145                 150                 155                 160 
Ser Lys Asp Leu Pro Pro Ala Gly Leu Gly Ile Thr Pro Ser Phe Ser 
                165                 170                 175     
Leu Pro Gly Lys Phe Lys Gln Ser Ile Leu Arg Ser Val Ala Asp Leu 
            180                 185                 190         
Val Leu Phe Arg Glu Ser Asn Lys Val Met Arg Lys Met Leu Thr Glu 
        195                 200                 205             
His Gly Ile Asp His Leu Tyr Thr Asn Val Phe Asp Leu Met Val Lys 
    210                 215                 220                 
Lys Ser Thr Leu Leu Leu Gln Ser Gly Thr Pro Gly Phe Glu Tyr Tyr 
225                 230                 235                 240 
Arg Ser Asp Leu Gly Lys Asn Ile Arg Phe Ile Gly Ser Leu Leu Pro 
                245                 250                 255     
Tyr Gln Ser Lys Lys Gln Thr Thr Ala Trp Ser Asp Glu Arg Leu Asn 
            260                 265                 270         
Arg Tyr Glu Lys Ile Val Val Val Thr Gln Gly Thr Val Glu Lys Asn 
        275                 280                 285             
Ile Glu Lys Ile Leu Val Pro Thr Leu Glu Ala Phe Arg Asp Thr Asp 
    290                 295                 300                 
Leu Leu Val Ile Ala Thr Thr Gly Gly Ser Gly Thr Ala Glu Leu Lys 
305                 310                 315                 320 
Lys Arg Tyr Pro Gln Gly Asn Leu Ile Ile Glu Asp Phe Ile Pro Phe 
                325                 330                 335     
Gly Asp Ile Met Pro Tyr Ala Asp Val Tyr Ile Thr Asn Gly Gly Tyr 
            340                 345                 350         
Gly Gly Val Met Leu Gly Ile Glu Asn Gln Leu Pro Leu Val Val Ala 
        355                 360                 365             
Gly Ile His Glu Gly Lys Asn Glu Ile Asn Ala Arg Ile Gly Tyr Phe 
    370                 375                 380                 
Glu Leu Gly Ile Asn Leu Lys Thr Glu Trp Pro Lys Pro Glu Gln Met 
385                 390                 395                 400 
Lys Lys Ala Ile Asp Glu Val Ile Gly Asn Lys Lys Tyr Lys Glu Asn 
                405                 410                 415     
Ile Thr Lys Leu Ala Lys Glu Phe Ser Asn Tyr His Pro Asn Glu Leu 
            420                 425                 430         
Cys Ala Gln Tyr Ile Ser Glu Val Leu Gln Lys Thr Gly Arg Leu Tyr 
        435                 440                 445             
Ile Ser Ser Lys Lys Glu Glu Glu Lys Ile Tyr 
    450                 455                 

<210> 4
<211> 1380
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTC

<400> 4
atgagtaatt tattttcttc acaaacgaac cttgcatctg taaaacccct gaaaggcagg      60

aaaatacttt ttgccaactt cccggcagat gggcatttta atccattgac aggactggct     120

gttcacttac aatggctggg ttgtgatgta cgctggtaca cttccaataa atatgcagac     180

aaactgcgaa gattgaatat tccgcatttt cctttcagaa aagctatgga tatagctgac     240

ctggagaata tgtttccgga gcgtgatgcc attaaaggcc aggtagccaa actgaagttc     300

gacataatca atgcttttat tcttcgcggg ccggaatact atgttgacct gcaggagata     360

cataaaagtt ttccatttga cgtaatggtc gctgattgcg cttttacagg aattcctttt     420

gtaacagata aaatggatat acctgttgtt tctgtaggtg tgttccctct taccgaaaca     480

tcgaaagatc ttcctcccgc cggcctcggg attacgcctt ccttttcttt acccggaaaa     540

tttaaacaaa gcatactacg gtcggtggct gacctggtct tattccgcga gtccaataaa     600

gtaatgagaa aaatgctgac cgaacatggc attgatcatc tctatacaaa tgtatttgac     660

ctgatggtaa aaaaatcaac gctgctattg caaagcggaa caccgggttt tgaatattac     720

cgcagtgatc tgggaaaaaa tatccgtttc attggttcat tattacccta ccagtcaaaa     780

aaacaaacaa ctgcatggtc tgatgaaaga ctgaacaggt atgaaaaaat tgtggtggtg     840

acacagggca ctgttgaaaa gaatattgaa aagatcctcg tgcccactct ggaagccttt     900

agggatacag acttattggt aatagccaca acgggtggaa gtggtacagc tgagttgaaa     960

aaaagatatc ctcaaggcaa cctgatcatc gaagatttta ttccctttgg cgatatcatg    1020

ccttatgcgg atgtatatat taccaatgga ggatatggtg gtgtaatgct gggtatcgaa    1080

aaccaattgc cattggtagt agcgggtatt catgaaggga aaaatgagat caatgcaagg    1140

ataggatact ttgaactggg aattaacctg aaaaccgaat ggcctaaacc ggaacagatg    1200

aaaaaagcca tagatgaagt gatcggcaac aaaaaatata aagagaatat aacaaaattg    1260

gcaaaagaat tcagcaatta ccatcccaat gaactatgcg ctcagtatat aagcgaagta    1320

ttacaaaaaa caggcaggct ttatatcagc agtaaaaagg aagaagaaaa gatatactaa    1380


<210> 5
<211> 440
<212> PRT
<213> Artificial Sequence

<220> 
<223> GTD

<400> 5
Met Thr Lys Tyr Lys Asn Glu Leu Thr Gly Lys Arg Ile Leu Phe Gly 
1               5                   10                  15      
Thr Val Pro Gly Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Lys 
            20                  25                  30          
Tyr Leu Gln Glu Leu Gly Cys Asp Val Arg Trp Tyr Ala Ser Asp Val 
        35                  40                  45              
Phe Lys Cys Lys Leu Glu Lys Leu Ser Ile Pro His Tyr Gly Phe Lys 
    50                  55                  60                  
Lys Ala Trp Asp Val Asn Gly Val Asn Val Asn Glu Ile Leu Pro Glu 
65                  70                  75                  80  
Arg Gln Lys Leu Thr Asp Pro Ala Glu Lys Leu Ser Phe Asp Leu Ile 
                85                  90                  95      
His Ile Phe Gly Asn Arg Ala Pro Glu Tyr Tyr Glu Asp Ile Leu Glu 
            100                 105                 110         
Ile His Glu Ser Phe Pro Phe Asp Val Phe Ile Ala Asp Ser Cys Phe 
        115                 120                 125             
Ser Ala Ile Pro Leu Val Ser Lys Leu Met Ser Ile Pro Val Val Ala 
    130                 135                 140                 
Val Gly Val Ile Pro Leu Ala Glu Glu Ser Val Asp Leu Ala Pro Tyr 
145                 150                 155                 160 
Gly Thr Gly Leu Pro Pro Ala Ala Thr Glu Glu Gln Arg Ala Met Tyr 
                165                 170                 175     
Phe Gly Met Lys Asp Ala Leu Ala Asn Val Val Phe Lys Thr Ala Ile 
            180                 185                 190         
Asp Ser Phe Ser Ala Ile Leu Asp Arg Tyr Gln Val Pro His Glu Lys 
        195                 200                 205             
Ala Ile Leu Phe Asp Thr Leu Ile Arg Gln Ser Asp Leu Phe Leu Gln 
    210                 215                 220                 
Ile Gly Ala Lys Ala Phe Glu Tyr Asp Arg Ser Asp Leu Gly Glu Asn 
225                 230                 235                 240 
Val Arg Phe Val Gly Ala Leu Leu Pro Tyr Ser Glu Ser Lys Ser Arg 
                245                 250                 255     
Gln Pro Trp Phe Asp Gln Lys Leu Leu Gln Tyr Gly Arg Ile Val Leu 
            260                 265                 270         
Val Thr Gln Gly Thr Val Glu His Asp Ile Asn Lys Ile Leu Val Pro 
        275                 280                 285             
Thr Leu Glu Ala Phe Lys Asn Ser Glu Thr Leu Val Ile Ala Thr Thr 
    290                 295                 300                 
Gly Gly Asn Gly Thr Ala Glu Leu Arg Ala Arg Phe Pro Phe Glu Asn 
305                 310                 315                 320 
Leu Ile Ile Glu Asp Phe Ile Pro Phe Asp Asp Val Met Pro Arg Ala 
                325                 330                 335     
Asp Val Tyr Val Thr Asn Gly Gly Tyr Gly Gly Thr Leu Leu Ser Ile 
            340                 345                 350         
His Asn Gln Leu Pro Met Val Ala Ala Gly Val His Glu Gly Lys Asn 
        355                 360                 365             
Glu Val Cys Ser Arg Ile Gly His Phe Gly Cys Gly Ile Asn Leu Glu 
    370                 375                 380                 
Thr Glu Thr Pro Thr Pro Asp Gln Ile Arg Glu Ser Val His Lys Ile 
385                 390                 395                 400 
Leu Ser Asn Asp Ile Phe Lys Lys Asn Val Phe Arg Ile Ser Thr His 
                405                 410                 415     
Leu Asp Val Asp Ala Asn Glu Lys Ser Ala Gly His Ile Leu Asp Leu 
            420                 425                 430         
Leu Glu Glu Arg Val Val Cys Gly 
        435                 440 

<210> 6
<211> 1323
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTD

<400> 6
atgacgaaat acaaaaatga attaacaggt aaaagaatac tctttggtac cgttcccgga      60

gacggtcatt ttaatcccct taccgggctt gctaaatatt tacaggaatt agggtgcgat     120

gtcaggtggt atgcttctga tgttttcaaa tgcaagcttg aaaaattgtc gataccacat     180

tatggcttca aaaaagcatg ggatgtcaac ggtgtgaatg taaacgagat cctgccggag     240

cgacaaaaat taacagatcc cgccgaaaaa ctgagctttg acttgatcca cattttcgga     300

aaccgggcac ctgagtatta tgaggatatt ctcgaaatac acgaatcgtt cccattcgat     360

gtgttcattg ctgacagctg cttttccgcg attccgttag ttagcaagct gatgagcatc     420

cccgttgttg ccgttggcgt aattcctctg gcggaagaat ctgttgatct ggcgccttat     480

ggaacaggat tgccgcctgc cgcgacggag gagcaacgtg cgatgtattt tggtatgaaa     540

gatgctttgg ccaacgttgt tttcaaaact gccattgact ctttttcggc cattctggac     600

cggtaccagg taccgcacga aaaagcaatt ttattcgata cattgatccg tcaatccgac     660

ttgtttctgc aaattggcgc aaaagcattt gagtatgacc gcagcgacct gggcgaaaat     720

gtccgttttg tcggcgcatt gctgccgtac tcggaaagta aatcccggca gccctggttt     780

gatcagaaac ttttacaata tggcaggatt gtgctggtta cccagggcac tgttgagcac     840

gatatcaaca agatacttgt acccacgctg gaagctttca aaaattctga gacgctggta     900

attgccacaa caggcggtaa tgggacagcg gaattgcgcg cgcgttttcc tttcgaaaac     960

ctgatcatcg aagatttcat tccgtttgac gatgtgatgc ccagagcaga cgtttatgtt    1020

accaatggtg gctatggagg caccttgctc agcatacata atcagttgcc aatggtagcg    1080

gcgggcgtgc atgagggtaa aaatgaagtt tgctcacgta tcggccactt cggctgtggg    1140

attaatctgg aaacggaaac acctacccca gatcagatac gcgaaagtgt ccacaaaatc    1200

ctgtctaatg acatcttcaa aaagaatgtc ttcaggattt cgacgcactt ggatgtggat    1260

gcgaatgaaa aaagcgcggg tcacattctt gacttgttgg aagagcgggt tgtttgcggt    1320

taa                                                                  1323


<210> 7
<211> 441
<212> PRT
<213> Artificial Sequence

<220> 
<223> GTF

<400> 7
Met Thr Thr Lys Lys Ile Leu Phe Ala Thr Met Pro Met Asp Gly His 
1               5                   10                  15      
Phe Asn Pro Leu Thr Gly Leu Ala Val His Leu His Asn Gln Gly His 
            20                  25                  30          
Asp Val Arg Trp Tyr Val Gly Gly His Tyr Gly Ala Lys Val Lys Lys 
        35                  40                  45              
Leu Gly Leu Ile His Tyr Pro Tyr His Lys Ala Gln Val Ile Asn Gln 
    50                  55                  60                  
Glu Asn Leu Asp Glu Val Phe Pro Glu Arg Gln Lys Ile Lys Gly Thr 
65                  70                  75                  80  
Val Pro Arg Leu Arg Phe Asp Leu Asn Asn Val Phe Leu Leu Arg Ala 
                85                  90                  95      
Pro Glu Phe Ile Thr Asp Val Thr Ala Ile His Lys Ser Phe Pro Phe 
            100                 105                 110         
Asp Leu Leu Ile Cys Asp Thr Met Phe Ser Ala Ala Pro Met Leu Arg 
        115                 120                 125             
His Ile Leu Asn Val Pro Val Ala Ala Val Gly Ile Val Pro Leu Ser 
    130                 135                 140                 
Glu Thr Ser Lys Glu Leu Pro Pro Ala Gly Leu Gly Met Glu Pro Ala 
145                 150                 155                 160 
Thr Gly Phe Phe Gly Arg Leu Lys Gln Asp Phe Leu Arg Phe Met Thr 
                165                 170                 175     
Thr Arg Ile Leu Phe Lys Pro Cys Asp Asp Leu Tyr Asn Glu Ile Arg 
            180                 185                 190         
Gln Arg Tyr Asn Met Glu Pro Ala Arg Asp Phe Val Phe Asp Ser Phe 
        195                 200                 205             
Ile Arg Thr Ala Asp Leu Tyr Leu Gln Ser Gly Val Pro Gly Phe Glu 
    210                 215                 220                 
Tyr Lys Arg Ser Lys Met Ser Ala Asn Val Arg Phe Val Gly Pro Leu 
225                 230                 235                 240 
Leu Pro Tyr Ser Ser Gly Ile Lys Pro Asn Phe Ala His Ala Ala Lys 
                245                 250                 255     
Leu Lys Gln Tyr Lys Lys Val Ile Leu Ala Thr Gln Gly Thr Val Glu 
            260                 265                 270         
Arg Asp Pro Glu Lys Ile Leu Val Pro Thr Leu Glu Ala Phe Lys Asp 
        275                 280                 285             
Thr Asp His Leu Val Val Ile Thr Thr Gly Gly Ser Lys Thr Ala Glu 
    290                 295                 300                 
Leu Arg Ala Arg Tyr Pro Gln Lys Asn Val Ile Ile Glu Asp Phe Ile 
305                 310                 315                 320 
Asp Phe Asn Leu Ile Met Pro His Ala Asp Val Tyr Val Thr Asn Ser 
                325                 330                 335     
Gly Phe Gly Gly Val Met Leu Ser Ile Gln His Gly Leu Pro Met Val 
            340                 345                 350         
Ala Ala Gly Val His Glu Gly Lys Asn Glu Ile Ala Ala Arg Ile Gly 
        355                 360                 365             
Tyr Phe Lys Leu Gly Met Asn Leu Lys Thr Glu Thr Pro Thr Pro Asp 
    370                 375                 380                 
Gln Ile Arg Thr Ser Val Glu Thr Val Leu Thr Asp Gln Thr Tyr Arg 
385                 390                 395                 400 
Arg Asn Leu Ala Arg Leu Arg Thr Glu Phe Ala Gln Tyr Asp Pro Met 
                405                 410                 415     
Ala Leu Ser Glu Arg Tyr Ile Asn Glu Leu Leu Ala Lys Gln Pro Arg 
            420                 425                 430         
Lys Gln His Glu Ala Val Glu Ala Ile 
        435                 440     

<210> 8
<211> 1326
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTF

<400> 8
atgacaacta aaaaaatcct gtttgccacc atgccaatgg atggccactt caaccccctg      60

actggtctgg ctgttcattt gcataaccag ggtcacgacg tacgctggta cgtgggcgga     120

cactacggtg ccaaagtgaa aaagctgggc ctgattcatt acccttacca taaagcccag     180

gttatcaatc aggagaatct ggacgaggtt ttccctgaac gtcagaagat caaagggacc     240

gtaccccggc tgcgctttga cctcaacaat gtcttcctgc tgcgcgctcc cgaattcatt     300

accgacgtta cggccatcca caaatcattc ccattcgatc tgctcatatg cgacaccatg     360

ttctcagcgg ctcccatgct gcgccatatt ctgaacgttc cggtagcggc cgtaggcatt     420

gtgcccctga gtgaaacctc gaaagaactg ccaccggccg gcctgggtat ggagcctgct     480

accggtttct ttgggcggct gaagcaggac ttcctgcgct ttatgactac ccgtatcctc     540

ttcaagccct gcgacgattt gtacaacgag atccggcagc gctataacat ggaaccagcc     600

cgtgattttg tcttcgactc gtttatccgc accgccgatt tgtacctgca aagtggtgta     660

ccgggctttg aatacaaacg gagcaagatg agtgctaacg tccggtttgt cggcccgctt     720

ctcccctact ccagcggtat taagccaaac tttgcccatg cggccaaact gaagcagtat     780

aaaaaggtaa ttctggccac gcagggcacg gtagaacgcg atccggagaa gattctggtg     840

ccgacgctcg aagcgttcaa agacaccgat cacctggtcg tcataacaac gggcggttct     900

aaaacggccg agttgcgcgc ccggtatccg cagaaaaatg tcatcatcga agacttcatt     960

gactttaacc tcatcatgcc ccatgccgac gtatacgtaa ccaattcggg tttcggcgga    1020

gtgatgctga gcattcagca tggcctgcca atggtagctg ccggtgttca cgagggtaaa    1080

aacgagattg cagcccgcat tggctatttc aaactgggga tgaatctgaa gacagaaacc    1140

cctacgccgg accagatccg gacaagcgtc gaaacggttc tgaccgatca gacctaccgc    1200

cggaacttag cccggttgcg gacggagttc gctcagtacg acccaatggc gttgagtgag    1260

cgatatatca acgagctgct ggccaaacaa ccgcgcaagc aacacgaagc cgtagaagca    1320

atctaa                                                               1326


<210> 9
<211> 454
<212> PRT
<213> Segetibacter koreensis

<220> 
<223> GT sequence

<400> 9
Met Lys Tyr Ile Ser Ser Ile Gln Pro Gly Thr Lys Ile Leu Phe Ala 
1               5                   10                  15      
Asn Phe Pro Ala Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Val 
            20                  25                  30          
His Leu Lys Asn Ile Gly Cys Asp Val Arg Trp Tyr Thr Ser Lys Thr 
        35                  40                  45              
Tyr Ala Glu Lys Ile Ala Arg Leu Asp Ile Pro Phe Tyr Gly Leu Gln 
    50                  55                  60                  
Arg Ala Val Asp Val Ser Ala His Ala Glu Ile Asn Asp Val Phe Pro 
65                  70                  75                  80  
Glu Arg Lys Lys Tyr Lys Gly Gln Val Ser Lys Leu Lys Phe Asp Met 
                85                  90                  95      
Ile Asn Ala Phe Ile Leu Arg Ser Thr Glu Tyr Tyr Glu Asp Ile Leu 
            100                 105                 110         
Glu Ile Tyr Glu Glu Phe Pro Phe Gln Leu Met Ile Ala Asp Ile Thr 
        115                 120                 125             
Phe Gly Ala Ile Pro Phe Val Glu Glu Lys Met Asn Ile Pro Val Ile 
    130                 135                 140                 
Ser Ile Ser Val Val Pro Leu Pro Glu Thr Ser Lys Asp Leu Ala Pro 
145                 150                 155                 160 
Ser Gly Leu Gly Ile Thr Pro Ser Tyr Ser Phe Phe Gly Lys Ile Lys 
                165                 170                 175     
Gln Ser Phe Leu Arg Phe Ile Ala Asp Glu Leu Leu Phe Ala Gln Pro 
            180                 185                 190         
Thr Lys Val Met Trp Gly Leu Leu Ala Gln His Gly Ile Asp Ala Gly 
        195                 200                 205             
Lys Ala Asn Ile Phe Asp Ile Leu Ile Gln Lys Ser Thr Leu Val Leu 
    210                 215                 220                 
Gln Ser Gly Thr Pro Gly Phe Glu Tyr Lys Arg Ser Asp Leu Ser Ser 
225                 230                 235                 240 
His Val His Phe Ile Gly Pro Leu Leu Pro Tyr Thr Lys Lys Lys Glu 
                245                 250                 255     
Arg Glu Ser Trp Tyr Asn Glu Lys Leu Ser His Tyr Asp Lys Val Ile 
            260                 265                 270         
Leu Val Thr Gln Gly Thr Ile Glu Lys Asp Ile Glu Lys Leu Ile Val 
        275                 280                 285             
Pro Thr Leu Glu Ala Phe Lys Asn Ser Asp Cys Leu Val Ile Ala Thr 
    290                 295                 300                 
Thr Gly Gly Ala Tyr Thr Glu Glu Leu Arg Lys Arg Tyr Pro Glu Glu 
305                 310                 315                 320 
Asn Ile Ile Ile Glu Asp Phe Ile Pro Phe Asp Asp Val Met Pro Tyr 
                325                 330                 335     
Ala Asp Val Tyr Val Ser Asn Gly Gly Tyr Gly Gly Val Leu Leu Ser 
            340                 345                 350         
Ile Gln His Gln Leu Pro Met Val Val Ala Gly Val His Glu Gly Lys 
        355                 360                 365             
Asn Glu Ile Asn Ala Arg Val Gly Tyr Phe Asp Leu Gly Ile Asn Leu 
    370                 375                 380                 
Lys Thr Glu Arg Pro Thr Val Leu Gln Leu Arg Lys Ser Val Asp Ala 
385                 390                 395                 400 
Val Leu Gln Ser Asp Ser Tyr Ala Lys Asn Val Lys Arg Leu Gly Lys 
                405                 410                 415     
Glu Phe Lys Gln Tyr Asp Pro Asn Glu Ile Cys Glu Lys Tyr Val Ala 
            420                 425                 430         
Gln Leu Leu Glu Asn Gln Ile Ser Tyr Lys Glu Lys Ala Asn Ser Tyr 
        435                 440                 445             
Gln Ala Glu Val Leu Val 
    450                 

<210> 10
<211> 1365
<212> DNA
<213> Segetibacter koreensis

<220> 
<223> GT sequence

<400> 10
atgaaatata tttcatcgat acaaccggga acaaaaatat tatttgccaa tttccctgcc      60

gatggtcact tcaatccgct gacaggattg gctgttcatt taaaaaatat tgggtgcgat     120

gtgcgttggt acacttcaaa gacatatgcc gaaaaaattg ccaggttaga tatacctttt     180

tatggtttgc aaagagccgt agatgtaagt gcccatgcgg aaatcaacga cgtttttccc     240

gaaaggaaaa aatacaaagg ccaggtaagc aagttgaaat ttgatatgat aaacgccttc     300

attctgcgct ctacggaata ttatgaagac atattggaaa tatacgagga atttcctttt     360

cagttaatga ttgctgacat cactttcggc gctattcctt ttgtagaaga aaaaatgaat     420

attccggtta tttccatcag cgttgttccg cttcccgaaa cctcaaaaga tctggctccc     480

tccggccttg gtatcacccc ttcttattcg ttttttggca aaataaaaca gagcttttta     540

cgctttattg ccgacgaatt actttttgcg caacccacta aagtaatgtg gggccttttg     600

gcccaacatg gaattgatgc ggggaaagcc aacatatttg acatacttat acaaaaatca     660

acactggtac tacaaagcgg cactccgggt tttgaataca agagaagtga cttaagcagt     720

catgtgcatt ttattggtcc gctgctgcct tacacaaaaa agaaagaaag agaaagctgg     780

tacaatgaaa agttaagcca ctacgataaa gttattcttg taacacaagg cacaattgaa     840

aaagatattg agaagcttat tgtgccaact cttgaagcat ttaaaaactc cgattgcctc     900

gttattgcta ctactggcgg tgcctatact gaagagttga gaaaacgtta ccccgaggaa     960

aatataatta tagaagattt tatccctttt gatgatgtaa tgccttatgc agacgtatat    1020

gtttcaaacg ggggatatgg cggagttctt ttatctatac aacatcaact gcctatggta    1080

gtggctggtg tacatgaagg aaaaaatgag attaatgcaa gagtgggata ttttgatttg    1140

ggcattaatc ttaagaccga aagacctacc gtacttcaat taagaaaaag tgttgacgca    1200

gtcttacaaa gtgattcata cgcgaagaat gtaaaacggc ttggtaaaga attcaaacaa    1260

tatgatccga atgaaatatg tgaaaaatat gtagcgcaac tgctggaaaa tcaaatttct    1320

tataaagaaa aagcaaatag ctaccaggcc gaagttttgg tttaa                    1365


<210> 11
<211> 447
<212> PRT
<213> Flavihumibacter solisilvae

<220> 
<223> GT sequence

<400> 11
Met Asn His Lys His Ser Arg Lys Ile Leu Met Ala Asn Val Pro Ala 
1               5                   10                  15      
Asp Gly His Phe Asn Pro Leu Thr Gly Ile Ala Val His Leu Lys Gln 
            20                  25                  30          
Gln Gly Tyr Asp Val Arg Trp Tyr Gly Ser Asp Val Tyr Ser Lys Lys 
        35                  40                  45              
Ala Ala Lys Leu Gly Ile Pro Tyr Phe Pro Phe Ser Lys Ala Leu Glu 
    50                  55                  60                  
Val Asn Ser Glu Asn Ala Glu Glu Val Phe Pro Glu Arg Lys Arg Ile 
65                  70                  75                  80  
Asn Ser Lys Ile Gly Lys Leu Asn Phe Asp Leu Gln Asn Phe Phe Val 
                85                  90                  95      
Arg Arg Ala Pro Glu Tyr Tyr Ala Asp Leu Ile Asp Ile His Arg Glu 
            100                 105                 110         
Phe Pro Phe Asp Leu Leu Ile Ala Asp Cys Met Phe Thr Ala Ile Pro 
        115                 120                 125             
Phe Val Lys Glu Leu Met Gln Ile Pro Val Leu Ser Ile Gly Ile Ala 
    130                 135                 140                 
Pro Leu Leu Glu Ser Ser Arg Asp Leu Ala Pro Tyr Gly Leu Gly Leu 
145                 150                 155                 160 
His Pro Ala Arg Ser Trp Ala Gly Lys Phe Arg Gln Ala Gly Leu Arg 
                165                 170                 175     
Trp Val Ala Asp Asn Ile Leu Phe Arg Lys Ser Ile Asn Val Met Tyr 
            180                 185                 190         
Asp Leu Phe Glu Glu Tyr Asn Ile Pro His Asn Gly Glu Asn Phe Phe 
        195                 200                 205             
Asp Met Gly Val Arg Lys Ala Ser Leu Phe Leu Gln Ser Gly Thr Pro 
    210                 215                 220                 
Gly Phe Glu Tyr Asn Arg Ser Asp Leu Ser Glu His Ile Arg Phe Ile 
225                 230                 235                 240 
Gly Ala Leu Leu Pro Tyr Ala Gly Glu Arg Lys Glu Glu Pro Trp Phe 
                245                 250                 255     
Asp Ser Arg Leu Asn Lys Phe Asp Arg Val Ile Leu Val Thr Gln Gly 
            260                 265                 270         
Thr Val Glu Arg Asp Val Thr Lys Ile Ile Val Pro Val Leu Lys Ala 
        275                 280                 285             
Phe Arg Asp Ser Asn Tyr Leu Val Val Ala Thr Thr Gly Gly Asn Gly 
    290                 295                 300                 
Thr Lys Leu Leu Arg Glu Gln Tyr Lys Ala Asp Asn Ile Ile Ile Glu 
305                 310                 315                 320 
Asp Phe Ile Pro Phe Thr Asp Ile Met Pro Tyr Thr Asp Val Tyr Val 
                325                 330                 335     
Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Gly Ile Glu Asn Gln Leu 
            340                 345                 350         
Pro Leu Val Val Ala Gly Val His Glu Gly Lys Asn Glu Ile Asn Ala 
        355                 360                 365             
Arg Ile Gly Tyr Phe Arg Leu Gly Ile Asp Leu Arg Asn Glu Arg Pro 
    370                 375                 380                 
Thr Pro Glu Gln Met Arg Asn Ala Ile Glu Lys Val Ile Ala Asn Gly 
385                 390                 395                 400 
Glu Tyr Arg Arg Asn Val Gln Ala Leu Ala Arg Glu Phe Lys Thr Tyr 
                405                 410                 415     
Ala Pro Leu Glu Leu Thr Glu Arg Phe Val Thr Glu Leu Leu Leu Ser 
            420                 425                 430         
Arg Arg His Lys Leu Val Pro Val Asn Asp Asp Ala Leu Ile Tyr 
        435                 440                 445         

<210> 12
<211> 1344
<212> DNA
<213> Flavihumibacter solisilvae

<220> 
<223> GT sequence

<400> 12
atgaatcaca aacattccag gaagatcctg atggccaacg tgcctgcgga tggccacttt      60

aatccgctga ccggcatcgc ggttcacctg aagcagcagg gctacgatgt acgctggtat     120

ggctcggatg tttacagcaa aaaagccgca aaactgggta ttccttattt tcctttcagc     180

aaggctcttg aagtaaacag cgaaaatgcc gaagaggtct ttccggaaag aaaacgcatt     240

aacagcaaga ttggcaagct gaattttgat ctgcagaact tctttgttcg ccgcgcaccg     300

gaatattatg ctgacctgat cgacattcac cgcgagttcc cttttgacct gctgatcgct     360

gactgtatgt ttactgccat accgtttgtt aaggaactca tgcagattcc tgtgctgtcg     420

atcggaattg cgccactgct ggaatcttcc cgcgacctgg caccgtatgg cctgggcctt     480

catcctgccc gcagctgggc cggcaagttt cgccaggcag gcttacgctg ggttgcagac     540

aatatccttt tccgcaaatc catcaacgtc atgtatgacc tttttgaaga gtataatatc     600

ccgcacaacg gggagaattt ctttgacatg ggtgtaagaa aagcttccct gttcctccag     660

agcggaacac cgggatttga atataaccgc agcgacctga gtgaacatat ccgtttcatc     720

ggcgcattgc ttccttacgc cggagaaaga aaagaagagc cctggttcga cagccgcctg     780

aacaaatttg accgggtgat cctggttacc cagggaactg tggaacgtga tgtgacaaag     840

atcattgtgc cggtactgaa agccttccgt gacagtaact acctcgtggt agccactacc     900

ggcggcaatg gaaccaaatt gctgcgggag caatacaagg cagataatat catcatcgag     960

gattttattc ctttcactga tatcatgccc tatacggatg tatacgttac caatggtggt    1020

tatggtggtg taatgctggg gatagaaaac cagcttccac ttgttgttgc aggcgttcac    1080

gaagggaaaa atgagatcaa tgcaagaata ggctatttca ggcttggtat agacctgcgc    1140

aacgaaagac cgacaccgga acagatgcgc aatgccattg aaaaagtcat tgcaaacggt    1200

gaatatcgca ggaatgtgca ggcactggcc cgcgaattca aaacctacgc accgcttgaa    1260

ttaacggaaa ggtttgtgac agaactgctg ctcagcaggc gacataaact ggttccggta    1320

aacgacgatg cgcttattta ctaa                                           1344


<210> 13
<211> 463
<212> PRT
<213> Cesiribacter andamanensis

<220> 
<223> GT sequence

<400> 13
Met Glu Thr Ser Gln Lys Gly Gly Thr Gln Ser Pro Lys Pro Phe Arg 
1               5                   10                  15      
Arg Ile Leu Phe Ala Asn Cys Pro Ala Asp Gly His Phe Asn Pro Leu 
            20                  25                  30          
Ile Pro Leu Ala Glu Phe Leu Lys Gln Gln Gly His Asp Val Arg Trp 
        35                  40                  45              
Tyr Ser Ser Arg Leu Tyr Ala Asp Lys Ile Ser Arg Met Gly Ile Pro 
    50                  55                  60                  
His Tyr Pro Phe Lys Lys Ala Leu Glu Phe Asp Thr His Asp Trp Glu 
65                  70                  75                  80  
Gly Ser Phe Pro Glu Arg Ser Lys His Lys Ser Gln Val Gly Lys Leu 
                85                  90                  95      
Arg Phe Asp Leu Glu His Val Phe Ile Arg Arg Gly Pro Glu Tyr Phe 
            100                 105                 110         
Glu Asp Ile Arg Asp Leu His Gln Glu Phe Pro Phe Asp Val Leu Val 
        115                 120                 125             
Ala Glu Ile Ser Phe Thr Gly Ile Ala Phe Ile Arg His Leu Met His 
    130                 135                 140                 
Lys Pro Val Ile Ala Val Gly Ile Phe Pro Asn Ile Ala Ser Ser Arg 
145                 150                 155                 160 
Asp Leu Pro Pro Tyr Gly Leu Gly Met Arg Pro Ala Ser Gly Phe Leu 
                165                 170                 175     
Gly Arg Lys Lys Gln Asp Leu Leu Arg Phe Leu Thr Asp Lys Leu Val 
            180                 185                 190         
Phe Gly Lys Gln Asn Glu Leu Asn Arg Gln Ile Leu Arg Ser Trp Gly 
        195                 200                 205             
Ile Glu Ala Pro Gly His Leu Asn Leu Phe Asp Leu Gln Thr Gln His 
    210                 215                 220                 
Ala Ser Val Val Leu Gln Asn Gly Thr Pro Gly Phe Glu Tyr Thr Arg 
225                 230                 235                 240 
Ser Asp Leu Ser Pro Asn Leu Val Phe Ala Gly Pro Leu Leu Pro Leu 
                245                 250                 255     
Val Lys Lys Val Arg Glu Asp Leu Pro Leu Gln Glu Lys Leu Arg Lys 
            260                 265                 270         
Tyr Lys Asn Val Ile Leu Val Thr Gln Gly Thr Ala Glu Gln Asn Thr 
        275                 280                 285             
Glu Lys Ile Leu Ala Pro Thr Leu Glu Ala Phe Lys Asp Ser Thr Trp 
    290                 295                 300                 
Leu Val Val Ala Thr Thr Gly Gly Ala Gly Thr Glu Ala Leu Arg Ala 
305                 310                 315                 320 
Arg Tyr Pro Gln Glu Asn Phe Leu Ile Glu Asp Tyr Ile Pro Phe Asp 
                325                 330                 335     
Gln Ile Met Pro Asn Ala Asp Val Tyr Val Ser Asn Gly Gly Phe Gly 
            340                 345                 350         
Gly Val Leu Gln Ala Ile Ser His Gln Leu Pro Met Val Val Ala Gly 
        355                 360                 365             
Val His Glu Gly Lys Asn Glu Ile Cys Ala Arg Val Gly Tyr Phe Lys 
    370                 375                 380                 
Leu Gly Leu Asp Leu Lys Thr Glu Thr Pro Lys Pro Ala Gln Ile Arg 
385                 390                 395                 400 
Ala Ala Val Glu Gln Val Leu Gln Asp Pro Gln Tyr Arg His Lys Val 
                405                 410                 415     
Gln Ala Leu Ser Ala Glu Phe Arg Gln Tyr Asn Pro Gln Gln Leu Cys 
            420                 425                 430         
Glu His Trp Val Gln Arg Leu Thr Gly Gly Arg Arg Ala Ala Ala Pro 
        435                 440                 445             
Ala Pro Gln Ser Ala Gly Gly Gln Leu Leu Ser Leu Thr Leu Asn 
    450                 455                 460             

<210> 14
<211> 1392
<212> DNA
<213> Cesiribacter andamanensis

<220> 
<223> GT sequence

<400> 14
atggaaactt cacaaaaagg cgggactcag tcacccaaac cattcagaag aattcttttt      60

gccaactgcc cggccgacgg gcactttaat ccgctcattc cactggcgga attcctcaag     120

cagcaggggc atgatgtgcg ctggtactcc tcccgcctgt atgccgataa gatttcgcgc     180

atgggcattc cccattatcc ttttaaaaag gcgcttgaat ttgacaccca cgactgggaa     240

gggagctttc ccgagcgcag caaacacaaa agccaggtag gcaagctgcg cttcgatctg     300

gagcatgtgt tcattcgccg cggccctgag tactttgaag atattcgaga cctccaccag     360

gagtttccct ttgatgtgct ggtggccgag atcagcttta ccggtattgc attcatccgc     420

cacctgatgc acaagccggt gattgcggtg ggcatttttc ccaacatcgc atcttcgcgc     480

gacttgcctc cctatgggct gggcatgcgt cctgctagcg ggtttctggg tagaaaaaag     540

caagacctgc tgcgctttct taccgacaag ctggtgtttg gaaaacagaa cgagctgaat     600

cggcagattc tccgcagctg gggaattgag gcccccgggc accttaacct gtttgacctg     660

cagacgcagc atgcctcggt ggttttgcag aacggaaccc cgggttttga gtacacccgc     720

agcgacctga gtcccaacct ggtatttgca ggccccctgt tgccgttggt gaaaaaagtg     780

cgggaagatc tacccctgca ggagaagctc aggaagtaca aaaacgtaat tctggtaacc     840

cagggcactg ccgagcaaaa taccgaaaag attctggcgc ccacactgga agcctttaaa     900

gacagcacct ggctggtggt ggcaaccaca ggaggagcgg gcaccgaggc gctgagggcc     960

aggtatcccc aggagaattt cctgatcgaa gattatattc cttttgatca gatcatgccc    1020

aatgccgatg tatatgtatc gaacggaggc tttggaggcg tcctgcaggc catttcacac    1080

caactgccca tggtagtggc aggggtacat gagggtaaaa atgagatctg tgcccgggtg    1140

ggctatttta agctggggct cgacctgaag acggaaaccc ccaaaccagc ccagataaga    1200

gcggcggtag agcaggtgct gcaagacccc cagtaccgcc acaaggtgca ggccctgagt    1260

gctgaattcc ggcaatacaa tccacaacag ctgtgcgagc actgggtgca gcgcctgaca    1320

ggcggacgta gagcggctgc acccgcacct cagtcggctg gcgggcagct actttccctg    1380

acgctgaact aa                                                        1392


<210> 15
<211> 450
<212> PRT
<213> Niabella aurantiaca

<220> 
<223> GT sequence

<400> 15
Met Tyr Thr Lys Thr Ala Asn Thr Thr Asn Ala Ala Ala Pro Leu His 
1               5                   10                  15      
Gly Gly Glu Lys Lys Lys Ile Leu Phe Ala Asn Ile Pro Ala Asp Gly 
            20                  25                  30          
His Phe Asn Pro Leu Thr Gly Leu Ala Val Arg Leu Lys Lys Ala Gly 
        35                  40                  45              
His Asp Val Arg Trp Tyr Thr Gly Ala Ser Tyr Ala Pro Arg Ile Glu 
    50                  55                  60                  
Gln Leu Gly Ile Pro Phe Tyr Leu Phe Asn Lys Ala Lys Glu Val Thr 
65                  70                  75                  80  
Val His Asn Ile Asp Glu Val Phe Pro Glu Arg Lys Thr Ile Arg Asn 
                85                  90                  95      
His Val Lys Lys Val Ile Phe Asp Ile Cys Thr Tyr Phe Ile Glu Arg 
            100                 105                 110         
Gly Thr Glu Phe Tyr Glu Asp Ile Lys Asp Ile Asn Lys Ser Phe Asp 
        115                 120                 125             
Phe Asp Val Leu Ile Cys Asp Ser Ala Phe Thr Gly Met Ser Phe Val 
    130                 135                 140                 
Lys Glu Lys Leu Asn Lys His Ala Val Ala Ile Gly Ile Leu Pro Leu 
145                 150                 155                 160 
Cys Ala Ser Ser Lys Gln Leu Pro Pro Pro Ile Met Gly Leu Thr Pro 
                165                 170                 175     
Ala Lys Thr Leu Ala Gly Lys Ala Val His Ser Phe Leu Arg Phe Leu 
            180                 185                 190         
Thr Asn Lys Val Leu Phe Lys Lys Pro His Ala Leu Ile Asn Glu Gln 
        195                 200                 205             
Tyr Arg Arg Ala Gly Met Leu Thr Asn Gly Lys Asn Leu Phe Asp Leu 
    210                 215                 220                 
Gln Ile Asp Lys Ala Thr Leu Phe Leu Gln Ser Cys Thr Pro Gly Phe 
225                 230                 235                 240 
Glu Tyr Gln Arg Ala His Met Ser Arg His Ile His Phe Ile Gly Pro 
                245                 250                 255     
Leu Leu Pro Ser His Ser Asp Ala Pro Ala Pro Phe His Phe Glu Asp 
            260                 265                 270         
Lys Leu His Gln Tyr Ala Lys Val Leu Leu Val Thr Gln Gly Thr Phe 
        275                 280                 285             
Glu Gly Asp Val Arg Lys Leu Ile Val Pro Ala Ile Glu Ala Phe Lys 
    290                 295                 300                 
Asn Ser Arg His Leu Val Val Val Thr Thr Ala Gly Trp His Thr His 
305                 310                 315                 320 
Lys Leu Arg Gln Arg Tyr Lys Ala Phe Ala Asn Val Val Ile Glu Asp 
                325                 330                 335     
Phe Ile Pro Phe Ser Gln Ile Met Pro Phe Ala Asp Val Phe Ile Ser 
            340                 345                 350         
Asn Gly Gly Tyr Gly Gly Val Met Gln Ser Ile Ser Asn Lys Leu Pro 
        355                 360                 365             
Met Val Val Ala Gly Ile His Glu Gly Lys Asn Glu Ile Cys Ala Arg 
    370                 375                 380                 
Val Gly Tyr Phe Lys Thr Gly Ile Asn Met Arg Thr Glu His Pro Lys 
385                 390                 395                 400 
Pro Glu Lys Ile Lys Thr Ala Val Asn Glu Ile Leu Ser Asn Pro Leu 
                405                 410                 415     
Tyr Arg Lys Ser Val Glu Arg Leu Ser Lys Glu Phe Ser Glu Tyr Asp 
            420                 425                 430         
Pro Leu Ala Leu Cys Glu Lys Phe Val Asn Ala Leu Pro Val Leu Gln 
        435                 440                 445             
Lys Pro 
    450 

<210> 16
<211> 1353
<212> DNA
<213> Niabella aurantiaca

<220> 
<223> GT sequence

<400> 16
atgtacacaa aaacagcaaa cacaaccaat gccgctgctc ccttacacgg cggtgaaaaa      60

aagaaaatct tatttgccaa catccctgcc gacgggcatt tcaaccctct aacgggatta     120

gccgttcggc tcaaaaaagc agggcatgat gtccgctggt acaccggcgc cagctatgca     180

ccccgtatcg aacagctggg cattcccttc tatcttttta acaaggcaaa agaggtaacc     240

gttcacaaca ttgacgaagt atttcccgaa aggaaaacga tccggaatca tgtaaagaaa     300

gtcatctttg atatctgcac gtattttatc gaacgcggaa cagaatttta tgaagacata     360

aaggacatca ataaaagttt cgatttcgac gtgctgatct gcgacagcgc ttttaccggt     420

atgtcgttcg taaaagaaaa actaaacaag catgcagtag ccatcggcat cctcccttta     480

tgtgcctctt cgaaacagct acccccgccc atcatgggac ttacaccggc caaaaccctg     540

gcaggaaaag ccgtgcattc gtttttgcgt tttcttacca ataaagtatt gtttaaaaag     600

ccccacgcgc tgatcaacga acaataccgc cgtgcaggca tgctgaccaa tggcaaaaac     660

ctgtttgatc tgcagatcga taaggcaaca ctgtttttac aaagctgtac cccggggttt     720

gaataccaac gcgcgcatat gagccggcat atccatttta taggcccttt actgccctcc     780

catagtgatg cccctgcccc attccatttt gaagacaaac tgcatcagta tgcaaaagtg     840

ctgctggtaa cgcagggaac ctttgaagga gatgtgcgca agctgatcgt gcccgcaatt     900

gaagccttta aaaacagccg ccacctggtg gtggtaacaa cggccggatg gcatacccat     960

aaactgcgcc agcggtataa agcatttgcc aatgttgtta ttgaagactt tattccgttc    1020

agccagatca tgccttttgc cgatgtattc atttcaaacg gtggttacgg cggtgtgatg    1080

caaagcataa gcaataagct gccaatggta gtggccggca tacacgaagg gaaaaacgaa    1140

atatgtgccc gggtgggata ttttaaaaca ggcatcaata tgcgcacgga acatcccaaa    1200

ccggaaaaaa taaaaacagc tgtgaacgag atcctgagca acccccttta ccggaaaagc    1260

gtggaacggc tttcgaagga attttcggag tacgacccgt tggccctttg tgaaaaattc    1320

gtcaacgctt tacccgtcct tcagaaacca tag                                 1353


<210> 17
<211> 441
<212> PRT
<213> Spirosoma radiotolerans

<220> 
<223> GT sequence

<400> 17
Met Ile Thr Pro Gln Arg Ile Leu Phe Ala Thr Met Pro Met Asp Gly 
1               5                   10                  15      
His Phe Ser Pro Leu Thr Gly Leu Ala Val His Leu Ser Asn Leu Gly 
            20                  25                  30          
His Asp Val Arg Trp Tyr Val Gly Gly Glu Tyr Gly Glu Lys Val Arg 
        35                  40                  45              
Lys Leu Lys Leu His His Tyr Pro Phe Val Asn Ala Arg Thr Ile Asn 
    50                  55                  60                  
Gln Glu Asn Leu Glu Arg Glu Phe Pro Glu Arg Ala Ala Leu Lys Gly 
65                  70                  75                  80  
Ser Ile Ala Arg Leu Arg Phe Asp Ile Lys Gln Val Phe Leu Leu Arg 
                85                  90                  95      
Ala Pro Glu Phe Val Glu Asp Met Lys Asp Ile Tyr Gln Thr Trp Pro 
            100                 105                 110         
Phe Thr Leu Val Val His Asp Val Ala Phe Ile Gly Gly Ser Phe Ile 
        115                 120                 125             
Lys Gln Leu Leu Pro Val Lys Thr Val Ala Val Gly Val Val Pro Leu 
    130                 135                 140                 
Thr Glu Ser Asp Asp Tyr Leu Pro Pro Ser Gly Leu Gly Arg Gln Pro 
145                 150                 155                 160 
Met Arg Gly Ile Ala Gly Arg Trp Ile Gln His Leu Met Arg Tyr Met 
                165                 170                 175     
Val Gln Gln Val Met Phe Lys Pro Ile Asn Val Leu His Asn Gln Leu 
            180                 185                 190         
Arg Gln Val Tyr Gly Leu Pro Pro Glu Pro Asp Ser Val Phe Asp Ser 
        195                 200                 205             
Ile Val Arg Ser Ala Asp Val Tyr Leu Gln Ser Gly Val Pro Ser Phe 
    210                 215                 220                 
Glu Tyr Pro Arg Lys Arg Ile Ser Ala Asn Val Gln Phe Val Gly Pro 
225                 230                 235                 240 
Leu Leu Pro Tyr Ala Lys Gly Gln Lys His Pro Phe Ile Gln Ala Lys 
                245                 250                 255     
Lys Ala Leu Gln Tyr Lys Lys Val Ile Leu Val Thr Gln Gly Thr Ile 
            260                 265                 270         
Glu Arg Asp Val Gln Lys Ile Ile Val Pro Thr Leu Glu Ala Phe Lys 
        275                 280                 285             
Asn Glu Pro Thr Thr Leu Val Ile Val Thr Thr Gly Gly Ser Gln Thr 
    290                 295                 300                 
Ser Glu Leu Arg Ala Arg Phe Pro Gln Glu Asn Phe Ile Ile Asp Asp 
305                 310                 315                 320 
Phe Ile Asp Phe Asn Ala Val Met Pro Tyr Ala Ser Val Tyr Val Thr 
                325                 330                 335     
Asn Gly Gly Tyr Gly Gly Val Met Leu Ala Leu Gln His Asn Leu Pro 
            340                 345                 350         
Ile Val Val Ala Gly Ile His Glu Gly Lys Asn Glu Ile Ala Ala Arg 
        355                 360                 365             
Ile Asp Tyr Cys Lys Val Gly Ile Asp Leu Lys Thr Glu Thr Pro Ser 
    370                 375                 380                 
Pro Thr Arg Ile Arg His Ala Val Glu Thr Val Leu Thr Asn Asp Met 
385                 390                 395                 400 
Tyr Arg Gln Asn Val Arg Gln Met Gly Gln Glu Phe Ser Gln Tyr Gln 
                405                 410                 415     
Pro Thr Glu Leu Ala Glu Gln Tyr Ile Asn Ala Leu Leu Ile Gln Glu 
            420                 425                 430         
Lys Ser Ser Arg Leu Ala Val Val Ala 
        435                 440     

<210> 18
<211> 1326
<212> DNA
<213> Spirosoma radiotolerans

<220> 
<223> GT sequence

<400> 18
atgatcacac cccaacgcat tttgtttgct accatgccaa tggatggcca ttttagtcct      60

ctcaccggtc ttgccgttca cttaagtaac cttggccacg atgtccgctg gtatgtgggc     120

ggtgagtacg gcgaaaaagt acggaagctt aagttgcacc attatccatt cgtgaacgcc     180

cgaaccatca atcaggaaaa tctggagcgt gagtttccgg aacgggccgc ccttaagggt     240

tcgattgccc ggctacggtt cgatattaag caggtgtttc tgcttcgtgc tccggaattc     300

gttgaggata tgaaagatat ctaccagacg tggccgttca ctctggtagt acatgatgta     360

gccttcattg ggggctcgtt cattaagcaa ctattgcccg ttaaaaccgt ggcggtaggc     420

gtagtacccc tcacggagtc ggacgattac ctgccgccgt ctggtctggg caggcaaccc     480

atgcgcggca tagctggccg ctggattcag catctgatgc gctacatggt gcagcaggtt     540

atgttcaaac ccatcaatgt cctgcacaat caacttcgac aggtctatgg tctgccgcct     600

gagccggact ccgtgttcga ttcgatcgta cgttctgccg atgtttatct ccaaagtggc     660

gtacccagct ttgagtaccc tcgcaaacgg ataagtgcca atgttcagtt tgtggggccg     720

ctgctcccct acgccaaagg tcaaaagcac ccgtttatac aggcaaaaaa agcgttgcag     780

tacaaaaaag ttattttagt aactcagggg acgatagagc gggatgtcca aaaaatcatt     840

gtaccaaccc tggaagcttt taaaaatgag cctactacgc tggtgatcgt cacaactggt     900

ggctcccaaa cgagtgagtt gcgtgcgcgt tttccgcagg aaaatttcat tattgatgac     960

tttatcgatt ttaatgcggt tatgccctat gccagtgtgt atgtaacaaa cgggggctat    1020

ggcggggtaa tgcttgcgct gcaacacaac ctgccgattg tcgtcgcggg aattcacgag    1080

ggtaaaaacg agattgcagc ccgcattgat tactgtaagg taggcataga cctgaagact    1140

gagacgccca gccccacccg cattcgccat gccgtcgaaa ctgtattgac caatgacatg    1200

taccggcaga atgtccgtca aatggggcaa gagttcagtc agtatcaacc aactgaactg    1260

gcggaacaat acatcaatgc gcttttaata caagagaaaa gctcccggct ggccgttgtg    1320

gcctag                                                               1326


<210> 19
<211> 440
<212> PRT
<213> Fibrella aestuarina

<220> 
<223> GT sequence

<400> 19
Met Asn Pro Gln Arg Ile Leu Phe Ala Thr Met Pro Phe Asp Gly His 
1               5                   10                  15      
Phe Ser Pro Leu Thr Asn Leu Ala Val His Leu Ser Gln Leu Gly His 
            20                  25                  30          
Asp Val Arg Trp Phe Val Gly Gly His Tyr Gly Gln Lys Val Thr Gln 
        35                  40                  45              
Leu Gly Leu His His Tyr Pro Tyr Val Lys Thr Arg Thr Val Asn Gln 
    50                  55                  60                  
Glu Asn Leu Asp Gln Leu Phe Pro Glu Arg Ala Thr Ile Lys Gly Ala 
65                  70                  75                  80  
Ile Ala Arg Ile Arg Phe Asp Leu Gly Gln Ile Phe Leu Leu Arg Val 
                85                  90                  95      
Pro Glu Gln Ile Asp Asp Leu Arg Ala Ile Tyr Asp Glu Trp Pro Phe 
            100                 105                 110         
Asp Leu Ile Val Gln Asp Leu Gly Phe Val Gly Gly Thr Phe Leu Arg 
        115                 120                 125             
Glu Leu Leu Pro Val Lys Val Val Gly Val Gly Val Val Pro Leu Thr 
    130                 135                 140                 
Glu Ser Asp Asp Trp Val Pro Pro Thr Ser Leu Gly Met Lys Pro Gln 
145                 150                 155                 160 
Ser Gly Arg Val Gly Arg Leu Val Ser Arg Leu Leu Asn Tyr Leu Val 
                165                 170                 175     
Gln Asp Val Met Leu Lys Pro Ala Asn Asp Leu His Asn Glu Leu Arg 
            180                 185                 190         
Ala Gln Tyr Gly Leu Arg Pro Val Pro Gly Phe Ile Phe Asp Ala Thr 
        195                 200                 205             
Val Arg Gln Ala Asp Leu Tyr Leu Gln Ser Gly Val Pro Gly Phe Glu 
    210                 215                 220                 
Phe Pro Arg Lys Arg Ile Ser Pro Asn Val Arg Phe Ile Gly Pro Met 
225                 230                 235                 240 
Leu Pro Tyr Ser Arg Ala Asn Arg Gln Pro Phe Glu Gln Ala Ile Lys 
                245                 250                 255     
Thr Leu Ala Tyr Lys Arg Val Val Leu Val Thr Gln Gly Thr Val Glu 
            260                 265                 270         
Arg Asn Val Glu Lys Ile Ile Val Pro Thr Leu Glu Ala Tyr Lys Lys 
        275                 280                 285             
Asp Pro Asp Thr Leu Val Ile Val Thr Thr Gly Gly Ser Gly Thr Leu 
    290                 295                 300                 
Ala Leu Arg Lys Arg Tyr Pro Gln Ala Asn Phe Ile Ile Glu Asp Phe 
305                 310                 315                 320 
Ile Asp Phe Asn Ala Val Met Pro Tyr Val Ser Val Tyr Val Thr Asn 
                325                 330                 335     
Gly Gly Tyr Gly Gly Val Met Leu Ala Leu Gln His Lys Leu Pro Ile 
            340                 345                 350         
Val Ala Ala Gly Val His Glu Gly Lys Asn Glu Ile Ala Ala Arg Ile 
        355                 360                 365             
Gly Tyr Cys Gln Val Gly Val Asp Leu Arg Thr Glu Thr Pro Thr Pro 
    370                 375                 380                 
Asp Gln Ile Arg Arg Ala Val Ala Thr Ile Leu Gly Asp Glu Thr Tyr 
385                 390                 395                 400 
Arg Arg Gln Val Arg Arg Leu Ser Asp Glu Phe Gly Arg Tyr Asn Pro 
                405                 410                 415     
Asn Gln Leu Ala Glu Gln Tyr Ile Asn Glu Leu Leu Ala Gln Ser Val 
            420                 425                 430         
Gly Glu Pro Val Ala Ala Leu Ser 
        435                 440 

<210> 20
<211> 1323
<212> DNA
<213> Fibrella aestuarina

<220> 
<223> GT sequence

<400> 20
atgaatcccc aacgcatcct cttcgccacc atgccattcg acgggcactt tagccccctc      60

accaacctgg ccgttcacct tagccaactc gggcacgatg tgcgctggtt tgtgggtggg     120

cattacggcc agaaagtaac gcagctgggc ctgcaccatt acccgtacgt gaaaacgcgc     180

accgtcaatc aggaaaatct ggatcagctc ttccccgaac gggccaccat caaaggcgcc     240

attgcccgca tccgtttcga cctgggccag attttcctgc ttcgtgtgcc cgaacagatc     300

gacgacctca gggcgattta cgacgaatgg ccgtttgacc tcattgtgca ggatctgggc     360

tttgtggggg gtacgttcct gcgcgagctg ctgccggtga aggtagtggg cgtgggcgtg     420

gtgccactca ccgaatccga cgactgggtg cccccgacca gcctgggcat gaaaccgcag     480

tcgggccggg tgggccggct ggtaagtcgg ctgctcaact acctggtgca ggacgttatg     540

ctgaagcccg ccaatgacct gcacaacgag ttaagggcgc agtacggcct tcggccggtg     600

ccgggtttta tctttgatgc caccgttcgg caggccgatc tgtacctgca aagcggcgtg     660

ccgggttttg aatttccccg taagcgcatc agccccaacg tgcggttcat cgggcccatg     720

ctgccctaca gccgggcaaa caggcagccg tttgagcagg ccatcaaaac gctggcctat     780

aagcgggtgg tgctcgtcac gcaggggacc gtcgagcgga acgtggagaa gatcatcgtg     840

cccacgctgg aagcctacaa aaaagatccc gatacgctgg tgattgtgac caccggcggc     900

tcaggtacgt tggcgttgcg gaaacggtac ccacaggcca attttatcat cgaagacttt     960

atcgatttca acgccgtgat gccctacgtg agtgtgtacg tgaccaacgg cgggtatggc    1020

ggcgtgatgc tggcgctgca acacaagctc ccgattgtgg cggcgggcgt gcatgaaggc    1080

aaaaacgaaa tcgccgcccg gatcggctac tgccaggtgg gtgtcgacct gcgcaccgaa    1140

acgcccaccc ccgaccagat tcgccgggcg gtggccacca tcctgggcga cgaaacctac    1200

cggcgtcagg tacgtcggtt gagcgacgag tttggccggt ataaccctaa tcaactggcc    1260

gaacagtaca tcaacgagct actggcccag tcggtggggg agcccgttgc cgccctgtcg    1320

tga                                                                  1323


<210> 21
<211> 434
<212> PRT
<213> Aquimarina macrocephali

<220> 
<223> GT sequence

<400> 21
Met Thr Arg Met Ser Gln Lys Lys Ile Leu Phe Ala Cys Ile Pro Ala 
1               5                   10                  15      
Asp Gly His Phe Asn Pro Met Thr Ala Ile Ala Ile His Leu Lys Thr 
            20                  25                  30          
Lys Gly Tyr Asp Val Arg Trp Tyr Thr Gly Glu Gly Tyr Lys Asn Thr 
        35                  40                  45              
Leu His Arg Ile Gly Ile Pro Tyr Leu Pro Phe Gln Asn Ala Gln Glu 
    50                  55                  60                  
Leu Lys Ile Glu Glu Ile Asp Lys Met Tyr Pro Asp Arg Lys Met Leu 
65                  70                  75                  80  
Lys Gly Ile Ala His Ile Lys Phe Asp Ile Ile Asn Leu Phe Ile Asn 
                85                  90                  95      
Arg Met Lys Gly Tyr Tyr Glu Asp Ile Ala Glu Ile His Gln Val Phe 
            100                 105                 110         
Pro Phe Asp Ile Leu Val Cys Asp Asn Thr Phe Pro Gly Ser Ile Val 
        115                 120                 125             
Lys Lys Lys Leu Asn Ile Pro Ile Ala Ser Ile Gly Val Val Pro Leu 
    130                 135                 140                 
Ala Leu Ser Ala Pro Asp Leu Pro Leu Tyr Gly Ile Gly His Gln Pro 
145                 150                 155                 160 
Ala Thr Thr Phe Phe Gly Lys Arg Lys Gln Asn Phe Ile Lys Leu Met 
                165                 170                 175     
Ala Asp Lys Leu Ile Phe Asp Glu Thr Lys Val Val Tyr Asn Gln Leu 
            180                 185                 190         
Leu Arg Ser Leu Asp Leu Ser Glu Glu Glu Asn Leu Thr Ile Phe Asp 
        195                 200                 205             
Ile Ala Pro Leu Gln Ser Asp Val Phe Leu Gln Asn Gly Ile Pro Glu 
    210                 215                 220                 
Ile Asp Tyr Pro Arg Tyr Ser Leu Pro Glu Ser Ile Lys Tyr Val Gly 
225                 230                 235                 240 
Ala Leu Gln Val Gln Thr Asn Asn Asn Asn Asn Gln Lys Leu Lys Lys 
                245                 250                 255     
Asp Trp Ser Ala Ile Leu Asp Thr Ser Lys Lys Ile Ile Leu Val Ser 
            260                 265                 270         
Gln Gly Thr Val Glu Lys Asn Leu Asp Lys Leu Ile Ile Pro Ser Leu 
        275                 280                 285             
Glu Ala Phe Lys Asp Ser Asp Tyr Ile Val Leu Val Ala Thr Gly Tyr 
    290                 295                 300                 
Thr Asp Thr Lys Gly Leu Gln Lys Arg Tyr Pro Gln Gln His Phe Tyr 
305                 310                 315                 320 
Ile Glu Asp Phe Ile Ala Tyr Asp Ala Val Met Pro His Ile Asp Val 
                325                 330                 335     
Phe Ile Met Asn Gly Gly Tyr Gly Ser Ala Leu Leu Ser Ile Lys His 
            340                 345                 350         
Gly Val Pro Met Ile Thr Ala Gly Val Asn Glu Gly Lys Asn Glu Ile 
        355                 360                 365             
Cys Ser Arg Met Asp Tyr Ser Gly Val Gly Ile Asp Leu Lys Thr Glu 
    370                 375                 380                 
Lys Pro Arg Ala Val Thr Ile Gln Asn Ala Thr Glu Arg Ile Leu Gly 
385                 390                 395                 400 
Thr Asp Lys Tyr Leu Asp Thr Ile Gln Lys Ile Gln Gln Arg Met Asn 
                405                 410                 415     
Ser Tyr Asn Thr Leu Asp Ile Cys Glu Gln His Ile Ser Arg Leu Ile 
            420                 425                 430         
Ser Glu 
        

<210> 22
<211> 1305
<212> DNA
<213> Aquimarina macrocephali

<220> 
<223> GT sequence

<400> 22
atgacacgaa tgtcccaaaa aaaaattctt ttcgcttgta tacctgcaga cggtcatttt      60

aatcctatga cagctatagc tattcatcta aaaacaaaag ggtatgatgt aagatggtat     120

actggggagg gctataaaaa cacactacac agaataggga taccttattt accgttccaa     180

aatgcgcagg agcttaaaat tgaggagata gataaaatgt atccagatcg aaaaatgcta     240

aaaggaatcg cacatattaa gttcgatatt attaatctgt ttattaatag aatgaaaggg     300

tactatgaag atatcgcaga gatacatcaa gtttttccgt ttgatatttt ggtatgtgac     360

aacacttttc ccgggtctat tgttaagaaa aaacttaata tcccaattgc tagtatagga     420

gttgtgcctt tagcactttc tgcacctgat cttccattat acggcattgg tcatcagcct     480

gctacaactt ttttcggtaa gagaaaacag aactttataa aactaatggc agataaactc     540

atttttgatg aaacaaaagt agtatataat caattattac gctcattgga tttatccgaa     600

gaagaaaatc taactatttt tgatatagct ccattacaat cggatgtttt tttgcaaaac     660

ggaattcctg agatcgatta tccaaggtat agtcttcccg aatccataaa atacgttgga     720

gcactacaag tacagaccaa caataacaac aatcaaaagt taaaaaagga ctggagtgct     780

attttagata cgtcaaaaaa aatcatatta gtatctcagg gaaccgtaga aaaaaatctt     840

gacaagctta ttattccttc tttagaagct tttaaagact cagattacat agtactggta     900

gctactggtt ataccgacac taaaggttta caaaaacgat accctcagca gcatttttat     960

atcgaagatt tcatagccta tgatgctgta atgccacata tagatgtctt tatcatgaat    1020

ggaggatatg gcagtgcttt actaagtatt aaacacggtg taccaatgat taccgctggg    1080

gttaacgaag gtaaaaatga aatctgttcc cgaatggatt attctggagt cggtattgat    1140

ctaaaaacag aaaaaccacg agcagtcaca atacaaaatg caactgaaag aatattaggt    1200

acagataaat atttagacac tatacagaaa atacaacagc gtatgaattc ttataacaca    1260

ttagatatct gcgaacaaca tatctcccgt cttatttcag aataa                    1305


<210> 23
<211> 452
<212> PRT
<213> Artificial Sequence

<220> 
<223> Chimera 1

<400> 23
Met Thr Lys Tyr Lys Asn Glu Leu Thr Gly Lys Arg Ile Leu Phe Gly 
1               5                   10                  15      
Thr Val Pro Gly Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Lys 
            20                  25                  30          
Tyr Leu Gln Glu Leu Gly Cys Asp Val Arg Trp Tyr Ala Ser Asp Val 
        35                  40                  45              
Phe Lys Cys Lys Leu Glu Lys Leu Ser Ile Pro His Tyr Gly Phe Lys 
    50                  55                  60                  
Lys Ala Trp Asp Val Asn Gly Val Asn Val Asn Glu Ile Leu Pro Glu 
65                  70                  75                  80  
Arg Gln Lys Leu Thr Asp Pro Ala Glu Lys Leu Ser Phe Asp Leu Ile 
                85                  90                  95      
His Ile Phe Gly Asn Arg Ala Pro Glu Tyr Tyr Glu Asp Ile Leu Glu 
            100                 105                 110         
Ile His Glu Ser Phe Pro Phe Asp Val Phe Ile Ala Asp Ser Cys Phe 
        115                 120                 125             
Ser Ala Ile Pro Leu Val Ser Lys Leu Met Ser Ile Pro Val Val Ala 
    130                 135                 140                 
Val Gly Val Ile Pro Leu Ala Glu Glu Ser Val Asp Leu Ala Pro Tyr 
145                 150                 155                 160 
Gly Thr Gly Leu Pro Pro Ala Ala Thr Glu Glu Gln Arg Ala Met Tyr 
                165                 170                 175     
Phe Gly Met Lys Asp Ala Leu Ala Asn Val Val Phe Lys Thr Ala Ile 
            180                 185                 190         
Asp Ser Phe Ser Ala Ile Leu Asp Arg Tyr Gln Val Pro His Glu Lys 
        195                 200                 205             
Ala Ile Leu Phe Asp Thr Leu Ile Arg Gln Ser Asp Leu Phe Leu Gln 
    210                 215                 220                 
Ile Gly Ala Lys Ala Phe Glu Tyr Asp Arg Ser Asp Leu Gly Lys Asn 
225                 230                 235                 240 
Ile Arg Phe Ile Gly Ser Leu Leu Pro Tyr Gln Ser Lys Lys Gln Thr 
                245                 250                 255     
Thr Ala Trp Ser Asp Glu Arg Leu Asn Arg Tyr Glu Lys Ile Val Val 
            260                 265                 270         
Val Thr Gln Gly Thr Val Glu Lys Asn Ile Glu Lys Ile Leu Val Pro 
        275                 280                 285             
Thr Leu Glu Ala Phe Arg Asp Thr Asp Leu Leu Val Ile Ala Thr Thr 
    290                 295                 300                 
Gly Gly Ser Gly Thr Ala Glu Leu Lys Lys Arg Tyr Pro Gln Gly Asn 
305                 310                 315                 320 
Leu Ile Ile Glu Asp Phe Ile Pro Phe Gly Asp Ile Met Pro Tyr Ala 
                325                 330                 335     
Asp Val Tyr Ile Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Gly Ile 
            340                 345                 350         
Glu Asn Gln Leu Pro Leu Val Val Ala Gly Ile His Glu Gly Lys Asn 
        355                 360                 365             
Glu Ile Asn Ala Arg Ile Gly Tyr Phe Glu Leu Gly Ile Asn Leu Lys 
    370                 375                 380                 
Thr Glu Trp Pro Lys Pro Glu Gln Met Lys Lys Ala Ile Asp Glu Val 
385                 390                 395                 400 
Ile Gly Asn Lys Lys Tyr Lys Glu Asn Ile Thr Lys Leu Ala Lys Glu 
                405                 410                 415     
Phe Ser Asn Tyr His Pro Asn Glu Leu Cys Ala Gln Tyr Ile Ser Glu 
            420                 425                 430         
Val Leu Gln Lys Thr Gly Arg Leu Tyr Ile Ser Ser Lys Lys Glu Glu 
        435                 440                 445             
Glu Lys Ile Tyr 
    450         

<210> 24
<211> 1359
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chimera 1

<400> 24
atgacgaaat acaaaaatga attaacaggt aaaagaatac tctttggtac cgttcccgga      60

gacggtcatt ttaatcccct taccgggctt gctaaatatt tacaggaatt agggtgcgat     120

gtcaggtggt atgcttctga tgttttcaaa tgcaagcttg aaaaattgtc gataccacat     180

tatggcttca aaaaagcatg ggatgtcaac ggtgtgaatg taaacgagat cctgccggag     240

cgacaaaaat taacagatcc cgccgaaaaa ctgagctttg acttgatcca cattttcgga     300

aaccgggcac ctgagtatta tgaggatatt ctcgaaatac acgaatcgtt cccattcgat     360

gtgttcattg ctgacagctg cttttccgcg attccgttag ttagcaagct gatgagcatc     420

cccgttgttg ccgttggcgt aattcctctg gcggaagaat ctgttgatct ggcgccttat     480

ggaacaggat tgccgcctgc cgcgacggag gagcaacgtg cgatgtattt tggtatgaaa     540

gatgctttgg ccaacgttgt tttcaaaact gccattgact ctttttcggc cattctggac     600

cggtaccagg taccgcacga aaaagcaatt ttattcgata cattgatccg tcaatccgac     660

ttgtttctgc aaattggcgc aaaagcattt gagtatgacc gcagtgatct gggaaaaaat     720

atccgtttca ttggttcatt attaccctac cagtcaaaaa aacaaacaac tgcatggtct     780

gatgaaagac tgaacaggta tgaaaaaatt gtggtggtga cacagggcac tgttgaaaag     840

aatattgaaa agatcctcgt gcccactctg gaagccttta gggatacaga cttattggta     900

atagccacaa cgggtggaag tggtacagct gagttgaaaa aaagatatcc tcaaggcaac     960

ctgatcatcg aagattttat tccctttggc gatatcatgc cttatgcgga tgtatatatt    1020

accaatggag gatatggtgg tgtaatgctg ggtatcgaaa accaattgcc attggtagta    1080

gcgggtattc atgaagggaa aaatgagatc aatgcaagga taggatactt tgaactggga    1140

attaacctga aaaccgaatg gcctaaaccg gaacagatga aaaaagccat agatgaagtg    1200

atcggcaaca aaaaatataa agagaatata acaaaattgg caaaagaatt cagcaattac    1260

catcccaatg aactatgcgc tcagtatata agcgaagtat tacaaaaaac aggcaggctt    1320

tatatcagca gtaaaaagga agaagaaaag atatactaa                           1359


<210> 25
<211> 447
<212> PRT
<213> Artificial Sequence

<220> 
<223> Chimera 2

<400> 25
Met Ser Asn Leu Phe Ser Ser Gln Thr Asn Leu Ala Ser Val Lys Pro 
1               5                   10                  15      
Leu Lys Gly Arg Lys Ile Leu Phe Ala Asn Phe Pro Ala Asp Gly His 
            20                  25                  30          
Phe Asn Pro Leu Thr Gly Leu Ala Val His Leu Gln Trp Leu Gly Cys 
        35                  40                  45              
Asp Val Arg Trp Tyr Thr Ser Asn Lys Tyr Ala Asp Lys Leu Arg Arg 
    50                  55                  60                  
Leu Asn Ile Pro His Phe Pro Phe Arg Lys Ala Met Asp Ile Ala Asp 
65                  70                  75                  80  
Leu Glu Asn Met Phe Pro Glu Arg Asp Ala Ile Lys Gly Gln Val Ala 
                85                  90                  95      
Lys Leu Lys Phe Asp Ile Ile Asn Ala Phe Ile Leu Arg Gly Pro Glu 
            100                 105                 110         
Tyr Tyr Val Asp Leu Gln Glu Ile His Lys Ser Phe Pro Phe Asp Val 
        115                 120                 125             
Met Val Ala Asp Cys Ala Phe Thr Gly Ile Pro Phe Val Thr Asp Lys 
    130                 135                 140                 
Met Asp Ile Pro Val Val Ser Val Gly Val Phe Pro Leu Thr Glu Thr 
145                 150                 155                 160 
Ser Lys Asp Leu Pro Pro Ala Gly Leu Gly Ile Thr Pro Ser Phe Ser 
                165                 170                 175     
Leu Pro Gly Lys Phe Lys Gln Ser Ile Leu Arg Ser Val Ala Asp Leu 
            180                 185                 190         
Val Leu Phe Arg Glu Ser Asn Lys Val Met Arg Lys Met Leu Thr Glu 
        195                 200                 205             
His Gly Ile Asp His Leu Tyr Thr Asn Val Phe Asp Leu Met Val Lys 
    210                 215                 220                 
Lys Ser Thr Leu Leu Leu Gln Ser Gly Thr Pro Gly Phe Glu Tyr Tyr 
225                 230                 235                 240 
Arg Ser Asp Leu Gly Lys Asn Ile Arg Phe Ile Gly Ser Leu Leu Pro 
                245                 250                 255     
Tyr Gln Ser Lys Lys Gln Thr Thr Ala Trp Ser Asp Glu Arg Leu Asn 
            260                 265                 270         
Arg Tyr Glu Lys Ile Val Val Val Thr Gln Gly Thr Val Glu Lys Asn 
        275                 280                 285             
Ile Glu Lys Ile Leu Val Pro Thr Leu Glu Ala Phe Arg Asp Thr Asp 
    290                 295                 300                 
Leu Leu Val Ile Ala Thr Thr Gly Gly Ser Gly Thr Ala Glu Leu Lys 
305                 310                 315                 320 
Lys Arg Tyr Pro Gln Gly Asn Leu Ile Ile Glu Asp Phe Ile Pro Phe 
                325                 330                 335     
Asp Asp Val Met Pro Arg Ala Asp Val Tyr Val Thr Asn Gly Gly Tyr 
            340                 345                 350         
Gly Gly Thr Leu Leu Ser Ile His Asn Gln Leu Pro Met Val Ala Ala 
        355                 360                 365             
Gly Val His Glu Gly Lys Asn Glu Val Cys Ser Arg Ile Gly His Phe 
    370                 375                 380                 
Gly Cys Gly Ile Asn Leu Glu Thr Glu Thr Pro Thr Pro Asp Gln Ile 
385                 390                 395                 400 
Arg Glu Ser Val His Lys Ile Leu Ser Asn Asp Ile Phe Lys Lys Asn 
                405                 410                 415     
Val Phe Arg Ile Ser Thr His Leu Asp Val Asp Ala Asn Glu Lys Ser 
            420                 425                 430         
Ala Gly His Ile Leu Asp Leu Leu Glu Glu Arg Val Val Cys Gly 
        435                 440                 445         

<210> 26
<211> 1344
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chimera 2

<400> 26
atgagtaatt tattttcttc acaaacgaac cttgcatctg taaaacccct gaaaggcagg      60

aaaatacttt ttgccaactt cccggcagat gggcatttta atccattgac aggactggct     120

gttcacttac aatggctggg ttgtgatgta cgctggtaca cttccaataa atatgcagac     180

aaactgcgaa gattgaatat tccgcatttt cctttcagaa aagctatgga tatagctgac     240

ctggagaata tgtttccgga gcgtgatgcc attaaaggcc aggtagccaa actgaagttc     300

gacataatca atgcttttat tcttcgcggg ccggaatact atgttgacct gcaggagata     360

cataaaagtt ttccatttga cgtaatggtc gctgattgcg cttttacagg aattcctttt     420

gtaacagata aaatggatat acctgttgtt tctgtaggtg tgttccctct taccgaaaca     480

tcgaaagatc ttcctcccgc cggcctcggg attacgcctt ccttttcttt acccggaaaa     540

tttaaacaaa gcatactacg gtcggtggct gacctggtct tattccgcga gtccaataaa     600

gtaatgagaa aaatgctgac cgaacatggc attgatcatc tctatacaaa tgtatttgac     660

ctgatggtaa aaaaatcaac gctgctattg caaagcggaa caccgggttt tgaatattac     720

cgcagtgatc tgggaaaaaa tatccgtttc attggttcat tattacccta ccagtcaaaa     780

aaacaaacaa ctgcatggtc tgatgaaaga ctgaacaggt atgaaaaaat tgtggtggtg     840

acacagggca ctgttgaaaa gaatattgaa aagatcctcg tgcccactct ggaagccttt     900

agggatacag acttattggt aatagccaca acgggtggaa gtggtacagc tgagttgaaa     960

aaaagatatc ctcaaggcaa cctgatcatc gaagatttca ttccgtttga cgatgtgatg    1020

cccagagcag acgtttatgt taccaatggt ggctatggag gcaccttgct cagcatacat    1080

aatcagttgc caatggtagc ggcgggcgtg catgagggta aaaatgaagt ttgctcacgt    1140

atcggccact tcggctgtgg gattaatctg gaaacggaaa cacctacccc agatcagata    1200

cgcgaaagtg tccacaaaat cctgtctaat gacatcttca aaaagaatgt cttcaggatt    1260

tcgacgcact tggatgtgga tgcgaatgaa aaaagcgcgg gtcacattct tgacttgttg    1320

gaagagcggg ttgtttgcgg ttaa                                           1344


<210> 27
<211> 1380
<212> DNA
<213> Artificial Sequence

<220> 
<223> codon optimized GTC sequence

<400> 27
atgtcaaacc tgttctcatc tcaaacaaac ctggcctcgg taaaaccgtt aaaaggtcgt      60

aaaatccttt tcgcaaattt tcccgctgat ggacacttta atccgttaac tgggttagca     120

gtccatttac aatggcttgg ttgcgatgtg cgttggtaca cttcaaataa gtacgccgat     180

aagcttcgtc gccttaacat ccctcacttc ccttttcgta aggccatgga tattgctgac     240

ttagaaaaca tgtttcctga gcgtgatgcc atcaaaggac aggtcgcaaa actgaagttc     300

gacattatta atgctttcat tctgcgcggc cctgagtact acgtcgactt acaagaaatt     360

cataaatcct ttccctttga cgttatggtc gctgattgcg cgtttacggg aatcccgttc     420

gtaactgaca aaatggatat tcccgtcgta tcggtcgggg tctttccact gaccgagact     480

tctaaagatt tgcctccggc cggattgggt attactccct cgttttcctt gccaggtaag     540

ttcaagcaat cgattttacg cagtgtggcc gatttggtgt tatttcgtga gagcaataag     600

gtcatgcgca aaatgttgac tgagcatggt attgaccacc tttacacaaa cgtatttgat     660

cttatggtta aaaaatcaac gttactgttg cagtcaggga ctccgggctt cgagtattac     720

cgtagtgatc ttggtaagaa tattcgtttt atcggaagct tgcttcccta tcagagcaaa     780

aaacagacta ctgcttggag tgatgagcgt ctgaatcgct atgaaaaaat cgtcgtagtc     840

actcagggaa ctgtagagaa aaacatcgaa aagattttgg tgccaaccct tgaggctttc     900

cgcgacactg acctgcttgt gatcgcgacg acgggaggtt caggaaccgc tgaattgaaa     960

aaacgttacc ctcagggcaa cttaatcatt gaggacttca ttccatttgg tgacattatg    1020

ccatacgctg atgtatatat caccaatggt ggttacggcg gagttatgct tggcatcgaa    1080

aatcaactgc cccttgtcgt agccgggatc cacgaaggaa agaacgagat caacgcacgt    1140

attgggtact ttgagcttgg aatcaatctg aaaacggagt ggccgaagcc agagcagatg    1200

aaaaaagcga ttgacgaagt tatcggtaat aagaagtaca aagagaatat cacaaaactg    1260

gcgaaggaat tctcaaacta ccatcctaac gaattgtgcg cccaatacat ctctgaagtc    1320

ttacagaaga ccggccgctt gtacatttcg tccaagaagg aagaagaaaa gatttactaa    1380


<210> 28
<211> 1323
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon optimized GTD sequence

<400> 28
atgaccaaat acaaaaatga gttgaccggc aaacgtattt tgtttggaac cgtgcctgga      60

gatggacatt tcaacccctt aacaggctta gccaagtacc tgcaagaact gggctgcgat     120

gtacgctggt atgcatctga tgtatttaag tgcaaactgg agaagctgag catccctcac     180

tatgggttca agaaggcttg ggatgtaaat ggagtaaatg ttaatgaaat tcttccggag     240

cgtcaaaagc tgaccgaccc tgcggaaaag ctgagtttcg accttatcca catttttgga     300

aatcgcgctc ctgaatatta cgaggacatc ttggaaattc acgagagttt tcctttcgac     360

gtcttcatcg ccgactcctg cttcagtgct attcccttag tttccaagct tatgtctatt     420

cctgtcgtgg cagtaggggt gatcccgctg gcagaagaga gtgtggactt agcaccatac     480

ggaactggcc tgccgccagc tgcgacagaa gagcagcgcg ccatgtattt cggcatgaag     540

gacgcacttg ccaacgtggt gttcaaaaca gccattgact cgttttccgc cattttagat     600

cgttatcaag tgcctcacga gaaagcgatc ttatttgata ctcttattcg tcaaagcgat     660

ttgtttttgc aaatcggagc caaagctttc gagtatgacc gcagcgattt gggggaaaac     720

gtgcgtttcg ttggagccct gctgccttat tcggagagca aaagtcgtca accctggttc     780

gatcaaaagt tgttacaata tgggcgcatt gtcttggtca ctcaggggac ggtggaacat     840

gatattaata agattctggt tcctacttta gaggcattta aaaactcgga aaccctggtc     900

atcgcgacaa caggaggaaa tggtacagca gaattacgtg cgcgctttcc cttcgaaaac     960

ttgatcattg aggatttcat tccgttcgac gacgtgatgc cccgcgcgga tgtatatgtc    1020

accaatggag gctatggtgg cacgctgctt tcaattcaca accaacttcc gatggttgca    1080

gccggggtcc atgagggcaa aaatgaggtg tgttcccgta tcgggcactt tggctgtggg    1140

atcaatctgg agacggagac gccgacacca gatcagattc gtgaatcagt tcataaaatc    1200

ctgtcgaacg acattttcaa gaaaaacgtt tttcgtattt caactcattt ggacgtcgat    1260

gctaacgaga aaagcgccgg tcatatcttg gatctgttgg aggagcgtgt cgtttgtggg    1320

taa                                                                  1323


<210> 29
<211> 1326
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon optimized GTF sequence

<400> 29
atgacgacca agaagattct tttcgcaact atgcctatgg acggtcattt caatcctctg      60

acagggcttg cggtgcactt gcataaccaa ggtcatgatg tccgctggta cgtcggcgga     120

cattatggcg caaaggttaa aaaattagga ttaattcatt atccctatca caaggctcaa     180

gtcattaatc aagaaaatct ggacgaagtc ttcccggagc gtcaaaagat caaaggcact     240

gtaccacgtt tacgtttcga tcttaataat gtgttcttgc tgcgcgctcc cgaatttatt     300

accgatgtca ctgcgattca caaatcgttt ccttttgacc tgctgatctg tgataccatg     360

ttctcggcgg ctccaatgtt acgccacatt ttgaatgtac ccgtcgcagc ggtgggtatt     420

gtgccattgt cagaaacttc caaggaactg ccgccagcgg ggttggggat ggagccggcg     480

acaggattct ttggacgttt gaagcaggat ttcttacgtt tcatgaccac tcgtatcctt     540

tttaagccgt gcgacgattt atacaacgag atccgccagc gctacaacat ggagcccgcc     600

cgcgattttg tctttgactc cttcatccgt acggcggatc tgtacctgca gtcaggcgtc     660

cctggatttg agtacaagcg ctcaaagatg tcggcgaatg tgcgtttcgt cggaccctta     720

ctgccctata gttcagggat caagcctaat tttgcccatg ccgctaaatt gaaacagtac     780

aaaaaggtca tcttagccac ccagggaaca gtcgagcgtg accctgagaa aatcttagta     840

ccaactcttg aagctttcaa ggacaccgat catctggttg tgattacgac cggaggctcg     900

aagacagcgg agctgcgcgc tcgttaccct cagaagaacg tgattatcga ggatttcatt     960

gactttaact taatcatgcc tcatgcagat gtttacgtca ccaactctgg ttttggtggt    1020

gtgatgcttt ccattcagca tggtttgcca atggtagctg caggagttca cgaggggaag    1080

aacgaaattg ctgctcgcat tgggtatttc aaattaggga tgaacttgaa aaccgaaacg    1140

ccgacacccg accagatccg tacaagtgta gagactgttt tgacggacca aacctatcgt    1200

cgcaacttag cgcgtttacg cacggaattc gctcaatacg acccaatggc actgtcagaa    1260

cgctatatta acgagttgct tgcgaagcag ccacgcaaac agcatgaggc agtagaagcg    1320

atttaa                                                               1326


<210> 30
<211> 1365
<212> DNA
<213> Segetibacter koreensis

<220> 
<223> Codon optimized GT sequence

<400> 30
atgaaatata tcagctccat tcagcccggc acaaaaattt tattcgcaaa ctttccggct      60

gacggacact tcaatccatt gacgggcttg gcagtgcact tgaaaaatat tggctgtgat     120

gtccgttggt acaccagtaa aacctatgcc gagaagatcg ctcgcctgga tatcccattt     180

tatggactgc agcgtgcagt tgatgtatct gcccacgcag agattaatga cgtgtttcct     240

gaacgcaaga agtacaaggg acaagtttca aaattgaagt ttgatatgat caatgcgttt     300

attctgcgca gtacagagta ttacgaggac attttagaaa tttatgagga gtttcctttc     360

cagcttatga tcgccgacat taccttcggc gcgattccct tcgttgaaga aaaaatgaac     420

attccagtaa tctccatttc ggttgtaccg ttacctgaaa cgtcgaaaga tcttgccccg     480

agtggcctgg gaattacgcc atcatactcg ttctttggta aaattaagca atcgttctta     540

cgtttcattg ccgacgagct tttattcgcg caacctacca aggtcatgtg gggtttatta     600

gcccaacacg gaattgacgc ggggaaagct aacatctttg acatcttgat ccaaaagagt     660

acgctggtat tgcagagtgg tacgccaggg tttgagtaca aacgttctga tttgagctcc     720

cacgtgcatt tcatcggccc gctgttaccc tacactaaga agaaggaacg cgaatcatgg     780

tataatgaaa aattgtctca ttatgataag gtcattttgg taacccaggg gacaatcgaa     840

aaagatattg agaaattaat tgtaccgact ttggaggcct ttaagaattc cgattgcctg     900

gtgattgcga cgacgggtgg ggcttacact gaagaattgc gcaaacgcta tcctgaggaa     960

aatattatca tcgaagactt cattccgttt gacgacgtaa tgccgtatgc cgacgtttac    1020

gtttcgaacg gaggctatgg tggcgtattg ttatcaattc aacaccaact gcctatggtc    1080

gtcgcaggag ttcatgaggg taaaaacgag atcaatgcgc gtgtaggtta ctttgacctt    1140

ggcatcaatt tgaagaccga gcgccccacc gttcttcaat tgcgcaagag tgtagatgcc    1200

gtgttacagt ccgacagtta tgcgaaaaac gtgaagcgtc tgggaaagga gtttaagcaa    1260

tacgatccta atgaaatctg cgaaaagtac gtagcgcaac tgcttgagaa tcaaatcagc    1320

tacaaggaga aggcgaattc ctatcaggcc gaagttctgg tttaa                    1365


<210> 31
<211> 1344
<212> DNA
<213> Flavihumibacter solisilvae

<220> 
<223> Codon optimized GT sequence

<400> 31
atgaatcata agcattcgcg taagatcctg atggcgaacg ttcccgccga tggtcatttt      60

aatcccctga ctggaattgc ggtccacctt aagcagcaag gctacgatgt acgttggtat     120

ggatcagacg tttatagcaa aaaggcggcc aaattaggga ttccgtattt cccattttca     180

aaagcgttgg aagttaattc agaaaatgca gaagaagtgt tccccgaacg taagcgcatc     240

aattcgaaga ttggaaaatt aaatttcgat ttgcagaatt tctttgttcg tcgtgcgcca     300

gagtactatg ccgatcttat tgatattcac cgtgagtttc ctttcgactt gttaattgcg     360

gattgtatgt tcacagcgat cccttttgtt aaggagttga tgcagatccc ggtgctgtct     420

attggcattg cgccattgct tgaatcatcg cgtgatttgg ctccgtacgg attgggtctg     480

catccggctc gtagctgggc ggggaaattc cgtcaagcgg gactgcgctg ggttgctgat     540

aacatccttt ttcgtaaatc aatcaatgtt atgtacgacc tgttcgagga atataatatt     600

cctcacaatg gagaaaactt tttcgacatg ggcgttcgta aagcttcact gttcctgcaa     660

tcgggtacgc cgggttttga gtacaatcgc agcgatttat ctgagcatat ccgcttcatc     720

ggagcacttc ttccgtacgc tggtgaacgc aaggaggaac cctggttcga cagtcgcctg     780

aacaaattcg accgtgtcat tctggttaca caagggactg ttgaacgtga cgttacaaaa     840

atcatcgtac cagtgttgaa agccttccgt gattcgaatt acttggttgt cgcgacgact     900

gggggaaatg gtacaaagct tcttcgtgag cagtataagg ctgacaatat cattatcgag     960

gacttcattc catttaccga tattatgccc tatactgatg tttacgtaac taacgggggc    1020

tacggtggag tgatgttagg aatcgaaaat caattacctt tagtagtggc aggtgtgcac    1080

gaggggaaga atgagatcaa tgcccgcatc gggtatttcc gcttaggcat tgatctgcgt    1140

aatgaacgtc ccacccccga acaaatgcgt aacgcgattg aaaaagtaat cgcaaacgga    1200

gaatatcgtc gcaacgtcca agcgcttgca cgtgagttta aaacatacgc tcccttggag    1260

ttgaccgagc gtttcgtcac agaactgttg ttgtcacgtc gccacaaatt ggtccccgtc    1320

aacgatgacg ctttgatcta ctaa                                           1344


<210> 32
<211> 1392
<212> DNA
<213> Cesiribacter andamanensis

<220> 
<223> Codon optimized GT sequence

<400> 32
atggagacga gtcaaaaagg aggaacgcag tcgccaaagc ccttccgccg tatcttattt      60

gcaaattgtc ctgcggatgg gcatttcaac cctttaattc ctttggctga gtttttgaag     120

caacaaggtc atgacgtacg ctggtatagc tcgcgtttat atgcggataa gatttcacgt     180

atgggcatcc cgcactaccc attcaaaaag gcgctggaat ttgacaccca cgattgggaa     240

ggcagctttc cagaacgtag caagcataag tcgcaagtag gcaagttacg ttttgatctg     300

gaacatgtct tcatccgtcg cgggcccgaa tactttgagg atattcgcga tttacaccag     360

gagtttcctt tcgatgtttt agtggcagaa atcagcttta cggggatcgc atttatccgc     420

catctgatgc acaagcccgt gatcgcagtc ggcattttcc cgaacattgc ttcctcacgc     480

gacttacctc catacggact gggcatgcgt ccagcttctg gatttttggg tcgtaagaaa     540

caggacttac tgcgtttttt aaccgacaag ttggtcttcg gtaagcaaaa tgagttaaac     600

cgtcaaattc ttcgctcatg gggcatcgag gctcctggcc acctgaatct ttttgacctg     660

cagacacagc acgcgtctgt agttcttcag aatggtaccc ctggatttga gtacacccgt     720

tccgatctga gcccaaactt ggtatttgct gggcctctgc tgcctcttgt caaaaaggtg     780

cgcgaagatt tgccgttgca ggagaaattg cgcaaatata aaaacgtcat cctggtgaca     840

caggggaccg ctgaacagaa cacagaaaag atcttagctc ccacccttga agcattcaaa     900

gactccactt ggcttgtcgt ggcgacaact ggcggagcgg ggaccgaagc tttacgcgct     960

cgctatccac aagaaaattt cttaatcgag gactatattc ccttcgatca gatcatgcca    1020

aacgcggatg tttatgtgtc gaatgggggg ttcgggggtg tgcttcaggc gatctcacat    1080

cagcttccga tggtggtggc cggcgtacac gagggtaaaa atgagatttg cgcccgcgtg    1140

ggttacttca aattgggact tgatctgaag accgagaccc cgaagcctgc tcaaattcgc    1200

gcagcggtag aacaagttct tcaagatcca cagtaccgcc ataaggttca ggcgttgtca    1260

gccgaattcc gccaatataa cccgcagcaa ttatgcgaac attgggtgca acgtttaacg    1320

gggggccgcc gtgccgccgc ccccgccccg cagtccgccg ggggccagtt attgagtttg    1380

acccttaatt aa                                                        1392


<210> 33
<211> 1353
<212> DNA
<213> Niabella aurantiaca

<220> 
<223> Codon optimized GT sequence

<400> 33
atgtatacaa aaaccgcgaa cacgaccaac gcggcagcgc cattacacgg aggcgaaaag      60

aaaaagattt tgtttgcaaa catcccagca gatgggcact tcaacccgtt gacgggactg     120

gctgtccgcc ttaaaaaagc gggccacgat gtgcgctggt atacgggggc gtcgtatgca     180

ccccgcatcg agcaactggg gattcctttt tatttattca acaaagccaa agaagttaca     240

gttcataata ttgatgaagt attcccagaa cgtaaaacga tccgcaatca cgtcaaaaaa     300

gtcatcttcg atatctgtac ttactttatc gaacgtggga ccgaattcta tgaagatatt     360

aaagatatca acaagagctt cgacttcgat gttcttattt gcgatagtgc ctttacggga     420

atgtcctttg taaaagaaaa attaaataag catgcagtcg caattggcat tttgcccctt     480

tgcgcttcgt ctaaacagct gcccccccca attatggggt taactccggc gaagaccctg     540

gcaggaaagg ctgtgcactc gttccttcgc tttcttacta acaaggtatt gtttaaaaag     600

ccgcatgcct taatcaacga gcagtatcgt cgcgcgggaa tgctgacgaa cggtaagaac     660

ttattcgatt tgcagattga taaagctaca ttattcttgc aatcctgcac cccaggcttc     720

gaataccaac gcgctcatat gtctcgccat atccatttca tcggcccatt attgccgtca     780

cactcggatg cgcctgcacc atttcacttt gaagacaaac ttcatcagta cgctaaggta     840

ctgttggtga ctcaaggcac attcgaaggt gacgttcgca agcttattgt tcctgcaatt     900

gaagcgttca aaaattcgcg ccatttggta gtcgtcacaa cggcgggctg gcacacccat     960

aagctgcgtc agcgctataa agccttcgcg aatgttgtta ttgaagattt cattcccttc    1020

tcccaaatca tgccatttgc agacgtcttc attagtaacg gtggatatgg tggtgtaatg    1080

cagtccattt caaataaact gcctatggtg gttgctggga ttcatgaggg taaaaatgaa    1140

atctgcgctc gcgtgggtta tttcaagacc ggaattaaca tgcgtaccga gcatccaaaa    1200

ccggaaaaga ttaaaaccgc agtaaatgag attctttcta atccgttgta tcgcaaatca    1260

gtggaacgtc tgagtaaaga gttctccgaa tatgacccct tagcgttatg cgaaaagttc    1320

gtcaacgctc ttcccgtctt acagaagccc tag                                 1353


<210> 34
<211> 1326
<212> DNA
<213> Spirosoma radiotolerans

<220> 
<223> Codon optimized GT sequence

<400> 34
atgatcactc cacagcgcat tttgtttgcg acgatgccga tggacggtca tttttctccc      60

ctgacgggtc ttgccgtgca cttatcgaat ttagggcacg atgttcgctg gtacgtgggc     120

ggagagtatg gcgaaaaggt gcgcaagttg aagttgcacc attatccctt tgtcaacgct     180

cgcacaatta atcaagagaa tcttgagcgt gaattccctg agcgcgccgc gttaaagggt     240

agcattgccc gtcttcgttt tgacatcaag caggtttttc tgttgcgtgc accagaattc     300

gtggaagata tgaaagatat ttaccaaacc tggcccttta cacttgtggt tcacgacgtc     360

gcctttattg gtggaagctt tattaaacag ttgttacccg taaaaacagt agcggttgga     420

gtcgtgccac ttactgaatc ggatgattac ttaccaccct ccggtcttgg ccgccaaccg     480

atgcgcggaa tcgccggtcg ctggatccaa catctgatgc gctacatggt tcagcaagtc     540

atgtttaagc caatcaacgt cctgcataac caacttcgtc aggtctatgg tctgcctccg     600

gaaccggaca gtgtctttga cagtatcgtg cgctctgccg atgtgtactt gcagtccggc     660

gtaccgtctt ttgagtatcc acgcaagcgt atctcagcta atgttcaatt tgtgggccct     720

ctgcttccgt atgctaaagg acaaaaacac ccctttattc aggccaaaaa agccttgcag     780

tacaagaagg ttattctggt aactcaaggt actattgaac gcgatgtgca aaaaattatc     840

gtcccgacgc tggaggcatt taagaacgaa ccaacaactt tggtcatcgt aacaaccggg     900

ggttcccaga ctagcgagct gcgtgcgcgc tttccacaag agaatttcat tatcgacgac     960

ttcattgatt ttaatgcagt aatgccatac gcgagcgttt acgtcactaa tgggggctat    1020

ggtggtgtta tgttagctct gcaacacaac ttgccgattg ttgtagcggg aatccatgaa    1080

ggaaagaacg agattgctgc ccgcattgat tactgcaagg tcggtatcga cctgaagact    1140

gagaccccta gtccgacacg cattcgtcac gcggtggaga ctgttttgac caatgacatg    1200

taccgtcaaa atgttcgcca gatggggcag gaattttcgc agtaccaacc tactgagtta    1260

gctgaacaat acattaatgc actgctgatc caggagaaat caagccgttt ggcagttgta    1320

gcctag                                                               1326


<210> 35
<211> 1323
<212> DNA
<213> Fibrella aestuarina

<220> 
<223> Codon optimized GT sequence

<400> 35
atgaatcccc agcgcattct tttcgccacg atgcccttcg acggacactt ctctccactt      60

actaatttgg ccgttcacct ttcacagctg ggacacgacg tccgttggtt cgtgggcggg     120

cactacggtc agaaagtaac gcagttaggg ttacaccact atccctacgt aaaaacccgc     180

accgttaacc aggagaatct ggatcaattg ttccctgagc gtgccacaat taaaggcgcc     240

attgcccgta ttcgtttcga tttaggacaa atctttctgc ttcgtgttcc tgaacagatc     300

gacgatttgc gtgcgattta tgacgaatgg cccttcgatc ttatcgtaca agacttgggg     360

ttcgtcggtg gcacattttt acgtgagctt ttacccgtga aagttgtggg ggtgggcgtc     420

gtaccgttaa ctgagtcgga tgattgggta ccccctactt cattaggtat gaagccccaa     480

tccggtcgcg tgggacgttt agtgtcgcgt cttttaaatt atcttgttca ggacgtgatg     540

ctgaagcccg ctaacgactt acacaatgaa ttgcgcgcgc agtacggact gcgccccgtg     600

cccggcttca tttttgatgc aactgttcgt caggcagact tataccttca gagcggggta     660

ccaggatttg aatttcctcg caaacgcatt tcaccgaacg tacgttttat cggacccatg     720

ttaccctatt cccgcgctaa tcgtcaacca tttgaacagg cgatcaaaac acttgcgtac     780

aaacgcgtgg tgttggtaac tcaaggaaca gtagagcgca acgtcgagaa gattatcgtt     840

ccaacgcttg aggcgtataa gaaagatcca gataccttag tgatcgtaac taccggtggc     900

tcgggtacgc ttgcattacg taaacgttac ccacaagcta attttatcat tgaagacttt     960

attgacttta acgcagtaat gccctacgtc agcgtttacg taaccaacgg cggctatggg    1020

ggagtcatgt tggctttgca gcataaattg cctattgtgg ccgcgggagt gcatgaaggg    1080

aagaatgaga tcgctgcgcg tattgggtac tgtcaggtgg gcgtcgatct tcgtaccgag    1140

actccgactc ccgatcaaat tcgtcgtgcc gttgctacaa ttctgggaga tgagacttac    1200

cgccgccaag tccgtcgtct gagcgacgag ttcggtcgct ataacccaaa ccaacttgcg    1260

gagcagtata ttaacgaatt gcttgctcaa tcggttgggg aacccgttgc cgcgctgagc    1320

tga                                                                  1323


<210> 36
<211> 1305
<212> DNA
<213> Aquimarina macrocephali

<220> 
<223> Codon optimized GT sequence

<400> 36
atgacgcgca tgagtcagaa gaagatttta ttcgcttgca ttcccgcaga cggccatttt      60

aatccaatga cggctatcgc aatccattta aaaaccaagg gatacgacgt acgctggtat     120

accggggagg ggtataaaaa cacgttgcac cgcattggca tcccctatct tcccttccaa     180

aacgcgcaag agctgaagat tgaggaaatt gacaaaatgt acccggatcg taagatgttg     240

aagggcattg cacacattaa gttcgacatc atcaatttgt tcatcaaccg catgaagggt     300

tactatgagg atatcgccga gattcaccaa gtttttccat ttgatatttt ggtgtgtgac     360

aatacgttcc ccgggtccat tgttaagaag aagttgaata tccccattgc gtcgatcgga     420

gtggtccccc tggccttatc agcaccagac ttaccgttat acggaattgg tcatcagccg     480

gctactacgt tcttcggaaa gcgtaaacaa aattttatca aacttatggc agacaagttg     540

atcttcgacg aaactaaggt tgtatataac cagctgcttc gttccttgga tctgtcggag     600

gaggaaaacc ttacaatctt cgatattgcc cccttacagt ctgatgtatt cttacagaac     660

ggcatccccg agatcgacta cccccgctat tccttaccag agtccattaa gtacgtggga     720

gcgctgcaag tccagactaa taacaacaat aatcaaaagc tgaagaagga ttggagcgcg     780

attttggata catcaaaaaa gatcatcctg gttagccagg gaacagtaga aaaaaacctg     840

gacaaactta ttatccccag tttagaagcg ttcaaagaca gcgattatat tgtactggtg     900

gctacgggtt acactgacac aaaaggtttg caaaaacgtt atccgcagca acacttttat     960

atcgaagatt tcattgccta tgacgccgtc atgcctcata ttgatgtctt tatcatgaac    1020

ggcggttatg gatcggcact gttgagcatt aagcatggtg tcccgatgat tacggcaggc    1080

gtgaatgagg ggaaaaacga aatctgttca cgcatggatt attcaggtgt tggaatcgac    1140

ctgaagacag aaaagcctcg tgccgttaca atccaaaacg ccacagaacg cattttaggg    1200

acggacaagt acctggacac gattcagaag attcagcaac gtatgaactc ctacaataca    1260

ttagacattt gcgagcagca catctcgcgc ctgatttcgg agtaa                    1305


<210> 37
<211> 1359
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon optimized sequence of Chimera 1

<400> 37
atgaccaaat acaaaaatga gttgacaggc aaacgtattc ttttcggtac agttcccggt      60

gatggacact ttaacccatt aacaggctta gctaaatatt tgcaggaatt agggtgtgac     120

gtgcgttggt atgcttcgga tgtcttcaag tgcaagttag aaaaacttag tatccctcat     180

tatggattta aaaaagcatg ggacgttaat ggcgtaaatg ttaacgaaat cctgcctgaa     240

cgtcaaaaat tgaccgatcc cgctgaaaag ttaagtttcg atctgatcca tatttttggt     300

aaccgcgcgc ccgagtacta cgaggacatt cttgaaattc atgagagttt tccctttgac     360

gtctttattg ctgatagttg cttttcggca attcccttgg tgtctaaatt gatgagcatt     420

ccagtagtag cggtcggggt gattcctttg gccgaagagt ctgtcgatct tgccccatac     480

ggtactggat taccgccggc agccacggaa gagcaacgtg ctatgtactt tggcatgaaa     540

gatgcacttg caaacgtcgt gttcaaaact gcaattgaca gcttttccgc catcctggac     600

cgctaccagg tgccccatga aaaggcaatc ctgttcgata ccttgatccg tcagtccgat     660

cttttccttc aaatcggtgc taaggctttt gaatacgatc gtagcgactt ggggaagaat     720

attcgcttta ttggtagctt acttccttat cagtcgaaga agcaaacgac agcctggagt     780

gacgagcgtt tgaaccgcta cgagaaaatc gtggtcgtga cccagggaac tgttgaaaag     840

aatattgaaa aaatcttagt gccgacattg gaggccttcc gcgatacgga tttgctggta     900

atcgctacaa ctggtgggtc cggtactgct gagttaaaga aacgttaccc tcaggggaac     960

ttaattatcg aagatttcat ccccttcgga gatatcatgc catatgcgga tgtctacatc    1020

acgaatggag ggtacggtgg agttatgttg ggcattgaga atcaactgcc gttagtcgta    1080

gcggggatcc acgaggggaa gaacgaaatt aacgcacgca ttgggtactt cgagttggga    1140

attaacttaa aaactgaatg gcctaagccc gaacaaatga aaaaggccat cgacgaagta    1200

attggtaaca aaaaatataa ggagaacatc acgaaacttg ctaaggagtt ctcaaactac    1260

cacccaaacg aattatgcgc acagtacatc tctgaagtat tgcagaagac cggtcgtctg    1320

tacatctcgt cgaagaagga ggaagaaaag atctactaa                           1359


<210> 38
<211> 1344
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon optimized sequence of Chimera 2

<400> 38
atgtccaacc ttttttcgtc ccagacgaat cttgccagcg taaaaccttt aaaagggcgc      60

aagattcttt ttgcaaattt tcccgccgac gggcatttca atcctcttac aggcttggcc     120

gtacacttac aatggcttgg gtgtgacgtg cgttggtata cttcaaacaa gtatgccgac     180

aagctgcgtc gtttgaatat cccgcatttt ccttttcgca aagccatgga cattgctgac     240

cttgagaaca tgtttccaga gcgcgacgcc atcaaggggc aagtcgctaa attgaaattc     300

gatatcatta acgcatttat cctgcgcggt ccggagtatt acgtagattt gcaggagatt     360

cacaagtcat ttccattcga tgtcatggtt gctgattgcg cctttacagg aattccattc     420

gtcacagaca aaatggatat ccccgtggtc tcggtaggcg tatttccctt aaccgagacc     480

agcaaagatc ttccacccgc agggttgggg atcactccat ccttctccct tcctggaaag     540

ttcaagcaaa gcattcttcg ctcggttgcc gacttagtct tattccgcga atctaataaa     600

gttatgcgca aaatgttgac ggaacatggc attgaccatc tttatactaa tgtgttcgac     660

ttgatggtca aaaaaagcac cctgttactg caaagcggga cgccgggttt tgaatattac     720

cgcagcgatc tgggcaagaa tatccgcttt atcggctccc ttcttccgta tcaatctaag     780

aaacagacaa ccgcatggag cgatgagcgt ctgaaccgct atgaaaagat tgtcgttgtc     840

acccaaggga ccgtcgaaaa aaatattgag aaaatcttgg ttcctacctt agaggcattt     900

cgtgacactg atcttttagt gatcgcaacc acaggtggta gcggaacagc agagttaaaa     960

aagcgctacc cccaaggaaa tcttatcatt gaagatttca ttccgtttga cgacgttatg    1020

cctcgcgccg atgtatacgt gactaatgga ggatacggag gtacgttact gtctatccat    1080

aatcagctgc caatggtcgc cgccggcgtt cacgaaggca agaatgaagt atgttcccgt    1140

attgggcatt ttggatgtgg aatcaatttg gaaaccgaaa ccccaacccc tgaccagatt    1200

cgcgagtcag ttcacaaaat cttgtcaaat gacatcttca agaagaatgt attccgcatt    1260

agcacacatt tggatgtcga tgcgaacgag aaaagcgccg ggcacatttt agacttactg    1320

gaagaacgcg tagtatgtgg ataa                                           1344


<210> 39
<211> 29
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTC-Ndel-for

<400> 39
catatgagta atttattttc ttcacaaac                                      29


<210> 40
<211> 27
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTC-BamHI-rev

<400> 40
ggatccttag tatatctttt cttcttc                                        27


<210> 41
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTF_XhoI_for

<400> 41
ctcgagatga cgaaatacaa aaatgaat                                       28


<210> 42
<211> 25
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTF_BamHI_rev

<400> 42
ggatccttaa ccgcaaacaa cccgc                                          25


<210> 43
<211> 29
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTL_XhoI_for

<400> 43
ctcgagatga caactaaaaa aatcctgtt                                      29


<210> 44
<211> 26
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTL_BamHI_rev

<400> 44
ggatccttag attgcttcta cggctt                                         26


<210> 45
<211> 20
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO. 1

<220> 
<221> VARIANT
<222> 1
<223> Lys = Arg

<220> 
<221> UNSURE
<222> 6..7
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 9
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 14
<223> Asn = Ser

<220> 
<221> UNSURE
<222> 18
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 19
<223> Leu = Ile

<400> 45
Lys Ile Leu Phe Ala Xaa Xaa Pro Xaa Asp Gly His Phe Asn Pro Leu 
1               5                   10                  15      
Thr Xaa Leu Ala 
            20  

<210> 46
<211> 7
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO. 1

<220> 
<221> UNSURE
<222> 2
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 7
<223> Tyr = Phe

<400> 46
Gly Xaa Asp Val Arg Trp Tyr 
1               5           

<210> 47
<211> 4
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> VARIANT
<222> 1
<223> Phe = Tyr or Leu

<220> 
<221> VARIANT
<222> 3
<223> Glu = Asp

<400> 47
Phe Pro Glu Arg 
1               

<210> 48
<211> 17
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> UNSURE
<222> 3
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro or Val

<220> 
<221> UNSURE
<222> 4..5
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 6
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro or Val

<220> 
<221> UNSURE
<222> 8..9
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 11..12
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 14
<223> Tyr = Phe

<220> 
<221> UNSURE
<222> 15
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro or Val

<220> 
<221> UNSURE
<222> 16
<223> Xaa = any amino acid

<400> 48
Phe Asp Xaa Xaa Xaa Xaa Phe Xaa Xaa Arg Xaa Xaa Glu Tyr Xaa Xaa 
1               5                   10                  15      
Asp 
    

<210> 49
<211> 17
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO. 1

<220> 
<221> VARIANT
<222> 1
<223> Phe = Trp

<220> 
<221> UNSURE
<222> 4
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 5..7
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro or Val

<220> 
<221> UNSURE
<222> 8
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 9
<223> Asp = Glu

<220> 
<221> UNSURE
<222> 10..11
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 13..16
<223> Xaa = any amino acid

<400> 49
Phe Pro Phe Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Phe Xaa Xaa Xaa Xaa 
1               5                   10                  15      
Phe 
    

<210> 50
<211> 25
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> UNSURE
<222> 3
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 5
<223> Xaa = Asn, Cys, Gln, Gly, Ser, Thr or Tyr

<220> 
<221> UNSURE
<222> 6..8
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 10
<223> Phe = Ala

<220> 
<221> UNSURE
<222> 12
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 14
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 16..17
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 19..23
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 25
<223> Lys = Arg

<400> 50
Pro Leu Xaa Glu Xaa Xaa Xaa Xaa Leu Pro Pro Xaa Gly Xaa Gly Xaa 
1               5                   10                  15      
Xaa Pro Xaa Xaa Xaa Xaa Xaa Gly Lys 
            20                  25  

<210> 51
<211> 12
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO. 1

<220> 
<221> UNSURE
<222> 3
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 5
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 6
<223> Xaa = Phe or Lys

<220> 
<221> UNSURE
<222> 11
<223> Xaa = any amino acid

<400> 51
Leu Gln Xaa Gly Xaa Xaa Gly Phe Glu Tyr Xaa Arg 
1               5                   10          

<210> 52
<211> 21
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> UNSURE
<222> 5
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> VARIANT
<222> 7
<223> Lys = Arg

<220> 
<221> UNSURE
<222> 8..10
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 12..14
<223> Xaa = Ala, Ile, Leu, met, Phe, Pro, Trp or Val

<220> 
<221> VARIANT
<222> 21
<223> Arg = Lys

<400> 52
Thr Gln Gly Thr Xaa Glu Lys Xaa Xaa Xaa Lys Xaa Xaa Xaa Pro Thr 
1               5                   10                  15      
Leu Glu Ala Phe Arg 
            20      

<210> 53
<211> 8
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> UNSURE
<222> 3..4
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<400> 53
Leu Val Xaa Xaa Thr Thr Gly Gly 
1               5               

<210> 54
<211> 47
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO: 1

<220> 
<221> VARIANT
<222> 2
<223> Glu = Asp

<220> 
<221> UNSURE
<222> 8..9
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 10
<223> Val = Ile

<220> 
<221> UNSURE
<222> 13..14
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 17
<223> Tyr = Phe

<220> 
<221> VARIANT
<222> 18
<223> Ile = Val

<220> 
<221> VARIANT
<222> 19
<223> Thr = Ser

<220> 
<221> VARIANT
<222> 23
<223> Tyr = Phe

<220> 
<221> VARIANT
<222> 27
<223> Met = Leu

<220> 
<221> UNSURE
<222> 29
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 31
<223> Xaa = any amino acid

<220> 
<221> VARIANT
<222> 32
<223> Asn = His

<220> 
<221> UNSURE
<222> 33
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 36
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<220> 
<221> UNSURE
<222> 38
<223> Xaa = any amino acid

<220> 
<221> UNSURE
<222> 41
<223> Xaa = Ala, Ile, Leu, Met, Phe, Pro, Trp or Val

<400> 54
Ile Glu Asp Phe Ile Pro Phe Xaa Xaa Val Met Pro Xaa Xaa Asp Val 
1               5                   10                  15      
Tyr Ile Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Xaa Ile Xaa Asn 
            20                  25                  30          
Xaa Leu Pro Xaa Val Xaa Ala Gly Xaa His Glu Gly Lys Asn Glu 
        35                  40                  45          

<210> 55
<211> 6
<212> PRT
<213> Artificial Sequence

<220> 
<223> partial sequence of SEQ ID NO. 1

<400> 55
His Glu Gly Lys Asn Glu 
1               5       

<210> 56
<211> 464
<212> PRT
<213> Artificial Sequence

<220> 
<223> Chimera 1 frameshift

<400> 56
Met Thr Lys Tyr Lys Asn Glu Leu Thr Gly Lys Arg Ile Leu Phe Gly 
1               5                   10                  15      
Thr Val Pro Gly Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Lys 
            20                  25                  30          
Tyr Leu Gln Glu Leu Gly Cys Asp Val Arg Trp Tyr Ala Ser Asp Val 
        35                  40                  45              
Phe Lys Cys Lys Leu Glu Lys Leu Ser Ile Pro His Tyr Gly Phe Lys 
    50                  55                  60                  
Lys Ala Trp Asp Val Asn Gly Val Asn Val Asn Glu Ile Leu Pro Glu 
65                  70                  75                  80  
Arg Gln Lys Leu Thr Asp Pro Ala Glu Lys Leu Ser Phe Asp Leu Ile 
                85                  90                  95      
His Ile Phe Gly Asn Arg Ala Pro Glu Tyr Tyr Glu Asp Ile Leu Glu 
            100                 105                 110         
Ile His Glu Ser Phe Pro Phe Asp Val Phe Ile Ala Asp Ser Cys Phe 
        115                 120                 125             
Ser Ala Ile Pro Leu Val Ser Lys Leu Met Ser Ile Pro Val Val Ala 
    130                 135                 140                 
Val Gly Val Ile Pro Leu Ala Glu Glu Ser Val Asp Leu Ala Pro Tyr 
145                 150                 155                 160 
Gly Thr Gly Leu Pro Pro Ala Ala Thr Glu Glu Gln Arg Ala Met Tyr 
                165                 170                 175     
Phe Gly Met Lys Asp Ala Leu Ala Asn Val Val Phe Lys Thr Ala Ile 
            180                 185                 190         
Asp Ser Phe Ser Ala Ile Leu Asp Arg Tyr Gln Val Pro His Glu Lys 
        195                 200                 205             
Ala Ile Leu Phe Asp Thr Leu Ile Arg Gln Ser Asp Leu Phe Leu Gln 
    210                 215                 220                 
Ile Gly Ala Lys Ala Phe Glu Tyr Asp Arg Ser Asp Leu Gly Lys Asn 
225                 230                 235                 240 
Ile Arg Phe Ile Gly Ser Leu Leu Pro Tyr Gln Ser Lys Lys Gln Thr 
                245                 250                 255     
Thr Ala Trp Ser Asp Glu Arg Leu Asn Arg Tyr Glu Lys Ile Val Val 
            260                 265                 270         
Val Thr Gln Gly Thr Val Glu Lys Asn Ile Glu Lys Ile Leu Val Pro 
        275                 280                 285             
Thr Leu Glu Ala Phe Arg Asp Thr Asp Leu Leu Val Ile Ala Thr Thr 
    290                 295                 300                 
Gly Gly Ser Gly Thr Ala Glu Leu Lys Lys Arg Tyr Pro Gln Gly Asn 
305                 310                 315                 320 
Leu Ile Ile Glu Asp Phe Ile Pro Phe Gly Asp Ile Met Pro Tyr Ala 
                325                 330                 335     
Asp Val Tyr Ile Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Gly Ile 
            340                 345                 350         
Glu Asn Gln Leu Pro Leu Val Val Ala Gly Ile His Glu Gly Lys Asn 
        355                 360                 365             
Glu Ile Asn Ala Arg Ile Gly Tyr Phe Glu Leu Gly Ile Asn Leu Lys 
    370                 375                 380                 
Thr Glu Trp Pro Lys Pro Glu Gln Met Lys Lys Ala Ile Asp Glu Val 
385                 390                 395                 400 
Ile Gly Asn Lys Lys Tyr Lys Glu Asn Ile Thr Lys Leu Ala Lys Glu 
                405                 410                 415     
Phe Ser Asn Tyr His Pro Asn Glu Leu Cys Ala Gln Tyr Ile Ser Glu 
            420                 425                 430         
Val Leu Gln Lys Gln Ala Gly Phe Ile Ser Ala Val Lys Arg Lys Lys 
        435                 440                 445             
Lys Arg Tyr Thr Lys Asp Pro Ala Ala Asn Lys Ala Arg Lys Glu Ala 
    450                 455                 460                 



<210> 57
<211> 1395
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chimera 1 frameshift

<400> 57
atgacgaaat acaaaaatga attaacaggt aaaagaatac tctttggtac cgttcccgga      60

gacggtcatt ttaatcccct taccgggctt gctaaatatt tacaggaatt agggtgcgat     120

gtcaggtggt atgcttctga tgttttcaaa tgcaagcttg aaaaattgtc gataccacat     180

tatggcttca aaaaagcatg ggatgtcaac ggtgtgaatg taaacgagat cctgccggag     240

cgacaaaaat taacagatcc cgccgaaaaa ctgagctttg acttgatcca cattttcgga     300

aaccgggcac ctgagtatta tgaggatatt ctcgaaatac acgaatcgtt cccattcgat     360

gtgttcattg ctgacagctg cttttccgcg attccgttag ttagcaagct gatgagcatc     420

cccgttgttg ccgttggcgt aattcctctg gcggaagaat ctgttgatct ggcgccttat     480

ggaacaggat tgccgcctgc cgcgacggag gagcaacgtg cgatgtattt tggtatgaaa     540

gatgctttgg ccaacgttgt tttcaaaact gccattgact ctttttcggc cattctggac     600

cggtaccagg taccgcacga aaaagcaatt ttattcgata cattgatccg tcaatccgac     660

ttgtttctgc aaattggcgc aaaagcattt gagtatgacc gcagtgatct gggaaaaaat     720

atccgtttca ttggttcatt attaccctac cagtcaaaaa aacaaacaac tgcatggtct     780

gatgaaagac tgaacaggta tgaaaaaatt gtggtggtga cacagggcac tgttgaaaag     840

aatattgaaa agatcctcgt gcccactctg gaagccttta gggatacaga cttattggta     900

atagccacaa cgggtggaag tggtacagct gagttgaaaa aaagatatcc tcaaggcaac     960

ctgatcatcg aagattttat tccctttggc gatatcatgc cttatgcgga tgtatatatt    1020

accaatggag gatatggtgg tgtaatgctg ggtatcgaaa accaattgcc attggtagta    1080

gcgggtattc atgaagggaa aaatgagatc aatgcaagga taggatactt tgaactggga    1140

attaacctga aaaccgaatg gcctaaaccg gaacagatga aaaaagccat agatgaagtg    1200

atcggcaaca aaaaatataa agagaatata acaaaattgg caaaagaatt cagcaattac    1260

catcccaatg aactatgcgc tcagtatata agcgaagtat tacaaaaaca ggcaggcttt    1320

atatcagcag taaaaaggaa gaagaaaaga tatactaagg atccggctgc taacaaagcc    1380

cgaaaggaag cgtag                                                     1395


<210> 58
<211> 452
<212> PRT
<213> Artificial Sequence

<220> 
<223> Chimera 3

<400> 58
Met Thr Lys Tyr Lys Asn Glu Leu Thr Gly Lys Arg Ile Leu Phe Gly 
1               5                   10                  15      
Thr Val Pro Gly Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Lys 
            20                  25                  30          
Tyr Leu Gln Glu Leu Gly Cys Asp Val Arg Trp Tyr Ala Ser Asp Val 
        35                  40                  45              
Phe Lys Cys Lys Leu Glu Lys Leu Ser Ile Pro His Tyr Gly Phe Lys 
    50                  55                  60                  
Lys Ala Trp Asp Val Asn Gly Val Asn Val Asn Glu Ile Leu Pro Glu 
65                  70                  75                  80  
Arg Gln Lys Leu Thr Asp Pro Ala Glu Lys Leu Ser Phe Asp Leu Ile 
                85                  90                  95      
His Ile Phe Gly Asn Arg Ala Pro Glu Tyr Tyr Glu Asp Ile Leu Glu 
            100                 105                 110         
Ile His Glu Ser Phe Pro Phe Asp Val Phe Ile Ala Asp Ser Cys Phe 
        115                 120                 125             
Ser Ala Ile Pro Leu Val Ser Lys Leu Met Ser Ile Pro Val Val Ala 
    130                 135                 140                 
Val Gly Val Ile Pro Leu Ala Glu Glu Ser Val Asp Leu Ala Pro Tyr 
145                 150                 155                 160 
Gly Thr Gly Leu Pro Pro Ala Ala Thr Glu Glu Gln Arg Ala Met Tyr 
                165                 170                 175     
Phe Gly Met Lys Asp Ala Leu Ala Asn Val Val Phe Lys Thr Ala Ile 
            180                 185                 190         
Asp Ser Phe Ser Ala Ile Leu Asp Arg Tyr Gln Val Pro His Glu Lys 
        195                 200                 205             
Ala Ile Leu Phe Asp Thr Leu Ile Arg Gln Ser Asp Leu Phe Leu Gln 
    210                 215                 220                 
Ile Gly Ala Lys Ala Phe Glu Tyr Asp Arg Ser Asp Leu Gly Glu Asn 
225                 230                 235                 240 
Val Arg Phe Val Gly Ala Leu Leu Pro Tyr Ser Glu Ser Lys Ser Arg 
                245                 250                 255     
Gln Pro Trp Phe Asp Gln Lys Leu Leu Gln Tyr Gly Arg Ile Val Leu 
            260                 265                 270         
Val Thr Gln Gly Thr Val Glu His Asp Ile Asn Lys Ile Leu Val Pro 
        275                 280                 285             
Thr Leu Glu Ala Phe Lys Asn Ser Glu Thr Leu Val Ile Ala Thr Thr 
    290                 295                 300                 
Gly Gly Asn Gly Thr Ala Glu Leu Arg Ala Arg Phe Pro Gln Gly Asn 
305                 310                 315                 320 
Leu Ile Ile Glu Asp Phe Ile Pro Phe Gly Asp Ile Met Pro Tyr Ala 
                325                 330                 335     
Asp Val Tyr Ile Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Gly Ile 
            340                 345                 350         
Glu Asn Gln Leu Pro Leu Val Val Ala Gly Ile His Glu Gly Lys Asn 
        355                 360                 365             
Glu Ile Asn Ala Arg Ile Gly Tyr Phe Glu Leu Gly Ile Asn Leu Lys 
    370                 375                 380                 
Thr Glu Trp Pro Lys Pro Glu Gln Met Lys Lys Ala Ile Asp Glu Val 
385                 390                 395                 400 
Ile Gly Asn Lys Lys Tyr Lys Glu Asn Ile Thr Lys Leu Ala Lys Glu 
                405                 410                 415     
Phe Ser Asn Tyr His Pro Asn Glu Leu Cys Ala Gln Tyr Ile Ser Glu 
            420                 425                 430         
Val Leu Gln Lys Thr Gly Arg Leu Tyr Ile Ser Ser Lys Lys Glu Glu 
        435                 440                 445             
Glu Lys Ile Tyr 
    450         

<210> 59
<211> 1359
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chimera 3

<400> 59
atgacgaaat acaaaaatga attaacaggt aaaagaatac tctttggtac cgttcccgga      60

gacggtcatt ttaatcccct taccgggctt gctaaatatt tacaggaatt agggtgcgat     120

gtcaggtggt atgcttctga tgttttcaaa tgcaagcttg aaaaattgtc gataccacat     180

tatggcttca aaaaagcatg ggatgtcaac ggtgtgaatg taaacgagat cctgccggag     240

cgacaaaaat taacagatcc cgccgaaaaa ctgagctttg acttgatcca cattttcgga     300

aaccgggcac ctgagtatta tgaggatatt ctcgaaatac acgaatcgtt cccattcgat     360

gtgttcattg ctgacagctg cttttccgcg attccgttag ttagcaagct gatgagcatc     420

cccgttgttg ccgttggcgt aattcctctg gcggaagaat ctgttgatct ggcgccttat     480

ggaacaggat tgccgcctgc cgcgacggag gagcaacgtg cgatgtattt tggtatgaaa     540

gatgctttgg ccaacgttgt tttcaaaact gccattgact ctttttcggc cattctggac     600

cggtaccagg taccgcacga aaaagcaatt ttattcgata cattgatccg tcaatccgac     660

ttgtttctgc aaattggcgc aaaagcattt gagtatgacc gcagcgacct gggcgaaaat     720

gtccgttttg tcggcgcatt gctgccgtac tcggaaagta aatcccggca gccctggttt     780

gatcagaaac ttttacaata tggcaggatt gtgctggtta cccagggcac tgttgagcac     840

gatatcaaca agatacttgt acccacgctg gaagctttca aaaattctga gacgctggta     900

attgccacaa caggcggtaa tgggacagcg gaattgcgcg cgcgttttcc tcaaggcaac     960

ctgatcatcg aagattttat tccctttggc gatatcatgc cttatgcgga tgtatatatt    1020

accaatggag gatatggtgg tgtaatgctg ggtatcgaaa accaattgcc attggtagta    1080

gcgggtattc atgaagggaa aaatgagatc aatgcaagga taggatactt tgaactggga    1140

attaacctga aaaccgaatg gcctaaaccg gaacagatga aaaaagccat agatgaagtg    1200

atcggcaaca aaaaatataa agagaatata acaaaattgg caaaagaatt cagcaattac    1260

catcccaatg aactatgcgc tcagtatata agcgaagtat tacaaaaaac aggcaggctt    1320

tatatcagca gtaaaaagga agaagaaaag atatactaa                           1359


<210> 60
<211> 1359
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon-optimized nucleotide sequence of Chimera 3 (optimized for
      E. coli)

<400> 60
atgaccaaat acaaaaatga gttgaccggc aaacgtattt tgtttggaac cgtgcctgga      60

gatggacatt tcaacccctt aacaggctta gccaagtacc tgcaagaact gggctgcgat     120

gtacgctggt atgcatctga tgtatttaag tgcaaactgg agaagctgag catccctcac     180

tatgggttca agaaggcttg ggatgtaaat ggagtaaatg ttaatgaaat tcttccggag     240

cgtcaaaagc tgaccgaccc tgcggaaaag ctgagtttcg accttatcca catttttgga     300

aatcgcgctc ctgaatatta cgaggacatc ttggaaattc acgagagttt tcctttcgac     360

gtcttcatcg ccgactcctg cttcagtgct attcccttag tttccaagct tatgtctatt     420

cctgtcgtgg cagtaggggt gatcccgctg gcagaagaga gtgtggactt agcaccatac     480

ggaactggcc tgccgccagc tgcgacagaa gagcagcgcg ccatgtattt cggcatgaag     540

gacgcacttg ccaacgtggt gttcaaaaca gccattgact cgttttccgc cattttagat     600

cgttatcaag tgcctcacga gaaagcgatc ttatttgata ctcttattcg tcaaagcgat     660

ttgtttttgc aaatcggagc caaagctttc gagtatgacc gcagcgattt gggggaaaac     720

gtgcgtttcg ttggagccct gctgccttat tcggagagca aaagtcgtca accctggttc     780

gatcaaaagt tgttacaata tgggcgcatt gtcttggtca ctcaggggac ggtggaacat     840

gatattaata agattctggt tcctacttta gaggcattta aaaactcgga aaccctggtc     900

atcgcgacaa caggaggaaa tggtacagca gaattacgtg cgcgctttcc tcagggcaac     960

ttaatcattg aggacttcat tccatttggt gacattatgc catacgctga tgtatatatc    1020

accaatggtg gttacggcgg agttatgctt ggcatcgaaa atcaactgcc ccttgtcgta    1080

gccggcatcc acgaaggaaa gaacgagatc aacgcacgta ttgggtactt tgagcttgga    1140

atcaatctga aaacggagtg gccgaagcca gagcagatga aaaaagcgat tgacgaagtt    1200

atcggtaata agaagtacaa agagaatatc acaaaactgg cgaaggaatt ctcaaactac    1260

catcctaacg aattgtgcgc ccaatacatc tctgaagtct tacagaagac cggccgcttg    1320

tacatttcgt ccaagaagga agaagaaaag atttactaa                           1359


<210> 61
<211> 452
<212> PRT
<213> Artificial Sequence

<220> 
<223> Chimera 4

<400> 61
Met Thr Lys Tyr Lys Asn Glu Leu Thr Gly Lys Arg Ile Leu Phe Gly 
1               5                   10                  15      
Thr Val Pro Gly Asp Gly His Phe Asn Pro Leu Thr Gly Leu Ala Lys 
            20                  25                  30          
Tyr Leu Gln Glu Leu Gly Cys Asp Val Arg Trp Tyr Ala Ser Asp Val 
        35                  40                  45              
Phe Lys Cys Lys Leu Glu Lys Leu Ser Ile Pro His Tyr Gly Phe Lys 
    50                  55                  60                  
Lys Ala Trp Asp Val Asn Gly Val Asn Val Asn Glu Ile Leu Pro Glu 
65                  70                  75                  80  
Arg Gln Lys Leu Thr Asp Pro Ala Glu Lys Leu Ser Phe Asp Leu Ile 
                85                  90                  95      
His Ile Phe Gly Asn Arg Ala Pro Glu Tyr Tyr Glu Asp Ile Leu Glu 
            100                 105                 110         
Ile His Glu Ser Phe Pro Phe Asp Val Phe Ile Ala Asp Ser Cys Phe 
        115                 120                 125             
Ser Ala Ile Pro Leu Val Ser Lys Leu Met Ser Ile Pro Val Val Ala 
    130                 135                 140                 
Val Gly Val Ile Pro Leu Ala Glu Glu Ser Val Asp Leu Ala Pro Tyr 
145                 150                 155                 160 
Gly Thr Gly Leu Pro Pro Ala Ala Thr Glu Glu Gln Arg Ala Met Tyr 
                165                 170                 175     
Phe Gly Met Lys Asp Ala Leu Ala Asn Val Val Phe Lys Thr Ala Ile 
            180                 185                 190         
Asp Ser Phe Ser Ala Ile Leu Asp Arg Tyr Gln Val Pro His Glu Lys 
        195                 200                 205             
Ala Ile Leu Phe Asp Thr Leu Ile Arg Gln Ser Asp Leu Phe Leu Gln 
    210                 215                 220                 
Ile Gly Ala Lys Ala Phe Glu Tyr Asp Arg Ser Asp Leu Gly Glu Asn 
225                 230                 235                 240 
Val Arg Phe Val Gly Ala Leu Leu Pro Tyr Ser Glu Ser Lys Ser Arg 
                245                 250                 255     
Gln Pro Trp Phe Asp Gln Lys Leu Leu Gln Tyr Gly Gln Ile Val Val 
            260                 265                 270         
Val Thr Gln Gly Thr Val Glu Lys Asn Ile Glu Lys Ile Leu Val Pro 
        275                 280                 285             
Thr Leu Glu Ala Phe Arg Asp Thr Asp Leu Leu Val Ile Ala Thr Thr 
    290                 295                 300                 
Gly Gly Ser Gly Thr Ala Glu Leu Lys Lys Arg Tyr Pro Gln Gly Asn 
305                 310                 315                 320 
Leu Ile Ile Glu Asp Phe Ile Pro Phe Gly Asp Ile Met Pro Tyr Ala 
                325                 330                 335     
Asp Val Tyr Ile Thr Asn Gly Gly Tyr Gly Gly Val Met Leu Gly Ile 
            340                 345                 350         
Glu Asn Gln Leu Pro Leu Val Val Ala Gly Ile His Glu Gly Lys Asn 
        355                 360                 365             
Glu Ile Asn Ala Arg Ile Gly Tyr Phe Glu Leu Gly Ile Asn Leu Lys 
    370                 375                 380                 
Thr Glu Trp Pro Lys Pro Glu Gln Met Lys Lys Ala Ile Asp Glu Val 
385                 390                 395                 400 
Ile Gly Asn Lys Lys Tyr Lys Glu Asn Ile Thr Lys Leu Ala Lys Glu 
                405                 410                 415     
Phe Ser Asn Tyr His Pro Asn Glu Leu Cys Ala Gln Tyr Ile Ser Glu 
            420                 425                 430         
Val Leu Gln Lys Thr Gly Arg Leu Tyr Ile Ser Ser Lys Lys Glu Glu 
        435                 440                 445             
Glu Lys Ile Tyr 
    450         

<210> 62
<211> 1359
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chimera 4

<400> 62
atgacgaaat acaaaaatga attaacaggt aaaagaatac tctttggtac cgttcccgga      60

gacggtcatt ttaatcccct taccgggctt gctaaatatt tacaggaatt agggtgcgat     120

gtcaggtggt atgcttctga tgttttcaaa tgcaagcttg aaaaattgtc gataccacat     180

tatggcttca aaaaagcatg ggatgtcaac ggtgtgaatg taaacgagat cctgccggag     240

cgacaaaaat taacagatcc cgccgaaaaa ctgagctttg acttgatcca cattttcgga     300

aaccgggcac ctgagtatta tgaggatatt ctcgaaatac acgaatcgtt cccattcgat     360

gtgttcattg ctgacagctg cttttccgcg attccgttag ttagcaagct gatgagcatc     420

cccgttgttg ccgttggcgt aattcctctg gcggaagaat ctgttgatct ggcgccttat     480

ggaacaggat tgccgcctgc cgcgacggag gagcaacgtg cgatgtattt tggtatgaaa     540

gatgctttgg ccaacgttgt tttcaaaact gccattgact ctttttcggc cattctggac     600

cggtaccagg taccgcacga aaaagcaatt ttattcgata cattgatccg tcaatccgac     660

ttgtttctgc aaattggcgc aaaagcattt gagtatgacc gcagcgacct gggcgaaaat     720

gtccgttttg tcggcgcatt gctgccgtac tcggaaagta aatcccggca gccctggttt     780

gatcagaaac ttttacaata tggcaaaatt gtggtggtga cacagggcac tgttgaaaag     840

aatattgaaa agatcctcgt gcccactctg gaagccttta gggatacaga cttattggta     900

atagccacaa cgggtggaag tggtacagct gagttgaaaa aaagatatcc tcaaggcaac     960

ctgatcatcg aagattttat tccctttggc gatatcatgc cttatgcgga tgtatatatt    1020

accaatggag gatatggtgg tgtaatgctg ggtatcgaaa accaattgcc attggtagta    1080

gcgggtattc atgaagggaa aaatgagatc aatgcaagga taggatactt tgaactggga    1140

attaacctga aaaccgaatg gcctaaaccg gaacagatga aaaaagccat agatgaagtg    1200

atcggcaaca aaaaatataa agagaatata acaaaattgg caaaagaatt cagcaattac    1260

catcccaatg aactatgcgc tcagtatata agcgaagtat tacaaaaaac aggcaggctt    1320

tatatcagca gtaaaaagga agaagaaaag atatactaa                           1359


<210> 63
<211> 1362
<212> DNA
<213> Artificial Sequence

<220> 
<223> Codon-optimized nucleotide sequence of chimera 4 (optimized for
      E. coli)

<400> 63
atgaccaaat acaaaaatga gttgaccggc aaacgtattt tgtttggaac cgtgcctgga      60

gatggacatt tcaacccctt aacaggctta gccaagtacc tgcaagaact gggctgcgat     120

gtacgctggt atgcatctga tgtatttaag tgcaaactgg agaagctgag catccctcac     180

tatgggttca agaaggcttg ggatgtaaat ggagtaaatg ttaatgaaat tcttccggag     240

cgtcaaaagc tgaccgaccc tgcggaaaag ctgagtttcg accttatcca catttttgga     300

aatcgcgctc ctgaatatta cgaggacatc ttggaaattc acgagagttt tcctttcgac     360

gtcttcatcg ccgactcctg cttcagtgct attcccttag tttccaagct tatgtctatt     420

cctgtcgtgg cagtaggggt gatcccgctg gcagaagaga gtgtggactt agcaccatac     480

ggaactggcc tgccgccagc tgcgacagaa gagcagcgcg ccatgtattt cggcatgaag     540

gacgcacttg ccaacgtggt gttcaaaaca gccattgact cgttttccgc cattttagat     600

cgttatcaag tgcctcacga gaaagcgatc ttatttgata ctcttattcg tcaaagcgat     660

ttgtttttgc aaatcggagc caaagctttc gagtatgacc gcagcgattt gggggaaaac     720

gtgcgtttcg ttggagccct gctgccttat tcggagagca aaagtcgtca accctggttc     780

gatcaaaagt tgttacaata tgggcgcaaa atcgtcgtag tcactcaggg aactgtagag     840

aaaaacatcg aaaagatttt ggtgccaacc cttgaggctt tccgcgacac tgacctgctt     900

gtgatcgcga cgacgggagg ttcaggaacc gctgaattga aaaaacgtta ccctcagggc     960

aacttaatca ttgaggactt cattccattt ggtgacatta tgccatacgc tgatgtatat    1020

atcaccaatg gtggttacgg cggagttatg cttggcatcg aaaatcaact gccccttgtc    1080

gtagccggca tccacgaagg aaagaacgag atcaacgcac gtattgggta ctttgagctt    1140

ggaatcaatc tgaaaacgga gtggccgaag ccagagcaga tgaaaaaagc gattgacgaa    1200

gttatcggta ataagaagta caaagagaat atcacaaaac tggcgaagga attctcaaac    1260

taccatccta acgaattgtg cgcccaatac atctctgaag tcttacagaa gaccggccgc    1320

ttgtacattt cgtccaagaa ggaagaagaa aagatttact aa                       1362


<210> 64
<211> 38
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTSopt_pET_fw

<400> 64
gggaattcca tatgatgaaa tatatcagct ccattcag                             38


<210> 65
<211> 33
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTSopt_pET_rv

<400> 65
cgggatcctt aaaccagaac ttcggcctga tag                                  33


<210> 66
<211> 59
<212> DNA
<213> Artificial Sequence

<220> 
<223> Bridge_P1_pETGTD

<400> 66
gcggccatat cgacgacgac gacaagcata tgacgaaata caaaaatgaa ttaacaggt      59


<210> 67
<211> 51
<212> DNA
<213> Artificial Sequence

<220> 
<223> Bridge_P1_pETGTD

<400> 67
ggaagaagaa aagatatact aaggatccgg ctgctaacaa agcccgaaag g              51


<210> 68
<211> 26
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chim_P1_D_Nde_for

<400> 68
catatgacga aatacaaaaa tgaatt                                         26


<210> 69
<211> 20
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chim_P1_D_rev

<400> 69
gcggtcatac tcaaatgatt                                                20


<210> 70
<211> 21
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chim_P1_C_for

<400> 70
agtgatctgg gaaaaaatat c                                              21


<210> 71
<211> 29
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chim_P1_C_Bam_rev

<400> 71
ggatccttag tatatctttt cttcttcct                                      29


<210> 72
<211> 33
<212> DNA
<213> Artificial Sequence

<220> 
<223> GTDopt_pEt_fw

<400> 72
gggaattcca tatgatgacc aaatacaaaa atg                                  33


<210> 73
<211> 33
<212> DNA
<213> Artificial Sequence

<220> 
<223> Chim3_pET_rv

<400> 73
cgggatcctt agtaaatctt ttcttcttcc ttc                                  33


<210> 74
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> 1r-Chim3-opt-o(Chim3-opt)

<400> 74
tgccctgagg aaagcgcgca cgtaattc                                       28


<210> 75
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> 2f-Chim3-opt-o(Chim3-opt)

<400> 75
tgcgcgcttt cctcagggca acttaatc                                       28


<210> 76
<211> 40
<212> DNA
<213> Artificial Sequence

<220> 
<223> 1f-Assembly-o(Vec)

<400> 76
tgacgataag gatcgatggg gatccatgac caaatacaaa                           40


<210> 77
<211> 43
<212> DNA
<213> Artificial Sequence

<220> 
<223> 1r-Assembly-o(Vec)

<400> 77
tatggtacca gctgcagatc tcgagttagt aaatcttttc ttc                       43


<210> 78
<211> 32
<212> DNA
<213> Artificial Sequence

<220> 
<223> 1r-Chim4_GTD-o(Chim4_GTC)

<400> 78
cgattttgcg cccatattgt aacaactttt ga                                   32


<210> 79
<211> 28
<212> DNA
<213> Artificial Sequence

<220> 
<223> 2f-Chim4_GTC-o(Chim4_GTD)

<400> 79
acaatatggg cgcaaaatcg tcgtagtc                                       28


