                               SEQUENCE LISTING

<110> SYNLOGIC OPERATING COMPANY, INC.
 
<120> MICROORGANISMS ENGINEERED TO REDUCE HYPERPHENYLALANINEMIA

<130> 12671.0026-00304

<140> PCT/US2021/063976
<141> 2021-12-17

<150> 63/132,627
<151> 2020-12-31

<160> 27    

<170> PatentIn version 3.5

<210> 1
<211> 532
<212> PRT
<213> Photorhabdus luminescens

<400> 1
Met Lys Ala Lys Asp Val Gln Pro Thr Ile Ile Ile Asn Lys Asn Gly 
1               5                   10                  15      


Leu Ile Ser Leu Glu Asp Ile Tyr Asp Ile Ala Ile Lys Gln Lys Lys 
            20                  25                  30          


Val Glu Ile Ser Thr Glu Ile Thr Glu Leu Leu Thr His Gly Arg Glu 
        35                  40                  45              


Lys Leu Glu Glu Lys Leu Asn Ser Gly Glu Val Ile Tyr Gly Ile Asn 
    50                  55                  60                  


Thr Gly Phe Gly Gly Asn Ala Asn Leu Val Val Pro Phe Glu Lys Ile 
65                  70                  75                  80  


Ala Glu His Gln Gln Asn Leu Leu Thr Phe Leu Ser Ala Gly Thr Gly 
                85                  90                  95      


Asp Tyr Met Ser Lys Pro Cys Ile Lys Ala Ser Gln Phe Thr Met Leu 
            100                 105                 110         


Leu Ser Val Cys Lys Gly Trp Ser Ala Thr Arg Pro Ile Val Ala Gln 
        115                 120                 125             


Ala Ile Val Asp His Ile Asn His Asp Ile Val Pro Leu Val Pro Arg 
    130                 135                 140                 


Tyr Gly Ser Val Gly Ala Ser Gly Asp Leu Ile Pro Leu Ser Tyr Ile 
145                 150                 155                 160 


Ala Arg Ala Leu Cys Gly Ile Gly Lys Val Tyr Tyr Met Gly Ala Glu 
                165                 170                 175     


Ile Asp Ala Ala Glu Ala Ile Lys Arg Ala Gly Leu Thr Pro Leu Ser 
            180                 185                 190         


Leu Lys Ala Lys Glu Gly Leu Ala Leu Ile Asn Gly Thr Arg Val Met 
        195                 200                 205             


Ser Gly Ile Ser Ala Ile Thr Val Ile Lys Leu Glu Lys Leu Phe Lys 
    210                 215                 220                 


Ala Ser Ile Ser Ala Ile Ala Leu Ala Val Glu Ala Leu Leu Ala Ser 
225                 230                 235                 240 


His Glu His Tyr Asp Ala Arg Ile Gln Gln Val Lys Asn His Pro Gly 
                245                 250                 255     


Gln Asn Ala Val Ala Ser Ala Leu Arg Asn Leu Leu Ala Gly Ser Thr 
            260                 265                 270         


Gln Val Asn Leu Leu Ser Gly Val Lys Glu Gln Ala Asn Lys Ala Cys 
        275                 280                 285             


Arg His Gln Glu Ile Thr Gln Leu Asn Asp Thr Leu Gln Glu Val Tyr 
    290                 295                 300                 


Ser Ile Arg Cys Ala Pro Gln Val Leu Gly Ile Val Pro Glu Ser Leu 
305                 310                 315                 320 


Ala Thr Ala Arg Lys Ile Leu Glu Arg Glu Val Ile Ser Ala Asn Asp 
                325                 330                 335     


Asn Pro Leu Ile Asp Pro Glu Asn Gly Asp Val Leu His Gly Gly Asn 
            340                 345                 350         


Phe Met Gly Gln Tyr Val Ala Arg Thr Met Asp Ala Leu Lys Leu Asp 
        355                 360                 365             


Ile Ala Leu Ile Ala Asn His Leu His Ala Ile Val Ala Leu Met Met 
    370                 375                 380                 


Asp Asn Arg Phe Ser Arg Gly Leu Pro Asn Ser Leu Ser Pro Thr Pro 
385                 390                 395                 400 


Gly Met Tyr Gln Gly Phe Lys Gly Val Gln Leu Ser Gln Thr Ala Leu 
                405                 410                 415     


Val Ala Ala Ile Arg His Asp Cys Ala Ala Ser Gly Ile His Thr Leu 
            420                 425                 430         


Ala Thr Glu Gln Tyr Asn Gln Asp Ile Val Ser Leu Gly Leu His Ala 
        435                 440                 445             


Ala Gln Asp Val Leu Glu Met Glu Gln Lys Leu Arg Asn Ile Val Ser 
    450                 455                 460                 


Met Thr Ile Leu Val Val Cys Gln Ala Ile His Leu Arg Gly Asn Ile 
465                 470                 475                 480 


Ser Glu Ile Ala Pro Glu Thr Ala Lys Phe Tyr His Ala Val Arg Glu 
                485                 490                 495     


Ile Ser Ser Pro Leu Ile Thr Asp Arg Ala Leu Asp Glu Asp Ile Ile 
            500                 505                 510         


Arg Ile Ala Asp Ala Ile Ile Asn Asp Gln Leu Pro Leu Pro Glu Ile 
        515                 520                 525             


Met Leu Glu Glu 
    530         


<210> 2
<211> 532
<212> PRT
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polypeptide"

<400> 2
Met Lys Ala Lys Asp Val Gln Pro Thr Ile Ile Ile Asn Lys Asn Gly 
1               5                   10                  15      


Leu Ile Ser Leu Glu Asp Ile Tyr Asp Ile Ala Ile Lys Gln Lys Lys 
            20                  25                  30          


Val Glu Ile Ser Thr Glu Ile Thr Glu Leu Leu Thr His Gly Arg Glu 
        35                  40                  45              


Lys Leu Glu Glu Lys Leu Asn Ser Gly Glu Val Ile Tyr Gly Ile Asn 
    50                  55                  60                  


Thr Gly Phe Gly Gly Asn Ala Asn Leu Val Val Pro Phe Glu Lys Ile 
65                  70                  75                  80  


Ala Glu His Gln Gln Asn Leu Leu Thr Phe Leu Gly Ala Gly Thr Gly 
                85                  90                  95      


Asp Tyr Met Ser Lys Pro Cys Ile Lys Ala Ser Gln Phe Thr Met Leu 
            100                 105                 110         


Leu Ser Val Cys Lys Gly Trp Ser Ala Thr Arg Pro Ile Val Ala Gln 
        115                 120                 125             


Ala Ile Val Asp Met Ile Asn His Asp Ile Val Pro Leu Val Pro Arg 
    130                 135                 140                 


Tyr Gly Ser Val Gly Ala Ser Gly Asp Leu Ile Pro Leu Ser Tyr Ile 
145                 150                 155                 160 


Ala Arg Ala Leu Cys Gly Lys Gly Lys Val Tyr Tyr Met Gly Ala Glu 
                165                 170                 175     


Ile Asp Ala Ala Glu Ala Ile Lys Arg Ala Gly Leu Thr Pro Leu Ser 
            180                 185                 190         


Leu Lys Ala Lys Glu Gly Leu Ala Leu Ile Asn Gly Thr Arg Val Met 
        195                 200                 205             


Ser Gly Ile Ser Ala Ile Thr Val Ile Lys Leu Glu Lys Leu Phe Lys 
    210                 215                 220                 


Ala Ser Ile Ser Ala Ile Ala Leu Ala Val Glu Ala Leu Leu Ala Ser 
225                 230                 235                 240 


His Glu His Tyr Asp Ala Arg Ile Gln Gln Val Lys Asn His Pro Gly 
                245                 250                 255     


Gln Asn Ala Val Ala Ser Ala Leu Arg Asn Leu Leu Ala Gly Ser Thr 
            260                 265                 270         


Gln Val Asn Leu Leu Ser Gly Val Lys Glu Gln Ala Asn Lys Ala Cys 
        275                 280                 285             


Arg His Gln Glu Ile Thr Gln Leu Asn Asp Thr Leu Gln Glu Val Tyr 
    290                 295                 300                 


Ser Ile Arg Cys Ala Pro Gln Val Leu Gly Ile Val Pro Glu Ser Leu 
305                 310                 315                 320 


Ala Thr Ala Arg Lys Ile Leu Glu Arg Glu Val Ile Ser Ala Asn Asp 
                325                 330                 335     


Asn Pro Leu Ile Asp Pro Glu Asn Gly Asp Val Leu His Gly Gly Asn 
            340                 345                 350         


Phe Met Gly Gln Tyr Val Ala Arg Thr Met Asp Ala Leu Lys Leu Asp 
        355                 360                 365             


Ile Ala Leu Ile Ala Asn His Leu His Ala Ile Val Ala Leu Met Met 
    370                 375                 380                 


Asp Asn Arg Phe Ser Arg Gly Leu Pro Asn Ser Leu Ser Pro Thr Pro 
385                 390                 395                 400 


Gly Met Tyr Gln Gly Phe Lys Gly Val Gln Leu Ser Gln Thr Ala Leu 
                405                 410                 415     


Val Ala Ala Ile Arg His Asp Cys Ala Ala Ser Gly Ile His Thr Ile 
            420                 425                 430         


Ala Thr Glu Gln Tyr Asn Gln Asp Ile Val Ser Leu Gly Leu His Ala 
        435                 440                 445             


Ala Gln Asp Val Leu Glu Met Glu Gln Lys Leu Arg Asn Ile Val Ser 
    450                 455                 460                 


Met Thr Ile Leu Val Ala Cys Gln Ala Ile His Leu Arg Gly Asn Ile 
465                 470                 475                 480 


Ser Glu Ile Ala Pro Glu Thr Ala Lys Phe Tyr His Ala Val Arg Glu 
                485                 490                 495     


Ile Ser Ser Pro Leu Ile Thr Asp Arg Ala Leu Asp Glu Asp Ile Ile 
            500                 505                 510         


Arg Ile Ala Asp Ala Ile Ile Asn Asp Gln Leu Pro Leu Pro Glu Ile 
        515                 520                 525             


Met Leu Glu Glu 
    530         


<210> 3
<211> 532
<212> PRT
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polypeptide"

<400> 3
Met Lys Ala Lys Asp Val Gln Pro Thr Ile Ile Ile Asn Lys Asn Gly 
1               5                   10                  15      


Leu Ile Ser Leu Glu Asp Ile Tyr Asp Ile Ala Ile Lys Gln Lys Lys 
            20                  25                  30          


Val Glu Ile Ser Thr Glu Ile Thr Glu Leu Leu Thr His Gly Arg Glu 
        35                  40                  45              


Lys Leu Glu Glu Lys Leu Asn Ser Gly Glu Val Ile Tyr Gly Ile Asn 
    50                  55                  60                  


Thr Gly Phe Gly Gly Asn Ala Asn Leu Val Val Pro Phe Glu Lys Ile 
65                  70                  75                  80  


Ala Glu His Gln Gln Asn Leu Leu Thr Phe Leu Gly Ala Gly Thr Gly 
                85                  90                  95      


Asp Tyr Met Ser Lys Pro Cys Ile Lys Ala Ser Gln Phe Thr Met Leu 
            100                 105                 110         


Leu Ser Val Cys Lys Gly Trp Ser Ala Thr Arg Pro Ile Val Ala Gln 
        115                 120                 125             


Ala Ile Val Asp Phe Ile Asn His Asp Ile Val Pro Leu Val Pro Arg 
    130                 135                 140                 


Tyr Gly Ser Val Gly Ala Ser Gly Asp Leu Ile Pro Leu Ser Tyr Ile 
145                 150                 155                 160 


Ala Arg Ala Leu Cys Gly Ile Gly Lys Val Tyr Tyr Met Gly Ala Glu 
                165                 170                 175     


Ile Asp Ala Ala Glu Ala Ile Lys Arg Ala Gly Leu Thr Pro Leu Ser 
            180                 185                 190         


Leu Lys Ala Lys Glu Gly Leu Ala Leu Ile Asn Gly Thr Arg Val Met 
        195                 200                 205             


Ser Gly Ile Ser Ala Ile Thr Val Ile Lys Leu Glu Lys Leu Phe Lys 
    210                 215                 220                 


Ala Ser Ile Ser Ala Ile Ala Leu Ala Val Glu Ala Leu Leu Ala Ser 
225                 230                 235                 240 


His Glu His Tyr Asp Ala Arg Ile Gln Gln Val Lys Asn His Pro Gly 
                245                 250                 255     


Gln Asn Ala Val Ala Ser Ala Leu Arg Asn Leu Leu Ala Gly Ser Thr 
            260                 265                 270         


Gln Val Asn Leu Leu Ser Gly Val Lys Glu Gln Ala Asn Lys Ala Cys 
        275                 280                 285             


Arg His Gln Glu Ile Thr Gln Leu Asn Asp Thr Leu Gln Glu Val Tyr 
    290                 295                 300                 


Ser Ile Arg Cys Ala Pro Gln Val Leu Gly Ile Val Pro Glu Ser Leu 
305                 310                 315                 320 


Ala Thr Ala Arg Lys Ile Leu Glu Arg Glu Val Ile Ser Ala Asn Asp 
                325                 330                 335     


Asn Pro Leu Ile Asp Pro Glu Asn Gly Asp Val Leu His Gly Gly Asn 
            340                 345                 350         


Phe Met Gly Gln Tyr Val Ala Arg Thr Met Asp Ala Leu Lys Leu Asp 
        355                 360                 365             


Ile Ala Leu Ile Ala Asn His Leu His Ala Ile Val Ala Leu Met Met 
    370                 375                 380                 


Asp Asn Arg Phe Ser Arg Gly Leu Pro Asn Ser Leu Ser Pro Thr Pro 
385                 390                 395                 400 


Gly Met Tyr Gln Gly Phe Lys Gly Val Gln Leu Ser Gln Thr Ala Leu 
                405                 410                 415     


Val Ala Ala Ile Arg His Asp Cys Ala Ala Ser Gly Ile His Thr Leu 
            420                 425                 430         


Ser Thr Glu Gln Tyr Asn Gln Asp Ile Val Ser Leu Gly Leu His Ala 
        435                 440                 445             


Ala Gln Asp Val Leu Glu Met Glu Gln Lys Leu Arg Asn Ile Val Ser 
    450                 455                 460                 


Met Thr Ile Leu Val Ala Cys Gln Ala Ile His Leu Arg Gly Asn Ile 
465                 470                 475                 480 


Ser Glu Ile Ala Pro Glu Thr Ala Lys Phe Tyr His Ala Val Arg Glu 
                485                 490                 495     


Ile Ser Ser Pro Leu Ile Thr Asp Arg Ala Leu Asp Glu Asp Ile Ile 
            500                 505                 510         


Arg Ile Ala Asp Ala Ile Ile Asn Asp Gln Leu Pro Leu Pro Glu Ile 
        515                 520                 525             


Met Leu Glu Glu 
    530         


<210> 4
<211> 532
<212> PRT
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polypeptide"

<400> 4
Met Lys Ala Lys Asp Val Gln Pro Thr Ile Ile Ile Asn Lys Asn Gly 
1               5                   10                  15      


Leu Ile Ser Leu Glu Asp Ile Tyr Asp Ile Ala Ile Lys Gln Lys Lys 
            20                  25                  30          


Val Glu Ile Ser Thr Glu Ile Thr Glu Leu Leu Thr His Gly Arg Glu 
        35                  40                  45              


Lys Leu Glu Glu Lys Leu Asn Ser Gly Glu Val Ile Tyr Gly Ile Asn 
    50                  55                  60                  


Thr Gly Phe Gly Gly Asn Ala Asn Leu Val Val Pro Phe Glu Lys Ile 
65                  70                  75                  80  


Ala Glu His Gln Gln Asn Leu Leu Thr Phe Leu Gly Ala Gly Thr Gly 
                85                  90                  95      


Asp Tyr Met Ser Lys Pro Cys Ile Lys Ala Ser Gln Phe Thr Met Leu 
            100                 105                 110         


Leu Ser Val Cys Lys Gly Trp Ser Ala Thr Arg Pro Ile Val Ala Gln 
        115                 120                 125             


Ala Ile Val Asp Phe Ile Asn His Asp Ile Val Pro Leu Val Pro Arg 
    130                 135                 140                 


Tyr Gly Ser Val Gly Ala Ser Gly Asp Leu Ile Pro Leu Ser Tyr Ile 
145                 150                 155                 160 


Ala Arg Ala Leu Cys Gly Ile Gly Lys Val Tyr Tyr Met Gly Ala Glu 
                165                 170                 175     


Ile Asp Ala Ala Glu Ala Ile Lys Arg Ala Gly Leu Thr Pro Leu Ser 
            180                 185                 190         


Leu Lys Ala Lys Glu Gly Leu Ala Leu Ile Asn Gly Thr Arg Val Met 
        195                 200                 205             


Ser Gly Ile Ser Ala Ile Thr Val Ile Lys Leu Glu Lys Leu Phe Lys 
    210                 215                 220                 


Ala Ser Ile Ser Ala Ile Ala Leu Ala Val Glu Ala Leu Leu Ala Ser 
225                 230                 235                 240 


His Glu His Tyr Asp Ala Arg Ile Gln Gln Val Lys Asn His Pro Gly 
                245                 250                 255     


Gln Asn Ala Val Ala Ser Thr Leu Arg Asn Leu Leu Ala Gly Ser Thr 
            260                 265                 270         


Gln Val Asn Leu Leu Ser Gly Val Lys Glu Gln Ala Asn Lys Ala Cys 
        275                 280                 285             


Arg His Gln Glu Ile Thr Gln Leu Asn Asp Thr Leu Gln Glu Val Tyr 
    290                 295                 300                 


Ser Ile Arg Cys Ala Pro Gln Val Leu Gly Ile Val Pro Glu Ser Leu 
305                 310                 315                 320 


Ala Thr Ala Arg Lys Ile Leu Glu Arg Glu Val Ile Ser Ala Asn Asp 
                325                 330                 335     


Asn Pro Leu Ile Asp Pro Glu Asn Gly Asp Val Leu His Gly Gly Asn 
            340                 345                 350         


Phe Met Gly Gln Tyr Val Ala Arg Thr Met Asp Ala Leu Lys Leu Asp 
        355                 360                 365             


Ile Ala Leu Ile Ala Asn His Leu His Ala Ile Val Ala Leu Met Met 
    370                 375                 380                 


Asp Asn Arg Phe Ser Arg Gly Leu Pro Asn Ser Leu Ser Pro Thr Pro 
385                 390                 395                 400 


Gly Met Tyr Gln Gly Phe Lys Gly Val Gln Leu Ser Gln Thr Ala Leu 
                405                 410                 415     


Val Ala Ala Ile Arg His Asp Cys Ala Ala Ser Gly Ile His Thr Leu 
            420                 425                 430         


Ala Thr Glu Gln Tyr Asn Gln Asp Ile Val Ser Leu Gly Leu His Ala 
        435                 440                 445             


Ala Gln Asp Val Leu Glu Met Glu Gln Lys Leu Arg Asn Ile Val Ser 
    450                 455                 460                 


Met Thr Ile Leu Val Ala Cys Gln Ala Ile His Leu Arg Gly Asn Ile 
465                 470                 475                 480 


Ser Glu Ile Ala Pro Glu Thr Ala Lys Phe Tyr His Ala Val Arg Glu 
                485                 490                 495     


Ile Ser Ser Pro Leu Ile Thr Asp Arg Ala Leu Asp Glu Asp Ile Ile 
            500                 505                 510         


Arg Ile Ala Asp Ala Ile Ile Asn Asp Gln Leu Pro Leu Pro Glu Ile 
        515                 520                 525             


Met Leu Glu Glu 
    530         


<210> 5
<211> 46
<212> DNA
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      primer"

<400> 5
gtacataggg ctttgaacgc ttgctcgtgt aggctggagc tgcttc                      46


<210> 6
<211> 46
<212> DNA
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      primer"

<400> 6
gtacataggg ctttgaacgc ttgctccggc tgacatggga attagc                      46


<210> 7
<211> 2753
<212> DNA
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polynucleotide"

<400> 7
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa       60

cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga      120

gactggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc      180

cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata      240

acatgagcta tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag      300

cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat      360

cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc      420

actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg      480

ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat      540

ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cctcatggga      600

gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt      660

agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag      720

cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct      780

tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc      840

cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa      900

cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat      960

cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg     1020

ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt     1080

catattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt     1140

tttgcgccat tcgatggcgc gccgcttcgt caggccacat agctttcttg ttctgatcgg     1200

aacgatcgtt ggctgtgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag     1260

cgctcacaat tagctgtcac cggatgtgct ttccggtctg atgagtccgt gaggacgaaa     1320

cagcctctac aaataatttt gtttaatagc ttcggtaagt ttacggagga tttgtcatga     1380

aaaacgctag tacggtaagt gaggatacag cctcgaacca agaaccaacg ttgcaccgtg     1440

ggttgcacaa ccgtcacatc cagcttatcg cacttggcgg cgctattgga acggggttat     1500

tccttggaat tggaccggcc attcaaatgg ccgggccagc ggttttgttg gggtatggtg     1560

tagcggggat cattgcattt cttattatgc gtcagttggg agaaatggtg gttgaagaac     1620

ctgtatccgg cagcttcgcg catttcgcgt acaagtattg gggtccattt gctggctttc     1680

tgagtggctg gaattattgg gtgatgtttg tcctggttgg aatggcggaa cttactgcgg     1740

ccggcattta tatgcagtac tggtttcctg atgtgcctac gtggatctgg gccgcggctt     1800

tctttattat tatcaatgca gtcaatctgg tcaacgtgcg cttgtatggt gagacggagt     1860

tctggttcgc attaattaag gtattagcta ttatcggaat gattggcttt gggttatggt     1920

tgctgtttag cgggcacggc ggtgagaaag cctctatcga taacctttgg cgctacggtg     1980

ggttctttgc tacaggatgg aacgggttaa tcttgagtct tgcggtcatc atgttcagtt     2040

ttggtggcct tgaattgatt ggtatcacgg cagcagaggc gcgtgaccca gaaaaaagca     2100

tccccaaagc cgttaatcag gtggtgtacc gcatcttatt attttacatt ggttcactgg     2160

tcgtgttgtt ggctctgtac ccatgggttg aggtgaaatc taactcatcc cccttcgtca     2220

tgatctttca taaccttgat tcaaatgtgg tcgccagcgc gttaaacttt gtaatcctgg     2280

tggcaagcct ttccgtgtac aattcagggg tctattctaa tagtcgtatg ttgttcgggc     2340

tttcggtcca aggaaacgcg ccgaaattcc tgacacgcgt tagtcgtcgt ggtgtgccca     2400

ttaatagcct gatgctgagt ggtgcaatca cttctttagt cgtgcttatt aactatttac     2460

tgcctcagaa ggcattcggg ttattaatgg ctttagttgt cgcaacgtta ttgttaaact     2520

ggatcatgat ctgtttagca cacctgcgtt tccgtgcggc tatgcgtcgc cagggtcgtg     2580

aaacccagtt caaggcctta ctttatccct ttggtaatta cttgtgcatt gcatttttag     2640

gcatgatttt actgctgatg tgtactatgg atgatatgcg cctgtccgca atccttttac     2700

ccgtctggat tgtttttctt tttatggcat tcaaaacact tcgtcgcaag taa            2753


<210> 8
<211> 2975
<212> DNA
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polynucleotide"

<400> 8
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa       60

cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga      120

gactggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc      180

cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata      240

acatgagcta tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag      300

cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat      360

cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc      420

actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg      480

ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat      540

ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cctcatggga      600

gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt      660

agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag      720

cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct      780

tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc      840

cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa      900

cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat      960

cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg     1020

ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt     1080

catattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt     1140

tttgcgccat tcgatggcgc gccgcttcgt caggccacat agctttcttg ttctgatcgg     1200

aacgatcgtt ggctgtgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag     1260

cgctcacaat tagctgtcac cggatgtgct ttccggtctg atgagtccgt gaggacgaaa     1320

cagcctctac aaataatttt gtttaatagc ttcggtaagt ttacggagga tttgtcatga     1380

aagctaaaga tgttcagcca accattatta ttaataaaaa tggccttatc tctttggaag     1440

atatctatga cattgcgata aaacaaaaaa aagtagaaat atcaacggag atcactgaac     1500

ttttgacgca tggtcgtgaa aaattagagg aaaaattaaa ttcaggagag gttatatatg     1560

gaatcaatac aggatttgga gggaatgcca atttagttgt gccatttgag aaaatcgcag     1620

agcatcagca aaatctgtta acttttcttg gcgctggtac tggggactat atgtccaaac     1680

cttgtattaa agcgtcacaa tttactatgt tactttctgt ttgcaaaggt tggtctgcaa     1740

ccagaccaat tgtcgctcaa gcaattgttg atatgattaa tcatgacatt gttcctctgg     1800

ttcctcgcta tggctcagtg ggtgcaagcg gtgatttaat tcctttatct tatattgcac     1860

gagcattatg tggtaagggc aaagtttatt atatgggcgc agaaattgac gctgctgaag     1920

caattaaacg tgcagggttg acaccattat cgttaaaagc caaagaaggt cttgctctga     1980

ttaacggcac ccgggtaatg tcaggaatca gtgcaatcac cgtcattaaa ctggaaaaac     2040

tatttaaagc ctcaatttct gcgattgccc ttgctgttga agcattactt gcatctcatg     2100

aacattatga tgcccggatt caacaagtaa aaaatcatcc tggtcaaaac gcggtggcaa     2160

gtgcattgcg taatttattg gcaggttcaa cgcaggttaa tctattatct ggggttaaag     2220

aacaagccaa taaagcttgt cgtcatcaag aaattaccca actaaatgat accttacagg     2280

aagtttattc aattcgctgt gcaccacaag tattaggtat agtgccagaa tctttagcta     2340

ccgctcggaa aatattggaa cgggaagtta tctcagctaa tgataatcca ttgatagatc     2400

cagaaaatgg cgatgttcta cacggtggaa attttatggg gcaatatgtc gcccgaacaa     2460

tggatgcatt aaaactggat attgctttaa ttgccaatca tcttcacgcc attgtggctc     2520

ttatgatgga taaccgtttc tctcgtggat tacctaattc actgagtccg acacccggca     2580

tgtatcaagg ttttaaaggc gtccaacttt ctcaaaccgc tttagttgct gcaattcgcc     2640

atgattgtgc tgcatcaggt attcatacca ttgccacaga acaatacaat caagatattg     2700

tcagtttagg tctgcatgcc gctcaagatg ttttagagat ggagcagaaa ttacgcaata     2760

ttgtttcaat gacaattctg gtagcctgtc aggccattca tcttcgcggc aatattagtg     2820

aaattgcgcc tgaaactgct aaattttacc atgcagtacg cgaaatcagt tctcctttga     2880

tcactgatcg tgcgttggat gaagatataa tccgcattgc ggatgcaatt attaatgatc     2940

aacttcctct gccagaaatc atgctggaag aataa                                2975


<210> 9
<211> 117
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 9
atccccatca ctcttgatgg agatcaattc cccaagctgc tagagcgtta ccttgccctt       60

aaacattagc aatgtcgatt tatcagaggg ccgacaggct cccacaggag aaaaccg         117


<210> 10
<211> 108
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 10
ctcttgatcg ttatcaattc ccacgctgtt tcagagcgtt accttgccct taaacattag       60

caatgtcgat ttatcagagg gccgacaggc tcccacagga gaaaaccg                   108


<210> 11
<211> 290
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 11
gtcagcataa caccctgacc tctcattaat tgttcatgcc gggcggcact atcgtcgtcc       60

ggccttttcc tctcttactc tgctacgtac atctatttct ataaatccgt tcaatttgtc      120

tgttttttgc acaaacatga aatatcagac aattccgtga cttaagaaaa tttatacaaa      180

tcagcaatat accccttaag gagtatataa aggtgaattt gatttacatc aataagcggg      240

gttgctgaat cgttaaggta ggcggtaata gaaaagaaat cgaggcaaaa                 290


<210> 12
<211> 433
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 12
cggcccgatc gttgaacata gcggtccgca ggcggcactg cttacagcaa acggtctgta       60

cgctgtcgtc tttgtgatgt gcttcctgtt aggtttcgtc agccgtcacc gtcagcataa      120

caccctgacc tctcattaat tgctcatgcc ggacggcact atcgtcgtcc ggccttttcc      180

tctcttcccc cgctacgtgc atctatttct ataaacccgc tcattttgtc tattttttgc      240

acaaacatga aatatcagac aattccgtga cttaagaaaa tttatacaaa tcagcaatat      300

acccattaag gagtatataa aggtgaattt gatttacatc aataagcggg gttgctgaat      360

cgttaaggta ggcggtaata gaaaagaaat cgaggcaaaa atgtttgttt aactttaaga      420

aggagatata cat                                                         433


<210> 13
<211> 290
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 13
gtcagcataa caccctgacc tctcattaat tgctcatgcc ggacggcact atcgtcgtcc       60

ggccttttcc tctcttcccc cgctacgtgc atctatttct ataaacccgc tcattttgtc      120

tattttttgc acaaacatga aatatcagac aattccgtga cttaagaaaa tttatacaaa      180

tcagcaatat acccattaag gagtatataa aggtgaattt gatttacatc aataagcggg      240

gttgctgaat cgttaaggta ggcggtaata gaaaagaaat cgaggcaaaa                 290


<210> 14
<211> 173
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 14
atttcctctc atcccatccg gggtgagagt cttttccccc gacttatggc tcatgcatgc       60

atcaaaaaag atgtgagctt gatcaaaaac aaaaaatatt tcactcgaca ggagtattta      120

tattgcgccc gttacgtggg cttcgactgt aaatcagaaa ggagaaaaca cct             173


<210> 15
<211> 305
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 15
gtcagcataa caccctgacc tctcattaat tgttcatgcc gggcggcact atcgtcgtcc       60

ggccttttcc tctcttactc tgctacgtac atctatttct ataaatccgt tcaatttgtc      120

tgttttttgc acaaacatga aatatcagac aattccgtga cttaagaaaa tttatacaaa      180

tcagcaatat accccttaag gagtatataa aggtgaattt gatttacatc aataagcggg      240

gttgctgaat cgttaaggat ccctctagaa ataattttgt ttaactttaa gaaggagata      300

tacat                                                                  305


<210> 16
<211> 180
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 16
catttcctct catcccatcc ggggtgagag tcttttcccc cgacttatgg ctcatgcatg       60

catcaaaaaa gatgtgagct tgatcaaaaa caaaaaatat ttcactcgac aggagtattt      120

atattgcgcc cggatccctc tagaaataat tttgtttaac tttaagaagg agatatacat      180


<210> 17
<211> 199
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 17
agttgttctt attggtggtg ttgctttatg gttgcatcgt agtaaatggt tgtaacaaaa       60

gcaatttttc cggctgtctg tatacaaaaa cgccgtaaag tttgagcgaa gtcaataaac      120

tctctaccca ttcagggcaa tatctctctt ggatccctct agaaataatt ttgtttaact      180

ttaagaagga gatatacat                                                   199


<210> 18
<211> 207
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 18
agttgttctt attggtggtg ttgctttatg gttgcatcgt agtaaatggt tgtaacaaaa       60

gcaatttttc cggctgtctg tatacaaaaa cgccgcaaag tttgagcgaa gtcaataaac      120

tctctaccca ttcagggcaa tatctctctt ggatccaaag tgaactctag aaataatttt      180

gtttaacttt aagaaggaga tatacat                                          207


<210> 19
<211> 390
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 19
tcgtctttgt gatgtgcttc ctgttaggtt tcgtcagccg tcaccgtcag cataacaccc       60

tgacctctca ttaattgctc atgccggacg gcactatcgt cgtccggcct tttcctctct      120

tcccccgcta cgtgcatcta tttctataaa cccgctcatt ttgtctattt tttgcacaaa      180

catgaaatat cagacaattc cgtgacttaa gaaaatttat acaaatcagc aatataccca      240

ttaaggagta tataaaggtg aatttgattt acatcaataa gcggggttgc tgaatcgtta      300

aggtagaaat gtgatctagt tcacatttgc ggtaatagaa aagaaatcga ggcaaaaatg      360

tttgtttaac tttaagaagg agatatacat                                       390


<210> 20
<211> 200
<212> DNA
<213> Unknown

<220>
<221> source
<223> /note="Description of Unknown: 
      FNR-responsive regulatory region sequence"

<400> 20
agttgttctt attggtggtg ttgctttatg gttgcatcgt agtaaatggt tgtaacaaaa       60

gcaatttttc cggctgtctg tatacaaaaa cgccgcaaag tttgagcgaa gtcaataaac      120

tctctaccca ttcagggcaa tatctctcaa atgtgatcta gttcacattt tttgtttaac      180

tttaagaagg agatatacat                                                  200


<210> 21
<211> 567
<212> PRT
<213> Anabaena variabilis

<400> 21
Met Lys Thr Leu Ser Gln Ala Gln Ser Lys Thr Ser Ser Gln Gln Phe 
1               5                   10                  15      


Ser Phe Thr Gly Asn Ser Ser Ala Asn Val Ile Ile Gly Asn Gln Lys 
            20                  25                  30          


Leu Thr Ile Asn Asp Val Ala Arg Val Ala Arg Asn Gly Thr Leu Val 
        35                  40                  45              


Ser Leu Thr Asn Asn Thr Asp Ile Leu Gln Gly Ile Gln Ala Ser Cys 
    50                  55                  60                  


Asp Tyr Ile Asn Asn Ala Val Glu Ser Gly Glu Pro Ile Tyr Gly Val 
65                  70                  75                  80  


Thr Ser Gly Phe Gly Gly Met Ala Asn Val Ala Ile Ser Arg Glu Gln 
                85                  90                  95      


Ala Ser Glu Leu Gln Thr Asn Leu Val Trp Phe Leu Lys Thr Gly Ala 
            100                 105                 110         


Gly Asn Lys Leu Pro Leu Ala Asp Val Arg Ala Ala Met Leu Leu Arg 
        115                 120                 125             


Ala Asn Ser His Met Arg Gly Ala Ser Gly Ile Arg Leu Glu Leu Ile 
    130                 135                 140                 


Lys Arg Met Glu Ile Phe Leu Asn Ala Gly Val Thr Pro Tyr Val Tyr 
145                 150                 155                 160 


Glu Phe Gly Ser Ile Gly Ala Ser Gly Asp Leu Val Pro Leu Ser Tyr 
                165                 170                 175     


Ile Thr Gly Ser Leu Ile Gly Leu Asp Pro Ser Phe Lys Val Asp Phe 
            180                 185                 190         


Asn Gly Lys Glu Met Asp Ala Pro Thr Ala Leu Arg Gln Leu Asn Leu 
        195                 200                 205             


Ser Pro Leu Thr Leu Leu Pro Lys Glu Gly Leu Ala Met Met Asn Gly 
    210                 215                 220                 


Thr Ser Val Met Thr Gly Ile Ala Ala Asn Cys Val Tyr Asp Thr Gln 
225                 230                 235                 240 


Ile Leu Thr Ala Ile Ala Met Gly Val His Ala Leu Asp Ile Gln Ala 
                245                 250                 255     


Leu Asn Gly Thr Asn Gln Ser Phe His Pro Phe Ile His Asn Ser Lys 
            260                 265                 270         


Pro His Pro Gly Gln Leu Trp Ala Ala Asp Gln Met Ile Ser Leu Leu 
        275                 280                 285             


Ala Asn Ser Gln Leu Val Arg Asp Glu Leu Asp Gly Lys His Asp Tyr 
    290                 295                 300                 


Arg Asp His Glu Leu Ile Gln Asp Arg Tyr Ser Leu Arg Cys Leu Pro 
305                 310                 315                 320 


Gln Tyr Leu Gly Pro Ile Val Asp Gly Ile Ser Gln Ile Ala Lys Gln 
                325                 330                 335     


Ile Glu Ile Glu Ile Asn Ser Val Thr Asp Asn Pro Leu Ile Asp Val 
            340                 345                 350         


Asp Asn Gln Ala Ser Tyr His Gly Gly Asn Phe Leu Gly Gln Tyr Val 
        355                 360                 365             


Gly Met Gly Met Asp His Leu Arg Tyr Tyr Ile Gly Leu Leu Ala Lys 
    370                 375                 380                 


His Leu Asp Val Gln Ile Ala Leu Leu Ala Ser Pro Glu Phe Ser Asn 
385                 390                 395                 400 


Gly Leu Pro Pro Ser Leu Leu Gly Asn Arg Glu Arg Lys Val Asn Met 
                405                 410                 415     


Gly Leu Lys Gly Leu Gln Ile Cys Gly Asn Ser Ile Met Pro Leu Leu 
            420                 425                 430         


Thr Phe Tyr Gly Asn Ser Ile Ala Asp Arg Phe Pro Thr His Ala Glu 
        435                 440                 445             


Gln Phe Asn Gln Asn Ile Asn Ser Gln Gly Tyr Thr Ser Ala Thr Leu 
    450                 455                 460                 


Ala Arg Arg Ser Val Asp Ile Phe Gln Asn Tyr Val Ala Ile Ala Leu 
465                 470                 475                 480 


Met Phe Gly Val Gln Ala Val Asp Leu Arg Thr Tyr Lys Lys Thr Gly 
                485                 490                 495     


His Tyr Asp Ala Arg Ala Cys Leu Ser Pro Ala Thr Glu Arg Leu Tyr 
            500                 505                 510         


Ser Ala Val Arg His Val Val Gly Gln Lys Pro Thr Ser Asp Arg Pro 
        515                 520                 525             


Tyr Ile Trp Asn Asp Asn Glu Gln Gly Leu Asp Glu His Ile Ala Arg 
    530                 535                 540                 


Ile Ser Ala Asp Ile Ala Ala Gly Gly Val Ile Val Gln Ala Val Gln 
545                 550                 555                 560 


Asp Ile Leu Pro Cys Leu His 
                565         


<210> 22
<211> 514
<212> PRT
<213> Photorhabdus laumondii

<400> 22
Met Lys Gln Leu Thr Ile Tyr Pro Gly Lys Leu Thr Leu Asp Glu Leu 
1               5                   10                  15      


Arg Gln Val Tyr Leu Gln Pro Val Lys Ile Thr Leu Asp Ser Gln Ile 
            20                  25                  30          


Phe Pro Ala Ile Glu Arg Ser Val Glu Cys Val Asn Ala Ile Leu Ala 
        35                  40                  45              


Glu Asn Arg Thr Ala Tyr Gly Ile Asn Thr Gly Phe Gly Leu Leu Ala 
    50                  55                  60                  


Ser Thr Arg Ile Glu Glu Asp Asn Leu Glu Lys Leu Gln Arg Ser Leu 
65                  70                  75                  80  


Val Val Ser His Ala Ala Gly Val Gly Lys Ala Leu Asp Asp Asn Met 
                85                  90                  95      


Thr Arg Leu Ile Met Val Leu Lys Ile Asn Ser Leu Ser Arg Gly Tyr 
            100                 105                 110         


Ser Gly Ile Arg Leu Ala Val Ile Gln Ala Leu Ile Ala Leu Val Asn 
        115                 120                 125             


Ala Glu Ile Tyr Pro His Ile Pro Cys Lys Gly Ser Val Gly Ala Ser 
    130                 135                 140                 


Gly Asp Leu Ala Pro Leu Ala His Met Ser Leu Leu Leu Leu Gly Glu 
145                 150                 155                 160 


Gly Gln Ala Arg Tyr Gln Gly Glu Trp Leu Pro Ala Lys Glu Ala Leu 
                165                 170                 175     


Ala Lys Ala Asn Leu Gln Pro Ile Thr Leu Ala Ala Lys Glu Gly Leu 
            180                 185                 190         


Ala Leu Leu Asn Gly Thr Gln Val Ser Thr Ala Phe Ala Leu Arg Gly 
        195                 200                 205             


Leu Phe Glu Ala Glu Asp Leu Leu Ala Ala Ala Ile Val Cys Gly Ser 
    210                 215                 220                 


Leu Ser Val Glu Ala Ala Leu Gly Ser Arg Lys Pro Phe Asp Ala Arg 
225                 230                 235                 240 


Val His Val Val Arg Gly Gln Gln Gly Gln Ile Asp Val Ala Ala Leu 
                245                 250                 255     


Tyr Arg His Val Leu Glu Glu Ser Ser Glu Leu Ser Asp Ser His Ile 
            260                 265                 270         


Asn Cys Pro Lys Val Gln Asp Pro Tyr Ser Leu Arg Cys Gln Pro Gln 
        275                 280                 285             


Val Met Gly Ala Cys Leu Thr Gln Leu Arg His Ala Ala Asp Val Ile 
    290                 295                 300                 


Leu Thr Glu Ala Asn Ala Val Ser Asp Asn Pro Leu Val Phe Ala Glu 
305                 310                 315                 320 


Gln Gly Glu Val Ile Ser Gly Gly Asn Phe His Ala Glu Pro Val Ala 
                325                 330                 335     


Met Ala Ser Asp Asn Leu Ala Leu Val Leu Ala Glu Ile Gly Ala Leu 
            340                 345                 350         


Ser Glu Arg Arg Ile Ala Leu Leu Met Asp Ser His Met Ser Gln Leu 
        355                 360                 365             


Pro Pro Phe Leu Val Glu Asn Gly Gly Val Asn Ser Gly Phe Met Ile 
    370                 375                 380                 


Ala Gln Val Thr Ala Ala Ala Leu Ala Ser Glu Asn Lys Ala Leu Ala 
385                 390                 395                 400 


His Pro Ala Ser Val Asp Ser Leu Pro Thr Ser Ala Asn Gln Glu Asp 
                405                 410                 415     


His Val Ser Met Ala Pro Ala Ala Gly Arg Arg Leu Trp Glu Met Ala 
            420                 425                 430         


Glu Asn Thr Arg Gly Ile Leu Ala Ile Glu Trp Leu Ser Ala Cys Gln 
        435                 440                 445             


Gly Ile Asp Phe Arg Asn Gly Leu Lys Ser Ser Pro Ile Leu Glu Glu 
    450                 455                 460                 


Ala Arg Val Ile Leu Arg Ala Lys Val Asp Tyr Tyr Asp Gln Asp Arg 
465                 470                 475                 480 


Phe Phe Ala Pro Asp Ile Asp Ala Ala Val Lys Leu Leu Ala Glu Gln 
                485                 490                 495     


His Leu Ser Ser Leu Leu Pro Ser Gly Gln Ile Leu Gln Arg Lys Asn 
            500                 505                 510         


Asn Arg 
        


<210> 23
<211> 471
<212> PRT
<213> Proteus mirabilis

<400> 23
Met Ala Ile Ser Arg Arg Lys Phe Ile Leu Gly Gly Thr Val Val Ala 
1               5                   10                  15      


Val Ala Ala Gly Ala Gly Val Leu Thr Pro Met Leu Thr Arg Glu Gly 
            20                  25                  30          


Arg Phe Val Pro Gly Thr Pro Arg His Gly Phe Val Glu Gly Thr Gly 
        35                  40                  45              


Gly Pro Leu Pro Lys Gln Asp Asp Val Val Val Ile Gly Ala Gly Ile 
    50                  55                  60                  


Leu Gly Ile Met Thr Ala Ile Asn Leu Ala Glu Arg Gly Leu Ser Val 
65                  70                  75                  80  


Thr Ile Val Glu Lys Gly Asn Ile Ala Gly Glu Gln Ser Ser Arg Phe 
                85                  90                  95      


Tyr Gly Gln Ala Ile Ser Tyr Lys Met Pro Asp Glu Thr Phe Leu Leu 
            100                 105                 110         


His His Leu Gly Lys His Arg Trp Arg Glu Met Asn Ala Lys Val Gly 
        115                 120                 125             


Ile Asp Thr Thr Tyr Arg Thr Gln Gly Arg Val Glu Val Pro Leu Asp 
    130                 135                 140                 


Glu Glu Asp Leu Glu Asn Val Arg Lys Trp Ile Asp Ala Lys Ser Lys 
145                 150                 155                 160 


Asp Val Gly Ser Asp Ile Pro Phe Arg Thr Lys Met Ile Glu Gly Ala 
                165                 170                 175     


Glu Leu Lys Gln Arg Leu Arg Gly Ala Thr Thr Asp Trp Lys Ile Ala 
            180                 185                 190         


Gly Phe Glu Glu Asp Ser Gly Ser Phe Asp Pro Glu Val Ala Thr Phe 
        195                 200                 205             


Val Met Ala Glu Tyr Ala Lys Lys Met Gly Ile Lys Ile Phe Thr Asn 
    210                 215                 220                 


Cys Ala Ala Arg Gly Leu Glu Thr Gln Ala Gly Val Ile Ser Asp Val 
225                 230                 235                 240 


Val Thr Glu Lys Gly Pro Ile Lys Thr Ser Arg Val Val Val Ala Gly 
                245                 250                 255     


Gly Val Gly Ser Arg Leu Phe Met Gln Asn Leu Asn Val Asp Val Pro 
            260                 265                 270         


Thr Leu Pro Ala Tyr Gln Ser Gln Gln Leu Ile Ser Ala Ala Pro Asn 
        275                 280                 285             


Ala Pro Gly Gly Asn Val Ala Leu Pro Gly Gly Ile Phe Phe Arg Asp 
    290                 295                 300                 


Gln Ala Asp Gly Thr Tyr Ala Thr Ser Pro Arg Val Ile Val Ala Pro 
305                 310                 315                 320 


Val Val Lys Glu Ser Phe Thr Tyr Gly Tyr Lys Tyr Leu Pro Leu Leu 
                325                 330                 335     


Ala Leu Pro Asp Phe Pro Val His Ile Ser Leu Asn Glu Gln Leu Ile 
            340                 345                 350         


Asn Ser Phe Met Gln Ser Thr His Trp Asp Leu Asn Glu Glu Ser Pro 
        355                 360                 365             


Phe Glu Lys Tyr Arg Asp Met Thr Ala Leu Pro Asp Leu Pro Glu Leu 
    370                 375                 380                 


Asn Ala Ser Leu Glu Lys Leu Lys Lys Glu Phe Pro Ala Phe Lys Glu 
385                 390                 395                 400 


Ser Thr Leu Ile Asp Gln Trp Ser Gly Ala Met Ala Ile Ala Pro Asp 
                405                 410                 415     


Glu Asn Pro Ile Ile Ser Asp Val Lys Glu Tyr Pro Gly Leu Val Ile 
            420                 425                 430         


Asn Thr Ala Thr Gly Trp Gly Met Thr Glu Ser Pro Val Ser Ala Glu 
        435                 440                 445             


Ile Thr Ala Asp Leu Leu Leu Gly Lys Lys Pro Val Leu Asp Ala Lys 
    450                 455                 460                 


Pro Phe Ser Leu Tyr Arg Phe 
465                 470     


<210> 24
<211> 473
<212> PRT
<213> Proteus mirabilis

<400> 24
Met Asn Ile Ser Arg Arg Lys Leu Leu Leu Gly Val Gly Ala Ala Gly 
1               5                   10                  15      


Val Leu Ala Gly Gly Ala Ala Leu Val Pro Met Val Arg Arg Asp Gly 
            20                  25                  30          


Lys Phe Val Glu Ala Lys Ser Arg Ala Ser Phe Val Glu Gly Thr Gln 
        35                  40                  45              


Gly Ala Leu Pro Lys Glu Ala Asp Val Val Ile Ile Gly Ala Gly Ile 
    50                  55                  60                  


Gln Gly Ile Met Thr Ala Ile Asn Leu Ala Glu Arg Gly Met Ser Val 
65                  70                  75                  80  


Thr Ile Leu Glu Lys Gly Gln Ile Ala Gly Glu Gln Ser Gly Arg Ala 
                85                  90                  95      


Tyr Ser Gln Ile Ile Ser Tyr Gln Thr Ser Pro Glu Ile Phe Pro Leu 
            100                 105                 110         


His His Tyr Gly Lys Ile Leu Trp Arg Gly Met Asn Glu Lys Ile Gly 
        115                 120                 125             


Ala Asp Thr Ser Tyr Arg Thr Gln Gly Arg Val Glu Ala Leu Ala Asp 
    130                 135                 140                 


Glu Lys Ala Leu Asp Lys Ala Gln Ala Trp Ile Lys Thr Ala Lys Glu 
145                 150                 155                 160 


Ala Ala Gly Phe Asp Thr Pro Leu Asn Thr Arg Ile Ile Lys Gly Glu 
                165                 170                 175     


Glu Leu Ser Asn Arg Leu Val Gly Ala Gln Thr Pro Trp Thr Val Ala 
            180                 185                 190         


Ala Phe Glu Glu Asp Ser Gly Ser Val Asp Pro Glu Thr Gly Thr Pro 
        195                 200                 205             


Ala Leu Ala Arg Tyr Ala Lys Gln Ile Gly Val Lys Ile Tyr Thr Asn 
    210                 215                 220                 


Cys Ala Val Arg Gly Ile Glu Thr Ala Gly Gly Lys Ile Ser Asp Val 
225                 230                 235                 240 


Val Ser Glu Lys Gly Ala Ile Lys Thr Ser Gln Val Val Leu Ala Gly 
                245                 250                 255     


Gly Ile Trp Ser Arg Leu Phe Met Gly Asn Met Gly Ile Asp Ile Pro 
            260                 265                 270         


Thr Leu Asn Val Tyr Leu Ser Gln Gln Arg Val Ser Gly Val Pro Gly 
        275                 280                 285             


Ala Pro Arg Gly Asn Val His Leu Pro Asn Gly Ile His Phe Arg Glu 
    290                 295                 300                 


Gln Ala Asp Gly Thr Tyr Ala Val Ala Pro Arg Ile Phe Thr Ser Ser 
305                 310                 315                 320 


Ile Val Lys Asp Ser Phe Leu Leu Gly Pro Lys Phe Met His Leu Leu 
                325                 330                 335     


Gly Gly Gly Glu Leu Pro Leu Glu Phe Ser Ile Gly Glu Asp Leu Phe 
            340                 345                 350         


Asn Ser Phe Lys Met Pro Thr Ser Trp Asn Leu Asp Glu Lys Thr Pro 
        355                 360                 365             


Phe Glu Gln Phe Arg Val Ala Thr Ala Thr Gln Asn Thr Gln His Leu 
    370                 375                 380                 


Asp Ala Val Phe Gln Arg Met Lys Thr Glu Phe Pro Val Phe Glu Lys 
385                 390                 395                 400 


Ser Glu Val Val Glu Arg Trp Gly Ala Val Val Ser Pro Thr Phe Asp 
                405                 410                 415     


Glu Leu Pro Ile Ile Ser Glu Val Lys Glu Tyr Pro Gly Leu Val Ile 
            420                 425                 430         


Asn Thr Ala Thr Val Trp Gly Met Thr Glu Gly Pro Ala Ala Gly Glu 
        435                 440                 445             


Val Thr Ala Asp Ile Val Met Gly Lys Lys Pro Val Ile Asp Pro Thr 
    450                 455                 460                 


Pro Phe Ser Leu Asp Arg Phe Lys Lys 
465                 470             


<210> 25
<211> 471
<212> PRT
<213> Proteus vulgaris

<400> 25
Met Ala Ile Ser Arg Arg Lys Phe Ile Ile Gly Gly Thr Val Val Ala 
1               5                   10                  15      


Val Ala Ala Gly Ala Gly Ile Leu Thr Pro Met Leu Thr Arg Glu Gly 
            20                  25                  30          


Arg Phe Val Pro Gly Thr Pro Arg His Gly Phe Val Glu Gly Thr Glu 
        35                  40                  45              


Gly Ala Leu Pro Lys Gln Ala Asp Val Val Val Val Gly Ala Gly Ile 
    50                  55                  60                  


Leu Gly Ile Met Thr Ala Ile Asn Leu Val Glu Arg Gly Leu Ser Val 
65                  70                  75                  80  


Val Ile Val Glu Lys Gly Asn Ile Ala Gly Glu Gln Ser Ser Arg Phe 
                85                  90                  95      


Tyr Gly Gln Ala Ile Ser Tyr Lys Met Pro Asp Glu Thr Phe Leu Leu 
            100                 105                 110         


His His Leu Gly Lys His Arg Trp Arg Glu Met Asn Ala Lys Val Gly 
        115                 120                 125             


Ile Asp Thr Thr Tyr Arg Thr Gln Gly Arg Val Glu Val Pro Leu Asp 
    130                 135                 140                 


Glu Glu Asp Leu Val Asn Val Arg Lys Trp Ile Asp Glu Arg Ser Lys 
145                 150                 155                 160 


Asn Val Gly Ser Asp Ile Pro Phe Lys Thr Arg Ile Ile Glu Gly Ala 
                165                 170                 175     


Glu Leu Asn Gln Arg Leu Arg Gly Ala Thr Thr Asp Trp Lys Ile Ala 
            180                 185                 190         


Gly Phe Glu Glu Asp Ser Gly Ser Phe Asp Pro Glu Val Ala Thr Phe 
        195                 200                 205             


Val Met Ala Glu Tyr Ala Lys Lys Met Gly Val Arg Ile Tyr Thr Gln 
    210                 215                 220                 


Cys Ala Ala Arg Gly Leu Glu Thr Gln Ala Gly Val Ile Ser Asp Val 
225                 230                 235                 240 


Val Thr Glu Lys Gly Ala Ile Lys Thr Ser Gln Val Val Val Ala Gly 
                245                 250                 255     


Gly Val Trp Ser Arg Leu Phe Met Gln Asn Leu Asn Val Asp Val Pro 
            260                 265                 270         


Thr Leu Pro Ala Tyr Gln Ser Gln Gln Leu Ile Ser Gly Ser Pro Thr 
        275                 280                 285             


Ala Pro Gly Gly Asn Val Ala Leu Pro Gly Gly Ile Phe Phe Arg Glu 
    290                 295                 300                 


Gln Ala Asp Gly Thr Tyr Ala Thr Ser Pro Arg Val Ile Val Ala Pro 
305                 310                 315                 320 


Val Val Lys Glu Ser Phe Thr Tyr Gly Tyr Lys Tyr Leu Pro Leu Leu 
                325                 330                 335     


Ala Leu Pro Asp Phe Pro Val His Ile Ser Leu Asn Glu Gln Leu Ile 
            340                 345                 350         


Asn Ser Phe Met Gln Ser Thr His Trp Asn Leu Asp Glu Val Ser Pro 
        355                 360                 365             


Phe Glu Gln Phe Arg Asn Met Thr Ala Leu Pro Asp Leu Pro Glu Leu 
    370                 375                 380                 


Asn Ala Ser Leu Glu Lys Leu Lys Ala Glu Phe Pro Ala Phe Lys Glu 
385                 390                 395                 400 


Ser Lys Leu Ile Asp Gln Trp Ser Gly Ala Met Ala Ile Ala Pro Asp 
                405                 410                 415     


Glu Asn Pro Ile Ile Ser Glu Val Lys Glu Tyr Pro Gly Leu Val Ile 
            420                 425                 430         


Asn Thr Ala Thr Gly Trp Gly Met Thr Glu Ser Pro Val Ser Ala Glu 
        435                 440                 445             


Leu Thr Ala Asp Leu Leu Leu Gly Lys Lys Pro Val Leu Asp Pro Lys 
    450                 455                 460                 


Pro Phe Ser Leu Tyr Arg Phe 
465                 470     


<210> 26
<211> 452
<212> PRT
<213> Homo sapiens

<400> 26
Met Ser Thr Ala Val Leu Glu Asn Pro Gly Leu Gly Arg Lys Leu Ser 
1               5                   10                  15      


Asp Phe Gly Gln Glu Thr Ser Tyr Ile Glu Asp Asn Cys Asn Gln Asn 
            20                  25                  30          


Gly Ala Ile Ser Leu Ile Phe Ser Leu Lys Glu Glu Val Gly Ala Leu 
        35                  40                  45              


Ala Lys Val Leu Arg Leu Phe Glu Glu Asn Asp Val Asn Leu Thr His 
    50                  55                  60                  


Ile Glu Ser Arg Pro Ser Arg Leu Lys Lys Asp Glu Tyr Glu Phe Phe 
65                  70                  75                  80  


Thr His Leu Asp Lys Arg Ser Leu Pro Ala Leu Thr Asn Ile Ile Lys 
                85                  90                  95      


Ile Leu Arg His Asp Ile Gly Ala Thr Val His Glu Leu Ser Arg Asp 
            100                 105                 110         


Lys Lys Lys Asp Thr Val Pro Trp Phe Pro Arg Thr Ile Gln Glu Leu 
        115                 120                 125             


Asp Arg Phe Ala Asn Gln Ile Leu Ser Tyr Gly Ala Glu Leu Asp Ala 
    130                 135                 140                 


Asp His Pro Gly Phe Lys Asp Pro Val Tyr Arg Ala Arg Arg Lys Gln 
145                 150                 155                 160 


Phe Ala Asp Ile Ala Tyr Asn Tyr Arg His Gly Gln Pro Ile Pro Arg 
                165                 170                 175     


Val Glu Tyr Met Glu Glu Gly Lys Lys Thr Trp Gly Thr Val Phe Lys 
            180                 185                 190         


Thr Leu Lys Ser Leu Tyr Lys Thr His Ala Cys Tyr Glu Tyr Asn His 
        195                 200                 205             


Ile Phe Pro Leu Leu Glu Lys Tyr Cys Gly Phe His Glu Asp Asn Ile 
    210                 215                 220                 


Pro Gln Leu Glu Asp Val Ser Gln Phe Leu Gln Thr Cys Thr Gly Phe 
225                 230                 235                 240 


Arg Leu Arg Pro Val Ala Gly Leu Leu Ser Ser Arg Asp Phe Leu Gly 
                245                 250                 255     


Gly Leu Ala Phe Arg Val Phe His Cys Thr Gln Tyr Ile Arg His Gly 
            260                 265                 270         


Ser Lys Pro Met Tyr Thr Pro Glu Pro Asp Ile Cys His Glu Leu Leu 
        275                 280                 285             


Gly His Val Pro Leu Phe Ser Asp Arg Ser Phe Ala Gln Phe Ser Gln 
    290                 295                 300                 


Glu Ile Gly Leu Ala Ser Leu Gly Ala Pro Asp Glu Tyr Ile Glu Lys 
305                 310                 315                 320 


Leu Ala Thr Ile Tyr Trp Phe Thr Val Glu Phe Gly Leu Cys Lys Gln 
                325                 330                 335     


Gly Asp Ser Ile Lys Ala Tyr Gly Ala Gly Leu Leu Ser Ser Phe Gly 
            340                 345                 350         


Glu Leu Gln Tyr Cys Leu Ser Glu Lys Pro Lys Leu Leu Pro Leu Glu 
        355                 360                 365             


Leu Glu Lys Thr Ala Ile Gln Asn Tyr Thr Val Thr Glu Phe Gln Pro 
    370                 375                 380                 


Leu Tyr Tyr Val Ala Glu Ser Phe Asn Asp Ala Lys Glu Lys Val Arg 
385                 390                 395                 400 


Asn Phe Ala Ala Thr Ile Pro Arg Pro Phe Ser Val Arg Tyr Asp Pro 
                405                 410                 415     


Tyr Thr Gln Arg Ile Glu Val Leu Asp Asn Thr Gln Gln Leu Lys Ile 
            420                 425                 430         


Leu Ala Asp Ser Ile Asn Ser Glu Ile Gly Ile Leu Cys Ser Ala Leu 
        435                 440                 445             


Gln Lys Ile Lys 
    450         


<210> 27
<211> 4637
<212> DNA
<213> Artificial Sequence

<220>
<221> source
<223> /note="Description of Artificial Sequence: Synthetic
      polynucleotide"

<400> 27
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa       60

cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga      120

gactggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc      180

cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata      240

acatgagcta tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag      300

cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat      360

cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc      420

actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg      480

ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat      540

ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cctcatggga      600

gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt      660

agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag      720

cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct      780

tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc      840

cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa      900

cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat      960

cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg     1020

ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt     1080

catattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt     1140

tttgcgccat tcgatggcgc gccgcttcgt caggccacat agctttcttg ttctgatcgg     1200

aacgatcgtt ggctgtgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag     1260

cgctcacaat tagctgtcac cggatgtgct ttccggtctg atgagtccgt gaggacgaaa     1320

cagcctctac aaataatttt gtttaatagc ttcggtaagt ttacggagga tttgtcatga     1380

aagctaaaga tgttcagcca accattatta ttaataaaaa tggccttatc tctttggaag     1440

atatctatga cattgcgata aaacaaaaaa aagtagaaat atcaacggag atcactgaac     1500

ttttgacgca tggtcgtgaa aaattagagg aaaaattaaa ttcaggagag gttatatatg     1560

gaatcaatac aggatttgga gggaatgcca atttagttgt gccatttgag aaaatcgcag     1620

agcatcagca aaatctgtta acttttcttg gcgctggtac tggggactat atgtccaaac     1680

cttgtattaa agcgtcacaa tttactatgt tactttctgt ttgcaaaggt tggtctgcaa     1740

ccagaccaat tgtcgctcaa gcaattgttg atatgattaa tcatgacatt gttcctctgg     1800

ttcctcgcta tggctcagtg ggtgcaagcg gtgatttaat tcctttatct tatattgcac     1860

gagcattatg tggtaagggc aaagtttatt atatgggcgc agaaattgac gctgctgaag     1920

caattaaacg tgcagggttg acaccattat cgttaaaagc caaagaaggt cttgctctga     1980

ttaacggcac ccgggtaatg tcaggaatca gtgcaatcac cgtcattaaa ctggaaaaac     2040

tatttaaagc ctcaatttct gcgattgccc ttgctgttga agcattactt gcatctcatg     2100

aacattatga tgcccggatt caacaagtaa aaaatcatcc tggtcaaaac gcggtggcaa     2160

gtgcattgcg taatttattg gcaggttcaa cgcaggttaa tctattatct ggggttaaag     2220

aacaagccaa taaagcttgt cgtcatcaag aaattaccca actaaatgat accttacagg     2280

aagtttattc aattcgctgt gcaccacaag tattaggtat agtgccagaa tctttagcta     2340

ccgctcggaa aatattggaa cgggaagtta tctcagctaa tgataatcca ttgatagatc     2400

cagaaaatgg cgatgttcta cacggtggaa attttatggg gcaatatgtc gcccgaacaa     2460

tggatgcatt aaaactggat attgctttaa ttgccaatca tcttcacgcc attgtggctc     2520

ttatgatgga taaccgtttc tctcgtggat tacctaattc actgagtccg acacccggca     2580

tgtatcaagg ttttaaaggc gtccaacttt ctcaaaccgc tttagttgct gcaattcgcc     2640

atgattgtgc tgcatcaggt attcatacca ttgccacaga acaatacaat caagatattg     2700

tcagtttagg tctgcatgcc gctcaagatg ttttagagat ggagcagaaa ttacgcaata     2760

ttgtttcaat gacaattctg gtagcctgtc aggccattca tcttcgcggc aatattagtg     2820

aaattgcgcc tgaaactgct aaattttacc atgcagtacg cgaaatcagt tctcctttga     2880

tcactgatcg tgcgttggat gaagatataa tccgcattgc ggatgcaatt attaatgatc     2940

aacttcctct gccagaaatc atgctggaag aataaaacaa cacccactaa gataactcta     3000

gaaataattt tgtttaactt taagaaggag atatacatat gaaagctaaa gatgttcagc     3060

caaccattat tattaataaa aatggcctta tctctttgga agatatctat gacattgcga     3120

taaaacaaaa aaaagtagaa atatcaacgg agatcactga acttttgacg catggtcgtg     3180

aaaaattaga ggaaaaatta aattcaggag aggttatata tggaatcaat acaggatttg     3240

gagggaatgc caatttagtt gtgccatttg agaaaatcgc agagcatcag caaaatctgt     3300

taacttttct tggcgctggt actggggact atatgtccaa accttgtatt aaagcgtcac     3360

aatttactat gttactttct gtttgcaaag gttggtctgc aaccagacca attgtcgctc     3420

aagcaattgt tgatatgatt aatcatgaca ttgttcctct ggttcctcgc tatggctcag     3480

tgggtgcaag cggtgattta attcctttat cttatattgc acgagcatta tgtggtaagg     3540

gcaaagttta ttatatgggc gcagaaattg acgctgctga agcaattaaa cgtgcagggt     3600

tgacaccatt atcgttaaaa gccaaagaag gtcttgctct gattaacggc acccgggtaa     3660

tgtcaggaat cagtgcaatc accgtcatta aactggaaaa actatttaaa gcctcaattt     3720

ctgcgattgc ccttgctgtt gaagcattac ttgcatctca tgaacattat gatgcccgga     3780

ttcaacaagt aaaaaatcat cctggtcaaa acgcggtggc aagtgcattg cgtaatttat     3840

tggcaggttc aacgcaggtt aatctattat ctggggttaa agaacaagcc aataaagctt     3900

gtcgtcatca agaaattacc caactaaatg ataccttaca ggaagtttat tcaattcgct     3960

gtgcaccaca agtattaggt atagtgccag aatctttagc taccgctcgg aaaatattgg     4020

aacgggaagt tatctcagct aatgataatc cattgataga tccagaaaat ggcgatgttc     4080

tacacggtgg aaattttatg gggcaatatg tcgcccgaac aatggatgca ttaaaactgg     4140

atattgcttt aattgccaat catcttcacg ccattgtggc tcttatgatg gataaccgtt     4200

tctctcgtgg attacctaat tcactgagtc cgacacccgg catgtatcaa ggttttaaag     4260

gcgtccaact ttctcaaacc gctttagttg ctgcaattcg ccatgattgt gctgcatcag     4320

gtattcatac cattgccaca gaacaataca atcaagatat tgtcagttta ggtctgcatg     4380

ccgctcaaga tgttttagag atggagcaga aattacgcaa tattgtttca atgacaattc     4440

tggtagcctg tcaggccatt catcttcgcg gcaatattag tgaaattgcg cctgaaactg     4500

ctaaatttta ccatgcagta cgcgaaatca gttctccttt gatcactgat cgtgcgttgg     4560

atgaagatat aatccgcatt gcggatgcaa ttattaatga tcaacttcct ctgccagaaa     4620

tcatgctgga agaataa                                                    4637


