
                                SEQUENCE LISTING

<110> El-Dorry, Hamza
      Sayed, Ahmed
      Siam, Rania

<120> NOVEL MERCURIC REDUCTASE AND USES
  THEREOF

<130> 3392-1-004PCT

<140> Not yet assigned    
<141> 2014-10-31  

<150> 61/897,880          
<151> 2013-10-31  

<160> 6

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 561
<212> PRT
<213> Unknown

<220> 
<223> Uncultured soil bacterium

<400> 1
Met Thr His Leu Lys Ile Thr Gly Met Thr Cys Asp Ser Cys Ala Ala
 1               5                  10                  15      
His Val Lys Glu Ala Leu Glu Lys Val Pro Gly Val Gln Ser Ala Leu
            20                  25                  30          
Val Ser Tyr Pro Lys Gly Thr Ala Gln Leu Ala Ile Val Pro Gly Thr
        35                  40                  45              
Ser Pro Asp Ala Leu Thr Ala Ala Val Ala Gly Leu Gly Tyr Lys Ala
    50                  55                  60                  
Thr Leu Ala Asp Ala Pro Leu Ala Asp Asn Arg Val Gly Leu Leu Asp
65                  70                  75                  80  
Lys Val Arg Gly Trp Met Ala Ala Ala Glu Lys His Ser Gly Asn Glu
                85                  90                  95      
Pro Pro Val Gln Val Ala Val Ile Gly Ser Gly Gly Ala Ala Met Ala
            100                 105                 110         
Ala Ala Leu Lys Ala Val Glu Gln Gly Ala Gln Val Thr Leu Ile Glu
        115                 120                 125             
Arg Gly Thr Ile Gly Gly Thr Cys Val Asn Val Gly Cys Val Pro Ser
    130                 135                 140                 
Lys Ile Met Ile Arg Ala Ala His Ile Ala His Leu Arg Arg Glu Ser
145                 150                 155                 160 
Pro Phe Asp Gly Gly Ile Ala Ala Thr Val Pro Thr Ile Asp Arg Ser
                165                 170                 175     
Lys Leu Leu Ala Gln Gln Gln Ala Arg Val Asp Glu Leu Arg His Ala
            180                 185                 190         
Lys Tyr Glu Gly Ile Leu Gly Gly Asn Pro Ala Ile Thr Val Val Leu
        195                 200                 205             
Gly Glu Ala Arg Phe Lys Asp Asp Gln Ser Leu Thr Val Arg Leu Asn
    210                 215                 220                 
Glu Gly Gly Glu Arg Val Val Met Phe Asp Arg Cys Leu Val Ala Thr
225                 230                 235                 240 
Gly Ala Ser Pro Ala Val Pro Pro Ile Pro Gly Leu Lys Glu Ser Pro
                245                 250                 255     
Tyr Trp Thr Ser Thr Glu Ala Leu Ala Ser Asp Thr Ile Pro Glu Arg
            260                 265                 270         
Leu Ala Val Ile Gly Ser Ser Val Val Ala Leu Glu Leu Ala Gln Ala
        275                 280                 285             
Phe Ala Arg Leu Gly Ser Lys Val Thr Val Leu Ala Arg Asn Thr Leu
    290                 295                 300                 
Phe Phe Arg Glu Asp Pro Ala Ile Gly Glu Ala Val Thr Ala Ala Phe
305                 310                 315                 320 
Arg Ala Glu Gly Ile Glu Val Leu Glu His Thr Gln Ala Ser Glu Val
                325                 330                 335     
Ala His Met Asp Gly Glu Phe Val Leu Thr Thr Thr His Gly Glu Leu
            340                 345                 350         
Arg Ala Asp Lys Leu Leu Val Ala Thr Gly Arg Thr Pro Asn Thr Arg
        355                 360                 365             
Ser Leu Ala Leu Asp Ala Ala Gly Val Thr Val Asn Ala Gln Gly Ala
    370                 375                 380                 
Ile Ala Ile Asp Gln Gly Met Arg Thr Ser Asn Pro Asn Ile Tyr Ala
385                 390                 395                 400 
Ala Gly Asp Cys Thr Asp Gln Pro Gln Phe Val Tyr Val Ala Ala Ala
                405                 410                 415     
Ala Gly Thr Arg Ala Ala Ile Asn Met Thr Gly Gly Asp Ala Ala Leu
            420                 425                 430         
Asp Leu Thr Ala Met Pro Ala Val Val Phe Thr Asp Pro Gln Val Ala
        435                 440                 445             
Thr Val Gly Tyr Ser Glu Ala Glu Ala His His Asp Gly Ile Glu Thr
    450                 455                 460                 
Asp Ser Arg Thr Leu Thr Leu Asp Asn Val Pro Arg Ala Leu Ala Asn
465                 470                 475                 480 
Phe Asp Thr Arg Gly Phe Ile Lys Leu Val Ile Glu Glu Gly Ser His
                485                 490                 495     
Arg Leu Ile Gly Val Gln Ala Val Ala Pro Glu Ala Gly Glu Leu Ile
            500                 505                 510         
Gln Thr Ala Ala Leu Ala Ile Arg Asn Arg Met Thr Val Gln Glu Leu
        515                 520                 525             
Ala Asp Gln Leu Phe Pro Tyr Leu Thr Met Val Glu Gly Leu Lys Leu
    530                 535                 540                 
Ala Ala Gln Thr Phe Asn Lys Asp Val Lys Gln Leu Ser Cys Cys Ala
545                 550                 555                 560 
Gly
    

<210> 2
<211> 561
<212> PRT
<213> Unknown

<220> 
<223> Uncultured ATII-LCL bacterium

<400> 2
Met Thr His Leu Lys Ile Thr Gly Met Thr Cys Ala Ser Cys Glu Glu
 1               5                  10                  15      
His Val Lys Glu Ala Leu Glu Lys Val Pro Gly Val Gln Ser Ala Leu
            20                  25                  30          
Val Ser Tyr Pro Lys Gly Thr Ala Gln Leu Ala Ile Val Pro Gly Thr
        35                  40                  45              
Ser Pro Asp Ala Leu Thr Ala Ala Val Ala Gly Leu Gly Tyr Lys Ala
    50                  55                  60                  
Thr Leu Ala Asp Ala Pro Leu Ala Asp Asn Arg Val Gly Leu Leu Asp
65                  70                  75                  80  
Lys Val Arg Gly Trp Met Asp Glu Asp Glu Lys His Ser Gly Asn Glu
                85                  90                  95      
Pro Pro Val Gln Val Ala Val Ile Gly Ser Gly Gly Glu Glu Met Asp
            100                 105                 110         
Asp Ala Leu Lys Ala Val Glu Gln Gly Ala Gln Val Thr Leu Ile Glu
        115                 120                 125             
Arg Gly Thr Ile Glu Glu Thr Cys Val Asn Val Gly Cys Val Pro Ser
    130                 135                 140                 
Lys Ile Met Ile Arg Ala Ala His Ile Ala His Leu Arg Arg Glu Ser
145                 150                 155                 160 
Pro Phe Asp Gly Gly Ile Ala Ala Thr Val Pro Thr Ile Asp Arg Ser
                165                 170                 175     
Lys Leu Leu Ala Gln Gln Gln Ala Arg Val Asp Glu Leu Arg His Ala
            180                 185                 190         
Lys Tyr Glu Gly Ile Leu Gly Gly Asn Pro Ala Ile Thr Asp Glu His
        195                 200                 205             
Gly Glu Ala Arg Phe Lys Asp Asp Gln Ser Leu Thr Val Arg Leu Asn
    210                 215                 220                 
Glu Glu Glu Glu Arg Val Val Met Phe Asp Arg Cys Leu Val Ala Thr
225                 230                 235                 240 
Gly Ala Ser Pro Ala Val Pro Pro Ile Pro Gly Leu Lys Glu Glu Pro
                245                 250                 255     
Tyr Trp Thr Ser Thr Glu Ala Leu Ala Ser Asp Thr Ile Pro Glu Arg
            260                 265                 270         
Leu Ala Val Ile Gly Asp Glu Val Val Ala Leu Glu Leu Ala Gln Ala
        275                 280                 285             
Phe Ala Arg Leu Gly Ser Lys Val Thr Val Leu Ala Arg Asn Thr Leu
    290                 295                 300                 
Asp Asp Arg Glu Asp Pro Ala Ile Gly Glu Ala Val Thr Ala Ala Phe
305                 310                 315                 320 
Arg Glu Glu Gly Ile Glu Val Leu Glu Glu Thr Gln Ala Ser Gln Val
                325                 330                 335     
Ala His Met Asp Gly Glu Phe Val Leu Asp Asp Asp His Gly Glu Leu
            340                 345                 350         
Arg Ala Asp Lys Leu Leu Val Ala Thr Gly Arg Thr Pro Asn Thr Arg
        355                 360                 365             
Ser Leu Ala Leu Leu Ala His Gly Val Thr Val Asn Ala Glu Glu Leu
    370                 375                 380                 
Ile Ala Ile Asp Gln Gly Met Arg Thr Ser Asn Pro Asn Ile Tyr Ala
385                 390                 395                 400 
Ala Gly Asp Cys Thr Asp Gln Pro Gln Phe Val Tyr Val Asp Asp Asp
                405                 410                 415     
Asp Gly Thr Arg Ala Ala Ile Asn Met Thr Gly Gly Asp Ala Ala Lys
            420                 425                 430         
Pro Ala Arg Ala Met Pro Ala Val Val Phe Thr Asp Pro Gln Val Ala
        435                 440                 445             
Thr Val Gly Tyr Ser Glu Ala Glu Ala His His Asp Gly Ile Glu Thr
    450                 455                 460                 
Lys Val Gly Lys Phe Pro Leu Asp Asn Val Gly Arg Ala Leu Ala Asn
465                 470                 475                 480 
Phe Asp Thr Arg Gly Phe Ile Lys Leu Val Ile Glu Glu Gly Ser His
                485                 490                 495     
Arg Leu Ile Gly Val Gln Ala Val Ala Pro Glu Ala Gly Glu Leu Ile
            500                 505                 510         
Gln Thr Glu Glu Leu Ala Ile Arg Asn Arg Met Thr Val Gln Glu Leu
        515                 520                 525             
Ala Asp Gln Leu Phe Pro Tyr Leu Thr Met Val Glu Gly Leu Lys Leu
    530                 535                 540                 
Glu Glu Gln Thr Phe Asn Lys Asp Val Lys Gln Leu Ser Cys Cys Ala
545                 550                 555                 560 
Gly
    

<210> 3
<211> 1686
<212> DNA
<213> Unknown

<220> 
<223> Uncultured soil bacterium

<400> 3
atgacccatc taaaaatcac cggcatgact tgcgactcgt gcgcggcgca cgtcaaggaa 60
gcgctggaaa aagtgccagg cgtgcagtcg gcgctggtgt cctatccgaa gggcacagcg 120
caactcgcca tcgtgccggg cacatcgccg gacgcgctga ctgccgccgt ggccggactg 180
ggctacaagg caacgctagc cgatgcgcca ctggcggaca accgcgtcgg actgctcgac 240
aaggtgcggg gatggatggc cgccgccgaa aagcacagtg gcaacgagcc cccggtgcag 300
gtagcggtca ttggcagcgg tggagccgcg atggcggcgg cgctgaaagc cgtcgagcaa 360
ggcgcgcagg tcacgctgat cgagcgcggc accatcggcg gcacctgcgt caatgtcggc 420
tgtgtgccgt ccaagatcat gatccgcgcc gcccacatcg cccatctgcg ccgggaaagc 480
ccgttcgatg gcggtattgc ggcaactgtg cctacgattg accgcagtaa gctgctggcc 540
cagcagcagg cccgcgtcga cgaactgcgg cacgccaagt acgaaggcat cctgggcggt 600
aatccggcca tcaccgttgt gctcggtgag gcgcgcttca aggacgacca gagccttacc 660
gtccgtttga acgagggtgg cgagcgcgtc gtgatgttcg accgctgcct ggtcgccacg 720
ggtgccagcc cggcggtccc gccgattccg ggcttgaaag agtcacccta ctggacttcc 780
accgaggccc tggcgagcga caccattccc gaacgccttg ccgtaatcgg ctcgtcggtg 840
gtggcgctgg agctggcgca agcctttgcc cggctgggca gcaaggtcac ggtcctggcg 900
cgcaatacct tgttcttccg tgaagacccg gccatcggcg aggcggtgac agccgctttc 960
cgtgccgagg gcatcgaggt gctggagcac acgcaagcca gcgaggtcgc ccatatggac 1020
ggtgaattcg tgctgaccac cacgcacggt gaattgcgcg ccgacaaact gctggttgcc 1080
accggtcgga caccgaacac gcgcagcctc gcgctggacg cagcgggggt cactgtcaat 1140
gcgcaaggtg ccatcgccat cgaccaaggc atgcgcacga gcaacccgaa catctacgcg 1200
gccggcgact gcaccgacca gccgcagttc gtctatgtgg cggcagcggc cggcacccgt 1260
gccgcgatca acatgaccgg cggcgatgcg gcgctcgacc tgaccgcaat gccggccgtg 1320
gtgttcaccg atccgcaagt ggcgaccgtg ggctacagcg aggcggaagc ccaccacgac 1380
gggatcgaga ccgacagccg caccttgacc ttggacaacg tgccgcgtgc gctcgccaac 1440
ttcgacacac gcggcttcat caagttggtt atcgaggaag gcagccatcg gctgatcggc 1500
gtacaggcgg tcgcgccgga agcgggtgaa ctgatccaga cggcggctct ggccattcgc 1560
aaccgcatga cggtgcagga actggccgac cagttgttcc cctacctgac gatggtcgag 1620
gggttgaagc tcgcggcgca gaccttcaac aaggatgtga agcagctttc ctgctgcgcc 1680
gggtga                                                            1686


<210> 4
<211> 1686
<212> DNA
<213> Unknown

<220> 
<223> Uncultured ATII-LCL bacterium

<400> 4
atgacccatc tgaaaattac cggcatgacc tgcgcgagct gcgaagaaca tgtgaaagaa 60
gcgctggaaa aagtgccggg cgtgcagagc gcgctggtga gctatccgaa aggcaccgcg 120
cagctggcga ttgtgccggg caccagcccg gatgcgctga ccgcggcggt ggcgggcctg 180
ggctataaag cgaccctggc ggatgcgccg ctggcggata accgcgtggg cctgctggat 240
aaagtgcgcg gctggatgga tgaagatgaa aaacatagcg gcaacgaacc gccggtgcag 300
gtggcggtga ttggcagcgg cggcgaagaa atggatgatg cgctgaaagc ggtggaacag 360
ggcgcgcagg tgaccctgat tgaacgcggc accattgaag aaacctgcgt gaacgtgggc 420
tgcgtgccga gcaaaattat gattcgcgcg gcgcatattg cgcatctgcg ccgcgaaagc 480
ccgtttgatg gcggcattgc ggcgaccgtg ccgaccattg atcgcagcaa actgctggcg 540
cagcagcagg cgcgcgtgga tgaactgcgc catgcgaaat atgaaggcat tctgggcggc 600
aacccggcga ttaccgatga acatggcgaa gcgcgcttta aagatgatca gagcctgacc 660
gtgcgcctga acgaagaaga agaacgcgtg gtgatgtttg atcgctgcct ggtggcgacc 720
ggcgcgagcc cggcggtgcc gccgattccg ggcctgaaag aagaaccgta ttggaccagc 780
accgaagcgc tggcgagcga taccattccg gaacgcctgg cggtgattgg cgatgaagtg 840
gtggcgctgg aactggcgca ggcgtttgcg cgcctgggca gcaaagtgac cgtgctggcg 900
cgcaacaccc tggatgatcg cgaagatccg gcgattggcg aagcggtgac cgcggcgttt 960
cgcgaagaag gcattgaagt gctggaagaa acccaggcga gccaggtggc gcatatggat 1020
ggcgaatttg tgctggatga tgatcatggc gaactgcgcg cggataaact gctggtggcg 1080
accggccgca ccccgaacac ccgcagcctg gcgctgctgg cgcatggcgt gaccgtgaac 1140
gcggaagaac tgattgcgat tgatcagggc atgcgcacca gcaacccgaa catttatgcg 1200
gcgggcgatt gcaccgatca gccgcagttt gtgtatgtgg atgatgatga tggcacccgc 1260
gcggcgatta acatgaccgg cggcgatgcg gcgaaaccgg cgcgcgcgat gccggcggtg 1320
gtgtttaccg atccgcaggt ggcgaccgtg ggctatagcg aagcggaagc gcatcatgat 1380
ggcattgaaa ccaaagtggg caaatttccg ctggataacg tgggccgcgc gctggcgaac 1440
tttgataccc gcggctttat taaactggtg attgaagaag gcagccatcg cctgattggc 1500
gtgcaggcgg tggcgccgga agcgggcgaa ctgattcaga ccgaagaact ggcgattcgc 1560
aaccgcatga ccgtgcagga actggcggat cagctgtttc cgtatctgac catggtggaa 1620
ggcctgaaac tggaagaaca gacctttaac aaagatgtga aacagctgag ctgctgcgcg 1680
ggctga                                                            1686


<210> 5
<211> 1686
<212> DNA
<213> Artificial Sequence

<220> 
<223> E. coli codon optimized Soil MerA

<400> 5
atgacccacc tgaaaatcac gggtatgacc tgcgacagtt gtgccgccca cgttaaagaa 60
gctctggaaa aagttccggg cgttcaaagt gcgctggtgt cctatccgaa aggtaccgcc 120
cagctggcaa ttgttccggg cacctctccg gatgcactga cggcagcagt cgcaggtctg 180
ggttacaaag caaccctggc agatgctccg ctggcagaca accgtgtcgg tctgctggac 240
aaagtgcgcg gttggatggc agctgcggaa aaacatagcg gtaatgaacc gccagtccag 300
gtggcggtta ttggttctgg cggtgccgca atggctgcag cactgaaagc agttgaacaa 360
ggtgctcagg tcaccctgat tgaacgtggc accatcggcg gtacctgcgt caacgtgggt 420
tgtgtgccgt caaaaattat gatccgcgca gctcatatcg cccacctgcg tcgcgaatcg 480
ccgtttgatg gcggtattgc agcaacggtg ccgaccatcg accgtagcaa actgctggcg 540
cagcaacagg cccgtgttga tgaactgcgc cacgcgaaat atgaaggtat tctgggcggt 600
aacccggcaa tcaccgtggt tctgggcgaa gcgcgtttta aagatgacca gagcctgacg 660
gtgcgcctga atgaaggcgg tgaacgtgtc gtgatgttcg atcgctgcct ggtggctacc 720
ggtgcatctc cggcagttcc gccgattccg ggtctgaaag aatcaccgta ctggacgtcg 780
accgaagcgc tggccagtga taccattccg gaacgtctgg ccgttatcgg tagctctgtt 840
gtcgcactgg aactggcaca agctttcgcg cgcctgggct cgaaagttac cgtcctggcc 900
cgtaatacgc tgtttttccg cgaagatccg gcaattggtg aagctgttac cgcagctttt 960
cgtgcggaag gcatcgaagt cctggaacat acgcaggcga gcgaagtcgc ccacatggat 1020
ggtgaattcg tgctgaccac gacccacggc gaactgcgtg ctgacaaact gctggttgcg 1080
acgggtcgta ccccgaacac gcgttccctg gcactggatg cagcaggtgt gaccgttaat 1140
gcccaaggtg ccattgcaat cgaccagggc atgcgtacct caaacccgaa tatttatgca 1200
gctggtgatt gtacggacca accgcagttt gtctacgtgg cagcagcagc tggtacccgt 1260
gcagcaatca acatgacggg cggtgatgca gctctggatc tgaccgcaat gccggctgtg 1320
gttttcaccg atccgcaggt tgcgacggtc ggttatagtg aagctgaagc gcatcacgat 1380
ggcattgaaa ccgactcccg tacgctgacc ctggataacg tgccgcgtgc cctggcaaat 1440
tttgacacgc gcggtttcat taaactggtt atcgaagaag gcagccaccg cctgatcggt 1500
gtgcaagctg ttgcgccgga agcaggcgaa ctgattcaga ccgcggccct ggcgatccgt 1560
aatcgcatga cggtgcaaga actggcggat cagctgtttc cgtacctgac gatggtggaa 1620
ggcctgaaac tggcggctca aaccttcaac aaagatgtca aacaactgtc gtgctgtgct 1680
ggctaa                                                            1686


<210> 6
<211> 1686
<212> DNA
<213> Artificial Sequence

<220> 
<223> E. coli codon optimized ATII-LCL MerA

<400> 6
atgacgcatc tgaaaattac gggtatgacc tgtgcgagct gtgaagaaca cgttaaagaa 60
gccctggaaa aagtgccggg tgtgcaaagc gccctggttt cttatccgaa aggcacggcc 120
cagctggcaa ttgtgccggg tacgagcccg gatgcactga ccgcagcagt tgctggtctg 180
ggttacaaag caaccctggc agacgctccg ctggcagata accgtgtggg cctgctggat 240
aaagttcgcg gttggatgga cgaagatgaa aaacattcag gcaatgaacc gccagtgcag 300
gttgcagtca ttggttcggg cggtgaagaa atggatgacg ctctgaaagc ggtcgaacaa 360
ggcgcgcagg tgaccctgat tgaacgtggt acgatcgaag aaacctgcgt gaacgttggc 420
tgtgtgccga gcaaaattat gatccgcgca gctcatatcg cccacctgcg tcgcgaatct 480
ccgtttgacg gcggtattgc agcaaccgtg ccgacgatcg atcgtagtaa actgctggca 540
cagcaacagg ctcgtgttga cgaactgcgc catgcgaaat atgaaggcat tctgggcggt 600
aacccggcta tcacggatga acacggtgaa gcccgtttta aagatgacca gtccctgacc 660
gtgcgcctga atgaagaaga agaacgtgtg gttatgttcg atcgttgcct ggttgcaacc 720
ggtgcatcac cggctgtccc gccgattccg ggtctgaaag aagaaccgta ctggaccagt 780
acggaagcgc tggcctccga caccattccg gaacgtctgg cagttatcgg cgatgaagtc 840
gtggcgctgg aactggcaca ggcttttgcg cgcctgggtt ctaaagtcac ggtgctggcg 900
cgtaataccc tggatgaccg cgaagatccg gcgattggcg aagccgttac ggcagctttt 960
cgtgaagaag gtatcgaagt cctggaagaa acccaagcaa gtcaggtggc tcatatggac 1020
ggcgaattcg ttctggatga cgatcacggt gaactgcgcg ccgataaact gctggttgcg 1080
accggtcgta cgccgaacac ccgttcgctg gcactgctgg cgcatggcgt taccgtcaat 1140
gcggaagaac tgattgccat cgatcaaggc atgcgtacga gcaacccgaa tatttatgcg 1200
gccggtgact gtaccgatca accgcagttc gtgtacgttg acgatgacga tggcacgcgt 1260
gcagctatca acatgaccgg cggtgacgca gcaaaaccgg cacgtgcaat gccggccgtt 1320
gtctttacgg atccgcaggt cgcaaccgtg ggttatagcg aagctgaagc gcatcacgat 1380
ggcattgaaa cgaaagtcgg taaattcccg ctggacaacg tgggtcgtgc actggcaaat 1440
tttgataccc gcggtttcat taaactggtg atcgaagaag gctctcaccg cctgatcggt 1500
gttcaagctg tcgcgccgga agcgggcgaa ctgattcaga cggaagaact ggccatccgt 1560
aatcgcatga ccgtgcaaga actggcggat cagctgttcc cgtacctgac gatggtggaa 1620
ggtctgaaac tggaagaaca gacgttcaat aaagatgtca aacaactgtc gtgttgtgca 1680
ggctaa                                                            1686
