                         SEQUENCE LISTING

<110>  University of Utah Research Foundation
 
<120>  Metabolic engineering of non-pathogenic escherichia coli strains 
       for the controlled production of low molecular weight heparosan 
       and size-specific heparosan oligosaccharides

<130>  76926-318144

<140>  PCT/US2020/029902
<141>  2020-04-24

<150>  US 62/914,122
<151>  2019-10-11

<150>  US 62/838,432
<151>  2019-04-25

<160>  7     

<170>  PatentIn version 3.5

<210>  1
<211>  6517
<212>  DNA
<213>  Escherichia coli


<220>
<221>  exon
<222>  (319)..(2778)

<400>  1
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct       60

tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca      120

aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg      180

attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg      240

atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc      300

taacaggagg aattaacc atg gcg gtc tca acc gaa gtt gac cac aac gaa        351
                    Met Ala Val Ser Thr Glu Val Asp His Asn Glu           
                    1               5                   10                

tac aca ggt aac ggc gtt acg aca tca ttt ccg tat acc ttc cgt att        399
Tyr Thr Gly Asn Gly Val Thr Thr Ser Phe Pro Tyr Thr Phe Arg Ile           
            15                  20                  25                    

ttc aaa aaa tcc gac ctg gtt gtt cag gtg tct gac ctt aac ggt aac        447
Phe Lys Lys Ser Asp Leu Val Val Gln Val Ser Asp Leu Asn Gly Asn           
        30                  35                  40                        

gtt aca aaa cta gtg ctg gat gct ggt tat acg gta aca ggg gcg gga        495
Val Thr Lys Leu Val Leu Asp Ala Gly Tyr Thr Val Thr Gly Ala Gly           
    45                  50                  55                            

act tat agt ggc ggt gca gtg gtt ctt ccg tcg ccg ctt gct gct ggc        543
Thr Tyr Ser Gly Gly Ala Val Val Leu Pro Ser Pro Leu Ala Ala Gly           
60                  65                  70                  75            

tgg cga atc acg ata gag cgt gtg ctt gat gtg gtg cag gag act gat        591
Trp Arg Ile Thr Ile Glu Arg Val Leu Asp Val Val Gln Glu Thr Asp           
                80                  85                  90                

ctt cgc aat cag gga aaa ttt ttc ccc gaa gtt cat gag gat gca ttt        639
Leu Arg Asn Gln Gly Lys Phe Phe Pro Glu Val His Glu Asp Ala Phe           
            95                  100                 105                   

gac tac ctg acg atg ctg atc cag cga tgt ttt ggg tgg ttc aga cgt        687
Asp Tyr Leu Thr Met Leu Ile Gln Arg Cys Phe Gly Trp Phe Arg Arg           
        110                 115                 120                       

gca ttg atg aaa cca tct ttg ctt gca aaa tat tac gat gca aag caa        735
Ala Leu Met Lys Pro Ser Leu Leu Ala Lys Tyr Tyr Asp Ala Lys Gln           
    125                 130                 135                           

aac aga ata tct aac ctt gcc gat cca tca ctt gag cag gac gct gta        783
Asn Arg Ile Ser Asn Leu Ala Asp Pro Ser Leu Glu Gln Asp Ala Val           
140                 145                 150                 155           

aat aat cgc tca atg cgt aat tat gtc gat gct gca atc gcc gga gtt        831
Asn Asn Arg Ser Met Arg Asn Tyr Val Asp Ala Ala Ile Ala Gly Val           
                160                 165                 170               

att ggt ggt ttt ggt tgg ttt att cag tat ggt tct gga gcg gta tac        879
Ile Gly Gly Phe Gly Trp Phe Ile Gln Tyr Gly Ser Gly Ala Val Tyr           
            175                 180                 185                   

aga acg ttc cag gat aag atg cgt gat ggt gtc agc att aag gat ttt        927
Arg Thr Phe Gln Asp Lys Met Arg Asp Gly Val Ser Ile Lys Asp Phe           
        190                 195                 200                       

gga gct caa aat gga atc tta aat gat aac aag gat gct ttt aca aaa        975
Gly Ala Gln Asn Gly Ile Leu Asn Asp Asn Lys Asp Ala Phe Thr Lys           
    205                 210                 215                           

tca tta cat tcg ttt agc agt gtt ttt gtt ccg gaa ggg gta ttc aat       1023
Ser Leu His Ser Phe Ser Ser Val Phe Val Pro Glu Gly Val Phe Asn           
220                 225                 230                 235           

aca tct tta gtt tct ctt tca cgt tgt ggc ttg tac gga aca ggt ggg       1071
Thr Ser Leu Val Ser Leu Ser Arg Cys Gly Leu Tyr Gly Thr Gly Gly           
                240                 245                 250               

gga acg ata aaa cag tat gac aga gat ggt aat cat ctg gtt ttt aac       1119
Gly Thr Ile Lys Gln Tyr Asp Arg Asp Gly Asn His Leu Val Phe Asn           
            255                 260                 265                   

atg ccc gat ggt ggc atg ctt agt acg cta aca att atg gga aat aaa       1167
Met Pro Asp Gly Gly Met Leu Ser Thr Leu Thr Ile Met Gly Asn Lys           
        270                 275                 280                       

tca gat gat agt gtg cag gga cac cag gtg tca ttt tca ggt ggc cat       1215
Ser Asp Asp Ser Val Gln Gly His Gln Val Ser Phe Ser Gly Gly His           
    285                 290                 295                           

gat gta tcg gtt aaa aat atc aga ttt aca aat acg cga gga cca gga       1263
Asp Val Ser Val Lys Asn Ile Arg Phe Thr Asn Thr Arg Gly Pro Gly           
300                 305                 310                 315           

ttt agc ttg atc gct tat ccg gat aat ggt att ccg tca ggt tac att       1311
Phe Ser Leu Ile Ala Tyr Pro Asp Asn Gly Ile Pro Ser Gly Tyr Ile           
                320                 325                 330               

gtt aga gat ata aga gga gag tat tta ggg ttc gca aat aat aaa aaa       1359
Val Arg Asp Ile Arg Gly Glu Tyr Leu Gly Phe Ala Asn Asn Lys Lys           
            335                 340                 345                   

gca ggt tgt gtg ctt ttt gat tca tcg caa aat acg cta att gat ggt       1407
Ala Gly Cys Val Leu Phe Asp Ser Ser Gln Asn Thr Leu Ile Asp Gly           
        350                 355                 360                       

gtg ata gcc aga aat tat cct cag ttt ggt gca gtg gaa ctt aaa aca       1455
Val Ile Ala Arg Asn Tyr Pro Gln Phe Gly Ala Val Glu Leu Lys Thr           
    365                 370                 375                           

gca gca aaa tat aac att gtc agc aat gtt att ggt gaa gag tgt cag       1503
Ala Ala Lys Tyr Asn Ile Val Ser Asn Val Ile Gly Glu Glu Cys Gln           
380                 385                 390                 395           

cac gtt gtt tac aat gga act gag acg gaa act gcc cca acg aat aat       1551
His Val Val Tyr Asn Gly Thr Glu Thr Glu Thr Ala Pro Thr Asn Asn           
                400                 405                 410               

atc att agc agt gta atg gct aac aac cca aaa tac gcc gca gta gtt       1599
Ile Ile Ser Ser Val Met Ala Asn Asn Pro Lys Tyr Ala Ala Val Val           
            415                 420                 425                   

gtt ggc aag ggg act ggt aac ctg att tcg gat gtg ctg gtt gat tac       1647
Val Gly Lys Gly Thr Gly Asn Leu Ile Ser Asp Val Leu Val Asp Tyr           
        430                 435                 440                       

tct gaa tcg gac gca aag cag gcg cac ggc gtc acc gtt cag gga aat       1695
Ser Glu Ser Asp Ala Lys Gln Ala His Gly Val Thr Val Gln Gly Asn           
    445                 450                 455                           

aat aat att gcc agt aat att cta atg act ggg tgt gat ggg aaa aat       1743
Asn Asn Ile Ala Ser Asn Ile Leu Met Thr Gly Cys Asp Gly Lys Asn           
460                 465                 470                 475           

gaa tca gga gat ctg cag aca tct aca acc att cgt ttc tta gat gct       1791
Glu Ser Gly Asp Leu Gln Thr Ser Thr Thr Ile Arg Phe Leu Asp Ala           
                480                 485                 490               

gca cgc agt aat tat gcg tca ata ttc ccc atg tat agt tct tcc ggc       1839
Ala Arg Ser Asn Tyr Ala Ser Ile Phe Pro Met Tyr Ser Ser Ser Gly           
            495                 500                 505                   

gtg gtt acc ttc gag gaa ggg tgt atc agg aac ttt gtt gaa att aaa       1887
Val Val Thr Phe Glu Glu Gly Cys Ile Arg Asn Phe Val Glu Ile Lys           
        510                 515                 520                       

cat ccg ggt gac aga aat aat att ctg agt tct gca tca gcg gtg act       1935
His Pro Gly Asp Arg Asn Asn Ile Leu Ser Ser Ala Ser Ala Val Thr           
    525                 530                 535                           

ggt att tcc agt ata gac ggc act aca aat agc aat gtt gtt cac gtc       1983
Gly Ile Ser Ser Ile Asp Gly Thr Thr Asn Ser Asn Val Val His Val           
540                 545                 550                 555           

cct gcg ctt ggt cag tac gtt ggg act atg tca ggg cgt ttt gaa tgg       2031
Pro Ala Leu Gly Gln Tyr Val Gly Thr Met Ser Gly Arg Phe Glu Trp           
                560                 565                 570               

tgg gtt aaa tat ttt aac ctt gct aac cag acg ctt gtt tct gca gat       2079
Trp Val Lys Tyr Phe Asn Leu Ala Asn Gln Thr Leu Val Ser Ala Asp           
            575                 580                 585                   

aaa ttc aga atg ctt gct gaa ggc gat gta tct ctg gct gtg gga ggc       2127
Lys Phe Arg Met Leu Ala Glu Gly Asp Val Ser Leu Ala Val Gly Gly           
        590                 595                 600                       

ggt ata agt tcg caa ttg aaa tta ttc aat agt gat aat act aaa ggc       2175
Gly Ile Ser Ser Gln Leu Lys Leu Phe Asn Ser Asp Asn Thr Lys Gly           
    605                 610                 615                           

act atg tcg cta ata aat gga aat att cga ata tct act gga aat tca       2223
Thr Met Ser Leu Ile Asn Gly Asn Ile Arg Ile Ser Thr Gly Asn Ser           
620                 625                 630                 635           

gaa tat ata cag ttt tct gat tca gcc atg aca cca tcg aca acg aat       2271
Glu Tyr Ile Gln Phe Ser Asp Ser Ala Met Thr Pro Ser Thr Thr Asn           
                640                 645                 650               

act tat tct ctt ggg ttg gct ggt cgt gca tgg tcg ggg gga ttt acc       2319
Thr Tyr Ser Leu Gly Leu Ala Gly Arg Ala Trp Ser Gly Gly Phe Thr           
            655                 660                 665                   

cag tca gcg ttt acg gtg ctg tcc gat gcg cgt ttc aag act gct cca       2367
Gln Ser Ala Phe Thr Val Leu Ser Asp Ala Arg Phe Lys Thr Ala Pro           
        670                 675                 680                       

gag gtt att gat gag aaa ata ctg gac gca tgg gaa aga gtg gaa tgg       2415
Glu Val Ile Asp Glu Lys Ile Leu Asp Ala Trp Glu Arg Val Glu Trp           
    685                 690                 695                           

gtt tca tac cag tac ctt gac agg atc gaa gtg aaa ggt aaa gac gga       2463
Val Ser Tyr Gln Tyr Leu Asp Arg Ile Glu Val Lys Gly Lys Asp Gly           
700                 705                 710                 715           

gca aga tgg cac ttt ggt gca gtt gcg cag cat gtt atc agt gta ttt       2511
Ala Arg Trp His Phe Gly Ala Val Ala Gln His Val Ile Ser Val Phe           
                720                 725                 730               

cag aat gaa ggc ata gat gtg tca cga ctg gca ttt atc tgt tat gac       2559
Gln Asn Glu Gly Ile Asp Val Ser Arg Leu Ala Phe Ile Cys Tyr Asp           
            735                 740                 745                   

aag tgg aat gag acc ccg gca gaa tac agg gat gtg acg gaa gaa gag       2607
Lys Trp Asn Glu Thr Pro Ala Glu Tyr Arg Asp Val Thr Glu Glu Glu           
        750                 755                 760                       

cat tct gca gga gtt tac cca ctt ata cag aca aag gtt ctg gta cgc       2655
His Ser Ala Gly Val Tyr Pro Leu Ile Gln Thr Lys Val Leu Val Arg           
    765                 770                 775                           

gaa gcc gtc gag gct ggt gaa tgt tac ggt atc cgt tat gaa gag gct       2703
Glu Ala Val Glu Ala Gly Glu Cys Tyr Gly Ile Arg Tyr Glu Glu Ala           
780                 785                 790                 795           

ctg att ctg gaa tct gcg atg atg aga cgc agg gtt aaa aag ctg gaa       2751
Leu Ile Leu Glu Ser Ala Met Met Arg Arg Arg Val Lys Lys Leu Glu           
                800                 805                 810               

gag caa gtt ttg caa tta aca ggg aat tggaattcga agcttgggcc             2798
Glu Gln Val Leu Gln Leu Thr Gly Asn                                       
            815                 820                                       

cgaacaaaaa ctcatctcag aagaggatct gaatagcgcc gtcgaccatc atcatcatca     2858

tcattgagtt taaacggtct ccagcttggc tgttttggcg gatgagagaa gattttcagc     2918

ctgatacaga ttaaatcaga acgcagaagc ggtctgataa aacagaattt gcctggcggc     2978

agtagcgcgg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc     3038

gatggtagtg tggggtctcc ccatgcgaga gtagggaact gccaggcatc aaataaaacg     3098

aaaggctcag tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct     3158

cctgagtagg acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg     3218

gtggcgggca ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct     3278

gacggatggc ctttttgcgt ttctacaaac tcttttgttt atttttctaa atacattcaa     3338

atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga     3398

agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc     3458

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg     3518

gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc     3578

gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat     3638

tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg     3698

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag     3758

aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa     3818

cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc     3878

gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca     3938

cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc     3998

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc     4058

tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg     4118

ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta     4178

tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag     4238

gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga     4298

ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc     4358

tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa     4418

agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa     4478

aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc     4538

cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt     4598

agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc     4658

tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac     4718

gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca     4778

gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg     4838

ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag     4898

gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt     4958

ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat     5018

ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc     5078

acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt     5138

gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag     5198

cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca     5258

tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc     5318

gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc     5378

gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg     5438

gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa     5498

ttcgcgcgcg aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat     5558

tctgcaaacc ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga     5618

caacttgacg gctacatcat tcactttttc ttcacaaccg gcacggaact cgctcgggct     5678

ggccccggtg cattttttaa atacccgcga gaaatagagt tgatcgtcaa aaccaacatt     5738

gcgaccgacg gtggcgatag gcatccgggt ggtgctcaaa agcagcttcg cctggctgat     5798

acgttggtcc tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga     5858

cagacgcgac ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa aattgctgtc     5918

tgccaggtga tcgctgatgt actgacaagc ctcgcgtacc cgattatcca tcggtggatg     5978

gagcgactcg ttaatcgctt ccatgcgccg cagtaacaat tgctcaagca gatttatcgc     6038

cagcagctcc gaatagcgcc cttccccttg cccggcgtta atgatttgcc caaacaggtc     6098

gctgaaatgc ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga     6158

cggccagtta agccattcat gccagtaggc gcgcggacga aagtaaaccc actggtgata     6218

ccattcgcga gcctccggat gacgaccgta gtgatgaatc tctcctggcg ggaacagcaa     6278

aatatcaccc ggtcggcaaa caaattctcg tccctgattt ttcaccaccc cctgaccgcg     6338

aatggtgaga ttgagaatat aacctttcat tcccagcggt cggtcgataa aaaaatcgag     6398

ataaccgttg gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc     6458

cggcagcagg ggatcatttt gcgcttcagc catacttttc atactcccgc cattcagag      6517


<210>  2
<211>  6184
<212>  DNA
<213>  Escherichia coli


<220>
<221>  exon
<222>  (319)..(2445)

<400>  2
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct       60

tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca      120

aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg      180

attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg      240

atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc      300

taacaggagg aattaacc atg gtg atc cag cga tgt ttt ggg tgg ttc aga        351
                    Met Val Ile Gln Arg Cys Phe Gly Trp Phe Arg           
                    1               5                   10                

cgt gca ttg atg aaa cca tct ttg ctt gca aaa tat tac gat gca aag        399
Arg Ala Leu Met Lys Pro Ser Leu Leu Ala Lys Tyr Tyr Asp Ala Lys           
            15                  20                  25                    

caa aac aga ata tct aac ctt gcc gat cca tca ctt gag cag gac gct        447
Gln Asn Arg Ile Ser Asn Leu Ala Asp Pro Ser Leu Glu Gln Asp Ala           
        30                  35                  40                        

gta aat aat cgc tca atg cgt aat tat gtc gat gct gca atc gcc gga        495
Val Asn Asn Arg Ser Met Arg Asn Tyr Val Asp Ala Ala Ile Ala Gly           
    45                  50                  55                            

gtt att ggt ggt ttt ggt tgg ttt att cag tat ggt tct gga gcg gta        543
Val Ile Gly Gly Phe Gly Trp Phe Ile Gln Tyr Gly Ser Gly Ala Val           
60                  65                  70                  75            

tac aga acg ttc cag gat aag atg cgt gat ggt gtc agc att aag gat        591
Tyr Arg Thr Phe Gln Asp Lys Met Arg Asp Gly Val Ser Ile Lys Asp           
                80                  85                  90                

ttt gga gct caa aat gga atc tta aat gat aac aag gat gct ttt aca        639
Phe Gly Ala Gln Asn Gly Ile Leu Asn Asp Asn Lys Asp Ala Phe Thr           
            95                  100                 105                   

aaa tca tta cat tcg ttt agc agt gtt ttt gtt ccg gaa ggg gta ttc        687
Lys Ser Leu His Ser Phe Ser Ser Val Phe Val Pro Glu Gly Val Phe           
        110                 115                 120                       

aat aca tct tta gtt tct ctt tca cgt tgt ggc ttg tac gga aca ggt        735
Asn Thr Ser Leu Val Ser Leu Ser Arg Cys Gly Leu Tyr Gly Thr Gly           
    125                 130                 135                           

ggg gga acg ata aaa cag tat gac aga gat ggt aat cat ctg gtt ttt        783
Gly Gly Thr Ile Lys Gln Tyr Asp Arg Asp Gly Asn His Leu Val Phe           
140                 145                 150                 155           

aac atg ccc gat ggt ggc atg ctt agt acg cta aca att atg gga aat        831
Asn Met Pro Asp Gly Gly Met Leu Ser Thr Leu Thr Ile Met Gly Asn           
                160                 165                 170               

aaa tca gat gat agt gtg cag gga cac cag gtg tca ttt tca ggt ggc        879
Lys Ser Asp Asp Ser Val Gln Gly His Gln Val Ser Phe Ser Gly Gly           
            175                 180                 185                   

cat gat gta tcg gtt aaa aat atc aga ttt aca aat acg cga gga cca        927
His Asp Val Ser Val Lys Asn Ile Arg Phe Thr Asn Thr Arg Gly Pro           
        190                 195                 200                       

gga ttt agc ttg atc gct tat ccg gat aat ggt att ccg tca ggt tac        975
Gly Phe Ser Leu Ile Ala Tyr Pro Asp Asn Gly Ile Pro Ser Gly Tyr           
    205                 210                 215                           

att gtt aga gat ata aga gga gag tat tta ggg ttc gca aat aat aaa       1023
Ile Val Arg Asp Ile Arg Gly Glu Tyr Leu Gly Phe Ala Asn Asn Lys           
220                 225                 230                 235           

aaa gca ggt tgt gtg ctt ttt gat tca tcg caa aat acg cta att gat       1071
Lys Ala Gly Cys Val Leu Phe Asp Ser Ser Gln Asn Thr Leu Ile Asp           
                240                 245                 250               

ggt gtg ata gcc aga aat tat cct cag ttt ggt gca gtg gaa ctt aaa       1119
Gly Val Ile Ala Arg Asn Tyr Pro Gln Phe Gly Ala Val Glu Leu Lys           
            255                 260                 265                   

aca gca gca aaa tat aac att gtc agc aat gtt att ggt gaa gag tgt       1167
Thr Ala Ala Lys Tyr Asn Ile Val Ser Asn Val Ile Gly Glu Glu Cys           
        270                 275                 280                       

cag cac gtt gtt tac aat gga act gag acg gaa act gcc cca acg aat       1215
Gln His Val Val Tyr Asn Gly Thr Glu Thr Glu Thr Ala Pro Thr Asn           
    285                 290                 295                           

aat atc att agc agt gta atg gct aac aac cca aaa tac gcc gca gta       1263
Asn Ile Ile Ser Ser Val Met Ala Asn Asn Pro Lys Tyr Ala Ala Val           
300                 305                 310                 315           

gtt gtt ggc aag ggg act ggt aac ctg att tcg gat gtg ctg gtt gat       1311
Val Val Gly Lys Gly Thr Gly Asn Leu Ile Ser Asp Val Leu Val Asp           
                320                 325                 330               

tac tct gaa tcg gac gca aag cag gcg cac ggc gtc acc gtt cag gga       1359
Tyr Ser Glu Ser Asp Ala Lys Gln Ala His Gly Val Thr Val Gln Gly           
            335                 340                 345                   

aat aat aat att gcc agt aat att cta atg act ggg tgt gat ggg aaa       1407
Asn Asn Asn Ile Ala Ser Asn Ile Leu Met Thr Gly Cys Asp Gly Lys           
        350                 355                 360                       

aat gaa tca gga gat ctg cag aca tct aca acc att cgt ttc tta gat       1455
Asn Glu Ser Gly Asp Leu Gln Thr Ser Thr Thr Ile Arg Phe Leu Asp           
    365                 370                 375                           

gct gca cgc agt aat tat gcg tca ata ttc ccc atg tat agt tct tcc       1503
Ala Ala Arg Ser Asn Tyr Ala Ser Ile Phe Pro Met Tyr Ser Ser Ser           
380                 385                 390                 395           

ggc gtg gtt acc ttc gag gaa ggg tgt atc agg aac ttt gtt gaa att       1551
Gly Val Val Thr Phe Glu Glu Gly Cys Ile Arg Asn Phe Val Glu Ile           
                400                 405                 410               

aaa cat ccg ggt gac aga aat aat att ctg agt tct gca tca gcg gtg       1599
Lys His Pro Gly Asp Arg Asn Asn Ile Leu Ser Ser Ala Ser Ala Val           
            415                 420                 425                   

act ggt att tcc agt ata gac ggc act aca aat agc aat gtt gtt cac       1647
Thr Gly Ile Ser Ser Ile Asp Gly Thr Thr Asn Ser Asn Val Val His           
        430                 435                 440                       

gtc cct gcg ctt ggt cag tac gtt ggg act atg tca ggg cgt ttt gaa       1695
Val Pro Ala Leu Gly Gln Tyr Val Gly Thr Met Ser Gly Arg Phe Glu           
    445                 450                 455                           

tgg tgg gtt aaa tat ttt aac ctt gct aac cag acg ctt gtt tct gca       1743
Trp Trp Val Lys Tyr Phe Asn Leu Ala Asn Gln Thr Leu Val Ser Ala           
460                 465                 470                 475           

gat aaa ttc aga atg ctt gct gaa ggc gat gta tct ctg gct gtg gga       1791
Asp Lys Phe Arg Met Leu Ala Glu Gly Asp Val Ser Leu Ala Val Gly           
                480                 485                 490               

ggc ggt ata agt tcg caa ttg aaa tta ttc aat agt gat aat act aaa       1839
Gly Gly Ile Ser Ser Gln Leu Lys Leu Phe Asn Ser Asp Asn Thr Lys           
            495                 500                 505                   

ggc act atg tcg cta ata aat gga aat att cga ata tct act gga aat       1887
Gly Thr Met Ser Leu Ile Asn Gly Asn Ile Arg Ile Ser Thr Gly Asn           
        510                 515                 520                       

tca gaa tat ata cag ttt tct gat tca gcc atg aca cca tcg aca acg       1935
Ser Glu Tyr Ile Gln Phe Ser Asp Ser Ala Met Thr Pro Ser Thr Thr           
    525                 530                 535                           

aat act tat tct ctt ggg ttg gct ggt cgt gca tgg tcg ggg gga ttt       1983
Asn Thr Tyr Ser Leu Gly Leu Ala Gly Arg Ala Trp Ser Gly Gly Phe           
540                 545                 550                 555           

acc cag tca gcg ttt acg gtg ctg tcc gat gcg cgt ttc aag act gct       2031
Thr Gln Ser Ala Phe Thr Val Leu Ser Asp Ala Arg Phe Lys Thr Ala           
                560                 565                 570               

cca gag gtt att gat gag aaa ata ctg gac gca tgg gaa aga gtg gaa       2079
Pro Glu Val Ile Asp Glu Lys Ile Leu Asp Ala Trp Glu Arg Val Glu           
            575                 580                 585                   

tgg gtt tca tac cag tac ctt gac agg atc gaa gtg aaa ggt aaa gac       2127
Trp Val Ser Tyr Gln Tyr Leu Asp Arg Ile Glu Val Lys Gly Lys Asp           
        590                 595                 600                       

gga gca aga tgg cac ttt ggt gca gtt gcg cag cat gtt atc agt gta       2175
Gly Ala Arg Trp His Phe Gly Ala Val Ala Gln His Val Ile Ser Val           
    605                 610                 615                           

ttt cag aat gaa ggc ata gat gtg tca cga ctg gca ttt atc tgt tat       2223
Phe Gln Asn Glu Gly Ile Asp Val Ser Arg Leu Ala Phe Ile Cys Tyr           
620                 625                 630                 635           

gac aag tgg aat gag acc ccg gca gaa tac agg gat gtg acg gaa gaa       2271
Asp Lys Trp Asn Glu Thr Pro Ala Glu Tyr Arg Asp Val Thr Glu Glu           
                640                 645                 650               

gag cat tct gca gga gtt tac cca ctt ata cag aca aag gtt ctg gta       2319
Glu His Ser Ala Gly Val Tyr Pro Leu Ile Gln Thr Lys Val Leu Val           
            655                 660                 665                   

cgc gaa gcc gtc gag gct ggt gaa tgt tac ggt atc cgt tat gaa gag       2367
Arg Glu Ala Val Glu Ala Gly Glu Cys Tyr Gly Ile Arg Tyr Glu Glu           
        670                 675                 680                       

gct ctg att ctg gaa tct gcg atg atg aga cgc agg gtt aaa aag ctg       2415
Ala Leu Ile Leu Glu Ser Ala Met Met Arg Arg Arg Val Lys Lys Leu           
    685                 690                 695                           

gaa gag caa gtt ttg caa tta aca ggg aat tggaattcga agcttgggcc         2465
Glu Glu Gln Val Leu Gln Leu Thr Gly Asn                                   
700                 705                                                   

cgaacaaaaa ctcatctcag aagaggatct gaatagcgcc gtcgaccatc atcatcatca     2525

tcattgagtt taaacggtct ccagcttggc tgttttggcg gatgagagaa gattttcagc     2585

ctgatacaga ttaaatcaga acgcagaagc ggtctgataa aacagaattt gcctggcggc     2645

agtagcgcgg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc     2705

gatggtagtg tggggtctcc ccatgcgaga gtagggaact gccaggcatc aaataaaacg     2765

aaaggctcag tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct     2825

cctgagtagg acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg     2885

gtggcgggca ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct     2945

gacggatggc ctttttgcgt ttctacaaac tcttttgttt atttttctaa atacattcaa     3005

atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga     3065

agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc     3125

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg     3185

gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc     3245

gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat     3305

tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg     3365

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag     3425

aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa     3485

cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc     3545

gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca     3605

cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc     3665

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc     3725

tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg     3785

ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta     3845

tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag     3905

gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga     3965

ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc     4025

tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa     4085

agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa     4145

aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc     4205

cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt     4265

agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc     4325

tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac     4385

gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca     4445

gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg     4505

ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag     4565

gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt     4625

ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat     4685

ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc     4745

acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt     4805

gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag     4865

cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca     4925

tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc     4985

gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc     5045

gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg     5105

gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa     5165

ttcgcgcgcg aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat     5225

tctgcaaacc ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga     5285

caacttgacg gctacatcat tcactttttc ttcacaaccg gcacggaact cgctcgggct     5345

ggccccggtg cattttttaa atacccgcga gaaatagagt tgatcgtcaa aaccaacatt     5405

gcgaccgacg gtggcgatag gcatccgggt ggtgctcaaa agcagcttcg cctggctgat     5465

acgttggtcc tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga     5525

cagacgcgac ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa aattgctgtc     5585

tgccaggtga tcgctgatgt actgacaagc ctcgcgtacc cgattatcca tcggtggatg     5645

gagcgactcg ttaatcgctt ccatgcgccg cagtaacaat tgctcaagca gatttatcgc     5705

cagcagctcc gaatagcgcc cttccccttg cccggcgtta atgatttgcc caaacaggtc     5765

gctgaaatgc ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga     5825

cggccagtta agccattcat gccagtaggc gcgcggacga aagtaaaccc actggtgata     5885

ccattcgcga gcctccggat gacgaccgta gtgatgaatc tctcctggcg ggaacagcaa     5945

aatatcaccc ggtcggcaaa caaattctcg tccctgattt ttcaccaccc cctgaccgcg     6005

aatggtgaga ttgagaatat aacctttcat tcccagcggt cggtcgataa aaaaatcgag     6065

ataaccgttg gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc     6125

cggcagcagg ggatcatttt gcgcttcagc catacttttc atactcccgc cattcagag      6184


<210>  3
<211>  5938
<212>  DNA
<213>  Escherichia coli


<220>
<221>  exon
<222>  (319)..(2199)

<400>  3
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct       60

tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca      120

aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg      180

attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg      240

atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc      300

taacaggagg aattaacc atg ggt gat ggt gtc agc att aag gat ttt gga        351
                    Met Gly Asp Gly Val Ser Ile Lys Asp Phe Gly           
                    1               5                   10                

gct caa aat gga atc tta aat gat aac aag gat gct ttt aca aaa tca        399
Ala Gln Asn Gly Ile Leu Asn Asp Asn Lys Asp Ala Phe Thr Lys Ser           
            15                  20                  25                    

tta cat tcg ttt agc agt gtt ttt gtt ccg gaa ggg gta ttc aat aca        447
Leu His Ser Phe Ser Ser Val Phe Val Pro Glu Gly Val Phe Asn Thr           
        30                  35                  40                        

tct tta gtt tct ctt tca cgt tgt ggc ttg tac gga aca ggt ggg gga        495
Ser Leu Val Ser Leu Ser Arg Cys Gly Leu Tyr Gly Thr Gly Gly Gly           
    45                  50                  55                            

acg ata aaa cag tat gac aga gat ggt aat cat ctg gtt ttt aac atg        543
Thr Ile Lys Gln Tyr Asp Arg Asp Gly Asn His Leu Val Phe Asn Met           
60                  65                  70                  75            

ccc gat ggt ggc atg ctt agt acg cta aca att atg gga aat aaa tca        591
Pro Asp Gly Gly Met Leu Ser Thr Leu Thr Ile Met Gly Asn Lys Ser           
                80                  85                  90                

gat gat agt gtg cag gga cac cag gtg tca ttt tca ggt ggc cat gat        639
Asp Asp Ser Val Gln Gly His Gln Val Ser Phe Ser Gly Gly His Asp           
            95                  100                 105                   

gta tcg gtt aaa aat atc aga ttt aca aat acg cga gga cca gga ttt        687
Val Ser Val Lys Asn Ile Arg Phe Thr Asn Thr Arg Gly Pro Gly Phe           
        110                 115                 120                       

agc ttg atc gct tat ccg gat aat ggt att ccg tca ggt tac att gtt        735
Ser Leu Ile Ala Tyr Pro Asp Asn Gly Ile Pro Ser Gly Tyr Ile Val           
    125                 130                 135                           

aga gat ata aga gga gag tat tta ggg ttc gca aat aat aaa aaa gca        783
Arg Asp Ile Arg Gly Glu Tyr Leu Gly Phe Ala Asn Asn Lys Lys Ala           
140                 145                 150                 155           

ggt tgt gtg ctt ttt gat tca tcg caa aat acg cta att gat ggt gtg        831
Gly Cys Val Leu Phe Asp Ser Ser Gln Asn Thr Leu Ile Asp Gly Val           
                160                 165                 170               

ata gcc aga aat tat cct cag ttt ggt gca gtg gaa ctt aaa aca gca        879
Ile Ala Arg Asn Tyr Pro Gln Phe Gly Ala Val Glu Leu Lys Thr Ala           
            175                 180                 185                   

gca aaa tat aac att gtc agc aat gtt att ggt gaa gag tgt cag cac        927
Ala Lys Tyr Asn Ile Val Ser Asn Val Ile Gly Glu Glu Cys Gln His           
        190                 195                 200                       

gtt gtt tac aat gga act gag acg gaa act gcc cca acg aat aat atc        975
Val Val Tyr Asn Gly Thr Glu Thr Glu Thr Ala Pro Thr Asn Asn Ile           
    205                 210                 215                           

att agc agt gta atg gct aac aac cca aaa tac gcc gca gta gtt gtt       1023
Ile Ser Ser Val Met Ala Asn Asn Pro Lys Tyr Ala Ala Val Val Val           
220                 225                 230                 235           

ggc aag ggg act ggt aac ctg att tcg gat gtg ctg gtt gat tac tct       1071
Gly Lys Gly Thr Gly Asn Leu Ile Ser Asp Val Leu Val Asp Tyr Ser           
                240                 245                 250               

gaa tcg gac gca aag cag gcg cac ggc gtc acc gtt cag gga aat aat       1119
Glu Ser Asp Ala Lys Gln Ala His Gly Val Thr Val Gln Gly Asn Asn           
            255                 260                 265                   

aat att gcc agt aat att cta atg act ggg tgt gat ggg aaa aat gaa       1167
Asn Ile Ala Ser Asn Ile Leu Met Thr Gly Cys Asp Gly Lys Asn Glu           
        270                 275                 280                       

tca gga gat ctg cag aca tct aca acc att cgt ttc tta gat gct gca       1215
Ser Gly Asp Leu Gln Thr Ser Thr Thr Ile Arg Phe Leu Asp Ala Ala           
    285                 290                 295                           

cgc agt aat tat gcg tca ata ttc ccc atg tat agt tct tcc ggc gtg       1263
Arg Ser Asn Tyr Ala Ser Ile Phe Pro Met Tyr Ser Ser Ser Gly Val           
300                 305                 310                 315           

gtt acc ttc gag gaa ggg tgt atc agg aac ttt gtt gaa att aaa cat       1311
Val Thr Phe Glu Glu Gly Cys Ile Arg Asn Phe Val Glu Ile Lys His           
                320                 325                 330               

ccg ggt gac aga aat aat att ctg agt tct gca tca gcg gtg act ggt       1359
Pro Gly Asp Arg Asn Asn Ile Leu Ser Ser Ala Ser Ala Val Thr Gly           
            335                 340                 345                   

att tcc agt ata gac ggc act aca aat agc aat gtt gtt cac gtc cct       1407
Ile Ser Ser Ile Asp Gly Thr Thr Asn Ser Asn Val Val His Val Pro           
        350                 355                 360                       

gcg ctt ggt cag tac gtt ggg act atg tca ggg cgt ttt gaa tgg tgg       1455
Ala Leu Gly Gln Tyr Val Gly Thr Met Ser Gly Arg Phe Glu Trp Trp           
    365                 370                 375                           

gtt aaa tat ttt aac ctt gct aac cag acg ctt gtt tct gca gat aaa       1503
Val Lys Tyr Phe Asn Leu Ala Asn Gln Thr Leu Val Ser Ala Asp Lys           
380                 385                 390                 395           

ttc aga atg ctt gct gaa ggc gat gta tct ctg gct gtg gga ggc ggt       1551
Phe Arg Met Leu Ala Glu Gly Asp Val Ser Leu Ala Val Gly Gly Gly           
                400                 405                 410               

ata agt tcg caa ttg aaa tta ttc aat agt gat aat act aaa ggc act       1599
Ile Ser Ser Gln Leu Lys Leu Phe Asn Ser Asp Asn Thr Lys Gly Thr           
            415                 420                 425                   

atg tcg cta ata aat gga aat att cga ata tct act gga aat tca gaa       1647
Met Ser Leu Ile Asn Gly Asn Ile Arg Ile Ser Thr Gly Asn Ser Glu           
        430                 435                 440                       

tat ata cag ttt tct gat tca gcc atg aca cca tcg aca acg aat act       1695
Tyr Ile Gln Phe Ser Asp Ser Ala Met Thr Pro Ser Thr Thr Asn Thr           
    445                 450                 455                           

tat tct ctt ggg ttg gct ggt cgt gca tgg tcg ggg gga ttt acc cag       1743
Tyr Ser Leu Gly Leu Ala Gly Arg Ala Trp Ser Gly Gly Phe Thr Gln           
460                 465                 470                 475           

tca gcg ttt acg gtg ctg tcc gat gcg cgt ttc aag act gct cca gag       1791
Ser Ala Phe Thr Val Leu Ser Asp Ala Arg Phe Lys Thr Ala Pro Glu           
                480                 485                 490               

gtt att gat gag aaa ata ctg gac gca tgg gaa aga gtg gaa tgg gtt       1839
Val Ile Asp Glu Lys Ile Leu Asp Ala Trp Glu Arg Val Glu Trp Val           
            495                 500                 505                   

tca tac cag tac ctt gac agg atc gaa gtg aaa ggt aaa gac gga gca       1887
Ser Tyr Gln Tyr Leu Asp Arg Ile Glu Val Lys Gly Lys Asp Gly Ala           
        510                 515                 520                       

aga tgg cac ttt ggt gca gtt gcg cag cat gtt atc agt gta ttt cag       1935
Arg Trp His Phe Gly Ala Val Ala Gln His Val Ile Ser Val Phe Gln           
    525                 530                 535                           

aat gaa ggc ata gat gtg tca cga ctg gca ttt atc tgt tat gac aag       1983
Asn Glu Gly Ile Asp Val Ser Arg Leu Ala Phe Ile Cys Tyr Asp Lys           
540                 545                 550                 555           

tgg aat gag acc ccg gca gaa tac agg gat gtg acg gaa gaa gag cat       2031
Trp Asn Glu Thr Pro Ala Glu Tyr Arg Asp Val Thr Glu Glu Glu His           
                560                 565                 570               

tct gca gga gtt tac cca ctt ata cag aca aag gtt ctg gta cgc gaa       2079
Ser Ala Gly Val Tyr Pro Leu Ile Gln Thr Lys Val Leu Val Arg Glu           
            575                 580                 585                   

gcc gtc gag gct ggt gaa tgt tac ggt atc cgt tat gaa gag gct ctg       2127
Ala Val Glu Ala Gly Glu Cys Tyr Gly Ile Arg Tyr Glu Glu Ala Leu           
        590                 595                 600                       

att ctg gaa tct gcg atg atg aga cgc agg gtt aaa aag ctg gaa gag       2175
Ile Leu Glu Ser Ala Met Met Arg Arg Arg Val Lys Lys Leu Glu Glu           
    605                 610                 615                           

caa gtt ttg caa tta aca ggg aat tggaattcga agcttgggcc cgaacaaaaa      2229
Gln Val Leu Gln Leu Thr Gly Asn                                           
620                 625                                                   

ctcatctcag aagaggatct gaatagcgcc gtcgaccatc atcatcatca tcattgagtt     2289

taaacggtct ccagcttggc tgttttggcg gatgagagaa gattttcagc ctgatacaga     2349

ttaaatcaga acgcagaagc ggtctgataa aacagaattt gcctggcggc agtagcgcgg     2409

tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg     2469

tggggtctcc ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag     2529

tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg     2589

acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca     2649

ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct gacggatggc     2709

ctttttgcgt ttctacaaac tcttttgttt atttttctaa atacattcaa atatgtatcc     2769

gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag     2829

tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt     2889

tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt     2949

gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga     3009

acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt     3069

tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga     3129

gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag     3189

tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg     3249

accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg     3309

ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt     3369

agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg     3429

gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc     3489

ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg     3549

tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac     3609

ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact     3669

gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa     3729

acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa     3789

aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg     3849

atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc     3909

gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac     3969

tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca     4029

ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt     4089

ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc     4149

ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg     4209

aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc     4269

cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac     4329

gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct     4389

ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc     4449

cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt     4509

tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac     4569

cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg     4629

cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac     4689

tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc gctatcgcta     4749

cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc gccctgacgg     4809

gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg     4869

tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg     4929

aaggcgaagc ggcatgcata atgtgcctgt caaatggacg aagcagggat tctgcaaacc     4989

ctatgctact ccgtcaagcc gtcaattgtc tgattcgtta ccaattatga caacttgacg     5049

gctacatcat tcactttttc ttcacaaccg gcacggaact cgctcgggct ggccccggtg     5109

cattttttaa atacccgcga gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg     5169

gtggcgatag gcatccgggt ggtgctcaaa agcagcttcg cctggctgat acgttggtcc     5229

tcgcgccagc ttaagacgct aatccctaac tgctggcgga aaagatgtga cagacgcgac     5289

ggcgacaagc aaacatgctg tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga     5349

tcgctgatgt actgacaagc ctcgcgtacc cgattatcca tcggtggatg gagcgactcg     5409

ttaatcgctt ccatgcgccg cagtaacaat tgctcaagca gatttatcgc cagcagctcc     5469

gaatagcgcc cttccccttg cccggcgtta atgatttgcc caaacaggtc gctgaaatgc     5529

ggctggtgcg cttcatccgg gcgaaagaac cccgtattgg caaatattga cggccagtta     5589

agccattcat gccagtaggc gcgcggacga aagtaaaccc actggtgata ccattcgcga     5649

gcctccggat gacgaccgta gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc     5709

ggtcggcaaa caaattctcg tccctgattt ttcaccaccc cctgaccgcg aatggtgaga     5769

ttgagaatat aacctttcat tcccagcggt cggtcgataa aaaaatcgag ataaccgttg     5829

gcctcaatcg gcgttaaacc cgccaccaga tgggcattaa acgagtatcc cggcagcagg     5889

ggatcatttt gcgcttcagc catacttttc atactcccgc cattcagag                 5938


<210>  4
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Forward primer for SEQ ID NO 1


<220>
<221>  primer_bind
<222>  (1)..(24)

<400>  4
aagtccatgg cggtctcaac cgaa                                              24


<210>  5
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Forward primer for SEQ ID NO 2


<220>
<221>  primer_bind
<222>  (1)..(30)

<400>  5
aagtccatgg tgatccagcg atgttttggg                                        30


<210>  6
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Forward primer for SEQ ID NO 3


<220>
<221>  primer_bind
<222>  (1)..(31)

<400>  6
aagtccatgg gtgatggtgt cagcattaag g                                      31


<210>  7
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Reverse primer for SEQ ID NOs. 1, 2, and 3


<220>
<221>  primer_bind
<222>  (1)..(32)

<400>  7
aaggaattca attccctgtt aattgcaaaa ct                                     32


