                         SEQUENCE LISTING

<110>  CARIBOU BIOSCIENCES, INC.
       CARTER, Matthew
       DONOHOUE, Paul
 
<120>  NOVEL CRISPR-ASSOCIATED (CAS) PROTEIN

<130>  CBI025.30

<150>  US 62/477,494
<151>  2017-03-28

<150>  US 62/629,641
<151>  2018-02-12

<160>  165   

<170>  PatentIn version 3.5

<210>  1
<211>  2862
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Eubacterium siraeum, modified 
       for expression in Escherichia coli

<400>  1
atggggaaga agattcatgc gcgcgattta cgcgaacaac gcaaaacgga tcgcactgag       60

aaatttgcgg atcaaaacaa aaagcgcgag gccgagcgcg ctgttcctaa aaaggacgcc      120

gcagtctcgg ttaagagtgt atcgtccgtg tcttcaaaaa aggacaacgt cactaaaagc      180

atggcgaagg ccgctggtgt aaagtctgta tttgccgtag gtaacacggt atacatgaca      240

tcgttcggcc gcggcaacga cgctgtactg gagcaaaaga tcgtggatac atcccatgaa      300

ccacttaaca tcgacgatcc agcatatcaa ttgaacgttg ttacaatgaa cggttattcc      360

gtcaccggcc accgcggaga gaccgtttct gcagtaacgg acaacccttt acgccgtttc      420

aatggccgca aaaaggacga acctgagcaa tcggttccaa ctgacatgct ttgtcttaaa      480

cctacgttag agaagaagtt cttcggcaag gagtttgacg acaacatcca catccagttg      540

atttataaca ttttagatat tgagaagatc ttagcagttt attcaaccaa tgcaatttac      600

gctttgaaca acatgagcgc cgacgaaaac atcgaaaatt cggatttttt catgaaacgt      660

accacagacg aaacctttga cgactttgaa aagaaaaaag aatctactaa ctcacgcgaa      720

aaggcagact tcgacgcgtt tgaaaaattt attggaaact accgtcttgc gtacttcgcg      780

gatgctttct atgtcaataa aaaaaaccct aagggaaagg ctaagaatgt tctgcgtgaa      840

gataaggagc tttactcggt cttaactctt atcggtaaac tgcgccattg gtgcgtacat      900

agcgaggagg gacgtgcaga gttctggctg tataagttag acgagttaaa agacgatttt      960

aaaaatgtat tggacgtcgt gtacaaccgt cccgtggaag aaatcaacaa ccgctttatt     1020

gagaataaca aagttaatat ccaaattctg gggagcgtgt acaaaaacac agacatcgct     1080

gaacttgtgc gctcgtatta cgaattcttg attaccaaaa aatacaaaaa tatgggcttt     1140

tctattaaga aacttcgtga atcaatgttg gaaggtaaag gttacgcaga caaggaatat     1200

gactccgtcc gtaataagtt gtaccaaatg acagacttca ttctgtatac gggatacatc     1260

aacgaagact cagatcgtgc agacgatctg gtcaataccc tgcgctcttc tctgaaggag     1320

gatgataaga cgactgtata ctgtaaagag gccgactatt tgtggaagaa gtatcgcgaa     1380

tcgatccgtg aggttgcgga tgcactggat ggtgataaca tcaagaagtt gagtaagtcg     1440

aacatcgaga tccaagagga taaacttcgt aagtgcttca ttagttatgc agactccgtt     1500

tcagagttca caaaactgat ctacctgctg acccgcttcc tgagcggaaa ggaaattaat     1560

gacctggtaa ctactcttat caataaattt gataacatcc gctcttttct tgagattatg     1620

gacgagctgg gattagatcg tacgtttacc gccgaatatt cgttctttga aggctcaacg     1680

aaatacttgg cggagcttgt agagttaaat tcttttgtaa aatcttgctc ttttgatatt     1740

aacgccaagc gcacaatgta tcgcgacgcc ttagacattt tggggattga atcggacaag     1800

actgaagagg atattgaaaa gatgattgat aatatccttc agattgatgc gaatggcgac     1860

aagaaactta agaaaaataa tggcctgcgt aacttcattg caagtaacgt tattgacagt     1920

aaccgtttca aatacttagt acgctacggg aaccctaaaa aaatccgcga aacagctaag     1980

tgcaaaccgg ctgttcgctt cgtgttgaac gagatccccg acgcacagat cgagcgctat     2040

tacgaggcat gctgtccaaa gaacacagcc ctttgctcag cgaacaagcg tcgcgagaag     2100

ttagctgaca tgattgccga gattaagttc gagaacttct ctgacgctgg aaattatcaa     2160

aaagctaacg ttacctcgcg cacatcagag gcggaaatca aacgtaaaaa ccaggcgatt     2220

attcgcttgt atttgacggt catgtacatt atgctgaaga acttagtcaa cgtgaacgct     2280

cgttacgtga tcgcatttca ctgtgtggag cgtgatacta agttgtatgc cgaatctgga     2340

ttggaggttg ggaacattga aaagaataaa actaatctta ccatggccgt aatgggagtt     2400

aagcttgaga atggtatcat caagactgag tttgataaat cttttgcgga aaacgcagca     2460

aatcgttacc ttcgtaacgc acgctggtat aaacttatct tagacaattt aaaaaagtca     2520

gaacgcgcgg tagtaaacga atttcgtaac acagtatgtc atttaaacgc catccgcaac     2580

attaacatta acatcaagga gattaaggag gtagaaaatt attttgcctt gtaccactat     2640

ttgatccaaa aacatttgga gaaccgtttc gccgacaaaa aagttgaacg cgatacgggt     2700

gactttattt ccaaattgga agagcataag acgtactgta aggactttgt aaaagcatac     2760

tgtacgccgt ttggatataa tttagtacgt tataagaact tgactattga cggacttttc     2820

gataaaaact accctgggaa ggatgattct gatgaacaga aa                        2862


<210>  2
<211>  2757
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834971, modified for expression in Escherichia coli

<400>  2
atggcaaaga aaaataaaat gaagccgcgc gagttacgcg aggcccagaa gaaagctcgt       60

caattaaaag cggccgagat caacaataac gcagccccag caattgcagc aatgccagcg      120

gccgaagtga ttgcgccggc tgcagagaag aagaagagct cagtcaaggc agcagggatg      180

aagagcatcc ttgttagcga gaacaagatg tacattacat cttttgggaa aggaaactca      240

gcggtattgg aatacgaggt tgataacaac gattacaatc agacgcagtt atcatccaag      300

gacaacagca acatccaact gggtggcgtc aatgaggtca acattacttt ttcaagcaag      360

cacggctttg aaagtggcgt ggaaattaac acttctaatc cgacacaccg ttcaggagaa      420

agttcccctg ttcgtggcga tatgttaggg cttaagtcag aactggaaaa gcgcttcttc      480

ggtaagacct tcgatgataa cattcacatt caacttatct acaacatcct tgatattgaa      540

aagatccttg cagtgtacgt tacgaacatc gtctacgctc tgaataatat gttaggtgtc      600

aaggggtctg aatcccatga tgacttcatt ggttacttgt cgacaaataa tatctacgat      660

gtcttcattg atccagataa tagttccttg agcgacgaca agaaagcaaa cgtacgtaaa      720

agtcttagta aatttaatgc gttgttaaaa actaaacgtc tgggctattt cggattagag      780

gaaccaaaga ccaaagacaa ccgtgtaagc caggcgtata agaagcgtgt gtatcacatg      840

cttgccattg tcgggcaaat tcgtcaatgc gtatttcatg acaaaagcgg tgccaaacgt      900

tttgatcttt attctttcat taacaatatt gatccagagt accgtgacac gcttgattat      960

ttggtagaag agcgcctgaa gtcaattaac aaagacttta ttgaagacaa caaagtaaac     1020

atcagccttt taattgatat gatgaagggt tacgaggcgg acgatatcat tcgcctgtac     1080

tacgacttca ttgtattaaa atctcagaaa aacctggggt tctctattaa gaagttacgt     1140

gagaagatgc tggacgagta tggtttccgt ttcaaagata aacaatacga ttctgttcgt     1200

tccaagatgt ataaattgat ggattttttg cttttttgta actattaccg caatgatatt     1260

gctgcggggg aatctctggt acgtaaactg cgtttttcga tgacagacga tgaaaaggag     1320

ggcatttatg cggacgaagc cgctaaattg tgggggaaat ttcgtaatga ctttgagaat     1380

atcgcggacc acatgaatgg cgatgttatt aaggagttgg gaaaagctga catggatttc     1440

gacgaaaaga tcttggattc tgagaagaaa aacgcttccg acctgctgta tttttcaaaa     1500

atgatttata tgctgacata tttcttagat gggaaagaga ttaacgactt gctgacgact     1560

ctgatttcaa aatttgacaa tatcaaagag tttttgaaaa ttatgaagtc ttctgcagtc     1620

gatgtagagt gtgaacttac agctgggtac aagctgttca atgacagtca acgtatcacc     1680

aacgaattat ttatcgttaa aaatattgcc tccatgcgta agccagccgc aagtgccaag     1740

ctgacaatgt tccgcgatgc actgacgatt ctgggaattg acgataagat tacggatgac     1800

cgtatttcag gaatcttgaa gcttaaagag aagggcaagg gcattcatgg acttcgtaac     1860

ttcatcacca acaacgtgat cgagagtagc cgttttgttt accttatcaa atatgcgaat     1920

gcacaaaaga tccgcgaagt ggcgaaaaac gagaaggtcg taatgttcgt attaggtgga     1980

attccagata cgcaaattga gcgctattat aagtcatgtg tagagttccc ggatatgaac     2040

agctcattag gagtgaaacg ttcagagctg gcgcgcatga ttaagaatat cagttttgac     2100

gatttcaaga acgtgaaaca acaagcgaaa ggacgcgaaa acgtcgcaaa agagcgcgcc     2160

aaggccgtca ttgggttgta cttaacggta atgtacttac ttgtcaaaaa cctggttaat     2220

gttaacgcgc gctatgtcat cgccatccat tgtctggaac gtgatttcgg tctttataag     2280

gagattattc ctgaactggc gtcaaagaac ctgaaaaacg attaccgcat tttatctcag     2340

actctgtgtg aactgtgtga taagtctccc aatttgttct tgaagaagaa tgagcgcctg     2400

cgtaaatgtg ttgaagtcga catcaataat gcagacagct cgatgactcg taaatatcgc     2460

aactgtatcg ctcacttgac tgtcgtccgt gaattaaaag agtacattgg tgatatttgt     2520

accgttgact cttatttcag tatttaccat tatgtaatgc aacgctgtat cacaaagcgt     2580

gaaaacgata ccaagcagga ggaaaaaatc aaatacgaag acgatttgct taagaatcac     2640

ggctatacaa aagacttcgt aaaagcattg aactcacctt tcggatacaa catcccgcgt     2700

tttaaaaatc tttcaattga gcaacttttt gatcgtaacg agtatcttac ggaaaaa        2757


<210>  3
<211>  2754
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus bicirculans, 
       modified for expression in Escherichia coli

<400>  3
atggcgaaaa agaataaaat gaaacctcgc gaattgcgcg aggcacaaaa gaaagcgcgt       60

caattgaaag cagcggagat caacaataac gcagttcccg ccatcgctgc tatgccggcc      120

gctgaggccg ctgcccccgc agcggagaaa aagaagtcat cggtcaaagc ggcagggatg      180

aagtcaatct tagtctccga gaacaagatg tacatcacca gttttggaaa aggtaactcg      240

gcggtcttgg agtacgaggt agacaataat gactataaca aaactcagtt atcctcgaag      300

gataatagca atattgagtt gtgtgatgtg gggaaggtta atatcacgtt cagctctcgt      360

cgtggctttg aatcgggagt cgagattaat acgagtaacc caacccaccg ctccggagag      420

tcgtcgtcag tccgtgggga tatgctgggc ttgaaaagcg agttggaaaa acgttttttt      480

ggcaagaatt tcgacgataa tatccatatt caacttattt acaacatctt ggacatcgag      540

aagatccttg ctgtgtatgt tacgaacatt gtttacgccc tgaataatat gcttggcgaa      600

ggggatgaat ctaactacga ctttatgggg tatttgagca cattcaacac atataaagtc      660

tttacgaatc cgaatggttc aacgctgtct gatgacaaga aagagaacat tcgcaaatca      720

ttatcgaaat ttaatgcttt gttgaaaacg aagcgcttag gttatttcgg gttagaggag      780

cctaaaacaa aggacacgcg cgcatcggag gcttacaaga aacgcgtata tcacatgctg      840

gctatcgttg ggcaaatccg tcagtgcgta tttcatgata agagcggggc caagcgtttc      900

gacctttatt catttattaa taacattgat ccagaatatc gtgaaactct ggattacttg      960

gtcgacgaac gctttgacag tattaataaa ggatttatcc aaggtaataa agtaaacatc     1020

agcttactga tcgatatgat gaagggttac gaggcggatg acatcatccg tctttactac     1080

gatttcattg tccttaaatc gcagaaaaac ctgggcttca gtatcaaaaa gttacgcgaa     1140

aagatgttgg atgagtatgg ctttcgtttc aaagataagc aatacgatag cgttcgcagc     1200

aagatgtata aattaatgga tttcttatta ttctgcaatt actaccgcaa cgacattgca     1260

gcgggcgaat ctcttgtccg caagctgcgc tttagtatga ccgatgatga gaaggagggg     1320

atctacgcag atgaggctgc aaaactgtgg ggcaaatttc gtaacgactt tgagaacatc     1380

gccgaccaca tgaacggtga cgtcattaaa gagttgggga aagcagatat ggactttgat     1440

gaaaagatcc ttgattccga aaagaaaaat gcgtcggatc tgttgtattt tagtaaaatg     1500

atttacatgc ttacgtattt tctggacgga aaagaaatca acgacttact tactacatta     1560

atttcgaagt ttgataacat taaggagttt ttaaaaatca tgaaaagcag tgcagttgac     1620

gttgaatgtg aacttacagc aggttataaa ttatttaatg acagccaacg catcacaaat     1680

gaattgttca tcgtgaagaa tatcgcgtct atgcgcaaac ccgctgcttc ggcgaagctg     1740

acaatgtttc gcgacgcttt aacaatcctg gggatcgacg ataagatcac tgatgatcgt     1800

atttccgaaa tcttaaaatt aaaggagaaa ggaaaaggta tccatggctt acgcaatttt     1860

atcactaata atgtaattga aagtagccgc tttgtgtacc ttatcaagta cgcaaacgca     1920

caaaaaatcc gtgaggtcgc caaaaacgag aaagtcgtta tgtttgtcct gggtgggatt     1980

cccgacacac aaatcgaacg ctactacaaa agttgtgtgg aattcccgga catgaactcg     2040

agtctgggtg ttaagcgtag tgaattggcc cgtatgatca agaatatcag ttttgacgat     2100

ttcaagaatg tgaaacagca ggccaaaggg cgtgagaacg tcgcaaagga acgcgctaaa     2160

gctgtgatcg gtttatatct gaccgtgatg tacttgttgg tgaagaattt ggtgaacgtt     2220

aacgcgcgtt acgttattgc cattcattgc ttagaacgcg actttggact gtataaggag     2280

attattcctg aattagccag caaaaacctg aaaaacgatt atcgtatcct gagccaaacc     2340

ctttgcgaac tttgtgataa aagcccaaac ttgtttttaa aaaaaaatga gcgtttacgc     2400

aaatgcgtgg aggttgatat taataatgct gattcctcga tgacccgcaa ataccgtaac     2460

tgtattgccc atttgacagt agtccgcgag ttgaaggagt acattggaga tatttgcact     2520

gtggacagtt acttcagtat ttaccattat gtaatgcaac gctgcattac aaagcgcgag     2580

aacgacacta agcaggagga aaaaatcaag tacgaggatg atctgctgaa aaatcatggc     2640

tacaccaagg actttgttaa ggccttgaac tctccgttcg ggtataacat tccccgcttc     2700

aaaaatctga gtattgagca gttgtttgat cgtaatgagt atcttacaga gaag           2754


<210>  4
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5608892, modified for expression in Escherichia coli

<400>  4
atggccaaaa agaacaaaat gaagccccgc gaacttcgtg aggcccaaaa gaaagctcgc       60

caattaaaag cagccgagat caacaacaac gcagctccgg ccattgcagc aatgcctgct      120

gcagaagtga ttgcgccagt cgccgaaaag aagaaatcca gtgttaaagc tgcaggtatg      180

aagtctattt tggtttcgga gaacaagatg tatatcacaa gcttcgggaa aggtaatagt      240

gctgttcttg agtatgaagt agataacaac gactataata aaacccaact tagttctaag      300

gataactcta atattgaatt gggggacgtt aatgaggtaa atatcacgtt ctcatcgaag      360

catggctttg gttccggggt ggaaatcaat acctctaatc ccactcatcg ttcgggtgaa      420

tcctccccag tccgtggtga tatgttgggg cttaaatcgg agttagagaa acgcttcttt      480

ggtaaaacct ttgatgataa tattcatatt caattgattt ataacatttt ggatatcgag      540

aagattttgg ctgtatacgt tacaaatatc gtgtatgcac ttaataatat gttgggtatt      600

aaagattctg aatcgtatga tgatttcatg ggctatttga gcgcacgcaa tacctatgaa      660

gtcttcactc atcctgataa aagcaactta agtgataagg ttaaagggaa cattaagaag      720

agtttatcaa agttcaatga cttgttaaag accaagcgcc ttgggtactt cggtcttgag      780

gaaccgaaga ccaaagatac ccgcgcttct gaggcgtata agaagcgcgt ctaccacatg      840

cttgcaatcg taggtcaaat ccgtcagtgt gtgtttcacg acaaatcagg agcgaaacgt      900

ttcgatttgt actccttcat taataacatc gacccagagt atcgcgacac tcttgactac      960

ttagttgagg aacgtttgaa gtcaattaat aaggatttca ttgagggaaa taaagtaaac     1020

attagccttc ttatcgacat gatgaaggga tacgaggccg acgatattat tcgcctgtat     1080

tatgatttta ttgtgttgaa atcacaaaag aatttggggt ttagcattaa aaaattgcgc     1140

gagaagatgt tggaggagta tgggtttcgc tttaaggata aacagtatga ctcagtccgc     1200

tcaaaaatgt ataagttaat ggacttcctg cttttttgta attattaccg taatgacgtc     1260

gccgccggtg aagccctggt tcgtaaattg cgcttctcaa tgactgacga tgagaaggag     1320

ggaatttatg ctgatgaggc tgcgaagtta tgggggaagt ttcgtaacga cttcgaaaat     1380

atcgccgacc acatgaatgg agatgttatc aaggagcttg gcaaggcgga tatggatttt     1440

gatgaaaaga tccttgacag cgaaaagaag aatgcctccg atttgctgta cttttcgaaa     1500

atgatctaca tgcttaccta tttcctggac ggcaaagaga tcaacgatct tttgaccacc     1560

cttatttcta agttcgataa tatcaaagag tttttgaaaa tcatgaagag ttcggcggtc     1620

gatgttgaat gtgaattaac ggccgggtat aaattattta acgactccca acgtattacg     1680

aatgaattat ttatcgttaa aaacatcgct tctatgcgca aaccagcagc gtccgccaaa     1740

cttacgatgt ttcgtgacgc ccttaccatt ttgggaatcg acgataacat cacagatgat     1800

cgcatttctg agatcttgaa gcttaaggaa aagggcaagg gcatccatgg tttacgtaat     1860

tttatcacaa acaacgtgat cgagtcgagt cgttttgtct atctgatcaa gtatgcaaac     1920

gcgcagaaaa ttcgtgaagt ggcaaaaaat gagaaagtag taatgtttgt tttgggtggt     1980

atccctgaca cccagattga gcgctactac aagtcgtgtg tagaattccc tgacatgaat     2040

agcagcttag aagctaaacg ctctgaactt gcgcgcatga ttaaaaatat ctcgttcgat     2100

gacttcaaga acgttaaaca acaggccaaa ggccgtgaga atgttgctaa agaacgcgcg     2160

aaggctgtaa ttggattata ccttactgta atgtatctgt tagtgaaaaa ccttgtgaac     2220

gtcaacgccc gctacgtcat tgcgatccat tgtttggagc gtgactttgg gttatacaag     2280

gagatcatcc cagaactggc ctcaaaaaac ttaaaaaatg actaccgtat tttgagtcag     2340

accttgtgcg aactgtgcga tgaccgtaac gaatcctcga acttgttctt gaagaagaat     2400

aaacgtttgc gcaaatgtgt cgaggtagat atcaacaatg cagacagctc tatgacgcgt     2460

aagtaccgta actgtattgc tcacttaacc gtagttcgtg aacttaaaga atacattgga     2520

gacattcgta cagttgatag ctacttcagt atttatcact atgtaatgca gcgctgtatc     2580

actaagcgtg gggatgatac gaagcaagaa gagaaaatta agtacgaaga tgacctgttg     2640

aaaaaccacg ggtacactaa ggactttgtc aaagctctga attccccgtt cgggtacaat     2700

atccctcgtt ttaagaatct gagtattgaa cagttatttg accgcaacga ataccttacg     2760

gagaag                                                                2766


<210>  5
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp. CAG:57, 
       modified for expression in Escherichia coli

<400>  5
atggctaaaa agaataaaat gaaacctcgc gagttgcgcg aagcccagaa aaaagctcgc       60

cagttaaagg cagcggaaat taataataat gcagcacccg ccatcgcagc gatgcccgca      120

gctgaagtaa tcgcccctgt tgctgaaaag aagaaatcca gcgtgaaagc ggcaggtatg      180

aagtccattt tggtcagcga gaataaaatg tacattacgt cgttcgggaa aggcaactcc      240

gctgtccttg agtatgaagt agacaacaat gactacaaca aaactcaact gtcaagcaaa      300

gacaacagta acatcgaact gggagacgtg aatgaggtga atatcacgtt ttcatcaaaa      360

catgggttcg gaagcggtgt ggaaatcaat acaagcaatc cgacccatcg ctcaggggag      420

tcgtcgcctg ttcgtggaga catgttgggt cttaagtccg agcttgagaa gcgttttttc      480

ggcaagacat tcgatgacaa catccatatt cagttgattt ataatatttt agatatcgaa      540

aagattttag ccgtatatgt gaccaacatt gtttatgcgt taaataacat gttagggatt      600

aaggactcgg aatcgtatga tgatttcatg ggttacttaa gcgctcgtaa tacttatgaa      660

gtcttcactc atcccgataa gagcaatttg agtgataaag tcaagggcaa catcaaaaag      720

tctttgtcga aattcaatga cctgttgaaa actaagcgct tgggttactt cgggttggaa      780

gaaccgaaga ccaaagatac gcgtgccagt gaagcttaca aaaaacgcgt ctatcacatg      840

ctggcaatcg tgggccaaat ccgtcagtgt gtttttcatg acaaaagtgg agctaaacgc      900

tttgatttgt acagcttcat taataacatt gatcctgaat atcgcgacac tttggattat      960

ttagtagaag aacgccttaa atctattaat aaagacttta ttgaagggaa taaggtgaac     1020

atcagcttac tgatcgacat gatgaagggt tacgaggctg acgacattat ccgcttgtat     1080

tatgatttca ttgtattaaa atctcagaaa aacctgggat tcagtattaa gaaattacgc     1140

gagaaaatgc ttgaggagta cggattccgt ttcaaggata aacaatatga ttctgtgcgt     1200

agtaaaatgt acaaacttat ggacttttta ttgttctgta actattaccg taatgacgtt     1260

gccgcaggcg aagccttggt acgtaagtta cgcttcagca tgacagatga cgaaaaggag     1320

ggcatttacg cggatgaagc agcgaagctg tggggtaaat tccgcaacga ttttgaaaat     1380

attgctgacc acatgaatgg tgatgttatc aaagaactgg gaaaagccga tatggatttc     1440

gacgagaaga tcttggacag tgaaaaaaag aatgccagcg atcttttata tttctccaaa     1500

atgatctaca tgcttactta tttccttgac gggaaagaga ttaatgatct gctgaccacg     1560

ctgattagta agttcgacaa cattaaggag tttttaaaga tcatgaaatc gtccgctgtg     1620

gacgtagaat gcgagttgac ggcaggttac aaactgttca acgatagtca acgcatcacc     1680

aatgaacttt tcatcgtcaa aaacattgcc tccatgcgca agcccgcggc tagcgctaaa     1740

ttaacgatgt tccgtgacgc cttgacgatt ttaggcatcg acgacaacat cacggacgat     1800

cgcatttcgg aaatccttaa acttaaggaa aaggggaaag gtatccatgg tctgcgcaat     1860

tttatcacta acaatgtaat tgaatcatca cgcttcgttt acttaatcaa atacgcgaat     1920

gctcaaaaga ttcgtgaagt agccaaggat gaaaaggttg tcatgtttgt cctgggcggg     1980

attccagaca cccaaattga acgttattac aagtcttgtg tggaattccc cgatatgaat     2040

agctccttgg aggccaaacg ctctgagtta gcccgcatga ttaagaacat ttccttcgac     2100

gattttaaaa atgtcaaaca acaggcaaaa ggccgcgaga atgtagccaa ggagcgtgcc     2160

aaggcagtaa tcggattgta tcttactgtc atgtatttgc ttgttaagaa tcttgttaac     2220

gttaacgcgc gctatgtaat cgctattcat tgcttagaac gcgactttgg cctttataag     2280

gagattattc ccgagcttgc atccaaaaat cttaagaacg actaccgtat tttgtcacaa     2340

accttatgcg agttatgcga tgaccgcaac gagtcttcca atctgtttct taaaaaaaac     2400

aaacgtcttc gcaaatgcgt ggaagtggac atcaacaacg ccgacagtag tatgactcgt     2460

aagtatcgta actgtattgc gcacttgact gtagtgcgcg agttgaagga gtatattggg     2520

gatatccgca ccgtggattc atacttcagt atctaccact acgtcatgca acgttgcatc     2580

acgaaacgtg gagacgacac caaacaagag gaaaagatta agtatgaaga cgaccttttg     2640

aagaaccacg gctacaccaa agattttgtt aaggctttga atagtccctt cgggtataac     2700

attccccgtt tcaaaaactt gagcattgaa cagctgttcg accgcaatga atacttgaca     2760

gaaaag                                                                2766


<210>  6
<211>  2799
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens FD-1,
       modified for expression in Escherichia coli

<400>  6
atgaaaaaaa aaatgtctct gcgtgaaaag cgtgaggcgg agaagcaagc aaagaaagcc       60

gcgtattccg ctgctagtaa gaatactgac agcaaacccg cagagaagaa ggcggaaaca      120

cccaagcccg cagaaattat ctcggataac tcgcgcaata aaactgctgt taaagccgcc      180

ggcttgaaat caactatcat cagtggggat aaattataca tgacgtcatt tggtaaggga      240

aatgccgccg tgatcgaaca gaagattgat attaatgact actctttttc tgccatgaag      300

gataccccta gcttagaggt tgataaggcc gagagcaagg agatctcttt ttcctctcac      360

catcccttcg taaagaatga caaattgacc acttacaacc ccctgtacgg cggcaaggac      420

aatccggaaa agccagtggg acgtgacatg ctggggttga aagacaaatt ggaggaacgt      480

tattttggat gcactttcaa tgataatctg cacatccaga tcatctacaa tatcttagac      540

atcgagaaaa tcctggctgt tcatagcgca aatatcacca ccgcactgga tcacatggta      600

gacgaggatg acgaaaaata cttgaactct gactacattg gttacatgaa caccattaat      660

acgtacgacg tatttatgga cccgtcaaag aactcttctt tgtcgccgaa agatcgcaag      720

aacatcgaca actcccgcgc caagtttgag aagttattgt caacgaagcg tttaggatac      780

tttggttttg actatgatgc gaatggcaag gataagaaga agaacgagga gattaagaag      840

cgtctgtacc atcttaccgc gtttgcgggt cagcttcgtc agtggtcctt tcacagcgct      900

ggcaattatc cacgtacatg gctgtacaaa cttgatagtt tggacaaaga ataccttgat      960

acacttgatc actatttcga taaacgcttc aatgacatta atgacgattt cgttacaaag     1020

aacgcgacga atttatatat tcttaaggaa gtttttccgg aggcgaactt taaagatatc     1080

gcagatcttt attacgactt catcgtaatc aaatcccaca aaaatatggg tttctctatt     1140

aaaaaattgc gtgaaaaaat gttagagtgt gatggtgcgg atcgcatcaa agaacaagat     1200

atggacagcg tacgttcaaa gctgtataaa cttattgact tttgcatttt caaatattac     1260

catgagttcc cggaactgtc tgagaagaat gttgatatct tacgtgctgc cgtctccgac     1320

acgaagaaag ataatcttta tagcgacgag gccgcgcgtc tgtggagtat cttcaaggag     1380

aagttcctgg gtttctgtga caaaattgtc gtatgggtga ctggtgaaca tgaaaaagat     1440

atcacttcgg taatcgataa agacgcgtat cgcaaccgta gcaatgtcag ttatttttcg     1500

aaactgatgt atgcgatgtg ctttttcctt gatggtaagg aaattaacga tttattgaca     1560

accctgatta ataaattcga taatatcgca aatcagatca aaacggcaaa ggaacttggt     1620

attaacacag ccttcgtaaa gaattatgac ttttttaacc actcggagaa gtatgtcgac     1680

gaactgaata ttgtgaaaaa catcgctcgc atgaaaaagc ctagtagcaa cgctaaaaaa     1740

gctatgtacc acgatgcatt gacgatcttg gggattcctg aagatatgga tgagaaagcc     1800

ttagatgagg agctggactt gattctggaa aaaaagaccg atccagtaac cgggaagcct     1860

ttgaaaggga aaaacccgct tcgcaacttt atcgctaaca atgtaatcga aaactctcgc     1920

ttcatctatt tgattaagtt ttgcaatccg gaaaacgtac gtaagattgt taataacacc     1980

aaagttacag agtttgtctt gaagcgcatc ccagatgcgc agatcgaacg ctattacaag     2040

tcttgtactg actcggaaat gaacccccca acggaaaaga aaattacgga gttagccggg     2100

aaacttaagg acatgaattt tggaaacttc cgcaacgtgc gtcaaagtgc aaaggagaac     2160

atggaaaagg agcgttttaa agcagtgatt ggtttgtacc ttaccgtagt ctatcgcgtt     2220

gtaaaaaatc tggttgatgt taattcccgc tacatcatgg cgtttcattc gctggagcgc     2280

gacagtcagt tatataatgt ctcggtcgac aacgactacc tggccttaac cgatacgtta     2340

gtaaaagagg gagataattc ccgttcccgt tacttagcgg ggaataaacg cttgcgtgac     2400

tgtgtgaaac aggatattga taatgctaag aaatggttcg tcagtgataa gtacaactct     2460

atcacaaaat accgtaataa cgtagcacat ttaactgcag tacgtaattg cgccgaattt     2520

atcggtgaca ttactaagat cgactcgtat tttgcattat atcactacct tattcagcgt     2580

caactggcta agggtttgga tcacgagcgt tcgggatttg accgcaacta tccgcagtat     2640

gctccacttt ttaagtggca tacttacgtg aaagacgtgg ttaaagcctt aaatgctccc     2700

ttcggataca acatcccacg ctttaagaat ttgtctattg atgctttatt tgatcgcaat     2760

gagatcaaaa agaatgacgg agagaagaag tctgatgat                            2799


<210>  7
<211>  2832
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus albus strain 
       KH2T6, modified for expression in Escherichia coli

<400>  7
atggcaaaga aatccaaggg gatgtcgtta cgtgagaaac gcgaattgga aaaacagaag       60

cgcattcaaa aggctgctgt taactccgtc aacgacactc ctgaaaagac agaagaggct      120

aacgtggtat cagtgaatgt gcgcacttct gccgaaaaca agcactccaa aaagtcagcg      180

gccaaggctt tggggctgaa atctggcttg gtaattggag atgagctgta tctgacatcg      240

ttcggtcgcg gcaacgaagc caagttggaa aagaaaatct caggtgatac ggttgagaaa      300

ttaggtatcg gcgcttttga ggtagctgag cgtgacgagt cgacgctgac gcttgaaagt      360

ggacgcatta aggacaagac ggcgcgtcca aaggacccac gtcacattac ggttgataca      420

caaggtaaat tcaaagagga tatgctgggt attcgcagcg tgttagaaaa aaagattttt      480

gggaagacct ttgacgataa catccatgta caactggcat acaacattct tgatgtcgag      540

aaaattatgg cacagtatgt cagtgatatt gtttatatgc tgcacaacac ggacaagacg      600

gagcgtaatg ataacctgat gggttacatg tcaatccgca acacatacaa gacgttctgt      660

gatacttcaa acttgcctga tgatactaaa caaaaagttg aaaaccaaaa acgtgaattt      720

gataaaatca ttaagagtgg ccgtctgggc tatttcgggg aagcttttat ggtaaatagc      780

ggcaactcta caaaactgcg cccggaaaaa gagatctatc atatttttgc gctgatggcg      840

tcgttacgcc aaagttactt tcatggttat gtcaaagata ccgattacca agggaccact      900

tgggcgtata cactggagga caaactgaag gggccctctc acgagttccg cgagacgatt      960

gacaaaatct ttgacgaggg attttccaaa atctcgaaag atttcggcaa aatgaacaag     1020

gtgaacctgc aaattttgga gcaaatgatc ggggagttgt acgggtccat tgagcgccaa     1080

aacttaactt gtgactacta cgatttcatc cagttaaaga aacataagta tcttggcttt     1140

agcattaaac gtttacgcga gacgatgctt gagactactc ccgcagagtg ctataaggca     1200

gagtgctaca actctgagcg ccagaaactg tacaagttga tcgacttttt aatctacgac     1260

ctttattaca atcgtaagcc cgcacgtatc gaagagatcg tcgataagct gcgtgaatct     1320

gtgaatgatg aagaaaaaga gtctatttac tcagtagagg ctaagtatgt ctatgaaagc     1380

ctttcaaaag tccttgacaa gagcttgaag aatagtgttt ctggggaaac cattaaagac     1440

cttcagaaac gttatgatga tgaaacagct aaccgtattt gggacatctc gcaacattca     1500

atcagtggca acgtcaattg cttctgtaaa ttaatttaca tcatgactct tatgctggac     1560

ggaaaagaaa tcaatgatct gttgacaacg ctggttaaca aattcgataa cattgccagt     1620

ttcattgatg tcatggatga gttaggatta gagcactcat tcactgataa ctataagatg     1680

ttcgctgatt ctaaagctat ttgtctggat ttgcaattta tcaattcatt tgcccgtatg     1740

tcgaagatcg atgacgaaaa gtcgaaacgt caactttttc gtgacgcgct ggttatttta     1800

gatattggta ataaggacga gacatggatt aataactact tagattccga tatctttaag     1860

ctggacaagg aaggtaataa gttaaaggga gcccgccatg attttcgcaa ctttatcgca     1920

aataacgtga ttaagtcttc acgcttcaaa tatttagtga agtattcgag tgcggatggc     1980

atgattaaat taaagacaaa tgagaagctt attgggttcg ttctggataa gttaccagag     2040

acgcaaatcg accgttacta cgagtcttgc gggttagaca atgccgtcgt ggacaaaaaa     2100

gtccgtattg agaagctgag tgggttaatt cgtgatatga agttcgacga tttttctggc     2160

gtaaaaacta gtaacaaagc tggcgacaat gacaagcagg acaaggccaa atatcaggcc     2220

attatttcgt tataccttat ggtgctttac cagatcgtaa agaacatgat ttacgtcaac     2280

tcacgctacg tcattgcttt ccactgttta gaacgcgatt ttgggatgta tggcaaggat     2340

tttggaaaat attaccaggg gtgccgcaag ctgactgatc acttcatcga agagaaatac     2400

atgaaggaag gaaaattggg atgcaacaaa aaagtaggac gctatcttaa aaataatatt     2460

tcctgctgca cggatggact gattaacaca taccgtaacc aggtggatca tttcgcagtg     2520

gttcgcaaaa ttggtaacta tgcggcctat atcaaatcta tcggaagctg gttcgaactt     2580

taccattatg tgattcaacg tattgtgttt gatgagtatc gtttcgcact taacaacaca     2640

gagtccaact ataaaaactc cattatcaaa caccatacgt actgtaaaga tatggtaaag     2700

gcattgaata cgccctttgg ctacgacctg cctcgctaca agaacttgtc gatcggggac     2760

ttgttcgacc gtaacaatta tttaaacaag acgaaggaat cgattgatgc taattcaagc     2820

attgattcac ag                                                         2832


<210>  8
<211>  2901
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens 
       strain XPD3002, modified for expression in Escherichia coli

<400>  8
atgatcgaga aaaaaaaatc ttttgctaag ggcatgggcg ttaagtccac cttggtttca       60

ggttctaagg tatatatgac cactttcgca gagggatccg acgcacgtct ggagaaaatt      120

gtcgaaggag attcgatccg ttcggtgaat gagggggagg cgttctccgc ggagatggcg      180

gacaaaaatg cgggttataa gattggaaac gctaaatttt cccacccgaa aggatacgca      240

gtggtagcca ataaccccct ttacacaggg cctgtgcaac aggacatgtt gggattgaag      300

gagactttgg aaaagcgcta ttttggtgag tccgcagatg gaaacgataa tatctgtatc      360

caggtaattc acaatatctt ggatattgaa aagatccttg ctgagtacat taccaacgct      420

gcctacgccg tgaataatat ctccggctta gacaaggaca ttattggctt tgggaagttc      480

agtaccgtct atacgtatga cgaatttaag gacccagaac accatcgtgc cgccttcaat      540

aataatgata agttgatcaa tgcaattaaa gcccagtacg acgaatttga taacttcttg      600

gataatcccc gcttaggcta cttcgggcaa gctttcttca gtaaggaggg gcgtaactac      660

attattaatt acggcaatga gtgttacgat atccttgcat tactttcggg gcttcgccac      720

tgggttgtac acaataatga ggaagagtca cgcattagcc gcacgtggtt gtataacctt      780

gataagaacc ttgacaatga atacatctct accctgaact acttatatga tcgcattacg      840

aatgagttaa ccaattcatt ctcaaagaat agtgcagcca acgtcaacta tatcgcagag      900

acgctgggta tcaacccggc ggaattcgcc gagcagtatt tccgcttttc aatcatgaag      960

gaacaaaaga atctgggttt caatattacc aagttacgtg aagtaatgtt ggatcgtaag     1020

gatatgtctg agattcgcaa aaaccataaa gtgtttgaca gcatccgtac gaaggtctac     1080

actatgatgg acttcgttat ctaccgctat tacatcgaag aggatgccaa agtggcagcg     1140

gcgaacaaat cccttccaga caacgagaaa agtctttctg agaaagacat ctttgtaatc     1200

aacttgcgcg gttcctttaa tgatgaccag aaagatgcgt tgtactatga tgaagctaat     1260

cgtatttggc gtaagttgga aaacatcatg cataacatta aggagtttcg tgggaacaag     1320

acacgtgagt ataaaaaaaa ggatgctcca cgtcttccgc gcattttgcc tgcaggacgc     1380

gatgtcagtg ctttcagcaa attaatgtat gcactgacaa tgtttctgga cgggaaggaa     1440

atcaatgatc ttctgactac acttattaac aagtttgata atattcagtc cttcttaaag     1500

gttatgcctt tgattggtgt aaacgcgaaa tttgtcgaag agtatgcctt tttcaaggat     1560

agcgcgaaaa ttgccgacga actgcgtctt attaagagtt tcgctcgtat gggggagcca     1620

atcgctgacg cccgccgcgc tatgtacatc gatgctattc gcatcttagg tacaaacttg     1680

tcatacgatg aacttaaagc tttagcagac accttttcgc tggatgaaaa cggaaacaag     1740

ttgaaaaagg ggaagcatgg aatgcgcaat tttattatca ataacgtgat ctcaaataag     1800

cgtttccact atcttatccg ttatggagat ccggcacacc tgcatgaaat tgccaagaat     1860

gaggccgtgg tgaaattcgt tttagggcgc attgctgata ttcagaagaa acaggggcag     1920

aatggaaaga atcaaatcga ccgttactat gagacgtgta ttggcaaaga caaggggaaa     1980

tcggtttcgg aaaaagttga cgccttgacg aagatcatca cgggcatgaa ctacgaccag     2040

tttgacaaaa aacgctcggt aattgaagat accggacgtg agaatgcgga acgtgagaaa     2100

tttaaaaaga tcatctcgtt gtatctgacc gtaatttatc atattttaaa aaatatcgta     2160

aacatcaacg cacgctatgt gatcgggttc cactgtgtag aacgcgacgc tcaactttat     2220

aaagaaaagg ggtatgatat taacttgaaa aagttagagg agaagggatt ctcatcagtc     2280

accaagttgt gcgcgggtat tgacgaaacg gcaccggaca agcgcaaaga cgttgaaaag     2340

gagatggccg aacgcgccaa ggaaagtatc gactcattag aaagcgcaaa tcccaagctg     2400

tatgccaatt atatcaagta tagcgatgag aagaaggcgg aggagtttac gcgccagatc     2460

aaccgtgaaa aggccaaaac tgcattgaat gcctacttgc gcaatacgaa atggaatgtg     2520

atcatccgtg aggacctgct gcgtatcgat aacaaaacat gtactttatt tcgcaataaa     2580

gcggtacatc ttgaagtggc gcgttacgtt cacgcgtata tcaatgacat tgcagaggtt     2640

aattcctatt tccagctgta tcactacatt atgcaacgca ttattatgaa cgagcgttac     2700

gagaaaagca gcggcaaagt atccgaatac tttgacgcag ttaacgatga gaagaaatat     2760

aacgaccgct tactgaaatt gctgtgtgta ccttttgggt attgcatccc ccgttttaaa     2820

aacctgagta tcgaagctct gtttgaccgc aacgaggccg ccaaatttga taaggaaaag     2880

aaaaaggttt cgggaaatag t                                               2901


<210>  9
<211>  2388
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834894, modified for expression in Escherichia coli

<400>  9
atggaaatca acacttcgaa ccccacccat cgcagcggtg aaagtagcag tgttcgtggg       60

gacatgcttg gactgaagtc agagctggag aaacgctttt ttggaaagac cttcgacgat      120

aacattcata ttcaattgat ctacaatatc ttggacattg aaaaaatcct ggccgtgtac      180

gtcactaata ttgtatatgc actgaacaat atgctgggag tgaagggcag tgagagctac      240

gatgacttca tgggctatct gtcagcgcag aatacatatt acatctttac tcatccagat      300

aagtcaaacc tgagtgacaa agtgaaaggc aacattaaaa agagtctgtc caaatttaat      360

gatctgctga aaacaaaacg tttgggttat tttggactgg aggagcccaa aactaaggac      420

aagcgcgtga gcgaagccta caagaaacgt gtttatcata tgctggcaat tgtgggtcag      480

atccgtcaaa gcgtcttcca tgacaagtct aatgaattgg atgagtatct gtactcgttt      540

atcgacatta tcgacagcga atatcgtgac acgctggatt atttggttga tgaacgtttc      600

gatagcatca ataagggctt cgtccagggg aataaggtaa acatctcgtt actgattgac      660

atgatgaagg ggtatgaggc cgatgacatt atccgcttat actatgactt catcgtgttg      720

aaatcccaaa agaaccttgg cttctccatt aaaaaacttc gtgagaagat gcttgatgag      780

tacggtttcc gcttcaagga taaacaatac gattcagtgc gtagcaaaat gtacaagttg      840

atggattttt tattattctg caactattat cgtaacgacg tggtagcggg cgaggctctt      900

gtccgtaaac tgcgcttctc gatgacagat gacgaaaaag aaggcatcta tgccgacgaa      960

gccgagaaat tgtggggcaa gttccgtaat gactttgaga atatcgctga tcatatgaat     1020

ggagacgtta tcaaggaact tggcaaagcc gacatggatt tcgacgagaa gatcctggat     1080

tctgaaaaga agaacgcgtc ggacttgctg tatttttcga agatgatcta tatgcttact     1140

tatttcttgg atggcaaaga aattaacgac ctgttgacca cactgattag caaatttgat     1200

aacattaagg agttccttaa aattatgaag tctagcgcag ttgacgtgga gtgcgagctg     1260

actgcgggat acaaattgtt taacgacagt caacgtatca cgaatgaact tttcattgtg     1320

aagaacattg cgtcgatgcg caagccggct gccagtgcaa agttgaccat gtttcgtgat     1380

gctctgacca tcttaggcat tgatgacaag attaccgatg accgcatttc cgaaattctt     1440

aagttaaaag aaaaagggaa aggaatccat ggtcttcgta actttatcac caacaatgtg     1500

atcgagtcct cgcgttttgt ctacttgatt aaatatgcta acgcacaaaa gattcgcgaa     1560

gtagctaaaa acgaaaaagt tgtgatgttt gttttaggtg gcattcccga tacccagatt     1620

gaacgctact ataaaagctg tgtcgaattc ccggacatga actcatcttt agaggcaaaa     1680

tgttcagagt tagctcgtat gatcaagaat attagtttcg atgacttcaa gaatgtgaaa     1740

cagcaagcaa agggccgcga aaatgtagcc aaagagcgcg ctaaggctgt catcggattg     1800

tatctgacag tcatgtacct tcttgtcaag aatttggtca acgtaaatgc tcgctatgtt     1860

attgctatcc attgtttaga acgcgacttc ggcttatata aagaaattat tccggagttg     1920

gcctcaaaaa acttgaagaa cgattaccgt attttgagtc agaccctgtg cgaactgtgc     1980

gacgaccgcg acgagtcacc taacctgttc ttgaagaaaa acaagcgctt acgtaagtgt     2040

gtggaggtgg acatcaacaa tgcggatagc tccatgaccc gtaaataccg taattgcatt     2100

gcccatctta ccgtggttcg cgaattaaaa gagtatattg gcgatatccg tactgtcgat     2160

tcttatttca gcatctacca ctacgttatg cagcgttgta tcacgaaacg tgaggacgat     2220

accaaacaag aggaaaagat taagtacgaa gacgatctgc tgaaaaacca tgggtatacg     2280

aaggacttcg taaaagcgtt gaactccccc ttcggctata acattcctcg cttcaagaac     2340

ttatctatcg agcaactttt tgaccgtaac gagtatttaa cggagaaa                  2388


<210>  10
<211>  2862
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Eubacterium siraeum, modified 
       for expression in human cells

<400>  10
atgggcaaaa aaatccacgc ccgggacttg agggagcaga gaaaaactga tcgcacagaa       60

aaattcgccg atcaaaacaa aaaaagggaa gctgagagag ccgtccctaa gaaagatgca      120

gcggtctcag tgaaaagcgt gagtagcgtt tccagtaaaa aagacaatgt aaccaagagt      180

atggccaagg cagccggcgt aaagtcagtt ttcgcggtgg gtaacactgt ttacatgaca      240

agttttggtc gaggaaacga cgctgtattg gagcagaaga ttgtggatac aagccatgaa      300

cccctgaaca ttgacgatcc agcctatcaa ctgaatgtgg taaccatgaa cggatactca      360

gttacaggcc ataggggtga gactgtttct gccgttaccg acaacccgtt gaggcgcttt      420

aatggacgaa aaaaagacga gcctgagcag tccgtaccaa ccgatatgct ttgcctgaag      480

cccaccctcg agaaaaaatt ttttgggaag gagttcgatg ataatattca catccagctt      540

atatacaaca ttctcgacat agaaaagatt cttgctgtct actcaacaaa tgcgatttac      600

gcactcaata acatgagcgc cgacgagaat atcgaaaata gcgatttttt catgaaaagg      660

actacggacg agacattcga tgactttgaa aagaaaaaag agtccacaaa cagtagggag      720

aaggcggatt ttgacgcctt cgagaaattt atcggtaact acaggcttgc ctattttgcg      780

gacgcgttct atgtgaataa aaaaaatccc aaaggaaaag caaagaatgt gctcagagag      840

gataaagaac tgtactcagt tttgacgctc atcggtaagc tccgccactg gtgtgtacat      900

tctgaagagg ggagagcgga gttctggctc tataaattgg acgagcttaa ggacgacttc      960

aagaacgttc tcgacgtagt gtacaaccga cctgtggaag agataaataa cagatttatc     1020

gaaaacaata aggtaaacat ccaaatattg ggctccgtct acaaaaacac agatattgcc     1080

gaacttgtca gaagctacta cgagtttttg attaccaaga agtataaaaa catgggattt     1140

tcaattaaga agttgagaga aagcatgctc gagggaaaag gttacgcgga taaagagtat     1200

gacagcgtga ggaacaaact ttaccaaatg acggacttca ttctctacac aggttacata     1260

aatgaggaca gcgacagagc agacgatctt gtaaatacgc ttcgctcttc cctgaaggaa     1320

gacgacaaga ccactgtgta ctgcaaggag gctgattacc tctggaagaa gtaccgagaa     1380

tccattcggg aagtagccga cgcacttgac ggcgacaata ttaaaaagtt gagtaaaagc     1440

aacattgaga ttcaggaaga taagcttcgc aagtgcttca tctcttatgc ggattctgtc     1500

agtgaattca caaagctgat ctacttgctt actagattct tgagtggtaa ggaaattaat     1560

gaccttgtta caactttgat caataagttc gacaatatta gatcctttct cgaaattatg     1620

gatgagcttg gtctggaccg aactttcact gctgagtact cattctttga aggttcaaca     1680

aaatatctgg ctgaattggt tgagctcaac tcctttgtca agagttgtag ctttgacatc     1740

aatgcaaagc gcacgatgta tcgagatgct ttggatatcc tgggaatcga gtctgacaaa     1800

acggaagagg acatcgaaaa aatgatagac aatatcttgc agattgacgc aaatggggat     1860

aaaaaactca aaaagaataa cggcttgcga aattttattg catctaacgt catagacagc     1920

aaccggttca aatacctcgt gcgctatggc aatccaaaaa agattagaga gaccgcaaag     1980

tgcaaaccag cggtccggtt tgtgctgaac gaaattcccg acgcacagat tgaacggtat     2040

tatgaagcat gctgccctaa aaacacggct ctgtgcagcg cgaataaaag aagggaaaag     2100

ttggcggata tgatcgcgga gattaaattc gagaattttt cagatgcagg caactatcaa     2160

aaagcgaacg ttacctcacg gacctcagag gctgagataa agaggaaaaa ccaggccatc     2220

ataagactgt atcttactgt tatgtacatc atgctgaaaa atctcgtaaa tgtgaacgca     2280

cggtacgtaa tagcgttcca ttgcgtcgag cgggatacga agctgtatgc agagtcaggg     2340

ctggaggtag gaaatatcga aaagaacaag acgaacctta ctatggcagt catgggggta     2400

aaactcgaaa acggtattat caagactgaa ttcgacaagt cattcgctga gaacgccgca     2460

aacaggtatc tgaggaacgc gagatggtac aagctgatat tggataatct gaaaaaaagc     2520

gagcgggcgg ttgtaaacga attcagaaac acagtatgcc atttgaatgc tatacgaaac     2580

attaacatta acattaagga aataaaggaa gtcgagaatt attttgcatt gtaccactat     2640

cttatacaaa aacacctcga aaatcgattt gcagacaaga aggttgaaag agataccggg     2700

gattttatct ctaaacttga agagcacaaa acctattgca aagactttgt gaaagcctac     2760

tgcacgccgt tcggctataa cttggtccgc tataaaaact tgaccatcga tggattgttc     2820

gacaaaaact acccggggaa agacgatagt gatgagcaga ag                        2862


<210>  11
<211>  2757
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834971, modified for expression in human cells

<400>  11
atggcaaaaa agaataaaat gaagccgcgg gaacttaggg aagctcagaa aaaggcccga       60

caacttaaag ctgccgagat aaacaacaac gctgcaccgg cgatagccgc catgcctgca      120

gctgaggtga ttgcacctgc tgccgaaaaa aagaaatcaa gcgtgaaagc agccggcatg      180

aaatctatcc tcgtgtccga aaataagatg tatattacgt cttttggaaa agggaatagt      240

gcggttctcg agtacgaagt agataataat gattataatc aaactcaact gtcatccaag      300

gacaatagca atatacaact tggcggggtt aacgaggtta acattacctt ttcaagcaag      360

cacggctttg agtcaggtgt agaaataaat acaagtaacc ccactcatcg ctcaggggaa      420

tcatcacctg tacgcgggga catgctcggg cttaagtcag aactggagaa acgcttcttt      480

ggtaaaacat ttgacgacaa tattcatata cagctgatct ataatattct tgatatagag      540

aaaatcttgg ctgtatacgt cacaaacatc gtatacgcac ttaataatat gctcggggtt      600

aaaggcagcg aaagccatga cgacttcatt ggatacctta gcaccaataa catctacgac      660

gtattcatcg acccagacaa tagcagtctg agcgatgaca agaaggctaa cgtgagaaag      720

tcactctcca aatttaatgc cttgcttaaa acaaagagat tggggtactt tgggcttgaa      780

gagcctaaga cgaaggataa tcgcgtatca caagcctata agaagcgggt ctatcacatg      840

ctggcgatcg tgggtcaaat tcgccaatgt gttttccacg acaagtctgg cgctaagaga      900

ttcgatcttt acagcttcat caacaacatc gaccccgagt accgggacac cctggactac      960

ctcgtggagg aaagactcaa gtcaatcaat aaggatttta ttgaagataa caaggtaaat     1020

atatccctcc tcatagatat gatgaaaggt tacgaggccg atgatatcat tcgactgtat     1080

tacgatttca ttgtactgaa gagtcaaaaa aatctgggct tctcaatcaa aaaactgcgg     1140

gagaaaatgc tggacgagta tggttttagg ttcaaggata agcaatacga cagtgtccgc     1200

agcaagatgt acaagctcat ggattttttg ctcttttgta attactaccg aaatgacata     1260

gctgcaggcg agtctttggt gcgaaaattg cgcttttcca tgacagacga tgaaaaggag     1320

ggcatatatg ccgatgaagc tgctaaattg tggggaaaat ttcggaacga tttcgaaaac     1380

atcgccgacc acatgaatgg agatgtcatc aaggagcttg gtaaagctga tatggacttt     1440

gacgaaaaga tattggacag tgaaaaaaaa aacgctagcg atcttcttta tttttccaag     1500

atgatatata tgctgacgta ttttcttgac ggtaaagaaa taaacgacct gctgactaca     1560

ttgatttcaa aatttgacaa catcaaggaa tttctgaaaa taatgaagag ttccgcggta     1620

gatgtagaat gtgagttgac agccggatac aaattgttca atgatagtca gaggatcacc     1680

aatgagttgt tcattgttaa gaatattgcg tctatgagga aaccagcggc aagtgctaag     1740

ttgacgatgt ttcgagacgc gcttacaatt cttgggatcg atgacaaaat cactgacgac     1800

cggatttcag ggatactgaa gctcaaggaa aagggaaaag gcattcatgg gcttaggaac     1860

tttatcacta acaatgtaat tgaatctagc cggttcgtct acttgatcaa gtacgccaat     1920

gcgcaaaaga ttagagaagt tgccaagaat gaaaaggtcg tgatgttcgt attggggggt     1980

attccagata cacagatcga acgctactac aagtcttgtg ttgagttccc ggacatgaac     2040

tcctctctgg gggtgaagcg ctccgaactg gctcggatga ttaagaacat tagcttcgac     2100

gatttcaaaa acgtcaagca acaagcgaag gggcgcgaaa acgttgccaa ggagagggct     2160

aaagcagtga tcggtcttta tctcacagtg atgtatcttc ttgttaagaa tcttgtcaat     2220

gtcaatgcac ggtatgttat agctatacac tgtctcgaac gagacttcgg tctctacaaa     2280

gaaattattc cagagcttgc aagtaaaaac ctgaaaaatg attatcgcat cttgtcacag     2340

acgttgtgtg agctgtgcga taagtctcca aacctcttcc ttaagaaaaa cgaacgattg     2400

cgaaagtgtg tcgaggtgga tatcaataat gcggactctt ccatgacccg aaaatataga     2460

aactgtattg cgcacttgac cgtagtcaga gaactcaaag agtacatagg ggacatctgt     2520

acggttgact catattttag tatctaccac tatgttatgc aacgctgcat aaccaagagg     2580

gagaatgata cgaagcaaga agaaaagata aagtatgaag atgacctctt gaaaaaccac     2640

ggttatacga aggacttcgt aaaagctctt aactcaccat ttggttacaa tatcccaaga     2700

ttcaagaacc tctcaatcga gcaattgttc gatcgaaatg agtatctgac ggagaaa        2757


<210>  12
<211>  2754
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus bicirculans, 
       modified for expression in human cells

<400>  12
atggcaaaga agaacaaaat gaagccgcgc gagttgcggg aggcccaaaa gaaagctcgc       60

cagctgaagg ccgccgaaat caataacaac gcagtccctg ccatagctgc catgccagca      120

gccgaagccg ccgcaccggc tgcggaaaag aagaagtcct cagtaaaagc tgcgggcatg      180

aaaagtatac ttgtgtcaga gaacaagatg tatatcacca gttttggaaa aggcaactcc      240

gcagtgcttg agtatgaggt agataacaat gattacaaca agacgcagtt gtccagcaaa      300

gataactcaa acattgaact gtgcgacgtt ggcaaggtta atataacttt cagtagtcgc      360

cgcggatttg aatcaggggt ggaaatcaat acttctaacc caactcatcg gtctggggag      420

agctcttcag tacgcgggga tatgttggga cttaaatctg agctcgaaaa gagatttttt      480

ggtaagaact tcgatgataa catccacatc caattgattt ataatatctt ggatatagag      540

aagatactcg cagtatatgt gactaacatc gtctacgcgc ttaacaatat gctcggtgag      600

ggagatgagt ctaactacga ctttatgggc tatctgagca catttaacac ctataaagtg      660

ttcactaatc ccaatggaag tactttgagc gatgacaaga aagaaaacat tcgcaagtca      720

ctctctaagt tcaacgccct cctcaagacc aaacgcttgg ggtattttgg tctggaagaa      780

cccaaaacga aagacactag agcttcagag gcatacaaga aacgagtata ccatatgctc      840

gccattgtcg ggcagatccg ccagtgtgtg tttcatgata agtctggagc aaaacgattc      900

gacctgtata gttttatcaa caatatagac cccgagtata gggaaacttt ggactacctt      960

gtagatgagc ggtttgactc cataaacaag ggctttatac aaggaaataa agtcaatatc     1020

agtctgctca tagatatgat gaaagggtat gaagctgacg acattattcg cctgtactat     1080

gactttatcg ttcttaagtc tcagaaaaat cttggcttca gtataaaaaa gctccgcgag     1140

aagatgctgg atgagtatgg atttagattc aaggataagc agtacgacag tgtaagatct     1200

aaaatgtata aacttatgga ttttctgttg ttctgcaact actaccggaa cgacatcgcc     1260

gcgggtgaga gtttggtgag aaagcttcgg ttctccatga ccgacgacga aaaggaaggg     1320

atatatgcag atgaagcggc taaactctgg ggcaagtttc gaaatgactt cgaaaacatt     1380

gcggatcata tgaacggtga tgtgataaaa gaacttggaa aagccgatat ggactttgat     1440

gaaaagatac tggactcaga aaagaaaaac gccagtgacc tcctttactt cagcaagatg     1500

atctacatgc tcacctactt tctggatggg aaagaaatca atgatttgct tacaaccttg     1560

atctctaagt tcgataatat aaaggaattt ttgaagatca tgaaatctag tgctgtggac     1620

gtagagtgtg aactcacagc aggatataag ctctttaatg atagccaacg aataacaaac     1680

gagcttttca tagtgaaaaa cattgccagc atgcggaagc cggcggcgtc agcaaaattg     1740

accatgttcc gcgatgcact gactattctt gggatcgatg ataaaataac ggatgatcgc     1800

ataagcgaga ttctgaaatt gaaggaaaag ggtaagggta tacacggttt gcggaacttc     1860

attacgaaca acgtcattga atccagtcga tttgtgtatc tgataaagta cgcgaatgcg     1920

cagaaaataa gggaggttgc taaaaatgag aaggtcgtca tgttcgtact tggcggcatt     1980

cccgacacac aaatcgaaag gtattacaaa agttgtgtag agttcccaga tatgaacagt     2040

tccttgggag taaaacggtc tgaactggcg agaatgataa agaatatatc attcgacgac     2100

ttcaaaaatg taaagcaaca ggcgaaagga agagagaacg tggctaagga acgggccaaa     2160

gccgttattg gactttacct tacggttatg tacttgttgg ttaaaaacct tgttaatgta     2220

aacgcacgct atgttatagc aatacattgc ctggagagag acttcgggct ctacaaggaa     2280

ataattcccg aactcgcttc aaagaacctt aaaaacgatt accgcattct tagtcaaacg     2340

ctctgcgagc tctgcgacaa atcccctaac ctgttcctca aaaaaaatga gagactcagg     2400

aagtgcgtcg aggttgacat caataatgca gattctagta tgactcgaaa gtatcggaac     2460

tgtatcgcgc acttgacagt tgtgcgcgaa ctgaaagaat acataggcga tatctgtacc     2520

gtagactcat atttctcaat ttaccactat gtgatgcaaa gatgcataac caagagggag     2580

aacgacacga aacaggagga aaagattaag tacgaggatg acttgttgaa aaaccacggt     2640

tatacaaaag attttgtcaa ggcactgaat agtccttttg ggtataatat cccgaggttc     2700

aaaaaccttt caattgaaca actcttcgat aggaacgagt acctgacgga gaag           2754


<210>  13
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5608892, modified for expression in human cells

<400>  13
atggcaaaaa agaacaagat gaagccccga gagttgcggg aagcgcagaa aaaagcgagg       60

cagcttaagg ccgctgaaat caacaacaat gccgctcccg caatagctgc gatgcctgcc      120

gcggaggtga ttgcaccagt agcggagaag aagaaaagtt ctgtaaaagc tgcaggtatg      180

aaaagcatat tggtaagtga aaacaagatg tatataacta gtttcggcaa aggtaattct      240

gccgtgttgg aatatgaggt tgataataac gattacaata aaacccaact ctcctctaaa      300

gacaattcaa atatagagct cggcgacgta aatgaagtga acattacgtt ctccagcaaa      360

cacggtttcg gctcaggggt ggaaattaat acttctaacc cgacacaccg gagtggtgag      420

tcatctccag tgagaggaga tatgctcgga ttgaaatccg aactcgagaa acggttcttc      480

ggcaagacat tcgacgacaa catccatatc cagttgattt ataacatact cgacatcgag      540

aaaattttgg ccgtgtatgt gacaaacatt gtttatgcat tgaacaacat gctgggtata      600

aaagattcag agagctatga cgactttatg gggtacttga gtgcacgcaa tacctacgag      660

gtgtttacgc acccagacaa gagtaatttg tctgacaagg tgaagggtaa tattaagaag      720

tccctttcaa aatttaacga cttgctgaaa actaaacgct tggggtactt tggactcgaa      780

gaaccaaaaa ccaaggatac aagggcatca gaagcctaca agaagagggt gtaccatatg      840

ctggctatag taggtcagat tcggcagtgc gtattccacg acaagtcagg tgcaaagaga      900

tttgatcttt actcattcat aaacaacatt gatccggaat accgggatac gctggactat      960

ctggtagaag agcgattgaa gtcaatcaat aaagatttta ttgaaggaaa caaagtgaat     1020

attagcctgc tgatcgacat gatgaaaggg tatgaagctg atgacatcat acggctctac     1080

tacgacttca tagtactcaa gagtcagaag aacctgggtt tttccatcaa aaaactgcga     1140

gaaaagatgt tggaagaata cggctttcgc ttcaaagaca aacagtatga ttccgtccga     1200

agcaaaatgt ataagcttat ggatttcctg ctcttctgca attattacag aaatgacgta     1260

gccgcgggag aagccctggt acgaaagttg agattctcta tgacggatga cgagaaggaa     1320

ggcatctatg ctgacgaggc agcgaagctg tggggaaaat tccgcaacga cttcgaaaac     1380

atagcggatc atatgaatgg ggacgttata aaagaactcg gaaaagcgga tatggacttt     1440

gatgagaaga tcctggattc tgagaaaaaa aacgctagtg atcttctcta tttctctaag     1500

atgatttaca tgctcacgta ttttttggat ggcaaagaaa ttaatgatct cctcactacc     1560

ctcatttcta agttcgacaa tattaaggaa ttccttaaga tcatgaagag ttcagcggtc     1620

gacgtagaat gtgagcttac tgccggatac aaattgttta acgatagcca gcgaatcacg     1680

aatgagctgt tcattgtcaa gaatatcgcc agtatgagga agcccgctgc gtctgcaaaa     1740

ttgactatgt tccgcgatgc tcttaccatt ctgggcattg acgacaatat aactgacgac     1800

cgcatcagtg agatcctgaa gctcaaggag aaggggaagg ggatccacgg attgcggaat     1860

ttcatcacaa ataacgtaat tgagagttcc cggttcgtgt atcttattaa atatgccaat     1920

gctcaaaaga taagagaagt agcaaaaaac gagaaggtgg tcatgtttgt actgggcgga     1980

atacccgaca cccaaatcga acggtattat aaatcttgtg tagaattccc agacatgaac     2040

agttcactcg aagcgaagag atcagaactc gcgcggatga ttaaaaacat ttccttcgac     2100

gacttcaaaa acgtcaaaca gcaggcgaaa ggtagggaga atgttgcgaa agaaagagct     2160

aaagcggtaa ttggtctgta tctgaccgtc atgtacctgt tggtgaaaaa tcttgtcaac     2220

gtaaatgcgc gatacgtcat cgcgatccat tgtcttgagc gagacttcgg gctctataag     2280

gagattatcc ctgagttggc cagtaaaaat cttaaaaacg actacagaat ccttagccag     2340

acgctttgtg agctttgtga cgacaggaac gagtcttcca atctgtttct caagaaaaat     2400

aagaggctca gaaaatgtgt agaggttgat atcaataacg ctgatagctc tatgactcga     2460

aagtatcgga attgtattgc acaccttacg gtagttaggg agctgaaaga atatatcggc     2520

gatatacgaa cagtagacag ctatttcagt atataccatt atgtcatgca acgctgcatt     2580

accaagaggg gggacgatac caagcaggag gagaaaatca aatacgaaga tgacttgctc     2640

aagaatcacg gttatactaa ggattttgtt aaagcgctca atagtccttt tggctacaac     2700

atcccccgat tcaagaacct gagtattgaa caacttttcg atagaaacga gtaccttact     2760

gagaaa                                                                2766


<210>  14
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp. CAG:57, 
       modified for expression in human cells

<400>  14
atggccaaaa aaaataagat gaaaccacgc gaattgcggg aagctcagaa aaaggctaga       60

cagttgaagg ccgcggagat aaacaacaat gcagcacctg ctatcgccgc catgccagct      120

gccgaggtga ttgcccccgt agcggaaaag aagaaatcct ccgtaaaagc ggcggggatg      180

aagagcatcc ttgtgagcga gaacaaaatg tacattacaa gctttggtaa agggaactca      240

gctgtgttgg agtacgaagt cgacaataac gactacaaca agacccagct gtcctctaaa      300

gacaatagca acatagaact gggcgacgta aacgaggtaa atataacgtt ctcttctaag      360

catggctttg gcagtggtgt ggagataaat acttccaacc ccactcatcg aagcggggaa      420

agtagcccgg ttaggggaga catgctcggc ttgaaatcag agctggagaa gagatttttt      480

gggaaaacat tcgacgataa tatacacatc cagctgatat ataacattct ggatatagaa      540

aaaatacttg cagtgtacgt tacgaacatt gtctatgctt tgaacaatat gctcggaatt      600

aaggattccg agtcctacga tgatttcatg ggttacctga gcgcccgaaa cacgtacgag      660

gtgttcactc atccggacaa atccaatctc agtgataaag tgaagggcaa cataaagaaa      720

tccctttcta aatttaacga tctcctcaag acgaaaagac tcgggtactt tgggctggag      780

gaacctaaaa cgaaagacac tagagccagc gaggcttata aaaaaagagt ctaccacatg      840

ctcgctatag ttggacaaat taggcaatgt gtgtttcatg acaaaagtgg tgcaaaacgg      900

ttcgatctgt actcatttat caacaacatt gatccagagt accgagacac tctcgactat      960

ttggttgagg aacgattgaa atctataaac aaggatttca ttgaggggaa caaggtaaat     1020

ataagccttc tcattgatat gatgaagggg tacgaagccg acgatataat ccgcctctac     1080

tatgatttta ttgtgctgaa aagtcagaag aatctggggt ttagtattaa aaagcttagg     1140

gagaagatgc tggaagaata tggttttcgg tttaaagata aacaatatga ctccgtgagg     1200

agtaaaatgt acaaacttat ggatttcctc ctgttctgta actattatcg gaatgatgtt     1260

gcagcaggcg aagcactcgt ccgcaaactt agattcagta tgacagatga tgagaaggaa     1320

ggaatatacg ctgacgaagc ggcgaaactg tgggggaaat ttcgcaacga ctttgagaac     1380

atagctgacc atatgaatgg cgacgttatc aaagagctcg gtaaggcgga catggacttc     1440

gacgagaaaa ttctcgacag tgagaaaaag aacgccagtg atctgctgta ttttagcaaa     1500

atgatataca tgctcacata ctttctcgat ggtaaagaga tcaacgactt gttgaccacg     1560

cttattagca aatttgataa catcaaagag ttcttgaaaa taatgaagtc cagtgccgtg     1620

gatgtggagt gcgagctcac ggcaggttat aaacttttta acgatagtca acggatcact     1680

aatgagctgt tcattgtcaa gaatattgca agcatgcgca agcccgcggc aagtgcaaag     1740

cttacgatgt ttcgggacgc cctcacgata ttgggtatag atgacaatat aactgatgat     1800

agaatcagtg agatacttaa gctcaaggaa aaggggaaag ggatacacgg tctgcgcaac     1860

ttcataacga ataacgtgat tgagagctcc cgatttgtct atctgataaa gtacgccaat     1920

gcccaaaaga taagggaagt agctaaagat gaaaaagtgg tcatgttcgt ccttggcggg     1980

attcccgaca cgcagattga gaggtactac aagtcttgtg tggagtttcc ggatatgaac     2040

agctccctcg aggctaagcg cagtgagctg gctagaatga ttaagaatat ttcctttgat     2100

gattttaaaa atgtaaagca acaagctaag ggacgggaga acgtcgccaa agaacgggcg     2160

aaagcagtga ttgggcttta tctcacggtc atgtatctgc ttgttaagaa cttggtcaac     2220

gtcaatgcaa gatatgttat agcgatccac tgccttgaac gagatttcgg gttgtacaaa     2280

gaaatcatcc cggagttggc atctaaaaac cttaagaatg actatcgaat actgtcacaa     2340

accttgtgcg aactctgcga tgaccgaaac gaatcatcta acctcttcct taaaaaaaac     2400

aagagactca gaaagtgtgt ggaggtggat atcaataatg ccgattccag tatgactaga     2460

aaataccgca actgcatcgc acacctgact gtggtcagag aacttaagga gtacattgga     2520

gatattagaa cggtcgactc atattttagc atctatcatt atgtcatgca gaggtgtatc     2580

accaagagag gagatgatac aaagcaggaa gagaagataa agtacgagga cgatcttctt     2640

aagaaccatg gctacactaa ggacttcgta aaagcgttga actccccgtt cgggtataac     2700

atacctaggt ttaagaatct ttcaattgag caattgtttg accgcaatga gtaccttaca     2760

gagaag                                                                2766


<210>  15
<211>  2799
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens FD-1,
       modified for expression in human cells

<400>  15
atgaaaaaga aaatgtcctt gcgagaaaaa agggaagctg aaaaacaagc aaagaaggcc       60

gcgtactcag cagcttccaa gaataccgac tccaaaccag cggaaaagaa ggcagaaacc      120

ccgaagccgg cagagataat aagtgacaac agtcggaata aaacggctgt gaaagctgcg      180

ggccttaaat ctaccattat atctggagat aagctgtaca tgacatcatt tggtaagggg      240

aacgctgcgg ttattgaaca gaagatcgac atcaatgact atagcttctc tgctatgaaa      300

gatacaccat ccctggaagt ggacaaggct gaaagcaagg aaatttcatt tagcagccac      360

cacccgttcg tgaaaaatga taaactgacc acctacaacc cattgtatgg tgggaaagat      420

aatccggaaa aaccagtagg aagagacatg ctgggactga aggacaagct tgaagaacgg      480

tatttcggat gcaccttcaa tgataacttg catattcaga ttatatataa catactcgat      540

atcgaaaaga tacttgcagt gcactccgca aacatcacga ccgcgctgga tcacatggtg      600

gacgaagatg atgagaaata tcttaacagt gattacatcg ggtacatgaa cacaattaac      660

acatacgacg tatttatgga cccttctaaa aattccagcc tctcacctaa ggaccgcaag      720

aatatcgaca acagtcgagc caagtttgaa aaactgttga gcacgaaaag gcttggatat      780

ttcggattcg attatgacgc caatggtaag gacaaaaaaa agaatgaaga gataaaaaaa      840

cggctgtatc atttgactgc attcgctggc caactgagac agtggtcctt ccattctgct      900

gggaactacc ctcgcacgtg gctctacaaa ttggacagct tggacaagga ataccttgac      960

acgctggacc attactttga taaacggttc aatgatatta acgatgattt tgttaccaaa     1020

aacgccacta acttgtatat actcaaggaa gtatttccgg aggcaaattt caaagacata     1080

gccgaccttt actacgactt tattgttatc aagagccaca agaacatggg gttttccatt     1140

aaaaaactcc gcgagaagat gctcgaatgc gatggtgctg accgcatcaa ggagcaggat     1200

atggactcag taaggagtaa gctttacaaa ctgatcgact tttgtatttt taagtattac     1260

cacgaatttc ctgagttgtc agagaagaac gtcgacatac ttcgagcagc ggtttctgat     1320

acgaaaaagg ataaccttta ttcagacgag gctgctcggc tgtggagcat attcaaagaa     1380

aagttcctcg gcttttgtga caaaattgtg gtttgggtca ccggagagca cgaaaaggac     1440

atcacgtcag tgattgataa agacgcatat cgaaatcgca gtaacgtttc ttacttctcc     1500

aagcttatgt acgcaatgtg tttctttctt gatggtaagg agataaacga cctcctcacg     1560

acccttatca ataagttcga caatatagca aatcagatta agacggccaa agaactcgga     1620

ataaacactg catttgtaaa gaactacgac ttcttcaatc atagcgagaa atacgtagac     1680

gagctgaata tcgtgaaaaa tatcgctcgg atgaaaaaac ccagttcaaa cgcaaaaaag     1740

gcaatgtatc atgacgcatt gacgatattg ggaatcccag aggacatgga tgagaaggct     1800

ctcgacgaag aattggacct cattttggag aaaaagactg atccggtgac tggcaaacca     1860

ctgaaaggca aaaaccctct gcgaaatttc atagccaaca acgtaatcga aaacagtaga     1920

ttcatatacc ttattaagtt ctgcaacccc gagaatgtcc gcaagatagt caacaacaca     1980

aaggtcacgg aattcgttct gaagcgcatt cctgatgccc aaatcgagcg gtactacaag     2040

agttgtactg atagtgagat gaaccccccc acggaaaaaa agattacgga gctcgctggt     2100

aagctgaaag atatgaattt tgggaacttc aggaacgtaa ggcaatctgc aaaggaaaac     2160

atggaaaagg agcgcttcaa agcagtgatt ggcctgtatc tcaccgttgt gtaccgagtc     2220

gtcaagaatc ttgtagatgt gaacagtcga tacatcatgg cttttcacag tctggaacgg     2280

gatagtcagc tgtacaacgt ctccgtggat aacgattacc tcgcacttac ggacactctt     2340

gtcaaggaag gcgacaattc ccggtcacga tatctggccg gaaataaacg ccttcgagat     2400

tgtgtaaagc aggatattga taacgcaaag aagtggtttg tgagcgacaa gtacaatagc     2460

ataactaaat accgaaacaa tgtagctcac cttaccgctg taaggaattg cgcggaattt     2520

atcggtgata ttactaagat tgattcctat ttcgcactgt atcattatct gatacagagg     2580

caacttgcca agggcctgga ccatgaacgg agtggctttg atcgaaacta tccccaatac     2640

gcaccattgt ttaaatggca tacttacgtt aaggacgttg tgaaggctct taatgctcct     2700

ttcggttaca atatacctag attcaaaaat ctgagcatcg atgcactgtt cgaccgcaat     2760

gagattaaaa agaacgacgg agagaaaaag tccgacgat                            2799


<210>  16
<211>  2832
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus albus strain 
       KH2T6, modified for expression in human cells

<400>  16
atggctaaaa aatcaaaagg aatgagcctc cgcgaaaagc gggaactcga gaagcaaaag       60

aggattcaaa aggccgcggt taattcagtg aacgatacac ccgaaaaaac ggaggaagct      120

aatgtcgtca gcgtcaacgt tcgaacctca gctgagaata agcactccaa aaaatctgcg      180

gccaaagctc tgggccttaa gagtggtctg gttataggag acgaacttta cttgacgagc      240

ttcggtcgcg gtaatgaagc gaagttggag aaaaagatta gcggcgacac ggtcgagaag      300

ctggggatcg gcgccttcga ggttgctgaa agggacgaat ctactctcac gctcgagagc      360

ggtcgcatca aagacaagac agccagacca aaagatccac ggcatattac tgttgataca      420

caaggaaaat tcaaagaaga tatgttgggt atccggagcg tactcgagaa aaaaatattt      480

ggcaaaactt ttgatgataa catccacgta caactggcgt ataacattct tgacgttgaa      540

aaaatcatgg ctcagtacgt ctcagacata gtatacatgt tgcacaacac ggataagacc      600

gagcgcaatg ataacctgat gggatatatg tccattcgaa acacatataa gacattttgc      660

gacactagca atctgcctga cgacacaaag caaaaagttg aaaaccaaaa gagagagttc      720

gataagataa tcaagtccgg ccgactcgga tattttggag aagcatttat ggtaaattca      780

ggcaatagta cgaagctccg acctgagaaa gaaatctacc atattttcgc gcttatggca      840

tccctgcgcc aaagctactt tcatggttac gtcaaggata cagattacca gggtaccacg      900

tgggcgtata cgcttgaaga caaactcaag ggtccatctc atgagtttcg agaaacgatc      960

gataagattt ttgacgaggg gttttcaaaa atcagtaaag atttcggaaa gatgaacaag     1020

gttaatctcc agattttgga acaaatgata ggcgagctgt atggctccat cgagcgccaa     1080

aaccttacgt gtgactatta tgattttata cagcttaaaa aacacaaata tctgggtttc     1140

tccataaaac gcctcaggga aacgatgctt gagacaacac ctgcggaatg ttataaggca     1200

gaatgttata actctgagag gcaaaaactg tacaagctga tcgacttcct gatctacgat     1260

ctctactaca atcgcaagcc agcacgaatt gaagagatag tcgataagct gcgggagagc     1320

gtgaacgacg aggagaagga gtccatatac tcagttgagg caaagtatgt ctatgagtcc     1380

ttgtcaaaag tgctcgacaa gagtctcaaa aactctgtga gcggtgagac gatcaaagac     1440

cttcagaaac ggtatgacga tgagacggcc aaccggatct gggacatctc ccagcattcc     1500

atatccggta acgtgaactg tttctgtaag cttatctaca tcatgacact gatgctcgac     1560

ggcaaggaaa tcaatgatct cctgactaca cttgttaaca agttcgataa cattgcttct     1620

ttcatagacg ttatggatga gcttgggctg gagcacagtt ttaccgataa ctataagatg     1680

tttgcagatt ccaaggccat atgcttggat ctgcaattta taaattcctt cgctagaatg     1740

tctaagattg atgacgaaaa atctaaacga cagcttttca gggatgcgct cgtaattctt     1800

gacatcggaa ataaagatga gacctggata aacaactact tggattccga catattcaag     1860

ttggataagg aaggaaacaa actcaagggt gcccggcatg actttaggaa ctttattgcg     1920

aacaacgtca tcaagtcctc ccggtttaag tatctcgtta agtactctag cgctgacggg     1980

atgataaagc tgaaaacgaa cgagaaactc atcggattcg tcctggacaa gctgcctgag     2040

acgcagatag atcgatatta tgaatcatgc ggccttgaca atgcggtcgt cgacaagaaa     2100

gtgcgaatag agaagttgag cggacttatc agggacatga agtttgatga cttctccggc     2160

gtgaagactt ctaacaaggc cggagacaat gataaacaag ataaggcgaa gtaccaggct     2220

attattagtt tgtatctgat ggtactgtac cagatagtaa aaaacatgat ttacgtcaat     2280

tcccgctatg tcattgcttt ccactgcctt gaacgcgact ttgggatgta tggcaaagat     2340

tttggaaagt actaccaggg ctgtcggaag ttgaccgacc acttcataga agaaaagtac     2400

atgaaggaag gaaagttggg gtgcaacaaa aaggtcgggc ggtacctgaa aaacaatatt     2460

tcctgctgta cggacggatt gataaatact taccgaaatc aggtggacca ttttgcggta     2520

gtccgaaaga taggaaacta cgcagcctac attaagtcaa taggctcttg gtttgaactg     2580

taccactacg taattcagag gattgtcttc gacgaataca gattcgctct taacaacacc     2640

gagtcaaatt ataagaattc catcatcaaa catcacacgt attgtaagga tatggtgaag     2700

gcgctgaaca cgccgtttgg ttatgatttg ccacggtaca aaaatctctc cattggggat     2760

cttttcgacc gcaataacta tctcaacaaa actaaggaaa gcatcgacgc taatagttca     2820

atagattctc aa                                                         2832


<210>  17
<211>  2901
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens 
       strain XPD3002, modified for expression in human cells

<400>  17
atgatagaga aaaaaaaaag ctttgcgaaa gggatgggcg taaaaagtac actggtatca       60

ggctctaagg tctacatgac aacgttcgca gaaggaagcg atgcacgcct cgaaaagatt      120

gttgagggag atagcattag gtccgtcaat gaaggagaag cctttagtgc agaaatggca      180

gacaagaacg ctggatacaa gattggaaac gcaaaatttt cccatccaaa gggatacgca      240

gttgtagcta acaatcccct ctataccggg cccgtccagc aagacatgct tggcctcaaa      300

gagacgcttg agaagaggta ttttggagag agtgctgatg gtaatgacaa tatctgtatc      360

caagttattc ataacatcct cgacatagag aaaatccttg cagaatatat caccaacgcc      420

gcatatgcag tgaataatat atccggtctg gataaagaca taatcggatt cggcaagttt      480

agtacagtat atacctatga cgagttcaaa gacccggagc atcatcgagc cgctttcaat      540

aacaacgaca aacttatcaa tgccattaag gctcaatatg acgagttcga taattttttg      600

gacaatccca gacttgggta tttcggccag gccttctttt ctaaggaagg caggaattac      660

atcattaatt acggaaacga atgttacgat atcctcgctt tgctctctgg cctgcgccac      720

tgggttgtac acaacaacga ggaggaatct cgaatttcac gaacttggct gtacaatttg      780

gataaaaact tggataatga atacatcagt actctgaact atctctacga taggatcacc      840

aacgaactta cgaattcatt ttcaaaaaat tccgccgcaa acgttaatta catcgctgag      900

acgttgggca taaatccggc cgagttcgcc gagcaatatt ttaggttcag tatcatgaag      960

gagcaaaaga atttggggtt caacatcacg aaactccgag aagtcatgct cgaccgaaaa     1020

gatatgtccg aaattcggaa gaaccataag gtattcgaca gcatccgcac aaaagtgtac     1080

acaatgatgg atttcgttat atacaggtat tatatagagg aagatgcaaa agttgccgcc     1140

gcaaacaaaa gtcttccaga taatgaaaag agcttgagtg aaaaagatat ttttgttata     1200

aaccttcgcg gttccttcaa tgatgaccaa aaggatgctc tgtactacga cgaggcaaac     1260

cgaatctggc gaaaactgga aaacatcatg cataatataa aggaatttcg cgggaacaaa     1320

acgagggagt ataagaagaa ggatgctcct cgcctcccca ggatactccc tgcgggcaga     1380

gacgtctccg catttagcaa actgatgtat gctctcacta tgtttttgga tgggaaggag     1440

ataaacgatc ttctgactac gttgattaac aaatttgaca acattcagag ttttctcaag     1500

gtcatgccac ttatcggcgt aaatgcaaag tttgttgagg aatacgcctt ctttaaagac     1560

tccgctaaaa tagcggacga gctccgcctg attaaatcct tcgcccgaat gggtgaaccg     1620

atagcggatg cccggcgagc tatgtacatc gatgctatca ggatccttgg aactaacttg     1680

agctacgacg aacttaaggc tctggcggac actttcagtt tggacgagaa tgggaacaag     1740

ctgaaaaagg gaaagcacgg gatgagaaac ttcataataa ataatgtcat ttccaacaag     1800

aggttccatt atttgattcg gtatggtgat cctgcgcacc ttcatgaaat tgcgaagaat     1860

gaagctgtgg ttaaatttgt tcttggcaga attgccgaca tccaaaaaaa acaggggcaa     1920

aatggtaaga accaaattga tagatactac gaaacttgca taggtaaaga caaaggtaaa     1980

agtgtctctg aaaaggtgga tgccctgacg aaaatcatca caggtatgaa ctatgaccaa     2040

ttcgacaaaa agagaagtgt aattgaggat actggtcggg aaaacgctga aagagagaag     2100

tttaagaaga ttattagtct ctatcttacc gttatttatc acattctcaa aaacatagtc     2160

aacatcaatg ccagatatgt catcggattc cactgcgttg aacgagatgc tcagttgtac     2220

aaggagaaag gctacgacat caacctcaaa aaactggagg aaaaggggtt tagttccgtt     2280

acaaagttgt gcgccggaat tgacgagacg gccccagata aacgaaagga cgttgagaaa     2340

gaaatggcgg aacgagcgaa agagtccatc gactctcttg agtcagctaa tcctaaattg     2400

tatgcaaact atattaaata ctctgatgag aagaaagcgg aggaattcac acgacagatc     2460

aatcgggaga aagcaaaaac ggcactgaat gcatacttga ggaacacgaa gtggaacgtg     2520

attatcagag aggacctgtt gaggatcgac aataaaacgt gtaccctgtt tagaaataaa     2580

gccgttcatc tcgaggtggc ccggtacgtg cacgcctata ttaatgacat tgcggaagtt     2640

aattcttatt ttcaactgta ccattacatc atgcagagaa ttatcatgaa tgaacgatac     2700

gaaaagagca gcggcaaagt gtctgagtat tttgatgccg tcaatgatga gaaaaaatac     2760

aatgacaggc tgttgaagct gctgtgcgta ccatttggtt attgtattcc tcggtttaaa     2820

aatcttagta ttgaggctct ttttgatcgg aatgaagccg caaagtttga taaggagaag     2880

aaaaaggtat ccggtaacag c                                               2901


<210>  18
<211>  2388
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834894, modified for expression in human cells

<400>  18
atggaaatta atactagtaa tcccactcat aggtccggtg aatcttctag cgtacgagga       60

gacatgcttg gtctcaaatc agagctcgag aagagatttt tcgggaaaac atttgatgat      120

aatatccaca ttcaacttat atataatatc cttgatatcg agaagatcct tgcagtctat      180

gtgactaata ttgtctacgc acttaacaat atgctcggtg taaaaggctc agagtcctat      240

gacgacttta tgggctatct ttcagcacag aatacgtact acatatttac acatcccgac      300

aagagcaact tgagcgataa agtgaagggc aatattaaga aatctcttag taaattcaat      360

gaccttctga agacgaagcg acttggctat tttgggctgg aggagcccaa aaccaaagat      420

aagcgagtgt ctgaagctta taaaaaacga gtgtatcaca tgctggctat agtgggtcaa      480

attcgccagt cagtctttca cgacaagtcc aacgaattgg atgagtactt gtattccttt      540

atagacatca tcgatagcga gtatcgagac acattggact acctggttga tgaacgattt      600

gattccatta acaaaggatt cgttcagggg aataaggtaa acatctcctt gcttatcgac      660

atgatgaagg gctacgaggc tgatgatata ataagattgt actatgactt tattgtcctc      720

aagtctcaaa agaatctggg tttcagtata aaaaaattgc gggagaagat gctcgacgag      780

tatggattta ggtttaagga caagcagtat gatagcgttc gctctaagat gtataaactt      840

atggactttc ttctgttctg taactactat cggaacgacg tagtcgcagg ggaggcactg      900

gttaggaaac tgaggtttag catgaccgac gacgagaaag aaggtattta tgcggacgaa      960

gcggagaagc tttggggaaa gtttaggaat gactttgaga acatcgccga tcacatgaac     1020

ggtgatgtga taaaggagct cgggaaggcg gatatggact ttgacgagaa aatactggat     1080

tctgaaaaga agaatgcaag tgacctcctt tacttcagca aaatgatcta catgttgacg     1140

tattttttgg atggtaaaga gatcaacgat ctgcttacaa cgcttatttc taaatttgat     1200

aacataaagg agtttttgaa gatcatgaaa tcctccgccg tggatgtaga gtgtgagctg     1260

accgcgggct ataaactgtt taacgattct caacggataa cgaacgagct cttcatagtg     1320

aagaacatcg cttccatgcg caagccggcg gcttcagcca aattgactat gttccgcgat     1380

gcgctgacaa tactcgggat tgacgataaa attacggacg accgaatatc agaaattctt     1440

aaattgaagg aaaagggcaa gggcatccat ggcctgcgga acttcatcac gaacaacgtt     1500

atcgagtcta gtcggtttgt ttatcttata aaatacgcga atgcgcagaa aattcgggag     1560

gtcgcaaaaa atgaaaaggt ggtaatgttt gtgctcgggg ggattcctga cacacagatt     1620

gagcggtact ataaaagttg cgttgagttc cctgacatga attcttcact cgaagccaag     1680

tgcagtgagc tggcacggat gatcaagaat atctccttcg atgattttaa gaacgtaaaa     1740

caacaagcta aaggacgcga aaatgtggcg aaagagaggg ccaaggcagt catcggtctc     1800

taccttacag ttatgtacct ccttgtgaaa aaccttgtaa acgtcaatgc tcggtatgta     1860

atagcaatcc actgtttgga gagagatttc ggcctctata aggagatcat cccggagctc     1920

gcttcaaaaa acttgaaaaa tgattatcgc attctttctc aaactctttg tgaactttgt     1980

gatgacaggg acgagagtcc taacctgttc ttgaagaaga acaaaagact gcggaaatgt     2040

gtggaggtcg atataaacaa tgcggattct agcatgaccc ggaaataccg gaattgcatt     2100

gcacacctta cagtggtacg cgagctcaag gaatacatcg gtgatatacg caccgtcgac     2160

tcctactttt ctatctacca ctatgttatg caacggtgta tcaccaaaag ggaggatgat     2220

actaagcaag aagaaaaaat caagtatgaa gatgacctgc ttaagaacca tggatacacg     2280

aaagattttg tgaaagccct taatagtcca ttcgggtaca atattccgcg attcaaaaac     2340

ctttccatcg aacaactctt cgatcgaaat gagtacctta ccgagaaa                  2388


<210>  19
<211>  2862
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Eubacterium siraeum, modified 
       for expression in Zea mays cells

<400>  19
atgggtaaaa agatacatgc acgggacttg cgcgagcaga ggaagacgga ccggaccgaa       60

aaattcgcag accaaaacaa aaaaagagag gctgaacggg cagtccccaa gaaagatgca      120

gccgtgtcag tcaaatcggt ctcaagcgtc tcatccaaaa aggacaatgt taccaaatct      180

atggcgaaag ccgccggagt caagtctgtt ttcgctgttg gcaatacggt ctacatgaca      240

tccttcgggc gcgggaatga tgcggttctt gaacagaaaa ttgttgatac ttcacacgaa      300

ccactcaaca ttgatgaccc agcttatcaa ctcaatgtgg ttacgatgaa tgggtattca      360

gtgaccgggc ataggggaga aacggtctcg gcagtcacag acaatccctt gagaagattc      420

aacggcagaa aaaaggacga gccggaacaa tcagtgccga ctgacatgtt gtgtctcaaa      480

ccaaccctgg aaaagaaatt ttttggcaaa gagttcgacg acaatatcca cattcagttg      540

atatataaca tcctggatat tgagaaaatt ttggccgtct actcgaccaa cgccatatac      600

gctctcaaca acatgtcagc agatgagaac attgagaact cagacttttt tatgaaacgc      660

accacggatg agaccttcga tgacttcgag aaaaagaaag agtccacgaa cagcagagag      720

aaagctgatt tcgatgcgtt cgaaaagttc atcggcaact acaggctggc gtatttcgca      780

gatgcatttt atgtcaacaa gaaaaatccc aagggtaagg ccaaaaatgt cctccgcgaa      840

gacaaggaac tctactcagt gctcacattg atcggaaagt tgcggcattg gtgcgttcat      900

tccgaggagg gtcgggcaga gttctggctt tataaactgg acgaattgaa ggacgatttt      960

aagaacgtgc ttgatgtcgt ctacaataga ccagtcgaag aaattaataa ccgctttatt     1020

gaaaacaata aggtcaacat acaaatcttg ggatcggtct ataaaaacac cgacatcgca     1080

gagctggtca gaagctacta cgagtttctg ataactaaaa agtacaagaa catgggcttc     1140

tcaataaaaa aactgcgcga atcaatgctt gaaggtaagg gatatgcgga taaagaatac     1200

gattctgtta gaaacaagct ctaccagatg actgacttca ttctctatac cggttatata     1260

aacgaagata gcgacagggc tgatgacctg gtcaacacac tgcggagctc cctgaaagag     1320

gacgataaga ccacagtgta ctgtaaggag gccgattacc tgtggaagaa ataccgcgag     1380

tctattaggg aggtcgcgga cgccctggac ggtgacaata ttaaaaaact ctctaaaagc     1440

aatatcgaga tacaagaaga caaactgcgc aagtgtttta tatcttatgc ggattcagtc     1500

tcggagttca cgaaactgat atatctcctg acacgctttc tgagcgggaa ggagattaat     1560

gacttggtga caactttgat taacaagttc gacaacataa ggagctttct tgaaatcatg     1620

gatgagctgg gcctcgatag aacgttcacc gcggagtact cgttcttcga gggttcaaca     1680

aaatatcttg cggaactcgt tgaattgaat tcgttcgtga aaagctgttc ttttgatata     1740

aatgccaaaa gaacaatgta ccgggacgcg cttgatatcc tgggcataga atcggataaa     1800

accgaggaag atatcgaaaa gatgatagac aatatcctgc aaatcgacgc aaatggtgac     1860

aagaagctta aaaagaataa cggcttgcgc aattttatcg cttcgaatgt catcgattcg     1920

aacaggttca aatatctggt tcggtacggt aacccgaaga agattagaga aacagctaag     1980

tgtaagccag cggtcagatt tgtcttgaac gaaataccgg atgcgcagat cgaaagatat     2040

tacgaagcct gctgccctaa gaacaccgca ttgtgtagcg cgaataagcg gcgggagaaa     2100

ctcgctgata tgatagcgga gattaaattc gaaaatttct cggacgcggg caactaccaa     2160

aaagctaacg ttacttcccg cacttcggag gcggagatta aacggaagaa tcaagcgata     2220

attagacttt atctgaccgt catgtacatt atgcttaaga atctcgtcaa cgttaatgct     2280

agatatgtca tcgcctttca ctgcgtggaa cgcgatacta aactgtatgc cgaatcgggt     2340

cttgaagtcg ggaacataga aaaaaataag accaacctta ctatggccgt gatgggtgtc     2400

aaactggaga acggcattat caaaactgaa tttgataaaa gcttcgccga aaacgcagcg     2460

aatcgctatc tgcggaacgc aagatggtat aagcttatac tcgataatct taagaagtcg     2520

gaaagggccg tggtcaacga gttccggaat accgtttgcc acttgaacgc gatccggaat     2580

attaacatca atatcaaaga aattaaagaa gtcgaaaact actttgcgct ctatcattac     2640

ttgatacaga agcatctcga gaatcgcttc gccgataaaa aggtggagag ggacacaggt     2700

gactttattt ccaagctcga agagcataaa acctattgca aggattttgt taaagcatat     2760

tgtacgccat tcggttataa tcttgttagg tacaagaatc tgacaatcga cggcttgttc     2820

gataaaaatt atccgggcaa ggacgatagc gatgagcaga ag                        2862


<210>  20
<211>  2757
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834971, modified for expression in Zea mays cells

<400>  20
atggccaaga agaataaaat gaagccacgc gagctgaggg aggctcaaaa aaaagcccgg       60

cagcttaagg ctgcggagat caataataat gctgcccccg ctatcgcagc aatgcccgcc      120

gcagaggtca ttgcgccggc cgccgaaaag aaaaaaagct cagtgaaggc tgcaggaatg      180

aagtcaattt tggttagcga gaataagatg tatattacct cgtttggcaa gggaaacagc      240

gccgtgctgg aatacgaagt tgataacaat gactataacc agacacagct ttcatcgaag      300

gataattcca acatccaatt ggggggcgtg aacgaagtta atataacgtt ttcttcaaaa      360

catggtttcg aatctggagt cgaaataaat acgtctaatc cgactcatag gtccggtgag      420

tccagccctg tccgggggga catgctcggt ctcaagtccg aactcgaaaa acggtttttc      480

ggtaagactt tcgatgataa tattcatatt cagcttatat acaatatctt ggatatagag      540

aaaattctgg cggtgtatgt cacaaatata gtgtatgctc tgaataatat gctcggtgtg      600

aaaggttcgg agagccatga tgatttcatc ggatatcttt ctacaaataa catctacgat      660

gtgtttatag acccggataa ctcttctctg agcgatgaca aaaaagccaa tgtgagaaag      720

agcctttcga agtttaacgc cctgctcaaa acaaaacgct tgggctattt tggattggaa      780

gaaccgaaga caaaagacaa tcgggtttcg caggcctaca aaaagcgcgt gtatcacatg      840

cttgcaatcg tcgggcaaat caggcaatgt gtctttcacg acaaaagcgg ggcaaaacgc      900

ttcgacctgt actcttttat taataacata gatccggaat atagggatac acttgattac      960

ctggtcgaag aacgccttaa atccataaac aaagacttta tagaagacaa taaagtgaat     1020

atttctttgc tgatcgacat gatgaagggc tacgaagcgg acgacataat aaggttgtat     1080

tatgacttta tcgttcttaa gtcccagaaa aatctggggt tttcaattaa aaagcttagg     1140

gaaaaaatgt tggatgagta tggtttccgg ttcaaagata agcaatacga ttcagtcaga     1200

tccaaaatgt acaagctcat ggactttctt ctgttctgta attactaccg caatgacata     1260

gcagctggtg aaagcctcgt gaggaagttg agattttcca tgaccgacga tgagaaagag     1320

ggtatttatg cagatgaggc agccaagctc tggggaaagt ttagaaatga cttcgagaat     1380

atcgccgacc atatgaacgg ggatgtcatc aaagagctgg gaaaggcgga tatggacttc     1440

gacgagaaaa tactggattc tgaaaaaaaa aatgcgagcg acctccttta cttctccaag     1500

atgatctata tgcttactta tttcctcgat ggaaaggaga taaacgacct gctgactaca     1560

cttatatcga aattcgacaa tatcaaagaa ttcctcaaaa taatgaagtc ttcagcggtt     1620

gatgtggagt gcgaattgac cgctggttac aagctgttta acgattcgca gcggatcacc     1680

aatgaattgt ttattgtcaa aaatatcgcc tctatgagaa aacctgctgc atctgcgaag     1740

ctcaccatgt tcagggatgc actcaccata ttgggcattg acgataagat caccgatgac     1800

aggatttctg gtatattgaa gcttaaggaa aagggtaagg gaatacatgg tctcagaaac     1860

tttatcacta acaacgtcat cgaatcctcg cgctttgtct acctgataaa atatgctaac     1920

gctcagaaga tccgggaggt tgcgaagaat gaaaaagtcg tcatgttcgt tttggggggg     1980

attcccgata cgcaaattga gaggtattat aagtcgtgtg tcgaatttcc tgacatgaac     2040

tcatcacttg gcgtcaaacg ctccgaattg gcacggatga tcaaaaacat ttcattcgac     2100

gacttcaaaa acgtcaaaca gcaagctaag ggccgcgaga acgttgcaaa ggaaagggca     2160

aaggcagtca taggacttta ccttactgtt atgtacctgc tcgttaagaa cctggtcaat     2220

gtcaacgcgc ggtatgtcat tgccattcat tgcttggaac gggacttcgg actttacaaa     2280

gagattatcc ctgaactggc gtcgaagaac ttgaaaaacg actaccggat tctgagccag     2340

acgctctgtg aactttgcga caagagccct aacctttttc ttaaaaaaaa cgagcggctt     2400

aggaaatgtg tggaggtgga tattaacaac gctgatagct cgatgactcg gaagtaccgg     2460

aattgtattg cgcacctgac agtcgttcgg gaactgaagg aatacatagg tgatatatgc     2520

acggttgact catacttttc catatatcat tacgttatgc aaagatgcat aacgaaaaga     2580

gagaacgata ctaaacagga ggaaaagata aagtatgaag atgacttgct taaaaatcac     2640

ggctacacta aagactttgt taaagcactc aatagccctt ttggctacaa catacctaga     2700

ttcaaaaatc tgtcaattga gcagcttttt gacagaaacg aatatctgac agaaaag        2757


<210>  21
<211>  2754
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus bicirculans, 
       modified for expression in Zea mays cells

<400>  21
atggcaaaaa agaataagat gaagccgcgg gagcttcgcg aggcccagaa aaaggcgcgg       60

cagcttaaag cggctgaaat taataataat gctgtcccag cgatagccgc aatgcctgcg      120

gctgaagcgg cggctcccgc ggccgagaag aaaaaatcat ctgttaaagc cgccgggatg      180

aaaagcatcc tcgtgtcgga gaataagatg tacattacgt cgttcggtaa ggggaattcg      240

gcggtccttg aatacgaagt tgataacaat gattataaca aaactcagct ttccagcaaa      300

gacaattcga atattgagct ctgtgacgtc gggaaagtga atataacgtt ttcttcccgg      360

aggggtttcg agagcggtgt ggaaatcaat acaagcaatc caactcatcg gtcgggcgag      420

tcctcctctg tgcggggcga catgttgggg cttaagtcgg aacttgaaaa gcggtttttt      480

ggaaaaaatt tcgacgacaa tatacacatc caacttatct acaacatact ggacatagag      540

aagattttgg cagtgtatgt gaccaatata gtctacgccc tcaacaacat gctgggtgag      600

ggcgacgaat caaattacga ctttatgggt tatctgtcaa cttttaacac atataaggtc      660

tttacaaacc cgaatgggtc tacattgtcc gacgataaga aagaaaatat aaggaagtcc      720

ctttctaaat tcaacgcgct ccttaaaaca aagagattgg gctacttcgg ccttgaagag      780

cccaagacaa aggacactcg ggcctcagaa gcttataaga agagagtcta ccacatgctc      840

gccatagtgg gccaaattag gcagtgcgtc ttccacgaca agtctggtgc aaagagattt      900

gatctgtact cattcattaa taatatcgat ccagagtacc gcgagacatt ggattatctt      960

gtcgacgaaa ggttcgattc tatcaataag ggttttatcc aaggtaataa agtcaacatc     1020

tccctcctga ttgacatgat gaaaggctat gaagccgatg acatcattag gctgtactac     1080

gactttatag ttctcaaatc acagaaaaac ctggggttct ctattaagaa gcttagagag     1140

aaaatgttgg acgaatacgg tttccgcttc aaagataagc aatacgactc agtgaggtct     1200

aaaatgtaca aactcatgga ttttcttctg ttctgtaact actatcggaa tgatatcgca     1260

gccggtgaat ctctcgtcag aaaactcagg ttttcgatga cggacgacga gaaagaaggg     1320

atatacgcgg acgaagccgc taagttgtgg ggaaaatttc gcaacgattt tgaaaatata     1380

gctgatcaca tgaatgggga cgttataaaa gagcttggaa aagccgacat ggattttgac     1440

gagaagatat tggactctga gaagaagaat gcgtcagact tgctttattt ttcaaaaatg     1500

atatatatgc tcacgtactt cttggacggg aaggagataa acgatctgtt gacgacgctg     1560

attagcaaat tcgacaatat caaagagttc ctgaaaataa tgaagagctc agctgtcgat     1620

gtcgagtgtg aactgacggc tggctacaaa ttgtttaacg attcgcaacg cattacgaat     1680

gagctgttta tagtgaaaaa cattgcatct atgcgcaaac cagctgccag cgctaagctt     1740

acaatgtttc gggacgctct gacgattttg ggcatcgacg ataaaattac tgacgatagg     1800

atcagcgaga tactgaaatt gaaagagaaa gggaaaggga ttcacggcct cagaaacttt     1860

attactaata atgtcatcga atcgtcaagg tttgtgtact tgattaaata tgcaaatgca     1920

caaaagattc gggaagtcgc taaaaatgaa aaggttgtta tgtttgtcct cggggggata     1980

cccgataccc aaattgagcg gtattacaag agctgcgtgg agtttccaga catgaactcg     2040

tctctggggg tgaaacggtc cgaactcgct cgcatgatta aaaacatatc cttcgacgac     2100

tttaagaacg tgaagcaaca agctaagggg cgcgagaacg tcgcgaaaga aagggccaaa     2160

gcggttatcg gtctgtacct tacggtcatg tacttgttgg tgaaaaacct tgtgaatgtg     2220

aacgctcggt acgtgatcgc gatccactgt ctggagcgcg attttgggct gtataaagag     2280

atcatcccgg agctggcttc caaaaacctg aaaaatgact accgcatact gtcccagaca     2340

ctttgcgagt tgtgcgacaa gagcccgaat ctgtttctga aaaaaaacga gcgcctgcgg     2400

aagtgcgttg aggttgatat aaacaacgcc gactcctcaa tgacgagaaa gtacagaaat     2460

tgcatagctc atttgaccgt cgtcagggag ctcaaagaat acatagggga catttgcact     2520

gtggactcgt atttttccat ctaccactac gtgatgcaaa ggtgtatcac taagcgggaa     2580

aacgatacca aacaagagga gaagatcaag tacgaggatg accttttgaa aaatcacggt     2640

tatacgaagg acttcgtgaa ggcattgaac tctccgttcg gttataatat ccctaggttc     2700

aagaatttgt ccatagaaca gctcttcgat cgcaatgagt atcttacaga aaaa           2754


<210>  22
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5608892, modified for expression in Zea mays cells

<400>  22
atggcaaaga agaacaaaat gaagccacgc gaactgagag aagctcaaaa gaaggcgaga       60

cagcttaaag ctgcggagat caataataac gcagctccgg ccattgccgc aatgcccgcc      120

gctgaagtga tagctccagt tgcggagaag aagaaatctt cagttaaagc agctggaatg      180

aaatccattc tcgtctcgga gaataaaatg tatattacgt ccttcggaaa aggaaattcc      240

gcggttctcg agtatgaggt ggacaacaac gactacaaca agactcaact gtcgagcaaa      300

gacaactcaa atattgaact cggggacgtt aacgaagtca atataacatt ttcctcaaag      360

catggattcg gcagcggtgt cgaaattaat acttcaaatc cgacacatag gtctggagaa      420

tcgtcgcctg tcaggggcga tatgcttggt ttgaagtccg aactggagaa gcggttcttt      480

gggaagactt ttgacgataa cattcatata caactgatct acaacatact ggatatcgag      540

aaaatcctcg cagtgtatgt cactaatatt gtttacgcct tgaacaacat gctgggcatt      600

aaagactctg aatcatatga tgacttcatg gggtatctca gcgccaggaa cacatatgaa      660

gtgtttacgc acccggacaa gtctaatctg tctgataagg tcaagggtaa tattaagaag      720

tcactcagca agttcaacga cttgcttaag acgaagcgcc tcggctactt tgggcttgag      780

gaaccaaaaa cgaaggacac cagagcctct gaggcttata agaaaagagt gtatcatatg      840

ctcgcgatag tcggtcaaat tagacagtgt gttttccacg ataaatctgg agcaaagagg      900

ttcgaccttt actcatttat aaacaatatc gaccctgaat atagagacac gctggattac      960

cttgtggagg agcggctgaa gtcgattaat aaggacttta tagaaggcaa taaagtcaat     1020

atctctctcc tcatagacat gatgaaaggt tatgaagccg acgacataat aaggctttat     1080

tacgatttta tcgttcttaa gtcacagaaa aatttgggtt tttcgatcaa aaaacttcgg     1140

gaaaagatgt tggaagaata cgggttcaga ttcaaagaca agcagtacga tagcgtgagg     1200

tcaaaaatgt acaagctgat ggacttcctg ctgttttgca attactacag aaatgatgtc     1260

gccgccgggg aggcgttggt tcgcaagctt cgcttttcaa tgacagatga tgaaaaagag     1320

gggatttatg cggatgaggc cgccaagctc tggggcaaat ttaggaatga ttttgaaaac     1380

attgctgatc atatgaatgg cgatgtgatt aaggaactgg gcaaagcaga catggatttt     1440

gatgaaaaga tcctcgactc agaaaagaag aatgccagcg atttgttgta tttctcaaag     1500

atgatctaca tgctgacgta ttttttggac ggtaaagaga taaacgatct gctcacgacg     1560

ttgatttcta aattcgacaa tattaaggag tttcttaaga ttatgaagtc ttcggcagtt     1620

gacgttgaat gcgaactgac tgctggctac aaactcttca acgactcaca acgcatcacc     1680

aatgaacttt ttatcgttaa aaatatagcc agcatgcgga agccggcagc ttctgccaag     1740

ctcaccatgt ttcgcgatgc tttgaccatc ttgggcattg atgacaatat tacagatgat     1800

cggatatctg agatactcaa acttaaggag aaaggcaagg gcatacatgg ccttcggaat     1860

ttcattacta ataacgtgat agaaagcagc cgctttgttt acctcattaa atacgcaaat     1920

gcccaaaaaa taagggaagt tgctaaaaac gaaaaagtgg tgatgttcgt gcttggagga     1980

atacctgaca cacaaatcga gcgctattac aagtcgtgtg tcgaattccc cgatatgaat     2040

tcttccttgg aggctaaacg gtcagagctc gccagaatga tcaagaacat ttcctttgat     2100

gacttcaaaa atgtgaaaca gcaagctaag ggtcgcgaaa acgtcgctaa agagagggcc     2160

aaggctgtta tcggcctcta tcttacggtg atgtatttgt tggtgaagaa cctcgttaat     2220

gtcaacgcca ggtatgttat agcaatacat tgcctcgaac gggattttgg tctttacaaa     2280

gagattatcc cagaattggc gtccaagaac ctcaagaacg actatcgcat attgtctcag     2340

acgctttgtg aattgtgcga tgaccgcaat gagtcttcca acttgttctt gaaaaagaat     2400

aagcggttgc gcaagtgcgt tgaagtggac ataaataacg ccgactcttc aatgactcgc     2460

aagtacagaa attgtatagc gcacctcact gtcgtgcggg aattgaaaga atacatcgga     2520

gacataagga ccgtcgatag ctattttagc atttaccact atgtcatgca aaggtgtata     2580

actaaacgcg gtgatgatac caaacaggaa gaaaagatca aatacgaaga cgatctgctc     2640

aagaatcatg gctacaccaa agatttcgtt aaagcattga atagcccttt cgggtataat     2700

attcccagat ttaaaaacct cagcattgaa caactgttcg accgcaacga atacctcacg     2760

gaaaag                                                                2766


<210>  23
<211>  2766
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp. CAG:57, 
       modified for expression in Zea mays cells

<400>  23
atggcgaaga agaacaaaat gaaaccacgc gaactcagag aggcacaaaa gaaagcccgg       60

cagttgaagg ccgccgagat aaacaacaac gcggcaccgg caattgcggc aatgccagct      120

gcggaggtca tcgctcccgt cgccgagaag aagaagagct cggtcaaggc agccgggatg      180

aaatctattc tggtgtcaga gaataagatg tacattacgt ctttcggcaa gggaaatagc      240

gcagtcttgg agtatgaagt tgacaacaac gactataaca aaacacaact ttctagcaaa      300

gacaactcga atatagaatt gggagatgtc aatgaggtca acataacctt tagctccaag      360

catggctttg gctcgggtgt ggaaattaac acgtccaatc ctacccatcg gtcgggcgag      420

tcgtcgccag ttagggggga catgctgggt ctcaagagcg agttggagaa aagatttttc      480

ggtaagacct tcgatgataa cattcatatc caacttatct ataacatctt ggacatagaa      540

aaaatacttg cagtgtacgt cactaatatc gtttatgcct tgaataatat gttgggaatt      600

aaggactctg aatcctatga cgattttatg ggctatctga gcgctcggaa tacctacgaa      660

gtgtttactc atccagataa aagcaacctt agcgataagg tcaagggcaa cataaaaaag      720

tccctgtcaa agtttaacga tcttctcaaa accaaacggc tgggctactt tggactcgag      780

gagcctaaga cgaaagacac gcgggcatct gaggcataca agaaaagggt ttatcatatg      840

ctggcaatag tcggtcaaat caggcagtgc gtctttcacg acaagagcgg agcgaagcgg      900

tttgaccttt attctttcat caataacatc gatccggaat accgcgacac attggattac      960

ctggtcgagg aaaggttgaa gtccataaac aaggacttca tcgagggaaa caaggttaac     1020

atttcacttc tgattgacat gatgaaaggc tacgaggctg acgatatcat aagactttat     1080

tatgacttta tcgtgctgaa atcgcagaaa aatttgggat tttctatcaa aaagctcaga     1140

gagaagatgc ttgaggagta tggatttaga tttaaggaca agcagtacga ttctgtgcgc     1200

tctaaaatgt acaagctcat ggattttctc ctcttttgca attactacag gaacgatgtt     1260

gccgcaggcg aggctcttgt ccggaagctc cgcttctcca tgacggacga cgaaaaggaa     1320

ggcatatacg cggatgaggc agcgaaattg tggggtaagt tcaggaatga ttttgaaaat     1380

atagctgatc acatgaacgg tgacgtcatc aaggagctgg ggaaagccga tatggatttt     1440

gatgagaaaa tcctggattc ggaaaagaaa aatgcgagcg acttgctcta ctttagcaaa     1500

atgatttata tgttgaccta tttcctcgat ggcaaagaga tcaacgattt gcttacgact     1560

ctgataagca aattcgataa tataaaagag tttttgaaaa taatgaagtc ctcagcggtt     1620

gatgttgaat gcgaactgac agccggctat aagcttttca atgattcaca gaggattacc     1680

aacgaacttt ttatagtgaa aaacatcgcc tcaatgagga aacccgccgc gagcgcgaag     1740

ttgacaatgt ttagggacgc tctgacgatt ttgggaatcg acgataatat cactgacgac     1800

aggatttcgg agatcctcaa attgaaagag aagggcaaag ggatccacgg gttgagaaat     1860

tttataacca ataacgttat agaatcatcg aggtttgtgt atctgatcaa atacgcgaat     1920

gctcaaaaga tcagggaagt ggcaaaggac gagaaggttg tcatgttcgt cctgggtggg     1980

atccctgaca cccagataga aagatactat aagtcctgcg tggaattccc tgatatgaat     2040

tcttccctcg aggctaaaag atctgagttg gcacggatga tcaagaatat ttcgtttgac     2100

gatttcaaaa acgtgaagca acaagctaaa gggcgggaaa acgttgccaa ggaacgggct     2160

aaagctgtca ttggccttta cctcactgtg atgtatttgc tcgttaagaa tctcgtgaac     2220

gttaacgcaa gatacgtgat cgctatccac tgcttggagc gcgatttcgg actgtacaag     2280

gagattatac cagagcttgc ttccaagaat cttaagaatg actatcgcat attgtcccaa     2340

actctttgcg agttgtgcga cgatcggaac gagtcttcca atctgttcct taagaaaaat     2400

aaaaggctgc ggaaatgcgt cgaagtcgac attaacaatg cggattcttc tatgacgaga     2460

aagtaccgca actgcatcgc ccatctcacg gttgtcaggg agctcaagga atacatagga     2520

gacattagaa cggtggactc atatttttca atataccatt atgttatgca aaggtgtatt     2580

acaaaacggg gggatgacac aaaacaagag gaaaagatta aatatgaaga cgatttgctt     2640

aagaaccatg gttacacgaa agatttcgtt aaagcgctta attcgccatt tggttataat     2700

attccgagat tcaaaaattt gagcatagag cagcttttcg atagaaatga atacttgacc     2760

gagaag                                                                2766


<210>  24
<211>  2799
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens FD-1,
       modified for expression in Zea mays cells

<400>  24
atgaaaaaaa agatgagctt gcgggaaaaa agagaggcag aaaagcaggc caagaaagct       60

gcatacagcg ctgcgtctaa gaacactgat tccaaaccag cggagaaaaa agcggagact      120

ccaaaacctg ccgaaattat atctgataac tcacgcaata agacggcggt caaggcagcg      180

ggactcaagt cgacgatcat atcaggcgat aaattgtata tgaccagctt tggcaagggc      240

aatgcagctg tgatagaaca aaagatagac atcaatgact attcttttag cgcaatgaag      300

gacaccccaa gccttgaagt cgacaaggca gaatctaagg aaatatcctt ttcgtcccat      360

catccctttg tgaagaacga caagttgacg acatataatc ctctttacgg tgggaaggat      420

aacccagaga agccggttgg gcgcgatatg ttggggttga aagataaact tgaggaacgg      480

tactttggtt gtacattcaa tgacaacctc cacattcaga tcatttacaa tattttggat      540

attgagaaga tcctcgctgt tcattccgca aatattacga cagctcttga tcatatggtg      600

gatgaggacg atgagaaata ccttaactct gactatatcg gctacatgaa cacgatcaac      660

acctacgacg tcttcatgga tccctctaag aattcctctt tgtcgccaaa agacaggaaa      720

aacatcgaca attcgagggc gaagtttgag aagctcctct ctacaaaaag gttggggtac      780

tttgggttcg actatgacgc gaacgggaaa gacaaaaaga agaatgagga aattaaaaag      840

cggctttacc acttgacggc atttgcaggc cagctgaggc agtggtcctt ccactcagca      900

ggaaactatc ccagaacctg gttgtataaa ttggactccc tggataaaga gtatctggac      960

acgctcgacc actatttcga taagaggttt aatgatataa atgacgattt tgtcactaaa     1020

aacgcaacga acctgtatat actgaaggag gttttccctg aggctaactt taaagatatt     1080

gcggacttgt attatgactt tattgtcatc aagtcacaca agaacatggg attctcgatc     1140

aagaaacttc gggaaaaaat gctcgagtgc gatggagctg accgcatcaa agaacaggat     1200

atggattctg tccgctccaa gctctacaag ctcattgatt tttgcatatt caagtattac     1260

catgagttcc cagagctcag cgagaagaac gtcgacatcc tgagggctgc cgtgagcgat     1320

actaagaagg acaatctcta ctcagatgaa gctgctcggt tgtggtcaat tttcaaggaa     1380

aaatttctcg gattttgtga caaaattgtt gtctgggtga ccggagagca tgagaaagat     1440

atcacgtctg tcattgataa agacgcctac aggaacagaa gcaatgtctc gtatttttca     1500

aagctcatgt acgcaatgtg tttttttctt gatgggaagg agataaacga ccttctgact     1560

accttgatta acaagtttga caatatcgcc aaccagatta agacagcaaa ggaattgggg     1620

atcaacacgg cgttcgttaa aaactatgac ttcttcaacc attctgagaa atatgtcgac     1680

gaattgaaca tagtgaaaaa tatcgctcgg atgaaaaaac cctcttcaaa cgcgaaaaaa     1740

gctatgtacc atgacgccct tactattctt ggcattcctg aagatatgga cgaaaaggct     1800

ttggatgaag aactcgacct tatactcgaa aaaaagaccg atcccgtcac aggtaaaccg     1860

ctgaagggta agaatccttt gcgcaatttt atagctaaca acgttataga gaactctcgg     1920

ttcatctacc ttataaaatt ctgtaatccg gaaaacgtga gaaaaattgt gaataacact     1980

aaggtgacag agttcgtgct gaaacgcata ccagatgccc aaattgagag gtattacaaa     2040

tcttgtacgg atagcgagat gaaccctccg actgaaaaaa aaattaccga gttggctggt     2100

aaacttaaag acatgaactt cggcaacttc cggaatgtcc ggcagtctgc aaaagagaat     2160

atggagaaag agaggtttaa agccgtcatt ggactgtacc ttaccgttgt gtacagggtg     2220

gttaagaatc tcgtcgacgt gaactcaaga tacattatgg cattccattc actcgagaga     2280

gactcccaat tgtataacgt ctcagtcgac aacgattatc tggcactgac cgatacactg     2340

gtcaaagagg gtgacaactc acgctcacgg tacttggccg ggaataaaag attgcgggat     2400

tgtgtcaaac aggatattga taacgcaaaa aagtggtttg ttagcgataa atataattcc     2460

ataaccaagt ataggaacaa tgtggcgcac ctgaccgccg ttcggaactg tgccgaattt     2520

ataggcgaca taacgaagat tgactcctac ttcgccctct accactacct tatccagcgg     2580

caactcgcca aaggtctcga tcatgagagg tcaggttttg accgcaatta tccacagtac     2640

gcaccactgt tcaagtggca tacttatgtg aaagatgttg tgaaagcgct gaatgcacct     2700

ttcggttata atattccaag gttcaagaat ctttccattg acgcactctt cgaccggaat     2760

gagatcaaga agaatgatgg agaaaagaaa tctgacgac                            2799


<210>  25
<211>  2832
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus albus strain 
       KH2T6, modified for expression in Zea mays cells

<400>  25
atggccaaaa aatctaaagg catgtccctg agggaaaaac gcgagctgga gaagcaaaag       60

cggatccaga aagctgcagt gaactctgtc aacgacactc ccgaaaagac cgaggaagca      120

aacgttgttt ctgtcaatgt gagaacgtct gcggaaaaca agcacagcaa gaagagcgct      180

gctaaagctc ttggacttaa atcggggttg gttattgggg acgaattgta cctcacatca      240

tttggcagag gaaatgaggc gaaactcgaa aagaaaataa gcggggatac cgtggaaaaa      300

ttgggcattg gtgctttcga agtggcggaa agggatgagt ctacactcac acttgaatct      360

gggcgcatta aagataaaac tgccagaccg aaagatccca gacatattac agtggacaca      420

caagggaagt ttaaggaaga tatgctcgga atacgctctg tgcttgagaa aaagatattt      480

ggtaagacct tcgatgacaa catccatgtc caacttgcgt acaatatcct cgatgtcgag      540

aagatcatgg cacagtacgt ctctgacatt gtttacatgc tccacaacac cgataagacg      600

gaacgcaatg acaacctgat ggggtatatg tccatcagga atacttacaa aaccttttgt      660

gatacttcca accttccgga cgatacaaaa caaaaggtcg agaatcaaaa acgggaattc      720

gacaagataa ttaagtctgg gcgcttggga tactttggcg aggcatttat ggtcaactcc      780

ggcaactcta caaaattgcg gcctgagaaa gaaatctatc atattttcgc tctcatggcc      840

tcacttaggc agtcctactt ccacgggtat gtgaaggaca cggactacca aggaacaacg      900

tgggcgtaca cattggagga caagttgaag ggcccgtcac acgagttcag agaaacaatt      960

gataagatat ttgatgaagg attctctaag atatcaaaag acttcgggaa aatgaacaaa     1020

gttaatctgc aaattctgga gcagatgata ggcgagctgt acggttctat tgagcgccag     1080

aatctcacat gtgattacta cgacttcatc caattgaaga aacataagta cttggggttc     1140

tctataaagc ggttgagaga aacgatgttg gaaacgacac cggcggaatg ttacaaggca     1200

gaatgctaca atagcgagcg gcagaagctt tacaaactta tagattttct gatctatgat     1260

ttgtactata accgcaagcc ggcgcggatc gaggaaattg tcgataagct tagggagtct     1320

gtgaacgatg aggagaaaga atcgatttat agcgtcgaag ctaagtatgt ctatgagtcc     1380

ctctccaaag tgctggataa gtccctcaag aactccgttt ccggggagac catcaaagat     1440

ctccagaaaa ggtatgatga cgaaactgct aatagaatat gggacatctc gcaacactcg     1500

atttctggga acgtcaactg tttctgcaaa ttgatctaca taatgaccct catgctggac     1560

gggaaagaaa ttaacgacct ccttacaacg ctcgtgaaca aattcgataa tattgcttca     1620

ttcattgatg ttatggacga attgggtttg gaacactcat ttactgataa ttataaaatg     1680

tttgcagatt caaaggctat ctgccttgat cttcaattta ttaattcgtt tgcacggatg     1740

agcaaaatcg acgatgaaaa atctaagcgc caattgttta gggacgctct ggttatcctc     1800

gacataggca ataaggacga gacctggata aataactact tggactccga tattttcaaa     1860

ttggataaag agggaaataa gttgaagggc gcaaggcatg actttcggaa ctttattgct     1920

aacaacgtga ttaagtcgtc acggtttaaa taccttgtta aatactcgtc agcagatggt     1980

atgataaaac tgaaaactaa cgaaaagctt ataggctttg tcctggacaa gctccctgag     2040

acacagatag atagatacta cgaatcgtgt ggacttgata atgctgttgt cgacaaaaaa     2100

gtcaggatcg agaagctgtc agggcttata cgcgacatga aatttgatga tttctccggt     2160

gtcaaaacat caaataaggc gggcgataac gataagcaag acaaagcaaa gtatcaggca     2220

attatcagct tgtaccttat ggttctgtac caaattgtga aaaacatgat ctatgtcaat     2280

tcacggtacg tgatcgcgtt ccattgcctt gagagggatt tcggcatgta cggaaaagac     2340

ttcgggaaat attaccaggg atgtagaaaa ttgactgacc atttcataga agagaaatat     2400

atgaaggaag ggaaacttgg ttgcaataag aaggtgggaa ggtatctcaa aaataatatt     2460

tcatgctgta cggatggtct gatcaatacc tataggaacc aagtggacca tttcgctgtt     2520

gttcggaaga tagggaatta tgcagcatat atcaaatcta tcggctcatg gtttgaactg     2580

tatcactacg tcattcagag gatcgtgttt gatgagtaca gatttgcact gaataatacg     2640

gagagcaact acaagaattc aatcattaag caccatactt attgcaaaga catggtgaag     2700

gctctcaata cgccttttgg gtatgacctc cccagatata agaatctctc catcggggat     2760

cttttcgata gaaacaatta tcttaataag acgaaggaat cgatagatgc taattccagc     2820

attgactcac ag                                                         2832


<210>  26
<211>  2901
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus flavefaciens 
       strain XPD3002, modified for expression in Zea mays cells

<400>  26
atgatcgaga agaaaaagtc tttcgcaaaa ggaatgggag tcaagtctac attggtttct       60

ggttcgaagg tttatatgac gacgttcgcc gagggctctg acgcgcgctt ggagaagata      120

gtggaggggg attcaatacg gtctgtgaac gaaggcgaag ctttttcggc cgagatggcg      180

gacaagaatg cagggtataa aattgggaat gcaaagtttt cgcaccccaa aggttacgca      240

gtcgttgcga ataacccgct ctatactggt ccagtccagc aagatatgct cgggctgaaa      300

gagaccctcg agaaacgcta ttttggggag agcgcggatg ggaatgacaa tatatgtatc      360

caagttatac ataatattct ggatatcgaa aagatccttg ctgaatacat taccaacgct      420

gcttatgcgg tcaacaatat ttcgggactt gataaagata taatcggctt cggtaaattc      480

agcactgtct atacatacga tgagttcaag gatccagagc atcatagagc ggcgttcaat      540

aataacgaca aactgattaa cgcaattaaa gcgcaatatg acgagttcga caattttctc      600

gacaacccac ggcttggcta ctttggccag gcatttttct cgaaggaggg taggaactac      660

ataatcaatt atggcaatga atgctatgac atacttgctc tgctttcagg tctcagacat      720

tgggtcgttc acaataacga agaagaatct cggatctctc ggacttggct ctataacctt      780

gacaagaacc ttgataacga gtacatctct acgctgaact acctttacga cagaatcact      840

aacgagctca ccaattcatt ctccaaaaat tctgccgcaa acgtcaacta catcgcggaa      900

acccttggga tcaacccagc agagtttgct gaacagtatt ttcgcttctc aatcatgaaa      960

gaacagaaaa atctgggctt caatataacg aaactgcgcg aggtcatgtt ggatagaaaa     1020

gatatgtccg aaatcaggaa aaaccataaa gtcttcgact caataaggac caaagtgtat     1080

accatgatgg attttgtcat ctaccgctat tacatagagg aggatgcaaa agtcgctgcc     1140

gctaacaaga gccttccaga taatgaaaag tctctgtcgg aaaaggatat atttgtgatt     1200

aatctccggg gaagctttaa cgacgatcaa aaggatgccc tgtactacga tgaggcaaac     1260

agaatttgga ggaagctgga aaacattatg cataacatta aggagttccg cgggaataaa     1320

acgagggaat ataagaagaa agatgctccg aggttgcctc ggattcttcc tgctggtagg     1380

gatgtttcgg cattctcgaa gctgatgtac gcactcacca tgttccttga cggtaaagag     1440

atcaacgatc tcttgacaac gcttattaat aagtttgata atatacagtc tttccttaag     1500

gttatgcccc ttattggagt taatgctaaa ttcgtggaag agtatgcttt cttcaaggac     1560

agcgcgaaaa ttgctgacga actgcgcctt atcaagtcct tcgcgcggat gggagagcct     1620

atagctgacg ctcgcagggc aatgtatatc gacgccatcc gcatccttgg caccaatctg     1680

agctatgatg agcttaaagc cctcgccgac accttcagcc tggacgaaaa cggcaacaaa     1740

ctcaagaagg gcaagcacgg catgcgcaat ttcattatca ataacgtgat ctcgaataag     1800

agatttcact atctgatacg gtatggcgac ccggcccacc tccatgagat tgcgaaaaac     1860

gaagctgttg tgaaatttgt gcttggtaga attgcggaca tacaaaaaaa acaaggccaa     1920

aatggcaaaa atcaaattga cagatattac gaaacatgca ttggaaagga taagggaaag     1980

tctgtgagcg agaaggttga tgcgttgacc aaaataatca caggaatgaa ttacgatcag     2040

ttcgataaaa agaggtcagt gatagaagac acggggcggg aaaacgctga acgcgaaaaa     2100

tttaagaaaa taatttcgct ctatcttacg gtcatttatc acatcttgaa gaatatagtc     2160

aatatcaacg ctagatacgt gattggtttc cattgtgtgg aaagagacgc tcaactgtac     2220

aaggaaaagg gttatgatat aaacctcaag aagctggagg aaaagggttt tagctcggtg     2280

actaaattgt gcgctggaat cgatgaaacc gcgccagata aaaggaagga tgttgagaag     2340

gagatggccg agagagcgaa ggaatctatc gacagcctgg aaagcgcgaa tcccaaactt     2400

tatgccaact acatcaagta ctctgacgag aaaaaagcgg aagagtttac tagacaaatc     2460

aatcgggaga aagctaagac cgccctcaat gcttacttgc gcaataccaa atggaacgtt     2520

atcattcgcg aagacctctt gcgcatagat aataaaacat gtacattgtt tagaaataaa     2580

gcagtgcacc tcgaggtcgc cagatacgtt cacgcatata taaatgacat cgctgaggtg     2640

aactcgtact ttcagctgta ccattacatt atgcaaagga tcataatgaa cgaaaggtac     2700

gagaaatcgt caggtaaagt ttccgaatat tttgacgcag tcaatgatga aaagaagtac     2760

aacgaccggc ttttgaagtt gctttgtgtg cctttcgggt actgtatccc tcggttcaaa     2820

aacctgtcca tagaggcatt gtttgacagg aacgaggcag caaagttcga caaggaaaag     2880

aaaaaggtgt cgggtaactc g                                               2901


<210>  27
<211>  2388
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834894, modified for expression in Zea mays cells

<400>  27
atggaaatta atacgtccaa tccgacacac agatcaggcg aatcttcctc agttagaggt       60

gatatgttgg gacttaaatc cgaattggaa aagaggtttt ttggcaagac attcgatgat      120

aacattcaca tacaacttat atataacatc cttgatatag aaaagatact tgctgtgtat      180

gtgacaaaca tagtttatgc actgaacaac atgcttggcg tgaagggatc agaaagctac      240

gatgatttca tggggtacct ctccgctcag aacacctatt acatattcac gcacccagat      300

aaatctaacc tgtcggataa agttaagggg aatattaaga agtcgctttc taaatttaac      360

gaccttctta agacaaaaag actgggctac tttgggcttg aggagccaaa gacgaaagac      420

aaacgggtta gcgaggcata taaaaagagg gtttatcata tgcttgccat agtgggccag      480

atacgccagt ccgtctttca tgataaatct aacgagttgg acgagtatct ttactctttc      540

atcgacatca tcgactccga atatagagac acgctcgact atcttgtcga cgaacggttt      600

gattcgataa ataagggttt tgtccaaggc aacaaagtca atatatcact cctcatagat      660

atgatgaaag gatacgaagc agacgatata atcagacttt attacgactt tattgttctt      720

aagagccaga aaaatcttgg attctcaata aagaaactga gggagaaaat gttggacgag      780

tatgggtttc ggtttaaaga taaacaatat gactcggtca ggtccaagat gtacaagctt      840

atggactttc ttttgttctg taattactat aggaatgacg ttgttgccgg ggaggccttg      900

gttagaaaat tgagattcag catgaccgat gacgaaaaag aaggcatcta tgcggatgag      960

gcagagaagt tgtgggggaa atttaggaat gactttgaaa acatagccga tcatatgaat     1020

ggcgatgtca taaaggagtt ggggaaagct gacatggatt ttgacgaaaa aatcctggat     1080

agcgaaaaaa agaatgcttc cgatctgttg tatttctcta agatgatcta tatgctcact     1140

tactttctgg acggtaaaga gatcaacgac cttcttacta cccttatttc aaagttcgat     1200

aacattaagg aatttctgaa aataatgaaa tcctcggctg tcgacgttga atgcgaactt     1260

actgcagggt acaagctgtt taacgactcg caaaggatta ctaatgaact gttcattgtc     1320

aagaacatag cgtccatgag aaagcctgca gcaagcgcaa agctgacgat gttccgcgat     1380

gctctcacca ttctgggaat tgatgacaag attaccgatg accgcatttc ggagatcctt     1440

aagcttaagg aaaaggggaa ggggattcac ggactgagaa attttatcac caataacgtg     1500

atcgaatcgt ctaggtttgt ctatttgata aagtatgcca atgcgcaaaa aattcgcgaa     1560

gtcgccaaga atgagaaggt cgttatgttc gtgctcggag gaattcccga tacacagatt     1620

gaacggtact ataaatcctg tgtggaattc ccggatatga actcatccct cgaggccaaa     1680

tgctctgagc ttgcgaggat gatcaagaat atctcctttg atgattttaa aaacgtgaag     1740

cagcaggcga agggccggga gaatgtggcg aaggagcggg ctaaagctgt gatagggctt     1800

tatcttactg ttatgtacct tctcgtgaaa aacctggtga atgtgaacgc caggtacgtt     1860

atagcgatcc attgtcttga gcgcgacttc ggtttgtata aggagataat tccagagctg     1920

gcatcgaaga acctgaaaaa cgattacaga attctgtcac aaactctctg tgaactctgc     1980

gatgaccgcg atgagtcacc gaatctcttc ctcaaaaaaa acaagaggct gaggaaatgt     2040

gtggaagttg acatcaataa cgcggattcg agcatgacac gcaagtaccg gaattgtatt     2100

gctcatctca cagtcgtccg cgagctcaaa gagtatatag gtgatatccg gaccgttgat     2160

tcttattttt ctatctatca ttacgttatg cagcggtgca ttacaaaaag ggaagatgat     2220

accaaacaag aagaaaaaat aaagtatgag gatgacttgt tgaaaaatca tggatatact     2280

aaagactttg tcaaggctct caactcaccg ttcggttaca acatacccag atttaaaaac     2340

ttgtcaattg aacagttgtt tgaccggaac gaatacctga cagaaaaa                  2388


<210>  28
<211>  2865
<212>  DNA
<213>  Eubacterium siraeum


<220>
<221>  misc_feature
<222>  (1)..(2865)
<223>  native CasM DNA sequence from Eubacterium siraeum

<400>  28
atgggtaaga aaatacacgc acgagatctc agagaacaaa gaaagaccga tagaacggaa       60

aaatttgcag atcagaacaa aaaacgtgaa gcagagaggg cagttccgaa aaaagacgca      120

gccgtttctg taaaatcagt ttcttctgtt tcatcaaaaa aagacaatgt aacaaaatct      180

atggctaaag ccgcaggcgt gaagtcggtt tttgctgtag gaaatactgt ttatatgact      240

tcattcggca gaggaaacga tgctgtactt gagcagaaaa tagtcgatac atcgcacgaa      300

ccgctgaata ttgacgatcc tgcatatcag ttgaacgttg tcacaatgaa cggttattcg      360

gttaccggtc acagaggtga aacggtatct gccgtaacgg ataatccgct gcgccgtttt      420

aacggaagaa agaaagatga accggaacag tctgtgccta cggatatgct gtgcctgaaa      480

ccgactcttg aaaagaaatt cttcggcaaa gaattcgatg ataatataca tatccagctt      540

atttacaata ttcttgacat tgaaaaaata ctggcggttt attcgaccaa cgctatttac      600

gcattgaata atatgagtgc tgacgaaaat atcgaaaaca gcgatttctt catgaaacgt      660

accaccgatg aaacctttga cgattttgaa aagaaaaagg agagtacaaa cagtcgagag      720

aaagccgatt ttgacgcatt tgaaaaattc atcggcaatt acaggctggc ttattttgcc      780

gatgcatttt atgtaaataa aaagaatccc aaaggtaaag caaaaaatgt tctgcgtgag      840

gataaagaac tttactccgt gctcactctg atcggtaaac tgcgtcattg gtgtgttcac      900

agtgaggagg gcagagcaga attctggctg tataagctcg atgaacttaa agatgatttc      960

aaaaatgtac tcgacgttgt ttataaccgt cctgttgaag aaataaacaa ccgctttata     1020

gaaaacaata aggtaaacat acagatactg ggctcggtat acaagaacac cgatattgcc     1080

gaacttgtaa ggtcatatta cgaatttctt atcacaaaga agtataaaaa tatgggcttt     1140

tcaataaaga agctccgtga gagtatgctc gaaggtaaag gttacgccga taaagaatat     1200

gattctgtaa ggaataagct gtatcagatg acggatttca tcttatacac aggatatatc     1260

aacgaagaca gcgatagagc cgacgatctt gtgaacactt tgagaagttc gctcaaagag     1320

gatgataaga caaccgtata ttgcaaggaa gcggattatc tgtggaaaaa ataccgtgaa     1380

tccataagag aggttgccga tgcgcttgat ggcgataaca ttaaaaagct gagcaaatcg     1440

aatattgaaa ttcaggaaga caagctgaga aaatgtttta tcagctatgc cgacagcgta     1500

tcggaattta ccaagcttat ttatctgctg acaagatttt taagcggtaa ggagatcaac     1560

gatcttgtca caacgctgat aaacaagttt gacaatatca gaagcttcct tgaaataatg     1620

gacgagcttg ggcttgacag gaccttcacc gccgagtaca gcttctttga aggcagtaca     1680

aagtatcttg ccgagcttgt cgagcttaac agctttgtga aatcgtgttc gtttgatata     1740

aacgcaaaaa gaacaatgta tcgcgatgcg ctggatattc tcggcattga atcggataag     1800

accgaagaag atattgagaa gatgatcgat aatatccttc agatcgacgc aaacggtgat     1860

aaaaagctca agaaaaacaa cggtctgaga aatttcattg caagtaacgt tatagattca     1920

aaccgattca agtaccttgt gcggtacgga aatccaaaga agattcgtga aacggcaaaa     1980

tgcaagcccg ctgtaaggtt tgtgctgaat gagatcccgg acgcacagat cgaaagatat     2040

tatgaggctt gttgcccaaa aaatacagct ttatgctctg caaataagag acgtgagaaa     2100

ctggctgata tgatagctga aataaagttt gagaattttt cggatgccgg caattatcag     2160

aaagcaaatg tcacatcaag aacgtctgaa gctgaaatca agcggaagaa tcaggctata     2220

atccgtcttt atcttaccgt tatgtacatt atgctgaaga accttgtaaa tgtgaacgcc     2280

agatacgtta tcgctttcca ttgcgttgaa agggatacga agctgtatgc ggaaagcggt     2340

ctggaagtcg gtaatataga aaaaaacaag acaaatctta ctatggctgt aatgggagtc     2400

aagctcgaaa acggaatcat aaaaacggaa tttgacaaga gctttgcaga aaatgccgca     2460

aacagatatc tcaggaatgc acgctggtac aagctgatac tggataattt aaagaagtcg     2520

gaaagagcgg ttgtcaatga gttcagaaat actgtctgcc atctgaatgc gataaggaat     2580

atcaatatca atatcaagga aataaaagag gtcgagaact actttgctct gtaccactac     2640

ctcattcaga aacatctcga aaatcgtttt gccgataaaa aagtagaaag agacaccggc     2700

gattttataa gcaagctcga agaacacaag acttactgca aggactttgt aaaagcatat     2760

tgtacgcctt tcggatataa ccttgtgaga tataaaaacc ttacgataga cgggctgttt     2820

gataagaatt accccggaaa agacgattct gatgaacaga aataa                     2865


<210>  29
<211>  2760
<212>  DNA
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(2760)
<223>  native CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834971

<400>  29
atggcaaaaa agaataaaat gaagcctaga gagctgcgtg aggctcagaa aaaagccaga       60

cagctcaaag cggctgagat aaataataac gctgctcctg caatcgctgc catgcctgct      120

gcagaggtca ttgcacctgc ggcagagaag aaaaaatcct ccgtaaaggc ggcaggaatg      180

aagtctattc ttgtcagcga aaataaaatg tacataacct ctttcggcaa gggcaattct      240

gctgtgcttg aatatgaggt ggataataat gactacaacc aaactcagct ttcttcaaag      300

gacaacagca atatccagct tggtggtgta aacgaagtaa acatcacttt ttcaagcaag      360

catggctttg agagcggagt ggaaataaac acttcaaacc ctactcacag aagcggtgaa      420

agctcgcctg taagagggga tatgctgggg cttaaatcgg agcttgaaaa gcgctttttc      480

ggcaaaactt ttgatgataa tatacatatc cagcttattt acaacattct ggatatcgaa      540

aagatacttg cggtgtatgt aacgaatatc gtttatgcgc tgaacaatat gctcggtgta      600

aagggttcag aaagtcatga cgattttatt gggtatcttt ccacaaataa tatttatgat      660

gtttttattg accctgataa cagcagttta tctgatgata agaaagcgaa tgtcagaaaa      720

agccttagca agttcaatgc cctgctgaaa actaagcgcc ttggctattt cggtcttgaa      780

gagccaaaga cgaaagataa tagagtttcg caagcttaca aaaagcgtgt ttatcatatg      840

cttgcaattg tgggtcagat aagacagtgt gtttttcatg ataaatcggg tgcaaaaaga      900

tttgaccttt acagttttat taacaatatt gatcccgaat acagagacac tcttgactat      960

cttgttgagg aacgcttaaa gtccataaac aaggacttta tcgaggacaa caaggtcaat     1020

atcagcttgc ttattgatat gatgaaaggc tatgaggctg atgatatcat acgcctttat     1080

tacgatttca ttgtgcttaa atctcagaaa aatctcggtt tttctatcaa aaagcttcgt     1140

gagaaaatgc tggacgaata cggcttcaga tttaaggaca agcaatatga ctctgtgcgc     1200

tcaaagatgt acaagcttat ggattttctg cttttctgca actactacag aaatgacatt     1260

gccgcaggcg aatctcttgt gcgcaaactg cgtttttcaa tgaccgatga tgaaaaagag     1320

gggatatatg ctgatgaagc ggcaaagctt tggggcaaat tcaggaatga ttttgaaaat     1380

atcgccgacc acatgaacgg tgacgttatc aaggagcttg gcaaggctga catggatttt     1440

gatgagaaaa ttcttgacag cgaaaagaag aatgcgtctg accttttgta tttctccaaa     1500

atgatatata tgctcacata ttttcttgac ggcaaggaga taaacgacct tcttacaacg     1560

cttatcagca agtttgataa catcaaggag tttttgaaga taatgaaaag ctctgctgtt     1620

gatgttgagt gtgaacttac ggcgggctac aagctgttca atgacagcca gaggataacc     1680

aacgagcttt ttatcgtaaa gaacattgct tccatgagaa agcctgcggc ttcggcgaag     1740

cttacgatgt tccgtgacgc actgactata ctcggtatag acgacaagat cacggacgat     1800

aggataagcg ggattctaaa acttaaagaa aaaggcaagg gcatacatgg cctgagaaat     1860

ttcataacaa acaatgttat cgagtcctct cggtttgtat accttatcaa gtatgcgaac     1920

gctcagaaga taagagaagt ggctaagaat gagaaagttg tcatgtttgt tcttgggggt     1980

atccctgaca cgcagataga gcgttattac aagagttgtg tggaatttcc tgacatgaac     2040

agttctttgg gagtaaagcg cagtgagctt gcgagaatga taaagaacat cagctttgat     2100

gatttcaaaa atgtgaaaca gcaggcaaag ggcagagaaa acgtggctaa ggagagggca     2160

aaggctgtta tcgggcttta tcttacggtc atgtatctgc tggtgaaaaa tcttgtgaat     2220

gtcaatgcaa ggtatgttat tgcgatacac tgccttgaac gtgattttgg gctgtataag     2280

gagataattc ctgagttggc ttcaaagaac ttgaaaaatg actacaggat actttcacag     2340

acgctttgtg aactttgtga taagtcgccg aatttgttct tgaaaaagaa cgagcggctg     2400

cgcaagtgcg ttgaagttga tatcaataat gcagacagca gcatgacaag aaaataccgc     2460

aactgtattg ctcatcttac tgtagttcgt gaactgaaag aatacatagg agatatttgt     2520

acagtggatt cttacttctc catttatcat tatgttatgc agcgctgtat cacgaaaagg     2580

gaaaatgaca caaagcaaga agagaaaata aagtatgagg acgatctttt aaaaaatcac     2640

ggctatacga aagactttgt aaaggctctc aactcgccgt ttggatacaa cattccgagg     2700

tttaaaaatc tttcaattga gcagttgttt gacagaaatg aatatcttac tgaaaagtag     2760


<210>  30
<211>  2757
<212>  DNA
<213>  Ruminococcus bicirculans


<220>
<221>  misc_feature
<222>  (1)..(2757)
<223>  native CasM DNA sequence from Ruminococcus bicirculans

<400>  30
atggcaaaaa agaataaaat gaagcctaga gagctgcgtg aggctcagaa aaaagccaga       60

cagctcaaag cggctgagat aaataataac gctgttcctg caatcgctgc catgcctgct      120

gcagaggctg ctgcacctgc ggcagagaag aaaaaatcct ccgtaaaggc ggcaggaatg      180

aagtctattc ttgtcagtga aaataaaatg tacataacct ctttcggcaa gggcaattct      240

gcggtgcttg aatatgaggt ggataataat gactacaaca aaactcagct ttcctcaaag      300

gacaacagta atatcgagct ctgtgatgta ggcaaagtaa acatcacttt ttcgagcaga      360

cgtggctttg agagcggtgt ggagataaac acttcaaacc ctactcacag aagcggtgaa      420

agctcgtctg taagagggga tatgctgggg cttaaatcgg agcttgaaaa gcgctttttc      480

ggcaagaatt ttgatgataa tatacatatc cagcttattt acaacattct ggatatcgaa      540

aagatacttg cagtgtatgt gacgaatatc gtttatgcac tgaacaatat gcttggggaa      600

ggcgatgaga gcaattacga tttcatgggg tatctttcca catttaacac ttataaagtt      660

tttactaatc ctaatggcag cactttatcc gacgataaga aagagaatat cagaaaaagt      720

cttagcaaat tcaatgccct gctgaaaact aagcgtcttg gctatttcgg ccttgaagag      780

ccaaagacaa aggatacaag agcttcggaa gcatacaaaa agcgtgttta tcatatgctt      840

gcaattgtgg ggcagataag acagtgtgtt tttcatgata aatcgggtgc aaaaagattt      900

gacctttaca gttttattaa caatattgat cccgaataca gagaaaccct tgactatctt      960

gtagatgaga gatttgattc tataaataag ggctttatcc agggcaacaa ggtcaatatc     1020

agcttgctta ttgatatgat gaaaggctat gaggctgatg atatcatacg cctttattac     1080

gatttcattg tgcttaaatc tcagaaaaat ctcggttttt ctatcaaaaa gcttcgtgag     1140

aaaatgctgg acgaatacgg cttcagattt aaggacaagc aatatgactc tgtgcgctca     1200

aagatgtaca agcttatgga ttttctgctt ttctgcaact actacagaaa tgacattgcc     1260

gcaggcgaat ctcttgtgcg caaactgcgt ttttcaatga ccgatgatga aaaagagggg     1320

atatatgctg atgaagcggc aaagctttgg ggcaaattca ggaatgattt tgaaaatatc     1380

gccgaccaca tgaacggtga cgttatcaag gagcttggca aggctgacat ggattttgat     1440

gagaaaattc ttgacagcga aaagaagaat gcgtctgacc ttttgtattt ctccaaaatg     1500

atatatatgc tcacatattt tcttgacggc aaggagataa acgaccttct tacaacgctt     1560

atcagcaagt ttgataacat caaggagttt ttgaagataa tgaaaagctc tgctgttgat     1620

gttgagtgtg aacttacggc gggctacaag ctgttcaatg acagccagag gataaccaac     1680

gagcttttta tcgtaaagaa cattgcttcc atgagaaagc ctgcggcttc ggcgaagctt     1740

acgatgttcc gtgacgcact gactatactc ggtatagacg acaagatcac ggacgatagg     1800

ataagcgaga ttctaaaact taaagaaaaa ggcaagggca tacatggcct gagaaatttc     1860

ataacaaaca atgttatcga gtcctctcgg tttgtatacc ttatcaagta tgcgaacgct     1920

cagaagataa gagaagtggc taagaatgag aaagttgtca tgtttgttct tgggggtatc     1980

cctgacacgc agatagagcg ttattacaag agttgtgtgg aatttcctga catgaacagt     2040

tctttgggag taaagcgcag tgagcttgcg agaatgataa agaacatcag ctttgatgat     2100

ttcaaaaatg tgaaacagca ggcaaagggc agagaaaacg tggctaagga gagggcaaag     2160

gctgttatcg ggctttatct tacggtcatg tatctgctgg tgaaaaatct tgtgaatgtc     2220

aatgcaaggt atgttattgc gatacactgc cttgaacgtg attttgggct gtataaggag     2280

ataattcctg agttggcttc aaagaacttg aaaaatgact acaggatact ttcacagacg     2340

ctttgtgaac tttgtgataa gtcgccgaat ttgttcttga aaaagaacga gcggctgcgc     2400

aagtgcgttg aagttgatat caataatgca gacagcagca tgacaagaaa ataccgcaac     2460

tgtattgctc atcttactgt agttcgtgaa ctgaaagaat acataggaga tatttgtaca     2520

gtggattctt acttctccat ttatcattat gttatgcagc gctgtatcac gaaaagggaa     2580

aatgacacaa agcaagaaga gaaaataaag tatgaggacg atcttttaaa aaatcacggc     2640

tatacgaaag actttgtaaa ggctctcaac tcgccgtttg gatacaacat tccgaggttt     2700

aaaaatcttt caattgagca gttgtttgac agaaatgaat atcttactga aaagtag        2757


<210>  31
<211>  2769
<212>  DNA
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(2769)
<223>  native CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5608892

<400>  31
atggcaaaaa agaataaaat gaagcctaga gagctgcgtg aggctcagaa aaaagccaga       60

cagctcaaag cggctgagat aaataataac gctgctcctg cgatcgctgc catgcctgct      120

gcagaggtca ttgcacctgt ggcagagaag aaaaaatcct ccgtaaaggc ggcaggaatg      180

aagtctattc ttgtcagcga aaataaaatg tacataacct ctttcggcaa gggcaattct      240

gctgtgcttg aatatgaggt ggacaataat gactacaaca aaactcagct ttcttcaaag      300

gacaacagca atatcgagct tggtgatgta aacgaggtaa acatcacttt ttcaagcaag      360

catggctttg ggagcggagt ggagataaat acttcaaacc ctactcacag aagcggtgaa      420

agctcgcctg taagagggga tatgctgggg cttaaatcgg agcttgaaaa gcgctttttc      480

ggcaaaactt ttgatgataa tatacatatc cagcttattt acaacattct ggatatcgaa      540

aagatacttg cggtgtatgt aacgaatatc gtttatgcgc tgaacaatat gcttggtata      600

aaggattctg aaagttatga tgattttatg gggtatcttt ctgcaagaaa tacttatgaa      660

gtttttactc accctgacaa aagcaatctt tccgataagg taaagggtaa tatcaagaaa      720

agccttagca agtttaatga cttgctgaaa actaagcgcc ttggctattt cggccttgaa      780

gagccaaaga caaaagacac aagagcttcg gaagcataca aaaagcgtgt ttatcatatg      840

cttgcaattg tggggcagat aagacagtgt gtttttcatg ataaatcggg tgcaaaaaga      900

tttgaccttt acagttttat taacaatatt gatcccgaat acagagatac tcttgactat      960

cttgttgagg agcgtttaaa gtccataaac aaggacttta tcgagggtaa caaggtcaat     1020

atcagcctgc ttattgatat gatgaaaggc tatgaggctg atgatatcat acgcctttat     1080

tacgatttca ttgtgcttaa atctcagaaa aatctcggct tttctatcaa aaagcttcgt     1140

gagaaaatgc tggaggaata cggtttcaga tttaaggaca agcaatatga ctctgtgcgc     1200

tcaaagatgt acaagcttat ggatttcctg cttttctgca actactacag aaatgacgtt     1260

gccgcaggcg aagctcttgt gcgtaaactg cgtttttcaa tgaccgatga tgaaaaagag     1320

gggatatatg ctgatgaagc ggcaaagctt tggggcaaat tcaggaatga ttttgaaaat     1380

atcgccgacc acatgaacgg tgacgttatc aaggagcttg gcaaggctga catggatttt     1440

gatgagaaaa ttcttgacag tgaaaagaag aatgcgtctg accttttgta tttctccaaa     1500

atgatatata tgctcacata ttttcttgac ggcaaggaga taaacgatct tcttacaacg     1560

cttatcagca agtttgataa catcaaggag tttttgaaga taatgaaaag ctctgctgtt     1620

gatgttgagt gtgagcttac ggcgggctac aagctgttca atgacagcca gaggataacc     1680

aacgagcttt ttatcgtaaa gaacattgct tccatgagaa agcctgcggc ttcagcgaag     1740

cttacgatgt tccgtgacgc actgactata ctcggtatag acgacaatat cacggacgat     1800

aggataagcg agattctaaa acttaaagaa aaaggcaagg gcatacatgg tctgagaaat     1860

tttataacaa acaatgttat cgagtcctct cggtttgtat accttatcaa gtatgcgaac     1920

gctcagaaga taagagaagt ggctaagaat gagaaagttg tcatgtttgt tcttgggggt     1980

atccctgaca cgcagataga gcgttattac aagagttgtg tggagtttcc tgacatgaat     2040

agttctttgg aagcaaagcg cagtgagctt gcgagaatga taaagaacat cagctttgat     2100

gatttcaaaa atgtgaaaca gcaggcaaag ggcagagaaa acgtggctaa ggagagggca     2160

aaggctgtta tcgggcttta tcttacggtc atgtatctgc tggtgaaaaa tcttgtgaat     2220

gtcaatgcaa ggtatgttat tgcgatacac tgccttgaac gtgattttgg gctgtataag     2280

gagataattc ctgagttggc ttcaaagaac ttgaaaaatg actacaggat actttcacag     2340

acgctttgtg aactttgtga tgatcgtaat gagtcgtcga atttgttctt gaaaaagaac     2400

aagcggctgc gcaagtgcgt tgaagttgat atcaataatg cagacagcag catgacaaga     2460

aaataccgca actgtattgc tcatcttact gtagttcgtg aactgaaaga atacatagga     2520

gatattcgta cagtggattc ttacttctcc atttatcatt atgttatgca gcgttgtatc     2580

acgaaaaggg gagatgacac aaagcaagaa gagaaaataa agtatgagga cgatctttta     2640

aaaaatcacg gctatacgaa agactttgta aaggctctca actcgccgtt tggatacaac     2700

attccgaggt ttaaaaatct ttcaattgag cagttgtttg acagaaatga atatcttact     2760

gaaaagtag                                                             2769


<210>  32
<211>  2769
<212>  DNA
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(2769)
<223>  native CasM DNA sequence from Ruminococcus sp. CAG:57

<400>  32
atggcaaaaa agaataaaat gaagcctaga gagctgcgtg aggctcagaa aaaagccaga       60

cagctcaaag cggctgagat aaataataac gctgctcctg cgatcgctgc catgcctgct      120

gcagaggtca ttgcacctgt ggcagagaag aaaaaatcct ccgtaaaggc ggcaggaatg      180

aagtctattc ttgtcagcga aaataaaatg tacataacct ctttcggcaa gggcaattct      240

gctgtgcttg aatatgaggt ggacaataat gactacaaca aaactcagct ttcttcaaag      300

gacaacagca atatcgagct tggtgatgta aacgaggtaa acatcacttt ttcaagcaag      360

catggctttg ggagcggagt ggagataaat acttcaaacc ctactcacag aagcggtgaa      420

agctcgcctg taagagggga tatgctgggg cttaaatcgg agcttgaaaa gcgctttttc      480

ggcaaaactt ttgatgataa tatacatatc cagcttattt acaacattct ggatatcgaa      540

aagatacttg cggtgtatgt aacgaatatc gtttatgcgc tgaacaatat gcttggtata      600

aaggattctg aaagttatga tgattttatg gggtatcttt ctgcaagaaa tacttatgaa      660

gtttttactc accctgacaa aagcaatctt tccgataagg taaagggtaa tatcaagaaa      720

agccttagca agtttaatga cttgctgaaa actaagcgcc ttggctattt cggccttgaa      780

gagccaaaga caaaagacac aagagcttcg gaagcataca aaaagcgtgt ttatcatatg      840

cttgcaattg tggggcagat aagacagtgt gtttttcatg ataaatcggg tgcaaaaaga      900

tttgaccttt acagttttat taacaatatt gatcccgaat acagagatac tcttgactat      960

cttgttgagg agcgtttaaa gtccataaac aaggacttta tcgagggtaa caaggtcaat     1020

atcagcctgc ttattgatat gatgaaaggc tatgaggctg atgatatcat acgcctttat     1080

tacgatttca ttgtgcttaa atctcagaaa aatctcggct tttctatcaa aaagcttcgt     1140

gagaaaatgc tggaggaata cggtttcaga tttaaggaca agcaatatga ctctgtgcgc     1200

tcaaagatgt acaagcttat ggatttcctg cttttctgca actactacag aaatgacgtt     1260

gccgcaggcg aagctcttgt gcgtaaactg cgtttttcaa tgaccgatga tgaaaaagag     1320

gggatatatg ctgatgaagc ggcaaagctt tggggcaaat tcaggaatga ttttgaaaat     1380

atcgccgacc acatgaacgg tgacgttatc aaggagcttg gcaaggctga catggatttt     1440

gatgagaaaa ttcttgacag tgaaaagaag aatgcgtctg accttttgta tttctccaaa     1500

atgatatata tgctcacata ttttcttgac ggcaaggaga taaacgatct tcttacaacg     1560

cttatcagca agtttgataa catcaaggag tttttgaaga taatgaaaag ctctgctgtt     1620

gatgttgagt gtgagcttac ggcgggctac aagctgttca atgacagcca gaggataacc     1680

aacgagcttt ttatcgtaaa gaacattgct tccatgagaa agcctgcggc ttcagcgaag     1740

cttacgatgt tccgtgacgc actgactata ctcggtatag acgacaatat cacggacgat     1800

aggataagcg agattctaaa acttaaagaa aaaggcaagg gcatacatgg tctgagaaat     1860

tttataacaa acaatgttat cgagtcctct cggtttgtat accttatcaa gtatgcgaac     1920

gctcagaaga taagagaagt ggctaaggat gagaaagttg tcatgtttgt tcttgggggt     1980

atccctgaca cgcagataga gcgttattac aagagttgtg tggagtttcc tgacatgaat     2040

agttctttgg aagcaaagcg cagtgagctt gcgagaatga taaagaacat cagctttgat     2100

gatttcaaaa atgtgaaaca gcaggcaaag ggcagagaaa acgtggctaa ggagagggca     2160

aaggctgtta tcgggcttta tcttacggtc atgtatctgc tggtgaaaaa tcttgtgaat     2220

gtcaatgcaa ggtatgttat tgcgatacac tgccttgaac gtgattttgg gctgtataag     2280

gagataattc ctgagttggc ttcaaagaac ttgaaaaatg actacaggat actttcacag     2340

acgctttgtg aactttgtga tgatcgtaat gagtcgtcga atttgttctt gaaaaagaac     2400

aagcggctgc gcaagtgcgt tgaagttgat atcaataatg cagacagcag catgacaaga     2460

aaataccgca actgtattgc tcatcttact gtagttcgtg aactgaaaga atacatagga     2520

gatattcgta cagtggattc ttacttctcc atttatcatt atgttatgca gcgttgtatc     2580

acgaaaaggg gagatgacac aaagcaagaa gagaaaataa agtatgagga cgatctttta     2640

aaaaatcacg gctatacgaa agactttgta aaggctctca actcgccgtt tggatacaac     2700

attccgaggt ttaaaaatct ttcaattgag cagttgtttg acagaaatga atatcttact     2760

gaaaagtag                                                             2769


<210>  33
<211>  2802
<212>  DNA
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(2802)
<223>  native CasM DNA sequence from Ruminococcus flavefaciens FD-1

<400>  33
atgaaaaaga aaatgtctct ccgtgaaaag cgtgaagccg agaaacaggc taaaaaagct       60

gcatattcag cagcttcaaa aaatacagat tctaagcctg cggaaaagaa agcagaaact      120

ccaaagcctg cggagattat ttccgataat tccagaaata agaccgctgt aaaggcggct      180

ggtctgaaat caacaattat cagcggcgat aagctgtata tgacatcttt cggcaagggt      240

aacgctgctg ttattgagca gaaaatagat atcaatgatt attctttttc agctatgaaa      300

gatactccgt cgcttgaagt tgataaagca gaatcaaaag agatctcttt ttcaagtcac      360

catccttttg taaagaatga taagctgaca acatataacc ctttatacgg cggcaaggat      420

aaccccgaaa agcctgtcgg cagggatatg ctcggcttaa aagataagct tgaagaacgc      480

tatttcggat gtacattcaa tgataatctt cacatccaga ttatctataa catacttgac      540

atcgagaaga ttttagctgt tcattctgca aatatcacaa ctgcgcttga ccacatggtt      600

gatgaagacg atgaaaaata tcttaacagc gattatatcg gctacatgaa taccataaat      660

acatatgacg tgtttatgga tccttcaaag aattcttcat taagccctaa agatagaaag      720

aatattgaca acagccgtgc aaaatttgag aaactgcttt caactaagcg ccttggctat      780

tttggatttg actatgatgc aaacggtaag gacaagaaaa agaacgagga aataaaaaag      840

cgtttatatc atctcacagc ttttgcaggt cagctccgtc agtggagttt tcatagtgct      900

ggcaattatc cgagaacatg gctttacaag ctcgattcac tggataagga atatcttgat      960

actcttgacc attacttcga taaacgtttt aacgatataa acgatgattt cgtaactaag     1020

aatgctacca atctctatat tctgaaagaa gtatttcccg aagcaaactt caaggatatt     1080

gccgatcttt attacgattt catagttata aagtcgcaca aaaatatggg attctccata     1140

aaaaagctga gggagaagat gcttgaatgt gatggtgcag acaggataaa agaacaggat     1200

atggactctg ttcgctcaaa gctgtataag ctcatagact tttgcatttt caagtattat     1260

cacgaatttc ctgaacttag tgaaaagaat gtggatatac tcagagcggc tgtatccgat     1320

acaaaaaaag ataaccttta ttctgatgag gctgcacgtt tatggagcat atttaaagaa     1380

aaattcctcg gcttctgtga taagatagtt gtatgggtaa caggagagca tgagaaagat     1440

atcacatccg ttattgataa ggatgcttac aggaacagga gcaatgtttc atatttctca     1500

aagctgatgt atgcaatgtg ctttttcctt gacggaaaag agataaatga ccttctcact     1560

actcttatca acaaattcga taatatcgct aaccagataa aaacagccaa agaacttggc     1620

attaatactg cttttgtaaa gaattacgat ttcttcaatc acagcgagaa atatgtcgat     1680

gaactgaaca tcgtcaagaa tattgcaaga atgaagaagc cttcaagtaa tgccaaaaaa     1740

gctatgtatc atgatgcgct tactattctc ggaatacctg aggatatgga tgaaaaagct     1800

cttgatgagg aactggattt aattcttgaa aaaaagacag acccagtaac tggcaagcca     1860

ctgaaaggta agaatccttt acgtaatttt atcgcaaaca atgtgataga gaattcaaga     1920

ttcatatatc ttatcaagtt ctgcaatcct gagaatgtac gtaaaatcgt gaataataca     1980

aaggtcactg agtttgtgtt aaagcgtatt cccgatgctc agatcgaacg ctattataag     2040

tcgtgtacag attctgaaat gaatccgcct actgaaaaga agatcaccga acttgctggt     2100

aagttaaagg atatgaactt tggcaacttc cgaaatgtga gacagtctgc taaagagaat     2160

atggagaagg agcgcttcaa agctgttata gggctttatc tcacggtagt atatcgtgtt     2220

gtcaagaatc ttgttgatgt aaactcacga tatatcatgg cttttcattc gcttgaacgt     2280

gattcacaac tgtataacgt atctgttgat aatgattatc ttgcacttac cgatactctt     2340

gttaaggagg gagataattc cagaagcaga tatcttgcag gcaacaagcg tctgagagat     2400

tgtgtgaagc aggatatcga taatgcaaaa aagtggtttg ttagtgataa gtacaatagc     2460

ataaccaagt acaggaataa cgttgcccat cttaccgctg tacgtaactg cgctgaattc     2520

atcggagata taacgaagat agactcctat tttgcattgt atcattatct cattcagaga     2580

cagcttgcga aaggtcttga ccatgagcga agtggctttg acagaaacta tccacagtat     2640

gcaccgctgt ttaagtggca tacgtatgta aaggatgttg tcaaggctct gaatgctcca     2700

tttggctaca atatccctcg tttcaagaat ctcagcatag atgcactttt tgaccgcaac     2760

gaaataaaga agaatgacgg cgagaaaaaa tccgatgatt ga                        2802


<210>  34
<211>  2835
<212>  DNA
<213>  Ruminococcus albus


<220>
<221>  misc_feature
<222>  (1)..(2835)
<223>  native CasM DNA sequence from Ruminococcus albus strain KH2T6

<400>  34
atggcaaaaa aatcgaaagg tatgagcctt agagaaaaac gtgaacttga aaagcagaaa       60

aggatacaaa aggcagctgt gaattcagtt aatgatacac ctgaaaaaac agaagaagca      120

aatgtcgtat ctgtaaatgt caggacatcg gctgagaata agcatagtaa aaaatctgct      180

gccaaagctt tgggactgaa atccgggctg gttatcggtg atgagctgta ccttacttca      240

ttcggcagag gtaacgaagc aaagcttgaa aagaagatat ccggtgacac tgtcgaaaaa      300

cttggcattg gtgcttttga agtcgccgaa cgtgacgaat caacgcttac cctcgaaagt      360

ggcaggataa aggacaagac cgccagaccc aaagacccca gacatataac cgtcgataca      420

caaggtaaat tcaaggaaga tatgcttggg atacgcagtg tactggagaa aaagatattt      480

ggcaaaacat ttgatgataa tatccatgtt cagcttgcgt acaatatcct ggatgtcgaa      540

aagataatgg cacagtatgt cagcgatatc gtatatatgc tgcataatac tgataaaaca      600

gaaagaaacg ataatcttat ggggtatatg agcatcagga atacctataa gacattttgt      660

gatacgtcaa atcttcccga tgatacaaaa caaaaagttg aaaatcagaa gagagagttt      720

gacaagatca taaaaagcgg cagacttggg tatttcggcg aagcttttat ggtaaacagc      780

ggcaatagta ccaagcttag acccgagaaa gagatatatc atatctttgc gcttatggcg      840

agcctgaggc agagttactt tcacggatat gtaaaagata ccgattatca gggaaccaca      900

tgggcatata ctcttgagga caagctgaaa ggtccgagcc atgagttcag ggaaaccatt      960

gataagatat ttgatgaggg attcagcaag atcagcaagg actttggcaa gatgaacaag     1020

gtcaaccttc agatacttga acagatgatt ggtgaactgt atggcagtat agaacgacaa     1080

aacctcactt gcgattacta tgacttcatt caactgaaaa agcataagta tcttggattt     1140

tctataaagc gtcttagaga gaccatgctt gaaacaacac cggctgaatg ttataaagct     1200

gaatgctata acagcgagcg tcaaaagctg tataagctga tagatttcct gatatatgat     1260

ctttactata accgtaagcc tgcacgcatc gaagaaatcg tggacaagct gagggaatct     1320

gtgaacgacg aagagaaaga atccatatat tcagttgagg cgaagtatgt ctatgaatca     1380

cttagcaaag ttctggataa atcgctgaaa aacagtgtgt ctggtgaaac gataaaggat     1440

ctccaaaaga gatatgatga cgaaacagca aacaggatct gggatatctc acagcacagt     1500

ataagtggaa atgtcaactg tttctgcaag ctaatttata ttatgaccct gatgcttgac     1560

ggcaaggaga taaatgatct gctgacaacg ctggtaaaca agttcgataa catagcatca     1620

tttatagatg ttatggacga acttggcttg gagcatagtt ttacagataa ctataaaatg     1680

tttgccgaca gcaaggctat atgccttgat ctgcagttca taaacagttt tgcacgtatg     1740

tcaaagatcg atgatgagaa gtcaaaaaga cagcttttcc gtgatgcgct tgtcatactg     1800

gatatcggta ataaagatga gacttggata aataattatc tggattctga tattttcaaa     1860

ctggacaaag aaggtaacaa gttaaagggc gcaaggcatg atttcaggaa ctttatagcc     1920

aataatgtta taaagtcatc acgtttcaaa tacctagtaa aatacagcag tgccgatggt     1980

atgataaagc tgaaaacgaa tgaaaagctg ataggctttg ttctggataa gcttccagaa     2040

acgcagatag accgctacta tgaatcatgc ggacttgaca atgcggtagt agataagaaa     2100

gtcaggatag aaaagctatc ggggcttatc agagatatga agttcgatga tttcagcggt     2160

gtcaaaacct caaacaaagc aggagataat gacaaacagg ataaggcgaa atatcaggcg     2220

ataataagcc tgtacctcat ggtgctgtat cagatagtca agaacatgat atatgtcaac     2280

tcacgttatg ttatcgcttt ccattgtctt gaacgtgact ttggtatgta tggaaaagat     2340

tttggaaagt attatcaagg ctgccgaaaa cttacagatc attttattga agaaaagtac     2400

atgaaagagg gtaaacttgg ctgcaataaa aaagtcggca gatatctgaa aaataatatt     2460

tcctgctgca ctgatggact gataaatacc taccgtaatc aggttgatca ctttgcagtg     2520

gtaaggaaga taggcaacta tgcggcatat atcaagagta tcggttcgtg gtttgaactt     2580

tatcactatg taatacagag gatagttttt gacgaataca gatttgcact taacaacact     2640

gaaagcaact ataagaacag catcatcaag caccatacct actgtaagga tatggtcaag     2700

gcactgaaca cacctttcgg ttatgacctg ccgagataca agaatctttc tatcggtgat     2760

ctgtttgatc gcaataatta tctgaataaa acaaaagagt caatagatgc aaatagctct     2820

attgacagtc agtga                                                      2835


<210>  35
<211>  2904
<212>  DNA
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(2904)
<223>  native CasM DNA sequence from Ruminococcus flavefaciens strain 
       XPD3002

<400>  35
atgatcgaaa agaagaagtc atttgcaaag ggcatgggag taaaatcaac acttgtatcc       60

ggttcaaagg tatacatgac gacgttcgca gaaggaagcg atgccagact tgaaaagatc      120

gttgaaggcg attctatcag atctgtcaac gaaggagaag cgttctcagc tgaaatggct      180

gataagaatg caggctacaa gatcggtaac gcaaagttca gccacccaaa gggctatgct      240

gtagttgcaa acaacccctt atacaccgga ccggtacagc aggatatgct cggtctgaag      300

gaaacgcttg aaaagagata ttttggagag tctgccgacg gaaatgataa tatctgtatt      360

caggtcatcc ataatatcct cgatatcgaa aagatcctcg ctgaatatat aaccaatgct      420

gcttatgcgg taaacaatat ttccggtctt gataaggata tcatcggttt tggtaagttc      480

agtacggtct atacttatga tgagttcaag gatcctgaac atcacagagc agctttcaac      540

aataacgata agttaattaa tgccatcaag gcacagtatg atgaatttga caatttcctt      600

gataatcctc gtctcggcta ctttggacag gcttttttca gtaaggaagg cagaaattac      660

attatcaatt acggcaacga gtgttatgat attcttgctt tactcagcgg attgcgtcac      720

tgggtagtac ataataatga ggaagaatca aggatttccc gtacatggct ttataatctc      780

gacaagaatc ttgacaacga atatatctct actctcaatt atctgtatga tagaattaca      840

aacgaattaa caaattcctt ctcaaagaat agtgcagcca acgtaaacta tatcgctgaa      900

acccttggta ttaatcctgc tgaatttgca gagcagtatt tcagattcag tatcatgaag      960

gaacagaaga atctcggttt caatattact aagctgagag aagtaatgct tgacagaaag     1020

gatatgtctg agatccgtaa aaatcataag gtctttgatt caatccgtac taaggtctat     1080

actatgatgg atttcgttat ctacagatat tacattgaag aggatgcaaa ggttgctgct     1140

gccaacaagt ctctgccgga taacgaaaaa agcctcagtg aaaaggatat ctttgttata     1200

aatctcagag gaagctttaa cgatgatcag aaggatgccc tttattatga tgaggccaat     1260

cgtatttgga gaaagctcga aaacattatg cacaatatca aggaattcag aggcaataag     1320

acacgtgaat acaagaagaa ggatgctcca agactcccca gaattcttcc tgccggaagg     1380

gatgtttccg cgttctcaaa gttgatgtac gctcttacca tgttccttga tggtaaggag     1440

atcaatgatc ttctcaccac gctcatcaat aagttcgata acatccagag tttcctcaag     1500

gtaatgcctc ttatcggagt gaatgcaaag tttgttgagg aatatgcctt cttcaaggac     1560

agcgcaaaga ttgctgacga actcaggctg attaagagct ttgccagaat gggagaacct     1620

atcgcagatg caagacgtgc tatgtatatc gatgctatca ggattctcgg aacaaacctc     1680

agctatgatg agcttaaggc ccttgccgat actttttcgc ttgatgaaaa cggcaacaag     1740

cttaagaagg gcaagcacgg catgagaaac ttcatcatta ataatgtaat cagtaacaag     1800

cgcttccatt atctcattcg ttacggtgat cctgcacatc tccatgagat cgccaagaat     1860

gaagctgttg taaagttcgt cctcggcagg atagctgata tccagaagaa gcagggacag     1920

aacggaaaga atcagatcga caggtactat gagacctgta tcggcaagga caagggcaag     1980

tctgtctccg aaaaggttga tgccctcaca aagattatca ccggtatgaa ctacgatcag     2040

ttcgataaga agagaagcgt tattgaggat actggaagag aaaacgctga gagagaaaag     2100

ttcaagaaga tcatcagcct ctatcttact gtcatttatc acatccttaa gaatattgtt     2160

aatatcaatg cgcgttacgt tatcggcttc cattgcgttg agcgtgatgc acagctctat     2220

aaggaaaagg gctatgatat caacctcaag aagctcgaag aaaaggggtt ttcatcagtc     2280

acaaagctgt gtgcaggtat tgatgagact gctcctgaca agcgtaagga tgttgaaaag     2340

gaaatggctg agcgtgcaaa ggaatctatc gatagccttg aatctgcaaa tcctaagctt     2400

tacgcaaact atatcaagta ttctgacgag aagaaggctg aggaatttac tagacagatc     2460

aaccgtgaga aggcaaagac cgctctgaat gcatatctca gaaatactaa gtggaatgtg     2520

ataatcaggg aagatcttct tagaatcgat aataagacat gtacgctctt tagaaataag     2580

gccgttcatc ttgaagttgc aagatatgtt catgcatata tcaacgatat tgccgaagta     2640

aacagctatt tccagcttta tcattacatc atgcagagaa tcatcatgaa cgaaagatat     2700

gaaaagtctt ctggaaaggt aagcgaatac ttcgatgctg tgaacgatga aaagaagtac     2760

aacgacaggc ttctgaagct gttgtgcgtt ccatttggtt actgcatccc gagattcaag     2820

aatctctcca ttgaagcttt gttcgacagg aacgaagcag ctaagtttga caaggaaaag     2880

aagaaagtat caggtaattc atag                                            2904


<210>  36
<211>  2391
<212>  DNA
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(2391)
<223>  native CasM DNA sequence from Ruminococcus sp., isolate 
       2789STDY5834894

<400>  36
gtggagataa acacttcaaa ccctactcac agaagcggtg aaagctcgtc tgtaagaggg       60

gatatgctgg ggcttaaatc ggagcttgaa aagcgctttt tcggcaagac ttttgatgat      120

aatatacata tccagcttat ttacaacatt ctggatatcg aaaagatact tgcagtgtat      180

gtgacgaata tcgtttatgc actgaacaat atgcttggtg taaagggttc tgaaagttat      240

gatgatttta tggggtatct ttctgcccaa aatacttatt atatttttac tcaccctgac      300

aaaagtaatc tttccgataa ggtaaagggt aatatcaaga aaagccttag caagtttaat      360

gacctgctga aaactaagcg tcttggctat tttggtcttg aagagcctaa gacgaaagat      420

aaaagagttt cggaggcata caaaaagcgt gtttatcata tgcttgcaat tgtggggcag      480

ataaggcaga gtgttttcca tgataagtca aatgagcttg atgagtacct ttacagcttt      540

attgacatta ttgattccga atacagagac actcttgact atcttgtaga tgagagattt      600

gattctataa ataagggctt tgtccagggc aacaaggtca atatcagctt gcttattgat      660

atgatgaaag gctatgaggc tgatgatatc atacgccttt attatgattt cattgtgctt      720

aaatctcaga aaaatctcgg tttttctatc aaaaagcttc gtgagaaaat gctggacgaa      780

tacggcttca gatttaagga caagcaatat gactctgtgc gctcaaagat gtacaagctt      840

atggattttc tgcttttctg caactattac agaaatgacg ttgtcgcagg cgaagctctt      900

gtgcgcaaac tgcgtttttc aatgaccgat gatgaaaaag aggggatata tgctgatgaa      960

gcggaaaagc tttggggcaa attcaggaat gattttgaaa atatcgccga ccacatgaac     1020

ggtgacgtta tcaaggagct tggcaaggct gacatggatt ttgatgagaa aattcttgac     1080

agcgaaaaga agaatgcgtc tgaccttttg tatttctcca aaatgatata tatgctcaca     1140

tattttcttg acggcaagga gataaacgat cttcttacaa cgcttatcag caagtttgat     1200

aacatcaagg agtttttgaa gataatgaaa agctctgctg ttgatgttga gtgtgagctt     1260

acggcgggct acaagctgtt caatgacagc cagaggataa ccaacgagct ttttatcgta     1320

aagaacattg cttccatgag aaagcctgcg gcttcggcga agcttacgat gttccgtgac     1380

gcactgacta tactcggtat agacgacaag atcacggacg ataggataag cgagatttta     1440

aaacttaaag aaaaaggcaa gggcatacat ggtctgagaa attttataac aaacaatgtt     1500

atcgagtcct ctcggtttgt ataccttatc aagtatgcga acgctcagaa gataagagaa     1560

gtggctaaga atgagaaagt tgtcatgttt gttcttgggg gtatccctga cacgcagata     1620

gagcgttatt acaagagttg tgtggaattt cctgacatga acagttcttt ggaagcaaag     1680

tgcagtgagc ttgcgagaat gataaagaac atcagctttg atgatttcaa aaatgtgaaa     1740

cagcaggcaa agggcagaga aaacgtggct aaggagaggg caaaggctgt tatcgggctt     1800

tatcttacgg tcatgtatct gctggtgaaa aatcttgtga atgtcaatgc aaggtatgtt     1860

attgcgatac actgccttga acgtgatttt gggctgtata aggagataat tcctgagttg     1920

gcttcaaaga acttgaaaaa tgactacagg atactttcac agacgctttg tgaactttgt     1980

gatgatcgtg atgagtcgcc gaatttgttc ttgaaaaaga acaagcggct gcgcaagtgc     2040

gttgaagttg atatcaataa tgcagacagc agcatgacaa gaaaataccg caactgtatt     2100

gctcatctta ctgtagttcg tgaactgaaa gaatacatag gagatattcg tacagtggat     2160

tcttacttct ccatttatca ttatgttatg cagcgctgta tcacgaaaag ggaagatgac     2220

acaaagcaag aagagaaaat aaagtatgag gacgatcttt taaaaaatca cggctatacg     2280

aaagactttg taaaggctct caactcgccg tttggataca acattccgag gtttaaaaat     2340

ctttcaattg agcagttgtt tgacagaaat gaatatctta ctgaaaagta g              2391


<210>  37
<211>  954
<212>  PRT
<213>  Eubacterium siraeum


<220>
<221>  misc_feature
<222>  (1)..(954)
<223>  native CasM protein sequence from Eubacterium siraeum

<400>  37

Met Gly Lys Lys Ile His Ala Arg Asp Leu Arg Glu Gln Arg Lys Thr 
1               5                   10                  15      


Asp Arg Thr Glu Lys Phe Ala Asp Gln Asn Lys Lys Arg Glu Ala Glu 
            20                  25                  30          


Arg Ala Val Pro Lys Lys Asp Ala Ala Val Ser Val Lys Ser Val Ser 
        35                  40                  45              


Ser Val Ser Ser Lys Lys Asp Asn Val Thr Lys Ser Met Ala Lys Ala 
    50                  55                  60                  


Ala Gly Val Lys Ser Val Phe Ala Val Gly Asn Thr Val Tyr Met Thr 
65                  70                  75                  80  


Ser Phe Gly Arg Gly Asn Asp Ala Val Leu Glu Gln Lys Ile Val Asp 
                85                  90                  95      


Thr Ser His Glu Pro Leu Asn Ile Asp Asp Pro Ala Tyr Gln Leu Asn 
            100                 105                 110         


Val Val Thr Met Asn Gly Tyr Ser Val Thr Gly His Arg Gly Glu Thr 
        115                 120                 125             


Val Ser Ala Val Thr Asp Asn Pro Leu Arg Arg Phe Asn Gly Arg Lys 
    130                 135                 140                 


Lys Asp Glu Pro Glu Gln Ser Val Pro Thr Asp Met Leu Cys Leu Lys 
145                 150                 155                 160 


Pro Thr Leu Glu Lys Lys Phe Phe Gly Lys Glu Phe Asp Asp Asn Ile 
                165                 170                 175     


His Ile Gln Leu Ile Tyr Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala 
            180                 185                 190         


Val Tyr Ser Thr Asn Ala Ile Tyr Ala Leu Asn Asn Met Ser Ala Asp 
        195                 200                 205             


Glu Asn Ile Glu Asn Ser Asp Phe Phe Met Lys Arg Thr Thr Asp Glu 
    210                 215                 220                 


Thr Phe Asp Asp Phe Glu Lys Lys Lys Glu Ser Thr Asn Ser Arg Glu 
225                 230                 235                 240 


Lys Ala Asp Phe Asp Ala Phe Glu Lys Phe Ile Gly Asn Tyr Arg Leu 
                245                 250                 255     


Ala Tyr Phe Ala Asp Ala Phe Tyr Val Asn Lys Lys Asn Pro Lys Gly 
            260                 265                 270         


Lys Ala Lys Asn Val Leu Arg Glu Asp Lys Glu Leu Tyr Ser Val Leu 
        275                 280                 285             


Thr Leu Ile Gly Lys Leu Arg His Trp Cys Val His Ser Glu Glu Gly 
    290                 295                 300                 


Arg Ala Glu Phe Trp Leu Tyr Lys Leu Asp Glu Leu Lys Asp Asp Phe 
305                 310                 315                 320 


Lys Asn Val Leu Asp Val Val Tyr Asn Arg Pro Val Glu Glu Ile Asn 
                325                 330                 335     


Asn Arg Phe Ile Glu Asn Asn Lys Val Asn Ile Gln Ile Leu Gly Ser 
            340                 345                 350         


Val Tyr Lys Asn Thr Asp Ile Ala Glu Leu Val Arg Ser Tyr Tyr Glu 
        355                 360                 365             


Phe Leu Ile Thr Lys Lys Tyr Lys Asn Met Gly Phe Ser Ile Lys Lys 
    370                 375                 380                 


Leu Arg Glu Ser Met Leu Glu Gly Lys Gly Tyr Ala Asp Lys Glu Tyr 
385                 390                 395                 400 


Asp Ser Val Arg Asn Lys Leu Tyr Gln Met Thr Asp Phe Ile Leu Tyr 
                405                 410                 415     


Thr Gly Tyr Ile Asn Glu Asp Ser Asp Arg Ala Asp Asp Leu Val Asn 
            420                 425                 430         


Thr Leu Arg Ser Ser Leu Lys Glu Asp Asp Lys Thr Thr Val Tyr Cys 
        435                 440                 445             


Lys Glu Ala Asp Tyr Leu Trp Lys Lys Tyr Arg Glu Ser Ile Arg Glu 
    450                 455                 460                 


Val Ala Asp Ala Leu Asp Gly Asp Asn Ile Lys Lys Leu Ser Lys Ser 
465                 470                 475                 480 


Asn Ile Glu Ile Gln Glu Asp Lys Leu Arg Lys Cys Phe Ile Ser Tyr 
                485                 490                 495     


Ala Asp Ser Val Ser Glu Phe Thr Lys Leu Ile Tyr Leu Leu Thr Arg 
            500                 505                 510         


Phe Leu Ser Gly Lys Glu Ile Asn Asp Leu Val Thr Thr Leu Ile Asn 
        515                 520                 525             


Lys Phe Asp Asn Ile Arg Ser Phe Leu Glu Ile Met Asp Glu Leu Gly 
    530                 535                 540                 


Leu Asp Arg Thr Phe Thr Ala Glu Tyr Ser Phe Phe Glu Gly Ser Thr 
545                 550                 555                 560 


Lys Tyr Leu Ala Glu Leu Val Glu Leu Asn Ser Phe Val Lys Ser Cys 
                565                 570                 575     


Ser Phe Asp Ile Asn Ala Lys Arg Thr Met Tyr Arg Asp Ala Leu Asp 
            580                 585                 590         


Ile Leu Gly Ile Glu Ser Asp Lys Thr Glu Glu Asp Ile Glu Lys Met 
        595                 600                 605             


Ile Asp Asn Ile Leu Gln Ile Asp Ala Asn Gly Asp Lys Lys Leu Lys 
    610                 615                 620                 


Lys Asn Asn Gly Leu Arg Asn Phe Ile Ala Ser Asn Val Ile Asp Ser 
625                 630                 635                 640 


Asn Arg Phe Lys Tyr Leu Val Arg Tyr Gly Asn Pro Lys Lys Ile Arg 
                645                 650                 655     


Glu Thr Ala Lys Cys Lys Pro Ala Val Arg Phe Val Leu Asn Glu Ile 
            660                 665                 670         


Pro Asp Ala Gln Ile Glu Arg Tyr Tyr Glu Ala Cys Cys Pro Lys Asn 
        675                 680                 685             


Thr Ala Leu Cys Ser Ala Asn Lys Arg Arg Glu Lys Leu Ala Asp Met 
    690                 695                 700                 


Ile Ala Glu Ile Lys Phe Glu Asn Phe Ser Asp Ala Gly Asn Tyr Gln 
705                 710                 715                 720 


Lys Ala Asn Val Thr Ser Arg Thr Ser Glu Ala Glu Ile Lys Arg Lys 
                725                 730                 735     


Asn Gln Ala Ile Ile Arg Leu Tyr Leu Thr Val Met Tyr Ile Met Leu 
            740                 745                 750         


Lys Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Phe His Cys 
        755                 760                 765             


Val Glu Arg Asp Thr Lys Leu Tyr Ala Glu Ser Gly Leu Glu Val Gly 
    770                 775                 780                 


Asn Ile Glu Lys Asn Lys Thr Asn Leu Thr Met Ala Val Met Gly Val 
785                 790                 795                 800 


Lys Leu Glu Asn Gly Ile Ile Lys Thr Glu Phe Asp Lys Ser Phe Ala 
                805                 810                 815     


Glu Asn Ala Ala Asn Arg Tyr Leu Arg Asn Ala Arg Trp Tyr Lys Leu 
            820                 825                 830         


Ile Leu Asp Asn Leu Lys Lys Ser Glu Arg Ala Val Val Asn Glu Phe 
        835                 840                 845             


Arg Asn Thr Val Cys His Leu Asn Ala Ile Arg Asn Ile Asn Ile Asn 
    850                 855                 860                 


Ile Lys Glu Ile Lys Glu Val Glu Asn Tyr Phe Ala Leu Tyr His Tyr 
865                 870                 875                 880 


Leu Ile Gln Lys His Leu Glu Asn Arg Phe Ala Asp Lys Lys Val Glu 
                885                 890                 895     


Arg Asp Thr Gly Asp Phe Ile Ser Lys Leu Glu Glu His Lys Thr Tyr 
            900                 905                 910         


Cys Lys Asp Phe Val Lys Ala Tyr Cys Thr Pro Phe Gly Tyr Asn Leu 
        915                 920                 925             


Val Arg Tyr Lys Asn Leu Thr Ile Asp Gly Leu Phe Asp Lys Asn Tyr 
    930                 935                 940                 


Pro Gly Lys Asp Asp Ser Asp Glu Gln Lys 
945                 950                 


<210>  38
<211>  919
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(919)
<223>  native CasM protein sequence from Ruminococcus sp., isolate 
       2789STDY5834971

<400>  38

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Ala Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Gln Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Gln Leu Gly Gly Val Asn Glu 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Glu Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Val Lys Gly Ser Glu Ser His Asp Asp 
        195                 200                 205             


Phe Ile Gly Tyr Leu Ser Thr Asn Asn Ile Tyr Asp Val Phe Ile Asp 
    210                 215                 220                 


Pro Asp Asn Ser Ser Leu Ser Asp Asp Lys Lys Ala Asn Val Arg Lys 
225                 230                 235                 240 


Ser Leu Ser Lys Phe Asn Ala Leu Leu Lys Thr Lys Arg Leu Gly Tyr 
                245                 250                 255     


Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Asn Arg Val Ser Gln Ala 
            260                 265                 270         


Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg 
        275                 280                 285             


Gln Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr 
    290                 295                 300                 


Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr 
305                 310                 315                 320 


Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Asp 
                325                 330                 335     


Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu 
            340                 345                 350         


Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser 
        355                 360                 365             


Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu 
    370                 375                 380                 


Asp Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg 
385                 390                 395                 400 


Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr 
                405                 410                 415     


Arg Asn Asp Ile Ala Ala Gly Glu Ser Leu Val Arg Lys Leu Arg Phe 
            420                 425                 430         


Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala 
        435                 440                 445             


Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His 
    450                 455                 460                 


Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe 
465                 470                 475                 480 


Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu 
                485                 490                 495     


Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys 
            500                 505                 510         


Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile 
        515                 520                 525             


Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys 
    530                 535                 540                 


Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr 
545                 550                 555                 560 


Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala 
                565                 570                 575     


Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly 
            580                 585                 590         


Ile Asp Asp Lys Ile Thr Asp Asp Arg Ile Ser Gly Ile Leu Lys Leu 
        595                 600                 605             


Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn 
    610                 615                 620                 


Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn 
625                 630                 635                 640 


Ala Gln Lys Ile Arg Glu Val Ala Lys Asn Glu Lys Val Val Met Phe 
                645                 650                 655     


Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser 
            660                 665                 670         


Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Gly Val Lys Arg Ser 
        675                 680                 685             


Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn 
    690                 695                 700                 


Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala 
705                 710                 715                 720 


Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys 
                725                 730                 735     


Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu 
            740                 745                 750         


Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser 
        755                 760                 765             


Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu 
    770                 775                 780                 


Leu Cys Asp Lys Ser Pro Asn Leu Phe Leu Lys Lys Asn Glu Arg Leu 
785                 790                 795                 800 


Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser Ser Met Thr 
                805                 810                 815     


Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val Arg Glu Leu 
            820                 825                 830         


Lys Glu Tyr Ile Gly Asp Ile Cys Thr Val Asp Ser Tyr Phe Ser Ile 
        835                 840                 845             


Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Glu Asn Asp Thr 
    850                 855                 860                 


Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu Lys Asn His 
865                 870                 875                 880 


Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro Phe Gly Tyr 
                885                 890                 895     


Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu Phe Asp Arg 
            900                 905                 910         


Asn Glu Tyr Leu Thr Glu Lys 
        915                 


<210>  39
<211>  918
<212>  PRT
<213>  Ruminococcus bicirculans


<220>
<221>  misc_feature
<222>  (1)..(918)
<223>  native CasM protein sequence from Ruminococcus bicirculans

<400>  39

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Val 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Ala Ala Ala Pro Ala Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Lys Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Glu Leu Cys Asp Val Gly Lys 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Arg Arg Gly Phe Glu Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Ser Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Asn Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Glu Gly Asp Glu Ser Asn Tyr Asp Phe 
        195                 200                 205             


Met Gly Tyr Leu Ser Thr Phe Asn Thr Tyr Lys Val Phe Thr Asn Pro 
    210                 215                 220                 


Asn Gly Ser Thr Leu Ser Asp Asp Lys Lys Glu Asn Ile Arg Lys Ser 
225                 230                 235                 240 


Leu Ser Lys Phe Asn Ala Leu Leu Lys Thr Lys Arg Leu Gly Tyr Phe 
                245                 250                 255     


Gly Leu Glu Glu Pro Lys Thr Lys Asp Thr Arg Ala Ser Glu Ala Tyr 
            260                 265                 270         


Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg Gln 
        275                 280                 285             


Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr Ser 
    290                 295                 300                 


Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Glu Thr Leu Asp Tyr Leu 
305                 310                 315                 320 


Val Asp Glu Arg Phe Asp Ser Ile Asn Lys Gly Phe Ile Gln Gly Asn 
                325                 330                 335     


Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu Ala 
            340                 345                 350         


Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser Gln 
        355                 360                 365             


Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu Asp 
    370                 375                 380                 


Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg Ser 
385                 390                 395                 400 


Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr Arg 
                405                 410                 415     


Asn Asp Ile Ala Ala Gly Glu Ser Leu Val Arg Lys Leu Arg Phe Ser 
            420                 425                 430         


Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala Lys 
        435                 440                 445             


Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His Met 
    450                 455                 460                 


Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe Asp 
465                 470                 475                 480 


Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu Tyr 
                485                 490                 495     


Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys Glu 
            500                 505                 510         


Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile Lys 
        515                 520                 525             


Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys Glu 
    530                 535                 540                 


Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr Asn 
545                 550                 555                 560 


Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala Ala 
                565                 570                 575     


Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly Ile 
            580                 585                 590         


Asp Asp Lys Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu Lys Leu Lys 
        595                 600                 605             


Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn Asn 
    610                 615                 620                 


Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn Ala 
625                 630                 635                 640 


Gln Lys Ile Arg Glu Val Ala Lys Asn Glu Lys Val Val Met Phe Val 
                645                 650                 655     


Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser Cys 
            660                 665                 670         


Val Glu Phe Pro Asp Met Asn Ser Ser Leu Gly Val Lys Arg Ser Glu 
        675                 680                 685             


Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn Val 
    690                 695                 700                 


Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala Lys 
705                 710                 715                 720 


Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys Asn 
                725                 730                 735     


Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu Glu 
            740                 745                 750         


Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser Lys 
        755                 760                 765             


Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu Leu 
    770                 775                 780                 


Cys Asp Lys Ser Pro Asn Leu Phe Leu Lys Lys Asn Glu Arg Leu Arg 
785                 790                 795                 800 


Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser Ser Met Thr Arg 
                805                 810                 815     


Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val Arg Glu Leu Lys 
            820                 825                 830         


Glu Tyr Ile Gly Asp Ile Cys Thr Val Asp Ser Tyr Phe Ser Ile Tyr 
        835                 840                 845             


His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Glu Asn Asp Thr Lys 
    850                 855                 860                 


Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu Lys Asn His Gly 
865                 870                 875                 880 


Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro Phe Gly Tyr Asn 
                885                 890                 895     


Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu Phe Asp Arg Asn 
            900                 905                 910         


Glu Tyr Leu Thr Glu Lys 
        915             


<210>  40
<211>  922
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(922)
<223>  native CasM protein sequence from Ruminococcus sp., isolate 
       2789STDY5608892

<400>  40

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Val Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Lys Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Glu Leu Gly Asp Val Asn Glu 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Gly Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Ile Lys Asp Ser Glu Ser Tyr Asp Asp 
        195                 200                 205             


Phe Met Gly Tyr Leu Ser Ala Arg Asn Thr Tyr Glu Val Phe Thr His 
    210                 215                 220                 


Pro Asp Lys Ser Asn Leu Ser Asp Lys Val Lys Gly Asn Ile Lys Lys 
225                 230                 235                 240 


Ser Leu Ser Lys Phe Asn Asp Leu Leu Lys Thr Lys Arg Leu Gly Tyr 
                245                 250                 255     


Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Thr Arg Ala Ser Glu Ala 
            260                 265                 270         


Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg 
        275                 280                 285             


Gln Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr 
    290                 295                 300                 


Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr 
305                 310                 315                 320 


Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Gly 
                325                 330                 335     


Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu 
            340                 345                 350         


Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser 
        355                 360                 365             


Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu 
    370                 375                 380                 


Glu Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg 
385                 390                 395                 400 


Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr 
                405                 410                 415     


Arg Asn Asp Val Ala Ala Gly Glu Ala Leu Val Arg Lys Leu Arg Phe 
            420                 425                 430         


Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala 
        435                 440                 445             


Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His 
    450                 455                 460                 


Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe 
465                 470                 475                 480 


Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu 
                485                 490                 495     


Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys 
            500                 505                 510         


Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile 
        515                 520                 525             


Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys 
    530                 535                 540                 


Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr 
545                 550                 555                 560 


Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala 
                565                 570                 575     


Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly 
            580                 585                 590         


Ile Asp Asp Asn Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu Lys Leu 
        595                 600                 605             


Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn 
    610                 615                 620                 


Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn 
625                 630                 635                 640 


Ala Gln Lys Ile Arg Glu Val Ala Lys Asn Glu Lys Val Val Met Phe 
                645                 650                 655     


Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser 
            660                 665                 670         


Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Glu Ala Lys Arg Ser 
        675                 680                 685             


Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn 
    690                 695                 700                 


Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala 
705                 710                 715                 720 


Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys 
                725                 730                 735     


Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu 
            740                 745                 750         


Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser 
        755                 760                 765             


Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu 
    770                 775                 780                 


Leu Cys Asp Asp Arg Asn Glu Ser Ser Asn Leu Phe Leu Lys Lys Asn 
785                 790                 795                 800 


Lys Arg Leu Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser 
                805                 810                 815     


Ser Met Thr Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val 
            820                 825                 830         


Arg Glu Leu Lys Glu Tyr Ile Gly Asp Ile Arg Thr Val Asp Ser Tyr 
        835                 840                 845             


Phe Ser Ile Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Gly 
    850                 855                 860                 


Asp Asp Thr Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu 
865                 870                 875                 880 


Lys Asn His Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro 
                885                 890                 895     


Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu 
            900                 905                 910         


Phe Asp Arg Asn Glu Tyr Leu Thr Glu Lys 
        915                 920         


<210>  41
<211>  922
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(922)
<223>  native CasM protein sequence from Ruminococcus sp. CAG:57

<400>  41

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Val Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Lys Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Glu Leu Gly Asp Val Asn Glu 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Gly Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Ile Lys Asp Ser Glu Ser Tyr Asp Asp 
        195                 200                 205             


Phe Met Gly Tyr Leu Ser Ala Arg Asn Thr Tyr Glu Val Phe Thr His 
    210                 215                 220                 


Pro Asp Lys Ser Asn Leu Ser Asp Lys Val Lys Gly Asn Ile Lys Lys 
225                 230                 235                 240 


Ser Leu Ser Lys Phe Asn Asp Leu Leu Lys Thr Lys Arg Leu Gly Tyr 
                245                 250                 255     


Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Thr Arg Ala Ser Glu Ala 
            260                 265                 270         


Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg 
        275                 280                 285             


Gln Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr 
    290                 295                 300                 


Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr 
305                 310                 315                 320 


Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Gly 
                325                 330                 335     


Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu 
            340                 345                 350         


Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser 
        355                 360                 365             


Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu 
    370                 375                 380                 


Glu Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg 
385                 390                 395                 400 


Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr 
                405                 410                 415     


Arg Asn Asp Val Ala Ala Gly Glu Ala Leu Val Arg Lys Leu Arg Phe 
            420                 425                 430         


Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala 
        435                 440                 445             


Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His 
    450                 455                 460                 


Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe 
465                 470                 475                 480 


Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu 
                485                 490                 495     


Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys 
            500                 505                 510         


Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile 
        515                 520                 525             


Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys 
    530                 535                 540                 


Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr 
545                 550                 555                 560 


Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala 
                565                 570                 575     


Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly 
            580                 585                 590         


Ile Asp Asp Asn Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu Lys Leu 
        595                 600                 605             


Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn 
    610                 615                 620                 


Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn 
625                 630                 635                 640 


Ala Gln Lys Ile Arg Glu Val Ala Lys Asp Glu Lys Val Val Met Phe 
                645                 650                 655     


Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser 
            660                 665                 670         


Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Glu Ala Lys Arg Ser 
        675                 680                 685             


Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn 
    690                 695                 700                 


Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala 
705                 710                 715                 720 


Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys 
                725                 730                 735     


Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu 
            740                 745                 750         


Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser 
        755                 760                 765             


Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu 
    770                 775                 780                 


Leu Cys Asp Asp Arg Asn Glu Ser Ser Asn Leu Phe Leu Lys Lys Asn 
785                 790                 795                 800 


Lys Arg Leu Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser 
                805                 810                 815     


Ser Met Thr Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val 
            820                 825                 830         


Arg Glu Leu Lys Glu Tyr Ile Gly Asp Ile Arg Thr Val Asp Ser Tyr 
        835                 840                 845             


Phe Ser Ile Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Gly 
    850                 855                 860                 


Asp Asp Thr Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu 
865                 870                 875                 880 


Lys Asn His Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro 
                885                 890                 895     


Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu 
            900                 905                 910         


Phe Asp Arg Asn Glu Tyr Leu Thr Glu Lys 
        915                 920         


<210>  42
<211>  933
<212>  PRT
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(933)
<223>  native CasM protein sequence from Ruminococcus flavefaciens FD-1

<400>  42

Met Lys Lys Lys Met Ser Leu Arg Glu Lys Arg Glu Ala Glu Lys Gln 
1               5                   10                  15      


Ala Lys Lys Ala Ala Tyr Ser Ala Ala Ser Lys Asn Thr Asp Ser Lys 
            20                  25                  30          


Pro Ala Glu Lys Lys Ala Glu Thr Pro Lys Pro Ala Glu Ile Ile Ser 
        35                  40                  45              


Asp Asn Ser Arg Asn Lys Thr Ala Val Lys Ala Ala Gly Leu Lys Ser 
    50                  55                  60                  


Thr Ile Ile Ser Gly Asp Lys Leu Tyr Met Thr Ser Phe Gly Lys Gly 
65                  70                  75                  80  


Asn Ala Ala Val Ile Glu Gln Lys Ile Asp Ile Asn Asp Tyr Ser Phe 
                85                  90                  95      


Ser Ala Met Lys Asp Thr Pro Ser Leu Glu Val Asp Lys Ala Glu Ser 
            100                 105                 110         


Lys Glu Ile Ser Phe Ser Ser His His Pro Phe Val Lys Asn Asp Lys 
        115                 120                 125             


Leu Thr Thr Tyr Asn Pro Leu Tyr Gly Gly Lys Asp Asn Pro Glu Lys 
    130                 135                 140                 


Pro Val Gly Arg Asp Met Leu Gly Leu Lys Asp Lys Leu Glu Glu Arg 
145                 150                 155                 160 


Tyr Phe Gly Cys Thr Phe Asn Asp Asn Leu His Ile Gln Ile Ile Tyr 
                165                 170                 175     


Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala Val His Ser Ala Asn Ile 
            180                 185                 190         


Thr Thr Ala Leu Asp His Met Val Asp Glu Asp Asp Glu Lys Tyr Leu 
        195                 200                 205             


Asn Ser Asp Tyr Ile Gly Tyr Met Asn Thr Ile Asn Thr Tyr Asp Val 
    210                 215                 220                 


Phe Met Asp Pro Ser Lys Asn Ser Ser Leu Ser Pro Lys Asp Arg Lys 
225                 230                 235                 240 


Asn Ile Asp Asn Ser Arg Ala Lys Phe Glu Lys Leu Leu Ser Thr Lys 
                245                 250                 255     


Arg Leu Gly Tyr Phe Gly Phe Asp Tyr Asp Ala Asn Gly Lys Asp Lys 
            260                 265                 270         


Lys Lys Asn Glu Glu Ile Lys Lys Arg Leu Tyr His Leu Thr Ala Phe 
        275                 280                 285             


Ala Gly Gln Leu Arg Gln Trp Ser Phe His Ser Ala Gly Asn Tyr Pro 
    290                 295                 300                 


Arg Thr Trp Leu Tyr Lys Leu Asp Ser Leu Asp Lys Glu Tyr Leu Asp 
305                 310                 315                 320 


Thr Leu Asp His Tyr Phe Asp Lys Arg Phe Asn Asp Ile Asn Asp Asp 
                325                 330                 335     


Phe Val Thr Lys Asn Ala Thr Asn Leu Tyr Ile Leu Lys Glu Val Phe 
            340                 345                 350         


Pro Glu Ala Asn Phe Lys Asp Ile Ala Asp Leu Tyr Tyr Asp Phe Ile 
        355                 360                 365             


Val Ile Lys Ser His Lys Asn Met Gly Phe Ser Ile Lys Lys Leu Arg 
    370                 375                 380                 


Glu Lys Met Leu Glu Cys Asp Gly Ala Asp Arg Ile Lys Glu Gln Asp 
385                 390                 395                 400 


Met Asp Ser Val Arg Ser Lys Leu Tyr Lys Leu Ile Asp Phe Cys Ile 
                405                 410                 415     


Phe Lys Tyr Tyr His Glu Phe Pro Glu Leu Ser Glu Lys Asn Val Asp 
            420                 425                 430         


Ile Leu Arg Ala Ala Val Ser Asp Thr Lys Lys Asp Asn Leu Tyr Ser 
        435                 440                 445             


Asp Glu Ala Ala Arg Leu Trp Ser Ile Phe Lys Glu Lys Phe Leu Gly 
    450                 455                 460                 


Phe Cys Asp Lys Ile Val Val Trp Val Thr Gly Glu His Glu Lys Asp 
465                 470                 475                 480 


Ile Thr Ser Val Ile Asp Lys Asp Ala Tyr Arg Asn Arg Ser Asn Val 
                485                 490                 495     


Ser Tyr Phe Ser Lys Leu Met Tyr Ala Met Cys Phe Phe Leu Asp Gly 
            500                 505                 510         


Lys Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn 
        515                 520                 525             


Ile Ala Asn Gln Ile Lys Thr Ala Lys Glu Leu Gly Ile Asn Thr Ala 
    530                 535                 540                 


Phe Val Lys Asn Tyr Asp Phe Phe Asn His Ser Glu Lys Tyr Val Asp 
545                 550                 555                 560 


Glu Leu Asn Ile Val Lys Asn Ile Ala Arg Met Lys Lys Pro Ser Ser 
                565                 570                 575     


Asn Ala Lys Lys Ala Met Tyr His Asp Ala Leu Thr Ile Leu Gly Ile 
            580                 585                 590         


Pro Glu Asp Met Asp Glu Lys Ala Leu Asp Glu Glu Leu Asp Leu Ile 
        595                 600                 605             


Leu Glu Lys Lys Thr Asp Pro Val Thr Gly Lys Pro Leu Lys Gly Lys 
    610                 615                 620                 


Asn Pro Leu Arg Asn Phe Ile Ala Asn Asn Val Ile Glu Asn Ser Arg 
625                 630                 635                 640 


Phe Ile Tyr Leu Ile Lys Phe Cys Asn Pro Glu Asn Val Arg Lys Ile 
                645                 650                 655     


Val Asn Asn Thr Lys Val Thr Glu Phe Val Leu Lys Arg Ile Pro Asp 
            660                 665                 670         


Ala Gln Ile Glu Arg Tyr Tyr Lys Ser Cys Thr Asp Ser Glu Met Asn 
        675                 680                 685             


Pro Pro Thr Glu Lys Lys Ile Thr Glu Leu Ala Gly Lys Leu Lys Asp 
    690                 695                 700                 


Met Asn Phe Gly Asn Phe Arg Asn Val Arg Gln Ser Ala Lys Glu Asn 
705                 710                 715                 720 


Met Glu Lys Glu Arg Phe Lys Ala Val Ile Gly Leu Tyr Leu Thr Val 
                725                 730                 735     


Val Tyr Arg Val Val Lys Asn Leu Val Asp Val Asn Ser Arg Tyr Ile 
            740                 745                 750         


Met Ala Phe His Ser Leu Glu Arg Asp Ser Gln Leu Tyr Asn Val Ser 
        755                 760                 765             


Val Asp Asn Asp Tyr Leu Ala Leu Thr Asp Thr Leu Val Lys Glu Gly 
    770                 775                 780                 


Asp Asn Ser Arg Ser Arg Tyr Leu Ala Gly Asn Lys Arg Leu Arg Asp 
785                 790                 795                 800 


Cys Val Lys Gln Asp Ile Asp Asn Ala Lys Lys Trp Phe Val Ser Asp 
                805                 810                 815     


Lys Tyr Asn Ser Ile Thr Lys Tyr Arg Asn Asn Val Ala His Leu Thr 
            820                 825                 830         


Ala Val Arg Asn Cys Ala Glu Phe Ile Gly Asp Ile Thr Lys Ile Asp 
        835                 840                 845             


Ser Tyr Phe Ala Leu Tyr His Tyr Leu Ile Gln Arg Gln Leu Ala Lys 
    850                 855                 860                 


Gly Leu Asp His Glu Arg Ser Gly Phe Asp Arg Asn Tyr Pro Gln Tyr 
865                 870                 875                 880 


Ala Pro Leu Phe Lys Trp His Thr Tyr Val Lys Asp Val Val Lys Ala 
                885                 890                 895     


Leu Asn Ala Pro Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser 
            900                 905                 910         


Ile Asp Ala Leu Phe Asp Arg Asn Glu Ile Lys Lys Asn Asp Gly Glu 
        915                 920                 925             


Lys Lys Ser Asp Asp 
    930             


<210>  43
<211>  944
<212>  PRT
<213>  Ruminococcus albus


<220>
<221>  misc_feature
<222>  (1)..(944)
<223>  native CasM protein sequence from Ruminococcus albus strain KH2T6

<400>  43

Met Ala Lys Lys Ser Lys Gly Met Ser Leu Arg Glu Lys Arg Glu Leu 
1               5                   10                  15      


Glu Lys Gln Lys Arg Ile Gln Lys Ala Ala Val Asn Ser Val Asn Asp 
            20                  25                  30          


Thr Pro Glu Lys Thr Glu Glu Ala Asn Val Val Ser Val Asn Val Arg 
        35                  40                  45              


Thr Ser Ala Glu Asn Lys His Ser Lys Lys Ser Ala Ala Lys Ala Leu 
    50                  55                  60                  


Gly Leu Lys Ser Gly Leu Val Ile Gly Asp Glu Leu Tyr Leu Thr Ser 
65                  70                  75                  80  


Phe Gly Arg Gly Asn Glu Ala Lys Leu Glu Lys Lys Ile Ser Gly Asp 
                85                  90                  95      


Thr Val Glu Lys Leu Gly Ile Gly Ala Phe Glu Val Ala Glu Arg Asp 
            100                 105                 110         


Glu Ser Thr Leu Thr Leu Glu Ser Gly Arg Ile Lys Asp Lys Thr Ala 
        115                 120                 125             


Arg Pro Lys Asp Pro Arg His Ile Thr Val Asp Thr Gln Gly Lys Phe 
    130                 135                 140                 


Lys Glu Asp Met Leu Gly Ile Arg Ser Val Leu Glu Lys Lys Ile Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Val Gln Leu Ala Tyr Asn Ile 
                165                 170                 175     


Leu Asp Val Glu Lys Ile Met Ala Gln Tyr Val Ser Asp Ile Val Tyr 
            180                 185                 190         


Met Leu His Asn Thr Asp Lys Thr Glu Arg Asn Asp Asn Leu Met Gly 
        195                 200                 205             


Tyr Met Ser Ile Arg Asn Thr Tyr Lys Thr Phe Cys Asp Thr Ser Asn 
    210                 215                 220                 


Leu Pro Asp Asp Thr Lys Gln Lys Val Glu Asn Gln Lys Arg Glu Phe 
225                 230                 235                 240 


Asp Lys Ile Ile Lys Ser Gly Arg Leu Gly Tyr Phe Gly Glu Ala Phe 
                245                 250                 255     


Met Val Asn Ser Gly Asn Ser Thr Lys Leu Arg Pro Glu Lys Glu Ile 
            260                 265                 270         


Tyr His Ile Phe Ala Leu Met Ala Ser Leu Arg Gln Ser Tyr Phe His 
        275                 280                 285             


Gly Tyr Val Lys Asp Thr Asp Tyr Gln Gly Thr Thr Trp Ala Tyr Thr 
    290                 295                 300                 


Leu Glu Asp Lys Leu Lys Gly Pro Ser His Glu Phe Arg Glu Thr Ile 
305                 310                 315                 320 


Asp Lys Ile Phe Asp Glu Gly Phe Ser Lys Ile Ser Lys Asp Phe Gly 
                325                 330                 335     


Lys Met Asn Lys Val Asn Leu Gln Ile Leu Glu Gln Met Ile Gly Glu 
            340                 345                 350         


Leu Tyr Gly Ser Ile Glu Arg Gln Asn Leu Thr Cys Asp Tyr Tyr Asp 
        355                 360                 365             


Phe Ile Gln Leu Lys Lys His Lys Tyr Leu Gly Phe Ser Ile Lys Arg 
    370                 375                 380                 


Leu Arg Glu Thr Met Leu Glu Thr Thr Pro Ala Glu Cys Tyr Lys Ala 
385                 390                 395                 400 


Glu Cys Tyr Asn Ser Glu Arg Gln Lys Leu Tyr Lys Leu Ile Asp Phe 
                405                 410                 415     


Leu Ile Tyr Asp Leu Tyr Tyr Asn Arg Lys Pro Ala Arg Ile Glu Glu 
            420                 425                 430         


Ile Val Asp Lys Leu Arg Glu Ser Val Asn Asp Glu Glu Lys Glu Ser 
        435                 440                 445             


Ile Tyr Ser Val Glu Ala Lys Tyr Val Tyr Glu Ser Leu Ser Lys Val 
    450                 455                 460                 


Leu Asp Lys Ser Leu Lys Asn Ser Val Ser Gly Glu Thr Ile Lys Asp 
465                 470                 475                 480 


Leu Gln Lys Arg Tyr Asp Asp Glu Thr Ala Asn Arg Ile Trp Asp Ile 
                485                 490                 495     


Ser Gln His Ser Ile Ser Gly Asn Val Asn Cys Phe Cys Lys Leu Ile 
            500                 505                 510         


Tyr Ile Met Thr Leu Met Leu Asp Gly Lys Glu Ile Asn Asp Leu Leu 
        515                 520                 525             


Thr Thr Leu Val Asn Lys Phe Asp Asn Ile Ala Ser Phe Ile Asp Val 
    530                 535                 540                 


Met Asp Glu Leu Gly Leu Glu His Ser Phe Thr Asp Asn Tyr Lys Met 
545                 550                 555                 560 


Phe Ala Asp Ser Lys Ala Ile Cys Leu Asp Leu Gln Phe Ile Asn Ser 
                565                 570                 575     


Phe Ala Arg Met Ser Lys Ile Asp Asp Glu Lys Ser Lys Arg Gln Leu 
            580                 585                 590         


Phe Arg Asp Ala Leu Val Ile Leu Asp Ile Gly Asn Lys Asp Glu Thr 
        595                 600                 605             


Trp Ile Asn Asn Tyr Leu Asp Ser Asp Ile Phe Lys Leu Asp Lys Glu 
    610                 615                 620                 


Gly Asn Lys Leu Lys Gly Ala Arg His Asp Phe Arg Asn Phe Ile Ala 
625                 630                 635                 640 


Asn Asn Val Ile Lys Ser Ser Arg Phe Lys Tyr Leu Val Lys Tyr Ser 
                645                 650                 655     


Ser Ala Asp Gly Met Ile Lys Leu Lys Thr Asn Glu Lys Leu Ile Gly 
            660                 665                 670         


Phe Val Leu Asp Lys Leu Pro Glu Thr Gln Ile Asp Arg Tyr Tyr Glu 
        675                 680                 685             


Ser Cys Gly Leu Asp Asn Ala Val Val Asp Lys Lys Val Arg Ile Glu 
    690                 695                 700                 


Lys Leu Ser Gly Leu Ile Arg Asp Met Lys Phe Asp Asp Phe Ser Gly 
705                 710                 715                 720 


Val Lys Thr Ser Asn Lys Ala Gly Asp Asn Asp Lys Gln Asp Lys Ala 
                725                 730                 735     


Lys Tyr Gln Ala Ile Ile Ser Leu Tyr Leu Met Val Leu Tyr Gln Ile 
            740                 745                 750         


Val Lys Asn Met Ile Tyr Val Asn Ser Arg Tyr Val Ile Ala Phe His 
        755                 760                 765             


Cys Leu Glu Arg Asp Phe Gly Met Tyr Gly Lys Asp Phe Gly Lys Tyr 
    770                 775                 780                 


Tyr Gln Gly Cys Arg Lys Leu Thr Asp His Phe Ile Glu Glu Lys Tyr 
785                 790                 795                 800 


Met Lys Glu Gly Lys Leu Gly Cys Asn Lys Lys Val Gly Arg Tyr Leu 
                805                 810                 815     


Lys Asn Asn Ile Ser Cys Cys Thr Asp Gly Leu Ile Asn Thr Tyr Arg 
            820                 825                 830         


Asn Gln Val Asp His Phe Ala Val Val Arg Lys Ile Gly Asn Tyr Ala 
        835                 840                 845             


Ala Tyr Ile Lys Ser Ile Gly Ser Trp Phe Glu Leu Tyr His Tyr Val 
    850                 855                 860                 


Ile Gln Arg Ile Val Phe Asp Glu Tyr Arg Phe Ala Leu Asn Asn Thr 
865                 870                 875                 880 


Glu Ser Asn Tyr Lys Asn Ser Ile Ile Lys His His Thr Tyr Cys Lys 
                885                 890                 895     


Asp Met Val Lys Ala Leu Asn Thr Pro Phe Gly Tyr Asp Leu Pro Arg 
            900                 905                 910         


Tyr Lys Asn Leu Ser Ile Gly Asp Leu Phe Asp Arg Asn Asn Tyr Leu 
        915                 920                 925             


Asn Lys Thr Lys Glu Ser Ile Asp Ala Asn Ser Ser Ile Asp Ser Gln 
    930                 935                 940                 


<210>  44
<211>  967
<212>  PRT
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(967)
<223>  native CasM protein sequence from Ruminococcus flavefaciens 
       strain XPD3002

<400>  44

Met Ile Glu Lys Lys Lys Ser Phe Ala Lys Gly Met Gly Val Lys Ser 
1               5                   10                  15      


Thr Leu Val Ser Gly Ser Lys Val Tyr Met Thr Thr Phe Ala Glu Gly 
            20                  25                  30          


Ser Asp Ala Arg Leu Glu Lys Ile Val Glu Gly Asp Ser Ile Arg Ser 
        35                  40                  45              


Val Asn Glu Gly Glu Ala Phe Ser Ala Glu Met Ala Asp Lys Asn Ala 
    50                  55                  60                  


Gly Tyr Lys Ile Gly Asn Ala Lys Phe Ser His Pro Lys Gly Tyr Ala 
65                  70                  75                  80  


Val Val Ala Asn Asn Pro Leu Tyr Thr Gly Pro Val Gln Gln Asp Met 
                85                  90                  95      


Leu Gly Leu Lys Glu Thr Leu Glu Lys Arg Tyr Phe Gly Glu Ser Ala 
            100                 105                 110         


Asp Gly Asn Asp Asn Ile Cys Ile Gln Val Ile His Asn Ile Leu Asp 
        115                 120                 125             


Ile Glu Lys Ile Leu Ala Glu Tyr Ile Thr Asn Ala Ala Tyr Ala Val 
    130                 135                 140                 


Asn Asn Ile Ser Gly Leu Asp Lys Asp Ile Ile Gly Phe Gly Lys Phe 
145                 150                 155                 160 


Ser Thr Val Tyr Thr Tyr Asp Glu Phe Lys Asp Pro Glu His His Arg 
                165                 170                 175     


Ala Ala Phe Asn Asn Asn Asp Lys Leu Ile Asn Ala Ile Lys Ala Gln 
            180                 185                 190         


Tyr Asp Glu Phe Asp Asn Phe Leu Asp Asn Pro Arg Leu Gly Tyr Phe 
        195                 200                 205             


Gly Gln Ala Phe Phe Ser Lys Glu Gly Arg Asn Tyr Ile Ile Asn Tyr 
    210                 215                 220                 


Gly Asn Glu Cys Tyr Asp Ile Leu Ala Leu Leu Ser Gly Leu Arg His 
225                 230                 235                 240 


Trp Val Val His Asn Asn Glu Glu Glu Ser Arg Ile Ser Arg Thr Trp 
                245                 250                 255     


Leu Tyr Asn Leu Asp Lys Asn Leu Asp Asn Glu Tyr Ile Ser Thr Leu 
            260                 265                 270         


Asn Tyr Leu Tyr Asp Arg Ile Thr Asn Glu Leu Thr Asn Ser Phe Ser 
        275                 280                 285             


Lys Asn Ser Ala Ala Asn Val Asn Tyr Ile Ala Glu Thr Leu Gly Ile 
    290                 295                 300                 


Asn Pro Ala Glu Phe Ala Glu Gln Tyr Phe Arg Phe Ser Ile Met Lys 
305                 310                 315                 320 


Glu Gln Lys Asn Leu Gly Phe Asn Ile Thr Lys Leu Arg Glu Val Met 
                325                 330                 335     


Leu Asp Arg Lys Asp Met Ser Glu Ile Arg Lys Asn His Lys Val Phe 
            340                 345                 350         


Asp Ser Ile Arg Thr Lys Val Tyr Thr Met Met Asp Phe Val Ile Tyr 
        355                 360                 365             


Arg Tyr Tyr Ile Glu Glu Asp Ala Lys Val Ala Ala Ala Asn Lys Ser 
    370                 375                 380                 


Leu Pro Asp Asn Glu Lys Ser Leu Ser Glu Lys Asp Ile Phe Val Ile 
385                 390                 395                 400 


Asn Leu Arg Gly Ser Phe Asn Asp Asp Gln Lys Asp Ala Leu Tyr Tyr 
                405                 410                 415     


Asp Glu Ala Asn Arg Ile Trp Arg Lys Leu Glu Asn Ile Met His Asn 
            420                 425                 430         


Ile Lys Glu Phe Arg Gly Asn Lys Thr Arg Glu Tyr Lys Lys Lys Asp 
        435                 440                 445             


Ala Pro Arg Leu Pro Arg Ile Leu Pro Ala Gly Arg Asp Val Ser Ala 
    450                 455                 460                 


Phe Ser Lys Leu Met Tyr Ala Leu Thr Met Phe Leu Asp Gly Lys Glu 
465                 470                 475                 480 


Ile Asn Asp Leu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn Ile Gln 
                485                 490                 495     


Ser Phe Leu Lys Val Met Pro Leu Ile Gly Val Asn Ala Lys Phe Val 
            500                 505                 510         


Glu Glu Tyr Ala Phe Phe Lys Asp Ser Ala Lys Ile Ala Asp Glu Leu 
        515                 520                 525             


Arg Leu Ile Lys Ser Phe Ala Arg Met Gly Glu Pro Ile Ala Asp Ala 
    530                 535                 540                 


Arg Arg Ala Met Tyr Ile Asp Ala Ile Arg Ile Leu Gly Thr Asn Leu 
545                 550                 555                 560 


Ser Tyr Asp Glu Leu Lys Ala Leu Ala Asp Thr Phe Ser Leu Asp Glu 
                565                 570                 575     


Asn Gly Asn Lys Leu Lys Lys Gly Lys His Gly Met Arg Asn Phe Ile 
            580                 585                 590         


Ile Asn Asn Val Ile Ser Asn Lys Arg Phe His Tyr Leu Ile Arg Tyr 
        595                 600                 605             


Gly Asp Pro Ala His Leu His Glu Ile Ala Lys Asn Glu Ala Val Val 
    610                 615                 620                 


Lys Phe Val Leu Gly Arg Ile Ala Asp Ile Gln Lys Lys Gln Gly Gln 
625                 630                 635                 640 


Asn Gly Lys Asn Gln Ile Asp Arg Tyr Tyr Glu Thr Cys Ile Gly Lys 
                645                 650                 655     


Asp Lys Gly Lys Ser Val Ser Glu Lys Val Asp Ala Leu Thr Lys Ile 
            660                 665                 670         


Ile Thr Gly Met Asn Tyr Asp Gln Phe Asp Lys Lys Arg Ser Val Ile 
        675                 680                 685             


Glu Asp Thr Gly Arg Glu Asn Ala Glu Arg Glu Lys Phe Lys Lys Ile 
    690                 695                 700                 


Ile Ser Leu Tyr Leu Thr Val Ile Tyr His Ile Leu Lys Asn Ile Val 
705                 710                 715                 720 


Asn Ile Asn Ala Arg Tyr Val Ile Gly Phe His Cys Val Glu Arg Asp 
                725                 730                 735     


Ala Gln Leu Tyr Lys Glu Lys Gly Tyr Asp Ile Asn Leu Lys Lys Leu 
            740                 745                 750         


Glu Glu Lys Gly Phe Ser Ser Val Thr Lys Leu Cys Ala Gly Ile Asp 
        755                 760                 765             


Glu Thr Ala Pro Asp Lys Arg Lys Asp Val Glu Lys Glu Met Ala Glu 
    770                 775                 780                 


Arg Ala Lys Glu Ser Ile Asp Ser Leu Glu Ser Ala Asn Pro Lys Leu 
785                 790                 795                 800 


Tyr Ala Asn Tyr Ile Lys Tyr Ser Asp Glu Lys Lys Ala Glu Glu Phe 
                805                 810                 815     


Thr Arg Gln Ile Asn Arg Glu Lys Ala Lys Thr Ala Leu Asn Ala Tyr 
            820                 825                 830         


Leu Arg Asn Thr Lys Trp Asn Val Ile Ile Arg Glu Asp Leu Leu Arg 
        835                 840                 845             


Ile Asp Asn Lys Thr Cys Thr Leu Phe Arg Asn Lys Ala Val His Leu 
    850                 855                 860                 


Glu Val Ala Arg Tyr Val His Ala Tyr Ile Asn Asp Ile Ala Glu Val 
865                 870                 875                 880 


Asn Ser Tyr Phe Gln Leu Tyr His Tyr Ile Met Gln Arg Ile Ile Met 
                885                 890                 895     


Asn Glu Arg Tyr Glu Lys Ser Ser Gly Lys Val Ser Glu Tyr Phe Asp 
            900                 905                 910         


Ala Val Asn Asp Glu Lys Lys Tyr Asn Asp Arg Leu Leu Lys Leu Leu 
        915                 920                 925             


Cys Val Pro Phe Gly Tyr Cys Ile Pro Arg Phe Lys Asn Leu Ser Ile 
    930                 935                 940                 


Glu Ala Leu Phe Asp Arg Asn Glu Ala Ala Lys Phe Asp Lys Glu Lys 
945                 950                 955                 960 


Lys Lys Val Ser Gly Asn Ser 
                965         


<210>  45
<211>  796
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(796)
<223>  native CasM protein sequence from Ruminococcus sp., isolate 
       2789STDY5834894

<400>  45

Met Glu Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser 
1               5                   10                  15      


Ser Val Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg 
            20                  25                  30          


Phe Phe Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr 
        35                  40                  45              


Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile 
    50                  55                  60                  


Val Tyr Ala Leu Asn Asn Met Leu Gly Val Lys Gly Ser Glu Ser Tyr 
65                  70                  75                  80  


Asp Asp Phe Met Gly Tyr Leu Ser Ala Gln Asn Thr Tyr Tyr Ile Phe 
                85                  90                  95      


Thr His Pro Asp Lys Ser Asn Leu Ser Asp Lys Val Lys Gly Asn Ile 
            100                 105                 110         


Lys Lys Ser Leu Ser Lys Phe Asn Asp Leu Leu Lys Thr Lys Arg Leu 
        115                 120                 125             


Gly Tyr Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Lys Arg Val Ser 
    130                 135                 140                 


Glu Ala Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln 
145                 150                 155                 160 


Ile Arg Gln Ser Val Phe His Asp Lys Ser Asn Glu Leu Asp Glu Tyr 
                165                 170                 175     


Leu Tyr Ser Phe Ile Asp Ile Ile Asp Ser Glu Tyr Arg Asp Thr Leu 
            180                 185                 190         


Asp Tyr Leu Val Asp Glu Arg Phe Asp Ser Ile Asn Lys Gly Phe Val 
        195                 200                 205             


Gln Gly Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly 
    210                 215                 220                 


Tyr Glu Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu 
225                 230                 235                 240 


Lys Ser Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys 
                245                 250                 255     


Met Leu Asp Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser 
            260                 265                 270         


Val Arg Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn 
        275                 280                 285             


Tyr Tyr Arg Asn Asp Val Val Ala Gly Glu Ala Leu Val Arg Lys Leu 
    290                 295                 300                 


Arg Phe Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu 
305                 310                 315                 320 


Ala Glu Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala 
                325                 330                 335     


Asp His Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met 
            340                 345                 350         


Asp Phe Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp 
        355                 360                 365             


Leu Leu Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp 
    370                 375                 380                 


Gly Lys Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp 
385                 390                 395                 400 


Asn Ile Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val 
                405                 410                 415     


Glu Cys Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg 
            420                 425                 430         


Ile Thr Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys 
        435                 440                 445             


Pro Ala Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile 
    450                 455                 460                 


Leu Gly Ile Asp Asp Lys Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu 
465                 470                 475                 480 


Lys Leu Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile 
                485                 490                 495     


Thr Asn Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr 
            500                 505                 510         


Ala Asn Ala Gln Lys Ile Arg Glu Val Ala Lys Asn Glu Lys Val Val 
        515                 520                 525             


Met Phe Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr 
    530                 535                 540                 


Lys Ser Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Glu Ala Lys 
545                 550                 555                 560 


Cys Ser Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe 
                565                 570                 575     


Lys Asn Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu 
            580                 585                 590         


Arg Ala Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu 
        595                 600                 605             


Val Lys Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His 
    610                 615                 620                 


Cys Leu Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu 
625                 630                 635                 640 


Ala Ser Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu 
                645                 650                 655     


Cys Glu Leu Cys Asp Asp Arg Asp Glu Ser Pro Asn Leu Phe Leu Lys 
            660                 665                 670         


Lys Asn Lys Arg Leu Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala 
        675                 680                 685             


Asp Ser Ser Met Thr Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr 
    690                 695                 700                 


Val Val Arg Glu Leu Lys Glu Tyr Ile Gly Asp Ile Arg Thr Val Asp 
705                 710                 715                 720 


Ser Tyr Phe Ser Ile Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys 
                725                 730                 735     


Arg Glu Asp Asp Thr Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp 
            740                 745                 750         


Leu Leu Lys Asn His Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn 
        755                 760                 765             


Ser Pro Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu 
    770                 775                 780                 


Gln Leu Phe Asp Arg Asn Glu Tyr Leu Thr Glu Lys 
785                 790                 795     


<210>  46
<211>  96
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CRISPR Arrays Component

<400>  46
ugauacugcu uugaugucag cauugcauau cuacuauacu ggugcgaauu ugcacuaguc       60

uaaaaucuau aaccauaagu ucuucugcgu ucauau                                 96


<210>  47
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CRISPR Arrays Component

<400>  47
ugauacugcu uugaugucag cauugcauau cuacuauacu ggugcgaauu ugcacuaguc       60

uaaaau                                                                  66


<210>  48
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CRISPR Arrays Component

<400>  48
cuacuauacu ggugcgaauu ugcacuaguc uaaaauugau acugcuuuga ugucagcauu       60

gcauau                                                                  66


<210>  49
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: Forward primer

<400>  49
cgaaattaat acgactcact ataggtttcg attatgcggc cgtgt                       45


<210>  50
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: Reverse primer

<400>  50
aggagatata ccatgggcag ca                                                22


<210>  51
<211>  36
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM repeat sequence

<400>  51
cuacuauacu ggugcgaauu ugcacuaguc uaaaau                                 36


<210>  52
<211>  824
<212>  PRT
<213>  Eubacterium siraeum


<220>
<221>  misc_feature
<222>  (1)..(824)
<223>  WYL Eubacterium siraeum

<400>  52

Met Lys Lys Thr Glu Lys Phe Asp Asp Val Gln Ser Gly Tyr Glu Tyr 
1               5                   10                  15      


Lys Tyr Phe Leu Glu Ser Ile Asp Lys Tyr Arg Ala Ala Val Gln Asn 
            20                  25                  30          


Ile Tyr Thr Tyr Gly Cys Phe Asn Gln Lys Gln Leu Ser Glu Gln Cys 
        35                  40                  45              


Asn Cys Ser Asp Gln Thr Ile Lys Lys Ala Phe Asn Phe Tyr Asn Leu 
    50                  55                  60                  


Cys Leu Ala Asn Tyr Ile Lys Lys Lys Lys Gly Thr Leu Ser Lys Lys 
65                  70                  75                  80  


Ala Lys Gly Arg Pro Thr Glu Ala Lys Tyr Leu Glu Tyr Asp Arg Phe 
                85                  90                  95      


Thr Leu Asn Glu Asn Tyr Leu Tyr Asn Ile Tyr Leu Trp Ala Arg Ile 
            100                 105                 110         


Thr Lys Lys Gln Met Trp Ala Phe Ser Tyr Phe Arg Arg His Thr Ser 
        115                 120                 125             


Leu Leu Ile Asn Ala Ser Arg Thr Glu Ile Lys Asn Gln Leu Ser Asp 
    130                 135                 140                 


Phe Phe Leu Tyr Phe Ser Glu Tyr Met Asp Arg Ser Lys Lys Ala Glu 
145                 150                 155                 160 


Asn Ser Gln Asp Leu Gly Tyr Ile Ile Asp Met Thr Ala Pro Thr Glu 
                165                 170                 175     


Lys Asn Met Leu Ile Ser Ser Met Cys Asp Ala Leu Ala Val Phe Gly 
            180                 185                 190         


Arg Lys Ala Pro Tyr Ser Val Pro Ala Tyr Ser Ile Ser His Lys Leu 
        195                 200                 205             


Lys Lys Leu Cys Gly Asn Asp Ser Lys Ser Leu Trp Ser Phe Met Tyr 
    210                 215                 220                 


Asp Asn Tyr Asp Arg Ile Leu Tyr Asp Glu Ala Val Tyr Thr Ile Arg 
225                 230                 235                 240 


Gln Ala Ile Arg Asp Arg Lys Leu Ile Gly Tyr Gln Thr Val Gly Thr 
                245                 250                 255     


Glu Lys Gln Lys Ser Val Asn Tyr Val Val Pro Leu Lys Ile Met Tyr 
            260                 265                 270         


Glu Tyr Asn Leu Gly Arg Cys Tyr Leu Leu Tyr Ser Pro Leu Asn Ser 
        275                 280                 285             


Asp Ser Ile Ile Lys Ser Ile Arg Leu Asp Lys Leu Tyr Lys Val Ala 
    290                 295                 300                 


Ala Tyr Glu Pro Asp Ser Ile Ile Asn Tyr Glu Lys Leu Tyr Asp Val 
305                 310                 315                 320 


Leu Ala Val Ala Glu Asn Glu Ile Trp Leu Ser Gly Asp Tyr Thr Lys 
                325                 330                 335     


Lys Asp Cys Leu Ser Arg Ile Val Leu Lys Asn Val Lys Pro Gln Ala 
            340                 345                 350         


Phe Ser Leu Ile Glu Lys Tyr Gly Val Cys Tyr Thr Glu Asp Arg Glu 
        355                 360                 365             


Ala Lys Thr Val Thr Phe Asn Ile Arg Lys Ala Asp Asp Ile Lys Pro 
    370                 375                 380                 


Phe Ile Arg Thr Leu Gly Gly Asp Ala Val Ile Ser Glu Glu Asp Asn 
385                 390                 395                 400 


Pro Gly Leu Phe Arg Glu Phe Ala Tyr Asp Ala Arg Ile Gly Arg Gln 
                405                 410                 415     


Met Tyr Tyr Asp Asp Ser Phe Ala Asp Cys Pro Ala Glu Lys Asp Ser 
            420                 425                 430         


Gln Pro Ala Lys Asp Ser Lys Thr Ala Ser Gly Asn Asp Asn Ile Lys 
        435                 440                 445             


Lys Tyr Ala Ser Tyr Pro Thr Leu Arg Leu Phe Asn Lys Tyr Gly Ser 
    450                 455                 460                 


Phe Met Asn Ile Leu Ala Glu Glu Leu Ala Glu His Ile Phe Ser Glu 
465                 470                 475                 480 


Ile Ile Arg Met Pro Val Glu Lys Arg Ala Gly Gln Ile Glu Tyr Ser 
                485                 490                 495     


Ser Asn Arg Leu Glu Arg Val Leu Asn Ser Tyr Phe Lys Ile Tyr Gly 
            500                 505                 510         


Phe Asp Glu Leu Arg Thr Glu Ala Ser Asn Ile Thr Glu Trp Phe Thr 
        515                 520                 525             


Lys Ala Thr Glu Glu Leu Ser Asp Ser Asp Tyr Ser Ser Trp Phe Ser 
    530                 535                 540                 


Val Asn Gly Gly Lys Phe Glu Ala Val Ala Asp Leu Asn Glu Tyr Glu 
545                 550                 555                 560 


His Lys Gln Leu Leu Thr Asn Ile Glu Tyr Glu Tyr Leu Arg Leu Met 
                565                 570                 575     


Leu Gly Asp Pro Asp Ala Arg Ala Ile Ile Gly Asn Glu Tyr Cys Glu 
            580                 585                 590         


Lys Leu Ser Glu Tyr Val Gly Ser Ala Asp Thr Thr Leu Asp Glu Phe 
        595                 600                 605             


Phe Thr Val Arg Tyr Ala Asn Arg Asn Glu Lys Thr Ile Glu Asn Lys 
    610                 615                 620                 


His Ser Val Leu Arg Thr Ile Met Arg Ala Met Asn Asn Glu Lys Lys 
625                 630                 635                 640 


Ala Asp Ile Glu Tyr Lys Gly Lys His Tyr Ile Cys Ser Ala Tyr Arg 
                645                 650                 655     


Phe Thr Tyr Ser Leu Arg Glu Arg Lys His Arg Leu Met Val Phe Asp 
            660                 665                 670         


Gly Asn Tyr Ile Met Gln Ile Asn Leu Cys Asp Ile Lys Asp Ala Gln 
        675                 680                 685             


Met Thr Lys Glu Pro Ser Leu Ser Asp Glu Glu Met Asn Lys Leu Leu 
    690                 695                 700                 


Thr Glu Arg Lys Lys Tyr Ile Glu Ile Ala Ile Pro Gln Asn Ala Asp 
705                 710                 715                 720 


Ala Gln Gln Arg Asn Val Phe Glu Arg Ala Leu Arg Leu Phe Gly Gly 
                725                 730                 735     


Phe Glu Arg Tyr Ser Trp Asn Asp Ala Lys Asn Gly Glu Tyr Val Ile 
            740                 745                 750         


Ala Val Ala Tyr Tyr Glu Pro Asp Ile Ser Val Ser Ser Ser Ala Asp 
        755                 760                 765             


Arg Arg Ile Tyr Arg Arg Asp Thr Val Ala Ala Asp Ile Met Ser Leu 
    770                 775                 780                 


Gly Arg Tyr Ala Arg Val Met Lys Gln Pro Gly Phe Glu Leu Asp Gly 
785                 790                 795                 800 


Val Arg Tyr Asp Ser Ser Leu Tyr Asp Tyr Ile Ser Lys Asn Tyr Ser 
                805                 810                 815     


Gly Thr Ala Ala Arg Tyr Glu Lys 
            820                 


<210>  53
<211>  389
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(389)
<223>  WYL Ruminococcus sp.isolate 2789STDY5834971

<400>  53

Met Leu Ile Leu Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Glu Lys Tyr Asn Gly Ile 
65                  70                  75                  80  


Pro Leu Leu Asn Ala Phe Val Lys Trp Gln Ile Glu Glu Ile Asn Asp 
                85                  90                  95      


Gly Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr Leu Ile Ser 
            100                 105                 110         


Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn Ala Val Arg 
        115                 120                 125             


Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu Ser Ser Asp 
    130                 135                 140                 


Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Ser Thr Leu Arg Arg Phe 
145                 150                 155                 160 


Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp Cys Glu Arg 
                165                 170                 175     


Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn Glu Val Ile 
            180                 185                 190         


Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala Tyr Ser Asn 
        195                 200                 205             


Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro Tyr Arg Ile 
    210                 215                 220                 


Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys Leu Ser Asp 
225                 230                 235                 240 


Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr Arg Ile Ser 
                245                 250                 255     


Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln Lys Glu Tyr 
            260                 265                 270         


Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His Val Lys Ser 
        275                 280                 285             


Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp Glu Ser Asp 
    290                 295                 300                 


Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met Phe Gly Lys 
305                 310                 315                 320 


Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys Pro Lys Pro 
                325                 330                 335     


Asn Ala Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val Lys Tyr Tyr 
            340                 345                 350         


Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro Ser Asp Ser 
        355                 360                 365             


Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu Ala Tyr Asn 
    370                 375                 380                 


Arg Glu Val Glu Met 
385                 


<210>  54
<211>  392
<212>  PRT
<213>  Ruminococcus bicirculans


<220>
<221>  misc_feature
<222>  (1)..(392)
<223>  WYL Ruminococcus bicirculans

<400>  54

Met Ser Met Thr Pro Ser Thr Phe Leu Pro Lys Arg Glu Asp Gly Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Lys Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Ser Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Ala Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met 
385                 390         


<210>  55
<211>  392
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(392)
<223>  WYL Ruminococcus sp. isolate 2789STDY5608892

<400>  55

Met Leu Ile Pro Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Glu Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Leu Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Thr Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met 
385                 390         


<210>  56
<211>  392
<212>  PRT
<213>  Ruminococcus sp.


<220>
<221>  misc_feature
<222>  (1)..(392)
<223>  WYL Ruminococcus sp. CAG:57

<400>  56

Met Leu Ile Pro Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Glu Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Leu Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Thr Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met 
385                 390         


<210>  57
<211>  280
<212>  PRT
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(280)
<223>  WYL Ruminococcus flavefaciens FD-1

<400>  57

Met Ile Ile Ala Ile Asn Gln Trp Lys Arg Arg Phe Ser Leu Val Ile 
1               5                   10                  15      


Tyr Gly Lys Ser Glu Gly Glu Thr Ile Val Lys Ile Lys Leu Leu Leu 
            20                  25                  30          


Ile Ser Leu Ala Tyr Leu Ile Ser Ile Tyr Leu Leu Cys Ser Pro Gly 
        35                  40                  45              


Cys Ile Gly Ile Phe Thr His Gly Met Leu Thr Thr Val Ile Gly Val 
    50                  55                  60                  


Val Thr Met Leu Ala Ala Thr Gly Thr Tyr Gly Met Tyr Leu Tyr Ser 
65                  70                  75                  80  


Ser Ala Ile Gly Glu Arg Ser Leu Pro Glu Ile Pro Met Asn Lys Glu 
                85                  90                  95      


Thr Glu Tyr Ser Arg Tyr Lys Glu Leu Glu Asn Trp Phe Arg Ala Phe 
            100                 105                 110         


Arg Tyr Leu Asp Arg Asn Asn Asn Phe Ala Met Leu Ser Ser Asp Leu 
        115                 120                 125             


Ala Thr Ser Tyr His Asp Gly Leu Ile Arg Asp Asn Pro Phe Arg Asn 
    130                 135                 140                 


Thr Glu Leu Gly Asp Arg Leu Gln Thr Thr Ser Ser Asp Ile Ser Ile 
145                 150                 155                 160 


Lys Tyr Asp Gln Thr Leu Lys Ile Leu Ser Glu Ser Phe Glu Lys Asn 
                165                 170                 175     


Asp Ile Thr Tyr Gln Asn Tyr Leu Ser Val Leu Asp Asn Val Leu Lys 
            180                 185                 190         


Leu Ser Ser Ser His Leu Lys Ala Ile Lys Lys Arg Val Cys Val Phe 
        195                 200                 205             


Asp Tyr Arg Thr Trp Ala Asp Asn Lys Asn Asp Glu Met Cys Arg Lys 
    210                 215                 220                 


Tyr Ile Glu Glu Val Lys Ser Ser Val Ile Arg Leu Glu Glu Ile Glu 
225                 230                 235                 240 


Gly Lys Phe Asp Asn Leu Leu His Glu Leu Ile Cys Leu Ser Glu Ile 
                245                 250                 255     


Ser Glu Asp Pro Leu Leu Glu Met Gln Asp Leu Ile Glu Thr Thr Ser 
            260                 265                 270         


Asp Tyr Lys Ser Ile Glu Asp Gln 
        275                 280 


<210>  58
<211>  226
<212>  PRT
<213>  Ruminococcus albus


<220>
<221>  misc_feature
<222>  (1)..(226)
<223>  WYL Ruminococcus albus strain KH2T6

<400>  58

Met Cys Thr Trp Tyr Tyr Ala Glu Ala Lys Ser Leu Ser Phe Phe Ile 
1               5                   10                  15      


Asp Lys Ala Ser Gln Leu Pro Leu Ser Asp Ile Ile Met Asn Thr Met 
            20                  25                  30          


Ser Lys Ser Lys Ala Met Ser Gly Asn Ile Arg Pro Thr Asp Met Ala 
        35                  40                  45              


Ala Val Leu Ala Pro Asn Lys Gln Gly Asn Val Ala Val Phe Pro Met 
    50                  55                  60                  


Ile Trp Gly Phe Thr His Glu Ser Thr Ser Lys Pro Val Ile Asn Cys 
65                  70                  75                  80  


Arg Ile Glu Ser Ala Asp Thr Lys Pro Leu Trp Lys Asp Ser Trp Tyr 
                85                  90                  95      


Arg Arg Arg Cys Val Ile Pro Ala Ser Trp Tyr Tyr Glu Trp Gly Val 
            100                 105                 110         


Pro Pro Ser Glu Gly Glu Leu Tyr His Lys Asn Glu Tyr Asn Lys Ile 
        115                 120                 125             


Gln Lys Glu Lys Tyr Ala Ile Gln Pro Glu Gly Ala Glu Ile Thr Tyr 
    130                 135                 140                 


Leu Ala Gly Leu Tyr Arg Phe Glu Glu His Arg Gly Val Gln Val Pro 
145                 150                 155                 160 


Met Phe Ala Val Ile Thr Arg Glu Ser Val Glu Pro Val Ser Ser Ile 
                165                 170                 175     


His Asp Arg Met Pro Leu Ile Leu Gly Lys Asp Ser Leu Ser Glu Trp 
            180                 185                 190         


Ile His Pro Asn Gly Asp Pro Asn Lys Ile Ala Lys Thr Ala Leu Thr 
        195                 200                 205             


Lys Met Val Met Glu Lys Ala Ile Asp Tyr Pro Glu Pro Glu Pro Ser 
    210                 215                 220                 


Phe Met 
225     


<210>  59
<211>  314
<212>  PRT
<213>  Ruminococcus flavefaciens


<220>
<221>  misc_feature
<222>  (1)..(314)
<223>  WYL Ruminococcus flavefaciens strain XPD3002

<400>  59

Met Glu Leu Phe Asn Glu Tyr Arg Asn Lys Ser Leu Arg Ala Phe Leu 
1               5                   10                  15      


Lys Leu Ala Glu Arg Ile Ser Tyr Gly Glu Glu Leu Ser Ile Asp Glu 
            20                  25                  30          


Phe Glu Ala Glu Tyr Tyr Arg Leu Ser Gly Asp Asn Lys Lys Ile Thr 
        35                  40                  45              


Ser Val Phe Tyr Lys Asn Thr Leu Tyr Asn Asp Lys Leu Pro Ile Phe 
    50                  55                  60                  


Asp Thr Arg Glu Gly Lys Val Arg Leu Phe Gly Glu Pro Asp Lys Cys 
65                  70                  75                  80  


Ser Asn Lys His Ile Ser Asp Thr Leu Leu Lys Ser Glu Ile Thr Trp 
                85                  90                  95      


Leu His Asn Ala Leu Asn Asp Lys Leu Ser Lys Leu Phe Leu Ser Asp 
            100                 105                 110         


Glu Glu Arg Ile Ser Ile Asp Ala Lys Leu Ser Asp Tyr Thr Glu Tyr 
        115                 120                 125             


Tyr Lys Asn Ile Asp Asp Met Trp Arg Ser Asn Glu Asp Ile Ser Glu 
    130                 135                 140                 


Glu Val Glu Lys Asn Phe Lys Ile Ile Leu Lys Ala Ile Asn Glu Lys 
145                 150                 155                 160 


Gln Ala Leu Ser Tyr Thr Phe Lys Asn Lys Asn Cys Glu Gly Phe Pro 
                165                 170                 175     


Val Arg Ile Glu Tyr Asp Glu Arg Thr Cys Arg Ile Tyr Met Ile Ile 
            180                 185                 190         


Tyr Asp Gly Asn Arg Phe Val Lys Ser Asp Ile Ser Lys Leu Ser Asp 
        195                 200                 205             


Ile Tyr Ile Thr Glu Asn Ser Ile Asp Thr Ile Pro Glu Ile Lys Asp 
    210                 215                 220                 


Asp Met Leu Asn Lys Lys Ala Tyr Leu Pro Val Val Phe Thr Val Thr 
225                 230                 235                 240 


Asp Asp Lys Asn Arg Lys Ala Ile Asp Arg Ala Leu Leu Ala Phe Ser 
                245                 250                 255     


Val Tyr Asp His Val Val Glu Pro Ile Asp Glu Lys Thr Ala Arg Phe 
            260                 265                 270         


Thr Ile Gln Tyr Tyr Thr Met Asp Leu Asp Leu Leu Ile Lys Asp Ile 
        275                 280                 285             


Leu Ala Phe Gly Ser Asp Ile Lys Val Glu Ser Pro Arg Tyr Val Val 
    290                 295                 300                 


Lys Arg Ile Thr Asp Ile Leu Arg Lys Val 
305                 310                 


<210>  60
<211>  412
<212>  PRT
<213>  Eubacterium siraeum


<220>
<221>  misc_feature
<222>  (1)..(412)
<223>  RtcB Eubacterium siraeum

<400>  60

Met Ile Val Leu Glu Ile Ile Gly Glu Arg Asn Thr Ala Val Val Tyr 
1               5                   10                  15      


Gly Glu Ile Ile Asp Glu Cys Ala Val Ser Gln Ile Glu Glu Ile Cys 
            20                  25                  30          


Asn His Pro Ala Phe Glu Asn Ser Arg Ile Arg Ile Met Pro Asp Cys 
        35                  40                  45              


His Ala Gly Lys Gly Cys Val Ile Gly Phe Thr Cys Val Thr Ser Asn 
    50                  55                  60                  


Arg Met Ile Val Pro Asn Ile Val Gly Val Asp Ile Gly Cys Gly Ile 
65                  70                  75                  80  


Leu Thr Thr Val Phe Thr Ala Asp Arg Glu Ile Asp Tyr Arg Ala Leu 
                85                  90                  95      


Asp Thr Phe Ile Arg Ser Asn Ile Pro Ser Gly Met Glu Ile His Asp 
            100                 105                 110         


Ser Val Ser Asp Thr Val Ala Glu Asn Thr Ala Leu Ile Ala Lys Val 
        115                 120                 125             


Asn Gly Ile Cys Asp Ala Ile Gly Glu Ser Ala Asp Val Asp Tyr His 
    130                 135                 140                 


Leu Arg Ser Ile Gly Thr Leu Gly Gly Gly Asn His Phe Ile Glu Ile 
145                 150                 155                 160 


Asp Arg Leu Asn Asn Gly Asn Tyr Ala Leu Thr Val His Thr Gly Ser 
                165                 170                 175     


Arg Asn Leu Gly Lys Arg Ile Cys Gly Tyr Phe Gln Ser Asn Ala Ser 
            180                 185                 190         


Val Ile Asp Thr Glu Leu Arg Arg Ser Ile Leu Leu Arg His Arg Ser 
        195                 200                 205             


Ala Thr Thr Ser Glu Glu His Glu Glu Ile Asp Arg Arg Ala Ala Gln 
    210                 215                 220                 


Ile Ala Pro Val Ser Lys Glu Leu Ala Phe Ile Thr Gly Glu Arg Tyr 
225                 230                 235                 240 


Asp Ser Tyr Ile Gly Cys Met Leu Asp Ala Lys Ala Leu Ala Ala Phe 
                245                 250                 255     


Asn Arg Thr Val Ile Ser Asp Arg Ile Met Ser Phe Leu Ala Asp Glu 
            260                 265                 270         


Tyr Gly Val Glu Ile Lys Asp Arg Phe Asp Thr Val His Asn Tyr Ile 
        275                 280                 285             


Asp Trp Tyr Asp Asp Thr His Thr Ser Val Val Ile Arg Lys Gly Ala 
    290                 295                 300                 


Ile Ser Ala Arg Lys Gly Glu Arg Ile Val Ile Pro Leu Asn Met Arg 
305                 310                 315                 320 


Asp Gly Ile Ile Ile Ala His Gly Arg Gly Asn Glu Glu Trp Asn Cys 
                325                 330                 335     


Ser Ala Pro His Gly Ser Gly Arg Ala Tyr Ser Arg Ser Asp Ala Arg 
            340                 345                 350         


Arg Thr Phe Thr Leu Glu Glu Tyr Val Glu Glu Met Asp Gly Val Asn 
        355                 360                 365             


Thr Trp Ser Val Ser Glu Ser Thr Ile Asp Glu Cys Pro Met Ala Tyr 
    370                 375                 380                 


Lys Pro Ser Glu Met Ile Ile Gly Ser Ile Gly Asp Thr Val Glu Ile 
385                 390                 395                 400 


Glu Ser Ile Ala His Thr Val Tyr Asn Phe Lys Ala 
                405                 410         


<210>  61
<211>  831
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Eubacterium siraeum + C-term NLS

<400>  61

Met Lys Lys Thr Glu Lys Phe Asp Asp Val Gln Ser Gly Tyr Glu Tyr 
1               5                   10                  15      


Lys Tyr Phe Leu Glu Ser Ile Asp Lys Tyr Arg Ala Ala Val Gln Asn 
            20                  25                  30          


Ile Tyr Thr Tyr Gly Cys Phe Asn Gln Lys Gln Leu Ser Glu Gln Cys 
        35                  40                  45              


Asn Cys Ser Asp Gln Thr Ile Lys Lys Ala Phe Asn Phe Tyr Asn Leu 
    50                  55                  60                  


Cys Leu Ala Asn Tyr Ile Lys Lys Lys Lys Gly Thr Leu Ser Lys Lys 
65                  70                  75                  80  


Ala Lys Gly Arg Pro Thr Glu Ala Lys Tyr Leu Glu Tyr Asp Arg Phe 
                85                  90                  95      


Thr Leu Asn Glu Asn Tyr Leu Tyr Asn Ile Tyr Leu Trp Ala Arg Ile 
            100                 105                 110         


Thr Lys Lys Gln Met Trp Ala Phe Ser Tyr Phe Arg Arg His Thr Ser 
        115                 120                 125             


Leu Leu Ile Asn Ala Ser Arg Thr Glu Ile Lys Asn Gln Leu Ser Asp 
    130                 135                 140                 


Phe Phe Leu Tyr Phe Ser Glu Tyr Met Asp Arg Ser Lys Lys Ala Glu 
145                 150                 155                 160 


Asn Ser Gln Asp Leu Gly Tyr Ile Ile Asp Met Thr Ala Pro Thr Glu 
                165                 170                 175     


Lys Asn Met Leu Ile Ser Ser Met Cys Asp Ala Leu Ala Val Phe Gly 
            180                 185                 190         


Arg Lys Ala Pro Tyr Ser Val Pro Ala Tyr Ser Ile Ser His Lys Leu 
        195                 200                 205             


Lys Lys Leu Cys Gly Asn Asp Ser Lys Ser Leu Trp Ser Phe Met Tyr 
    210                 215                 220                 


Asp Asn Tyr Asp Arg Ile Leu Tyr Asp Glu Ala Val Tyr Thr Ile Arg 
225                 230                 235                 240 


Gln Ala Ile Arg Asp Arg Lys Leu Ile Gly Tyr Gln Thr Val Gly Thr 
                245                 250                 255     


Glu Lys Gln Lys Ser Val Asn Tyr Val Val Pro Leu Lys Ile Met Tyr 
            260                 265                 270         


Glu Tyr Asn Leu Gly Arg Cys Tyr Leu Leu Tyr Ser Pro Leu Asn Ser 
        275                 280                 285             


Asp Ser Ile Ile Lys Ser Ile Arg Leu Asp Lys Leu Tyr Lys Val Ala 
    290                 295                 300                 


Ala Tyr Glu Pro Asp Ser Ile Ile Asn Tyr Glu Lys Leu Tyr Asp Val 
305                 310                 315                 320 


Leu Ala Val Ala Glu Asn Glu Ile Trp Leu Ser Gly Asp Tyr Thr Lys 
                325                 330                 335     


Lys Asp Cys Leu Ser Arg Ile Val Leu Lys Asn Val Lys Pro Gln Ala 
            340                 345                 350         


Phe Ser Leu Ile Glu Lys Tyr Gly Val Cys Tyr Thr Glu Asp Arg Glu 
        355                 360                 365             


Ala Lys Thr Val Thr Phe Asn Ile Arg Lys Ala Asp Asp Ile Lys Pro 
    370                 375                 380                 


Phe Ile Arg Thr Leu Gly Gly Asp Ala Val Ile Ser Glu Glu Asp Asn 
385                 390                 395                 400 


Pro Gly Leu Phe Arg Glu Phe Ala Tyr Asp Ala Arg Ile Gly Arg Gln 
                405                 410                 415     


Met Tyr Tyr Asp Asp Ser Phe Ala Asp Cys Pro Ala Glu Lys Asp Ser 
            420                 425                 430         


Gln Pro Ala Lys Asp Ser Lys Thr Ala Ser Gly Asn Asp Asn Ile Lys 
        435                 440                 445             


Lys Tyr Ala Ser Tyr Pro Thr Leu Arg Leu Phe Asn Lys Tyr Gly Ser 
    450                 455                 460                 


Phe Met Asn Ile Leu Ala Glu Glu Leu Ala Glu His Ile Phe Ser Glu 
465                 470                 475                 480 


Ile Ile Arg Met Pro Val Glu Lys Arg Ala Gly Gln Ile Glu Tyr Ser 
                485                 490                 495     


Ser Asn Arg Leu Glu Arg Val Leu Asn Ser Tyr Phe Lys Ile Tyr Gly 
            500                 505                 510         


Phe Asp Glu Leu Arg Thr Glu Ala Ser Asn Ile Thr Glu Trp Phe Thr 
        515                 520                 525             


Lys Ala Thr Glu Glu Leu Ser Asp Ser Asp Tyr Ser Ser Trp Phe Ser 
    530                 535                 540                 


Val Asn Gly Gly Lys Phe Glu Ala Val Ala Asp Leu Asn Glu Tyr Glu 
545                 550                 555                 560 


His Lys Gln Leu Leu Thr Asn Ile Glu Tyr Glu Tyr Leu Arg Leu Met 
                565                 570                 575     


Leu Gly Asp Pro Asp Ala Arg Ala Ile Ile Gly Asn Glu Tyr Cys Glu 
            580                 585                 590         


Lys Leu Ser Glu Tyr Val Gly Ser Ala Asp Thr Thr Leu Asp Glu Phe 
        595                 600                 605             


Phe Thr Val Arg Tyr Ala Asn Arg Asn Glu Lys Thr Ile Glu Asn Lys 
    610                 615                 620                 


His Ser Val Leu Arg Thr Ile Met Arg Ala Met Asn Asn Glu Lys Lys 
625                 630                 635                 640 


Ala Asp Ile Glu Tyr Lys Gly Lys His Tyr Ile Cys Ser Ala Tyr Arg 
                645                 650                 655     


Phe Thr Tyr Ser Leu Arg Glu Arg Lys His Arg Leu Met Val Phe Asp 
            660                 665                 670         


Gly Asn Tyr Ile Met Gln Ile Asn Leu Cys Asp Ile Lys Asp Ala Gln 
        675                 680                 685             


Met Thr Lys Glu Pro Ser Leu Ser Asp Glu Glu Met Asn Lys Leu Leu 
    690                 695                 700                 


Thr Glu Arg Lys Lys Tyr Ile Glu Ile Ala Ile Pro Gln Asn Ala Asp 
705                 710                 715                 720 


Ala Gln Gln Arg Asn Val Phe Glu Arg Ala Leu Arg Leu Phe Gly Gly 
                725                 730                 735     


Phe Glu Arg Tyr Ser Trp Asn Asp Ala Lys Asn Gly Glu Tyr Val Ile 
            740                 745                 750         


Ala Val Ala Tyr Tyr Glu Pro Asp Ile Ser Val Ser Ser Ser Ala Asp 
        755                 760                 765             


Arg Arg Ile Tyr Arg Arg Asp Thr Val Ala Ala Asp Ile Met Ser Leu 
    770                 775                 780                 


Gly Arg Tyr Ala Arg Val Met Lys Gln Pro Gly Phe Glu Leu Asp Gly 
785                 790                 795                 800 


Val Arg Tyr Asp Ser Ser Leu Tyr Asp Tyr Ile Ser Lys Asn Tyr Ser 
                805                 810                 815     


Gly Thr Ala Ala Arg Tyr Glu Lys Pro Lys Lys Lys Arg Lys Val 
            820                 825                 830     


<210>  62
<211>  396
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus sp.isolate 2789STDY5834971 + C-term 
       NLS

<400>  62

Met Leu Ile Leu Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Glu Lys Tyr Asn Gly Ile 
65                  70                  75                  80  


Pro Leu Leu Asn Ala Phe Val Lys Trp Gln Ile Glu Glu Ile Asn Asp 
                85                  90                  95      


Gly Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr Leu Ile Ser 
            100                 105                 110         


Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn Ala Val Arg 
        115                 120                 125             


Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu Ser Ser Asp 
    130                 135                 140                 


Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Ser Thr Leu Arg Arg Phe 
145                 150                 155                 160 


Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp Cys Glu Arg 
                165                 170                 175     


Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn Glu Val Ile 
            180                 185                 190         


Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala Tyr Ser Asn 
        195                 200                 205             


Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro Tyr Arg Ile 
    210                 215                 220                 


Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys Leu Ser Asp 
225                 230                 235                 240 


Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr Arg Ile Ser 
                245                 250                 255     


Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln Lys Glu Tyr 
            260                 265                 270         


Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His Val Lys Ser 
        275                 280                 285             


Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp Glu Ser Asp 
    290                 295                 300                 


Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met Phe Gly Lys 
305                 310                 315                 320 


Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys Pro Lys Pro 
                325                 330                 335     


Asn Ala Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val Lys Tyr Tyr 
            340                 345                 350         


Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro Ser Asp Ser 
        355                 360                 365             


Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu Ala Tyr Asn 
    370                 375                 380                 


Arg Glu Val Glu Met Pro Lys Lys Lys Arg Lys Val 
385                 390                 395     


<210>  63
<211>  399
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus bicirculans + C-term NLS

<400>  63

Met Ser Met Thr Pro Ser Thr Phe Leu Pro Lys Arg Glu Asp Gly Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Lys Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Ser Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Ala Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met Pro Lys Lys Lys Arg Lys Val 
385                 390                 395                 


<210>  64
<211>  399
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus sp. isolate 2789STDY5608892 + C-term 
       NLS

<400>  64

Met Leu Ile Pro Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Glu Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Leu Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Thr Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met Pro Lys Lys Lys Arg Lys Val 
385                 390                 395                 


<210>  65
<211>  399
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus sp. CAG:57 + C-term NLS

<400>  65

Met Leu Ile Pro Pro Ser Thr Phe Leu Pro Lys Arg Asp Lys Asn Val 
1               5                   10                  15      


Pro Tyr Ile Ala Glu Val Gln Ser Ile Pro Leu Ser Pro Ser Ala Tyr 
            20                  25                  30          


Ser Val Ile Ile Lys Asp Lys Ser Ile Phe Glu Thr Ser Leu Ser Pro 
        35                  40                  45              


Asn Gly Ser Val Ser Met Ser Ser Phe Leu Thr Ser Ile Phe Asp Ser 
    50                  55                  60                  


Ala Tyr Ile Ala Ser Leu Lys Tyr Lys Ser Asp Asp Asn Tyr Lys Tyr 
65                  70                  75                  80  


Ile Gly Ile Pro Leu Leu Asn Ala Phe Val Glu Trp Gln Ile Glu Glu 
                85                  90                  95      


Ile Asp Asp Ser Leu Asp Asp Lys Ser Lys Glu Ile Ile Lys Ser Tyr 
            100                 105                 110         


Leu Ile Ser Lys Leu Ser Ala Lys Tyr Glu Lys Thr Lys Thr Glu Asn 
        115                 120                 125             


Ala Val Arg Val Arg Leu Ser Ile Cys Arg Asp Leu Tyr Asp Thr Leu 
    130                 135                 140                 


Ser Ser Asp Asp Leu Tyr Tyr Glu Asn Lys Val Tyr Ser Leu Thr Leu 
145                 150                 155                 160 


Arg Arg Phe Leu Lys Ala Val Tyr Glu Asp Tyr Ala Leu Leu Ser Asp 
                165                 170                 175     


Cys Glu Arg Glu Arg Leu Ile Phe Ala Asp Asn Ile Ile Lys Ile Asn 
            180                 185                 190         


Glu Val Ile Lys Gln Asn Gly Ser Arg Tyr Tyr Ser Phe Ile Tyr Ala 
        195                 200                 205             


Tyr Ser Asn Met Tyr Ser Arg Glu Lys Arg Arg Ile Arg Leu Ile Pro 
    210                 215                 220                 


Tyr Arg Ile Val Ser Asp Glu Tyr Lys Met Tyr Asn Tyr Leu Val Cys 
225                 230                 235                 240 


Leu Ser Asp Glu Lys Ser Ala Gly Lys Glu Phe Lys Ala Asp Ser Tyr 
                245                 250                 255     


Arg Ile Ser Arg Leu Ser Gly Leu Ser Ile Ala Glu Lys Leu Ser Gln 
            260                 265                 270         


Lys Glu Tyr Ser Ser Val Thr Glu Tyr Glu Arg Leu Lys Glu Gly His 
        275                 280                 285             


Val Lys Ser Val Lys His Leu Leu Ser Asp Pro Arg Phe Gly Ser Asp 
    290                 295                 300                 


Glu Ser Asp Ile Ser Lys Val Tyr Leu Thr Glu Lys Gly Val Glu Met 
305                 310                 315                 320 


Phe Gly Lys Ile Leu Tyr Gln Arg Pro Ile Leu Lys Gly Asn Glu Lys 
                325                 330                 335     


Pro Lys Pro Asn Thr Val Asn Glu Phe Ile Ser Pro Pro Ile Gln Val 
            340                 345                 350         


Lys Tyr Tyr Phe Asn Lys Phe Gly Lys Asp Gly Val Ile Leu Ser Pro 
        355                 360                 365             


Ser Asp Ser Phe Glu Glu Met Arg Thr Leu Tyr Val Glu Gly Ala Glu 
    370                 375                 380                 


Ala Tyr Asn Arg Glu Val Glu Met Pro Lys Lys Lys Arg Lys Val 
385                 390                 395                 


<210>  66
<211>  287
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus flavefaciens FD-1 + C-term NLS

<400>  66

Met Ile Ile Ala Ile Asn Gln Trp Lys Arg Arg Phe Ser Leu Val Ile 
1               5                   10                  15      


Tyr Gly Lys Ser Glu Gly Glu Thr Ile Val Lys Ile Lys Leu Leu Leu 
            20                  25                  30          


Ile Ser Leu Ala Tyr Leu Ile Ser Ile Tyr Leu Leu Cys Ser Pro Gly 
        35                  40                  45              


Cys Ile Gly Ile Phe Thr His Gly Met Leu Thr Thr Val Ile Gly Val 
    50                  55                  60                  


Val Thr Met Leu Ala Ala Thr Gly Thr Tyr Gly Met Tyr Leu Tyr Ser 
65                  70                  75                  80  


Ser Ala Ile Gly Glu Arg Ser Leu Pro Glu Ile Pro Met Asn Lys Glu 
                85                  90                  95      


Thr Glu Tyr Ser Arg Tyr Lys Glu Leu Glu Asn Trp Phe Arg Ala Phe 
            100                 105                 110         


Arg Tyr Leu Asp Arg Asn Asn Asn Phe Ala Met Leu Ser Ser Asp Leu 
        115                 120                 125             


Ala Thr Ser Tyr His Asp Gly Leu Ile Arg Asp Asn Pro Phe Arg Asn 
    130                 135                 140                 


Thr Glu Leu Gly Asp Arg Leu Gln Thr Thr Ser Ser Asp Ile Ser Ile 
145                 150                 155                 160 


Lys Tyr Asp Gln Thr Leu Lys Ile Leu Ser Glu Ser Phe Glu Lys Asn 
                165                 170                 175     


Asp Ile Thr Tyr Gln Asn Tyr Leu Ser Val Leu Asp Asn Val Leu Lys 
            180                 185                 190         


Leu Ser Ser Ser His Leu Lys Ala Ile Lys Lys Arg Val Cys Val Phe 
        195                 200                 205             


Asp Tyr Arg Thr Trp Ala Asp Asn Lys Asn Asp Glu Met Cys Arg Lys 
    210                 215                 220                 


Tyr Ile Glu Glu Val Lys Ser Ser Val Ile Arg Leu Glu Glu Ile Glu 
225                 230                 235                 240 


Gly Lys Phe Asp Asn Leu Leu His Glu Leu Ile Cys Leu Ser Glu Ile 
                245                 250                 255     


Ser Glu Asp Pro Leu Leu Glu Met Gln Asp Leu Ile Glu Thr Thr Ser 
            260                 265                 270         


Asp Tyr Lys Ser Ile Glu Asp Gln Pro Lys Lys Lys Arg Lys Val 
        275                 280                 285         


<210>  67
<211>  233
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus albus strain KH2T6 + C-term NLS

<400>  67

Met Cys Thr Trp Tyr Tyr Ala Glu Ala Lys Ser Leu Ser Phe Phe Ile 
1               5                   10                  15      


Asp Lys Ala Ser Gln Leu Pro Leu Ser Asp Ile Ile Met Asn Thr Met 
            20                  25                  30          


Ser Lys Ser Lys Ala Met Ser Gly Asn Ile Arg Pro Thr Asp Met Ala 
        35                  40                  45              


Ala Val Leu Ala Pro Asn Lys Gln Gly Asn Val Ala Val Phe Pro Met 
    50                  55                  60                  


Ile Trp Gly Phe Thr His Glu Ser Thr Ser Lys Pro Val Ile Asn Cys 
65                  70                  75                  80  


Arg Ile Glu Ser Ala Asp Thr Lys Pro Leu Trp Lys Asp Ser Trp Tyr 
                85                  90                  95      


Arg Arg Arg Cys Val Ile Pro Ala Ser Trp Tyr Tyr Glu Trp Gly Val 
            100                 105                 110         


Pro Pro Ser Glu Gly Glu Leu Tyr His Lys Asn Glu Tyr Asn Lys Ile 
        115                 120                 125             


Gln Lys Glu Lys Tyr Ala Ile Gln Pro Glu Gly Ala Glu Ile Thr Tyr 
    130                 135                 140                 


Leu Ala Gly Leu Tyr Arg Phe Glu Glu His Arg Gly Val Gln Val Pro 
145                 150                 155                 160 


Met Phe Ala Val Ile Thr Arg Glu Ser Val Glu Pro Val Ser Ser Ile 
                165                 170                 175     


His Asp Arg Met Pro Leu Ile Leu Gly Lys Asp Ser Leu Ser Glu Trp 
            180                 185                 190         


Ile His Pro Asn Gly Asp Pro Asn Lys Ile Ala Lys Thr Ala Leu Thr 
        195                 200                 205             


Lys Met Val Met Glu Lys Ala Ile Asp Tyr Pro Glu Pro Glu Pro Ser 
    210                 215                 220                 


Phe Met Pro Lys Lys Lys Arg Lys Val 
225                 230             


<210>  68
<211>  321
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: WYL Ruminococcus flavefaciens strain XPD3002 + C-term 
       NLS

<400>  68

Met Glu Leu Phe Asn Glu Tyr Arg Asn Lys Ser Leu Arg Ala Phe Leu 
1               5                   10                  15      


Lys Leu Ala Glu Arg Ile Ser Tyr Gly Glu Glu Leu Ser Ile Asp Glu 
            20                  25                  30          


Phe Glu Ala Glu Tyr Tyr Arg Leu Ser Gly Asp Asn Lys Lys Ile Thr 
        35                  40                  45              


Ser Val Phe Tyr Lys Asn Thr Leu Tyr Asn Asp Lys Leu Pro Ile Phe 
    50                  55                  60                  


Asp Thr Arg Glu Gly Lys Val Arg Leu Phe Gly Glu Pro Asp Lys Cys 
65                  70                  75                  80  


Ser Asn Lys His Ile Ser Asp Thr Leu Leu Lys Ser Glu Ile Thr Trp 
                85                  90                  95      


Leu His Asn Ala Leu Asn Asp Lys Leu Ser Lys Leu Phe Leu Ser Asp 
            100                 105                 110         


Glu Glu Arg Ile Ser Ile Asp Ala Lys Leu Ser Asp Tyr Thr Glu Tyr 
        115                 120                 125             


Tyr Lys Asn Ile Asp Asp Met Trp Arg Ser Asn Glu Asp Ile Ser Glu 
    130                 135                 140                 


Glu Val Glu Lys Asn Phe Lys Ile Ile Leu Lys Ala Ile Asn Glu Lys 
145                 150                 155                 160 


Gln Ala Leu Ser Tyr Thr Phe Lys Asn Lys Asn Cys Glu Gly Phe Pro 
                165                 170                 175     


Val Arg Ile Glu Tyr Asp Glu Arg Thr Cys Arg Ile Tyr Met Ile Ile 
            180                 185                 190         


Tyr Asp Gly Asn Arg Phe Val Lys Ser Asp Ile Ser Lys Leu Ser Asp 
        195                 200                 205             


Ile Tyr Ile Thr Glu Asn Ser Ile Asp Thr Ile Pro Glu Ile Lys Asp 
    210                 215                 220                 


Asp Met Leu Asn Lys Lys Ala Tyr Leu Pro Val Val Phe Thr Val Thr 
225                 230                 235                 240 


Asp Asp Lys Asn Arg Lys Ala Ile Asp Arg Ala Leu Leu Ala Phe Ser 
                245                 250                 255     


Val Tyr Asp His Val Val Glu Pro Ile Asp Glu Lys Thr Ala Arg Phe 
            260                 265                 270         


Thr Ile Gln Tyr Tyr Thr Met Asp Leu Asp Leu Leu Ile Lys Asp Ile 
        275                 280                 285             


Leu Ala Phe Gly Ser Asp Ile Lys Val Glu Ser Pro Arg Tyr Val Val 
    290                 295                 300                 


Lys Arg Ile Thr Asp Ile Leu Arg Lys Val Pro Lys Lys Lys Arg Lys 
305                 310                 315                 320 


Val 
    


<210>  69
<211>  419
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic: RtcB Eubacterium siraeum + C-term NLS

<400>  69

Met Ile Val Leu Glu Ile Ile Gly Glu Arg Asn Thr Ala Val Val Tyr 
1               5                   10                  15      


Gly Glu Ile Ile Asp Glu Cys Ala Val Ser Gln Ile Glu Glu Ile Cys 
            20                  25                  30          


Asn His Pro Ala Phe Glu Asn Ser Arg Ile Arg Ile Met Pro Asp Cys 
        35                  40                  45              


His Ala Gly Lys Gly Cys Val Ile Gly Phe Thr Cys Val Thr Ser Asn 
    50                  55                  60                  


Arg Met Ile Val Pro Asn Ile Val Gly Val Asp Ile Gly Cys Gly Ile 
65                  70                  75                  80  


Leu Thr Thr Val Phe Thr Ala Asp Arg Glu Ile Asp Tyr Arg Ala Leu 
                85                  90                  95      


Asp Thr Phe Ile Arg Ser Asn Ile Pro Ser Gly Met Glu Ile His Asp 
            100                 105                 110         


Ser Val Ser Asp Thr Val Ala Glu Asn Thr Ala Leu Ile Ala Lys Val 
        115                 120                 125             


Asn Gly Ile Cys Asp Ala Ile Gly Glu Ser Ala Asp Val Asp Tyr His 
    130                 135                 140                 


Leu Arg Ser Ile Gly Thr Leu Gly Gly Gly Asn His Phe Ile Glu Ile 
145                 150                 155                 160 


Asp Arg Leu Asn Asn Gly Asn Tyr Ala Leu Thr Val His Thr Gly Ser 
                165                 170                 175     


Arg Asn Leu Gly Lys Arg Ile Cys Gly Tyr Phe Gln Ser Asn Ala Ser 
            180                 185                 190         


Val Ile Asp Thr Glu Leu Arg Arg Ser Ile Leu Leu Arg His Arg Ser 
        195                 200                 205             


Ala Thr Thr Ser Glu Glu His Glu Glu Ile Asp Arg Arg Ala Ala Gln 
    210                 215                 220                 


Ile Ala Pro Val Ser Lys Glu Leu Ala Phe Ile Thr Gly Glu Arg Tyr 
225                 230                 235                 240 


Asp Ser Tyr Ile Gly Cys Met Leu Asp Ala Lys Ala Leu Ala Ala Phe 
                245                 250                 255     


Asn Arg Thr Val Ile Ser Asp Arg Ile Met Ser Phe Leu Ala Asp Glu 
            260                 265                 270         


Tyr Gly Val Glu Ile Lys Asp Arg Phe Asp Thr Val His Asn Tyr Ile 
        275                 280                 285             


Asp Trp Tyr Asp Asp Thr His Thr Ser Val Val Ile Arg Lys Gly Ala 
    290                 295                 300                 


Ile Ser Ala Arg Lys Gly Glu Arg Ile Val Ile Pro Leu Asn Met Arg 
305                 310                 315                 320 


Asp Gly Ile Ile Ile Ala His Gly Arg Gly Asn Glu Glu Trp Asn Cys 
                325                 330                 335     


Ser Ala Pro His Gly Ser Gly Arg Ala Tyr Ser Arg Ser Asp Ala Arg 
            340                 345                 350         


Arg Thr Phe Thr Leu Glu Glu Tyr Val Glu Glu Met Asp Gly Val Asn 
        355                 360                 365             


Thr Trp Ser Val Ser Glu Ser Thr Ile Asp Glu Cys Pro Met Ala Tyr 
    370                 375                 380                 


Lys Pro Ser Glu Met Ile Ile Gly Ser Ile Gly Asp Thr Val Glu Ile 
385                 390                 395                 400 


Glu Ser Ile Ala His Thr Val Tyr Asn Phe Lys Ala Pro Lys Lys Lys 
                405                 410                 415     


Arg Lys Val 
            


<210>  70
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 1

<400>  70
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucuua caucuuuccu ccucauccag       60

caaaau                                                                  66


<210>  71
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 2

<400>  71
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaca auccugaagu aagugaagcu       60

acagac                                                                  66


<210>  72
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 3

<400>  72
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucugu caaaaaucac aauccugaag       60

uaagug                                                                  66


<210>  73
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 4

<400>  73
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucucu gucaaaaauc acaauccuga       60

aguaag                                                                  66


<210>  74
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 5

<400>  74
cuacuauacu ggugcgaauu ugcacuaguc uaaaauucuc uucacgagau ucacuaggac       60

cuucag                                                                  66


<210>  75
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 6

<400>  75
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucucc ucucuucacg agauucacua       60

ggaccu                                                                  66


<210>  76
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 7

<400>  76
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugucc ucuaggucca uguuacagcc       60

agaccc                                                                  66


<210>  77
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 8

<400>  77
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaugu ccucuagguc cauguuacag       60

ccagac                                                                  66


<210>  78
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 9

<400>  78
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucgcc aggagcgcug ccccggccgu       60

cccgga                                                                  66


<210>  79
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 10

<400>  79
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcgc caggagcgcu gccccggccg       60

ucccgg                                                                  66


<210>  80
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 11

<400>  80
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcag cgccaggagc gcugccccgg       60

ccgucc                                                                  66


<210>  81
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 12

<400>  81
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagca gcgccaggag cgcugccccg       60

gccguc                                                                  66


<210>  82
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 13

<400>  82
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucagc agcgccagga gcgcugcccc       60

ggccgu                                                                  66


<210>  83
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 14

<400>  83
cuacuauacu ggugcgaauu ugcacuaguc uaaaauccag cagcgccagg agcgcugccc       60

cggccg                                                                  66


<210>  84
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 15

<400>  84
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcca gcagcgccag gagcgcugcc       60

ccggcc                                                                  66


<210>  85
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 16

<400>  85
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucagc cagcagcgcc aggagcgcug       60

ccccgg                                                                  66


<210>  86
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 17

<400>  86
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcag ccagcagcgc caggagcgcu       60

gccccg                                                                  66


<210>  87
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 18

<400>  87
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucgca gccagcagcg ccaggagcgc       60

ugcccc                                                                  66


<210>  88
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 19

<400>  88
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcgc agccagcagc gccaggagcg       60

cugccc                                                                  66


<210>  89
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 20

<400>  89
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagcg cagccagcag cgccaggagc       60

gcugcc                                                                  66


<210>  90
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 21

<400>  90
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugagc gcagccagca gcgccaggag       60

cgcugc                                                                  66


<210>  91
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 22

<400>  91
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagag cgcagccagc agcgccagga       60

gcgcug                                                                  66


<210>  92
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 23

<400>  92
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaga gcgcagccag cagcgccagg       60

agcgcu                                                                  66


<210>  93
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 24

<400>  93
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcag agcgcagcca gcagcgccag       60

gagcgc                                                                  66


<210>  94
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 25

<400>  94
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugggc agagcgcagc cagcagcgcc       60

aggagc                                                                  66


<210>  95
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 26

<400>  95
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucggg cagagcgcag ccagcagcgc       60

caggag                                                                  66


<210>  96
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 27

<400>  96
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugccg ggcagagcgc agccagcagc       60

gccagg                                                                  66


<210>  97
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 28

<400>  97
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucgcc gggcagagcg cagccagcag       60

cgccag                                                                  66


<210>  98
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 29

<400>  98
cuacuauacu ggugcgaauu ugcacuaguc uaaaauucgc cgggcagagc gcagccagca       60

gcgcca                                                                  66


<210>  99
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 30

<400>  99
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucucg ccgggcagag cgcagccagc       60

agcgcc                                                                  66


<210>  100
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 31

<400>  100
cuacuauacu ggugcgaauu ugcacuaguc uaaaauacuc gccgggcaga gcgcagccag       60

cagcgc                                                                  66


<210>  101
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 32

<400>  101
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugacu cgccgggcag agcgcagcca       60

gcagcg                                                                  66


<210>  102
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 33

<400>  102
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaaaa gugcccaacu gcgugagcuu       60

guuacu                                                                  66


<210>  103
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 34

<400>  103
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaucu ucaaaagugc ccaacugcgu       60

gagcuu                                                                  66


<210>  104
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 35

<400>  104
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauga ucuucaaaag ugcccaacug       60

cgugag                                                                  66


<210>  105
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 36

<400>  105
cuacuauacu ggugcgaauu ugcacuaguc uaaaauccuc uggaggcuga gaaaaugauc       60

uucaaa                                                                  66


<210>  106
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 37

<400>  106
cuacuauacu ggugcgaauu ugcacuaguc uaaaauacau ccucuggagg cugagaaaau       60

gaucuu                                                                  66


<210>  107
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 38

<400>  107
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaguu auugaacauc cucuggaggc       60

ugagaa                                                                  66


<210>  108
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 39

<400>  108
cuacuauacu ggugcgaauu ugcacuaguc uaaaauacag uuauugaaca uccucuggag       60

gcugag                                                                  66


<210>  109
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 40

<400>  109
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaca guuauugaac auccucugga       60

ggcuga                                                                  66


<210>  110
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 41

<400>  110
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaccu cacaguuauu gaacauccuc       60

uggagg                                                                  66


<210>  111
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 42

<400>  111
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagga ccaccucaca guuauugaac       60

auccuc                                                                  66


<210>  112
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 43

<400>  112
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaag gaccaccuca caguuauuga       60

acaucc                                                                  66


<210>  113
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 44

<400>  113
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuucc caaggaccac cucacaguua       60

uugaac                                                                  66


<210>  114
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 45

<400>  114
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaaau ucccaaggac caccucacag       60

uuauug                                                                  66


<210>  115
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 46

<400>  115
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaaa uucccaagga ccaccucaca       60

guuauu                                                                  66


<210>  116
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 47

<400>  116
cuacuauacu ggugcgaauu ugcacuaguc uaaaauccaa auucccaagg accaccucac       60

aguuau                                                                  66


<210>  117
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 48

<400>  117
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuuuc caaauuccca aggaccaccu       60

cacagu                                                                  66


<210>  118
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 49

<400>  118
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuaau uuccaaauuc ccaaggacca       60

ccucac                                                                  66


<210>  119
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 50

<400>  119
cuacuauacu ggugcgaauu ugcacuaguc uaaaauguaa uuuccaaauu cccaaggacc       60

accuca                                                                  66


<210>  120
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 51

<400>  120
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaggu aauuuccaaa uucccaagga       60

ccaccu                                                                  66


<210>  121
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 52

<400>  121
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaua gguaauuucc aaauucccaa       60

ggacca                                                                  66


<210>  122
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 53

<400>  122
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucugc acauagguaa uuuccaaauu       60

cccaag                                                                  66


<210>  123
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 54

<400>  123
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucucu gcacauaggu aauuuccaaa       60

uuccca                                                                  66


<210>  124
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 55

<400>  124
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaauu ccucugcaca uagguaauuu       60

ccaaau                                                                  66


<210>  125
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 56

<400>  125
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagau cauaauuccu cugcacauag       60

guaauu                                                                  66


<210>  126
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 57

<400>  126
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauga ggacauaacc agccaccucc       60

uggaug                                                                  66


<210>  127
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 58

<400>  127
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugcaa ugaggacaua accagccacc       60

uccugg                                                                  66


<210>  128
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 59

<400>  128
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaauu cgcuccacug uguugagggc       60

aaugag                                                                  66


<210>  129
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 60

<400>  129
cuacuauacu ggugcgaauu ugcacuaguc uaaaauguuu uccaaaggaa uucgcuccac       60

uguguu                                                                  66


<210>  130
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 61

<400>  130
cuacuauacu ggugcgaauu ugcacuaguc uaaaauucug cagguuuucc aaaggaauuc       60

gcucca                                                                  66


<210>  131
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 62

<400>  131
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugauc ugcagguuuu ccaaaggaau       60

ucgcuc                                                                  66


<210>  132
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 63

<400>  132
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugaug aucugcaggu uuuccaaagg       60

aauucg                                                                  66


<210>  133
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 64

<400>  133
cuacuauacu ggugcgaauu ugcacuaguc uaaaauugau gaucugcagg uuuuccaaag       60

gaauuc                                                                  66


<210>  134
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 65

<400>  134
cuacuauacu ggugcgaauu ugcacuaguc uaaaauucug augaucugca gguuuuccaa       60

aggaau                                                                  66


<210>  135
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 66

<400>  135
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauuu ccucugauga ucugcagguu       60

uuccaa                                                                  66


<210>  136
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 67

<400>  136
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauau uuccucugau gaucugcagg       60

uuuucc                                                                  66


<210>  137
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 68

<400>  137
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuuuc guaguacaua uuuccucuga       60

ugaucu                                                                  66


<210>  138
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 69

<400>  138
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuagg aauuuucgua guacauauuu       60

ccucug                                                                  66


<210>  139
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 70

<400>  139
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaua ggaauuuucg uaguacauau       60

uuccuc                                                                  66


<210>  140
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 71

<400>  140
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucugc uaaggcauag gaauuuucgu       60

aguaca                                                                  66


<210>  141
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 72

<400>  141
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugaua agacugcuaa ggcauaggaa       60

uuuucg                                                                  66


<210>  142
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 73

<400>  142
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauag uuagauaaga cugcuaaggc       60

auagga                                                                  66


<210>  143
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 74

<400>  143
cuacuauacu ggugcgaauu ugcacuaguc uaaaauauca uaguuagaua agacugcuaa       60

ggcaua                                                                  66


<210>  144
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 75

<400>  144
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuauu ugcaucauag uuagauaaga       60

cugcua                                                                  66


<210>  145
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 76

<400>  145
cuacuauacu ggugcgaauu ugcacuaguc uaaaauaguc cgguuuuauu ugcaucauag       60

uuagau                                                                  66


<210>  146
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 77

<400>  146
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucuuc aguccgguuu uauuugcauc       60

auaguu                                                                  66


<210>  147
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 78

<400>  147
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuggg cagcuccuuc aguccgguuu       60

uauuug                                                                  66


<210>  148
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 79

<400>  148
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucaug ggcagcuccu ucaguccggu       60

uuuauu                                                                  66


<210>  149
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 80

<400>  149
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuaaa uuucucaugg gcagcuccuu       60

cagucc                                                                  66


<210>  150
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 81

<400>  150
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuagc ccccagcgcc acgaccuccg       60

agcuac                                                                  66


<210>  151
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 82

<400>  151
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugccu cccgacagag cgcuggugcu       60

agcccc                                                                  66


<210>  152
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 83

<400>  152
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuucc agcaccgagc gcccuggccg       60

gugagu                                                                  66


<210>  153
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 84

<400>  153
cuacuauacu ggugcgaauu ugcacuaguc uaaaauagaa aaaagaagag ggauaaaacc       60

cggauc                                                                  66


<210>  154
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 85

<400>  154
cuacuauacu ggugcgaauu ugcacuaguc uaaaauggga aguagagcaa ucuccccaag       60

ccgucg                                                                  66


<210>  155
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 86

<400>  155
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugggg aggagguggu agcuggggcu       60

gggggc                                                                  66


<210>  156
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 87

<400>  156
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucacc ccgccuccgg gcgcgggcuc       60

cggccc                                                                  66


<210>  157
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 88

<400>  157
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucacg gcuccuccga agcgagaaca       60

gcccag                                                                  66


<210>  158
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 89

<400>  158
cuacuauacu ggugcgaauu ugcacuaguc uaaaauuccg ggacggccgg ggcagcgcuc       60

cuggcg                                                                  66


<210>  159
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 90

<400>  159
cuacuauacu ggugcgaauu ugcacuaguc uaaaauccgg gacggccggg gcagcgcucc       60

uggcgc                                                                  66


<210>  160
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 91

<400>  160
cuacuauacu ggugcgaauu ugcacuaguc uaaaauggac ggccggggca gcgcuccugg       60

cgcugc                                                                  66


<210>  161
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 92

<400>  161
cuacuauacu ggugcgaauu ugcacuaguc uaaaaugacg gccggggcag cgcuccuggc       60

gcugcu                                                                  66


<210>  162
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 93

<400>  162
cuacuauacu ggugcgaauu ugcacuaguc uaaaauacgg ccggggcagc gcuccuggcg       60

cugcug                                                                  66


<210>  163
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 94

<400>  163
cuacuauacu ggugcgaauu ugcacuaguc uaaaaucggc cggggcagcg cuccuggcgc       60

ugcugg                                                                  66


<210>  164
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 95

<400>  164
cuacuauacu ggugcgaauu ugcacuaguc uaaaauggcc ggggcagcgc uccuggcgcu       60

gcuggc                                                                  66


<210>  165
<211>  66
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic: CasM EGFR mRNA knockdown experiment target 96

<400>  165
cuacuauacu ggugcgaauu ugcacuaguc uaaaauccgg ggcagcgcuc cuggcgcugc       60

uggcug                                                                  66


