                         SEQUENCE LISTING

<110>  AbSci, LLC
 
<120>  CYTOPLASMIC EXPRESSION SYSTEM

<130>  AbSci-002PCT

<160>  21    

<170>  PatentIn version 3.5

<210>  1
<211>  486
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Humicola insolens protein disulfide isomerase (PDI) amino acid 
       sequence, without signal peptide

<400>  1

Met Ser Asp Val Val Gln Leu Lys Lys Asp Thr Phe Asp Asp Phe Ile 
1               5                   10                  15      


Lys Thr Asn Asp Leu Val Leu Ala Glu Phe Phe Ala Pro Trp Cys Gly 
            20                  25                  30          


His Cys Lys Ala Leu Ala Pro Glu Tyr Glu Glu Ala Ala Thr Thr Leu 
        35                  40                  45              


Lys Glu Lys Asn Ile Lys Leu Ala Lys Val Asp Cys Thr Glu Glu Thr 
    50                  55                  60                  


Asp Leu Cys Gln Gln His Gly Val Glu Gly Tyr Pro Thr Leu Lys Val 
65                  70                  75                  80  


Phe Arg Gly Leu Asp Asn Val Ser Pro Tyr Lys Gly Gln Arg Lys Ala 
                85                  90                  95      


Ala Ala Ile Thr Ser Tyr Met Ile Lys Gln Ser Leu Pro Ala Val Ser 
            100                 105                 110         


Glu Val Thr Lys Asp Asn Leu Glu Glu Phe Lys Lys Ala Asp Lys Ala 
        115                 120                 125             


Val Leu Val Ala Tyr Val Asp Ala Ser Asp Lys Ala Ser Ser Glu Val 
    130                 135                 140                 


Phe Thr Gln Val Ala Glu Lys Leu Arg Asp Asn Tyr Pro Phe Gly Ser 
145                 150                 155                 160 


Ser Ser Asp Ala Ala Leu Ala Glu Ala Glu Gly Val Lys Ala Pro Ala 
                165                 170                 175     


Ile Val Leu Tyr Lys Asp Phe Asp Glu Gly Lys Ala Val Phe Ser Glu 
            180                 185                 190         


Lys Phe Glu Val Glu Ala Ile Glu Lys Phe Ala Lys Thr Gly Ala Thr 
        195                 200                 205             


Pro Leu Ile Gly Glu Ile Gly Pro Glu Thr Tyr Ser Asp Tyr Met Ser 
    210                 215                 220                 


Ala Gly Ile Pro Leu Ala Tyr Ile Phe Ala Glu Thr Ala Glu Glu Arg 
225                 230                 235                 240 


Lys Glu Leu Ser Asp Lys Leu Lys Pro Ile Ala Glu Ala Gln Arg Gly 
                245                 250                 255     


Val Ile Asn Phe Gly Thr Ile Asp Ala Lys Ala Phe Gly Ala His Ala 
            260                 265                 270         


Gly Asn Leu Asn Leu Lys Thr Asp Lys Phe Pro Ala Phe Ala Ile Gln 
        275                 280                 285             


Glu Val Ala Lys Asn Gln Lys Phe Pro Phe Asp Gln Glu Lys Glu Ile 
    290                 295                 300                 


Thr Phe Glu Ala Ile Lys Ala Phe Val Asp Asp Phe Val Ala Gly Lys 
305                 310                 315                 320 


Ile Glu Pro Ser Ile Lys Ser Glu Pro Ile Pro Glu Lys Gln Glu Gly 
                325                 330                 335     


Pro Val Thr Val Val Val Ala Lys Asn Tyr Asn Glu Ile Val Leu Asp 
            340                 345                 350         


Asp Thr Lys Asp Val Leu Ile Glu Phe Tyr Ala Pro Trp Cys Gly His 
        355                 360                 365             


Cys Lys Ala Leu Ala Pro Lys Tyr Glu Glu Leu Gly Ala Leu Tyr Ala 
    370                 375                 380                 


Lys Ser Glu Phe Lys Asp Arg Val Val Ile Ala Lys Val Asp Ala Thr 
385                 390                 395                 400 


Ala Asn Asp Val Pro Asp Glu Ile Gln Gly Phe Pro Thr Ile Lys Leu 
                405                 410                 415     


Tyr Pro Ala Gly Ala Lys Gly Gln Pro Val Thr Tyr Ser Gly Ser Arg 
            420                 425                 430         


Thr Val Glu Asp Leu Ile Lys Phe Ile Ala Glu Asn Gly Lys Tyr Lys 
        435                 440                 445             


Ala Ala Ile Ser Glu Asp Ala Glu Glu Thr Ser Ser Ala Thr Glu Thr 
    450                 455                 460                 


Thr Thr Glu Thr Ala Thr Lys Ser Glu Glu Ala Ala Lys Glu Thr Ala 
465                 470                 475                 480 


Thr Glu His Asp Glu Leu 
                485     


<210>  2
<211>  1487
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Protein disulfide isomerase (PDI) expression construct

<400>  2
gctagcagga ggaattcacc atgtctgatg ttgtacaact gaagaaagat acgttcgatg       60

actttatcaa aactaatgac ttggtgctgg cagagttttt cgccccgtgg tgtggccact      120

gcaaagctct ggctccggag tacgaagagg ccgcgaccac cctgaaagaa aagaacatca      180

aactggcgaa agtggactgt acggaagaaa ccgacctgtg tcagcagcac ggcgtggaag      240

gttacccgac cctgaaggtg tttcgtggcc tggacaatgt tagcccgtac aaaggtcaac      300

gtaaggccgc agcgatcacc agctacatga tcaagcagtc gctgcctgca gtctctgagg      360

tgaccaaaga taatctggaa gagttcaaaa aggcagataa ggcggtgctg gttgcctatg      420

ttgatgcaag cgacaaggcg agcagcgagg tctttaccca ggtcgcggag aaattgcgcg      480

ataactaccc gttcggcagc agctccgatg cagctttggc cgaggcggaa ggtgtcaagg      540

ctccggcgat cgttctgtac aaagatttcg acgagggtaa agcggtgttc agcgaaaagt      600

ttgaggtgga agcaattgaa aagttcgcaa aaaccggtgc cacgcctttg attggcgaaa      660

tcggtccgga aacctattct gactatatga gcgccggtat cccgctggcc tacattttcg      720

cagaaacggc agaagagcgc aaagaactga gcgacaagtt gaagccaatt gcagaggcac      780

agcgtggcgt catcaacttt ggtaccattg acgcgaaagc atttggtgcg catgccggta      840

acctgaatct gaaaacggac aaatttccgg cgtttgcgat tcaagaggtg gcgaagaacc      900

aaaagtttcc gttcgatcaa gaaaaagaga ttaccttcga ggcgatcaaa gcgttcgttg      960

acgactttgt tgccggtaaa atcgagccga gcattaagag cgagccgatc ccggagaagc     1020

aggaaggccc ggtgaccgtc gtcgtcgcga agaattacaa cgagattgtt ctggatgaca     1080

cgaaagacgt cctgattgag ttctatgcgc cgtggtgcgg tcattgcaaa gcgctggccc     1140

cgaaatatga agagctgggt gcgctgtacg cgaagagcga gtttaaggac cgtgtggtta     1200

tcgcgaaagt agatgcgacc gccaatgacg ttcctgacga gatccaaggc ttcccgacca     1260

ttaaactgta tccggctggt gctaaaggcc agccagttac ctatagcggt agccgcacgg     1320

ttgaggatct gattaagttc attgccgaga acggcaagta caaggcggca atcagcgagg     1380

atgcagaaga aacgagctcc gcaaccgaaa ccacgacgga aaccgctact aagtccgaag     1440

aggcggcgaa agaaaccgcg acggagcacg atgagctgta agtcgac                   1487


<210>  3
<211>  5304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Dual-promoter vector, pSOL

<400>  3
ggcctttctt cggtagaagt cttcccccag aggcaggtat caaaggatct tcttgagatc       60

ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg      120

tttgtttgcc ggatcaagag ctaccaactc tttttccgag gtaactggct tcagcagagc      180

gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc      240

tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg      300

cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg      360

gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga      420

actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc      480

ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg      540

gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcatcg      600

atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcagaaag      660

gcccacccga aggtgagcca ggtgattaca tttgggccct catcagaggt tttcaccgtc      720

atcaccgaaa cgcgcgaggc agctgcggta aagctcatca gcgtggtcgt gaagcgattc      780

acagatgtct gcctgttcat ccgcgtccag ctcgttgagt ttctccagaa gcgttaatgt      840

ctggcttctg ataaagcggg ccatgttaag ggcggttttt tcctgtttgg tcatacctgc      900

ttagaaaaac tcatcgagca tcaaatgaaa ttgcaattta ttcatatcag gattatcaat      960

accatatttt tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca     1020

taggatggca agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc     1080

tattaatttc ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac     1140

tgaatccggt gagaatggca aaagtttatg catttctttc cagacttgtt caacaggcca     1200

gccattacgc tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg     1260

cgcctgagcg aggcgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga     1320

gtgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata     1380

ttcttctaat acctggaacg ctgtttttcc ggggatcgca gtggtgagta accatgcatc     1440

atcaggagta cggataaaat gcttgatggt cggaagtggc ataaattccg tcagccagtt     1500

tagtctgacc atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa     1560

caactctggc gcatcgggct tcccatacaa gcgatagatt gtcgcacctg attgcccgac     1620

attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg     1680

cctcgacgtt tcccgttgaa tatggctcat agctcctgaa aatctcgata actcaaaaaa     1740

tacgcccggt agtgatctta tttcattatg gtgaaagttg gaacctctta cgtgccgatc     1800

aagaagacgg tcaaaagcct ccggtcggag gccgggagag tgttcaccga caaacaacag     1860

ataaaacaaa aggcccagtc ttccgactga gccttttgtt ttatttgatg tctggcagtt     1920

cccgagacgt tatgacaact tgacggctac atcattcact ttttcttcac aaccggcacg     1980

gaactcgctc gggctggccc cggtgcattt tttaaatacc cgcgagaaat agagttgatc     2040

gtcaaaacca acattgcgac cgacggtggc gataggcatc cgggtggtgc tcaaaagcag     2100

cttcgcctgg ctgatacgtt ggtcctcgcg ccagcttaag acgctaatcc ctaactgctg     2160

gcggaaaaga tgtgacagac gcgacggcga caagcaaaca tgctgtgcga cgctggcgat     2220

atcaaaattg ctgtctgcca ggtgatcgct gatgtactga caagcctcgc gtacccgatt     2280

atccatcggt ggatggagcg actcgttaat cgcttccatg cgccgcagta acaattgctc     2340

aagcagattt atcgccagca gctccgaata gcgcccttcc ccttgcccgg cgttaatgat     2400

ttgcccaaac aggtcgctga aatgcggctg gtgcgcttca tccgggcgaa agaaccccgt     2460

attggcaaat attgacggcc agttaagcca ttcatgccag taggcgcgcg gacgaaagta     2520

aacccactgg tgataccatt cgcgagcctc cggatgacga ccgtagtgat gaatctctcc     2580

tggcgggaac agcaaaatat cacccggtcg gcaaacaaat tctcgtccct gatttttcac     2640

caccccctga ccgcgaatgg tgagattgag aatataacct ttcattccca gcggtcggtc     2700

gataaaaaaa tcgagataac cgttggcctc aatcggcgtt aaacccgcca ccagatgggc     2760

attaaacgag tatcccggca gcaggggatc attttgcgct tcagccatac ttttcatact     2820

cccgccattc agagaagaaa ccaattgtcc atattgcatc agacattgcc gtctctgcgt     2880

cttttactgg ctcttctcgc taaccaaacc ggtaaccccg cttattaaaa gcattctgta     2940

acaaagcggg accaaagcca tgacaaaaac gcgtaacaaa agtgtctata atcacggcag     3000

aaaagtccac attgattatt tgcacggcgt cacactttgc tatgccatag catttttatc     3060

cataagatta gcggatccta cctgacgctt tttatcgcaa ctctctactg tttctccata     3120

cccgtttttt tgggctagca ggaggtaaaa aaaatgtgag accggtctcg gtctagatcg     3180

gtcagtttca cctgatttac gtaaaaaccc gcttcggcgg gtttttgctt ttggaggggc     3240

agaaagatga atgactgtct ctcctgttag tgagggttaa tgcccggaac gaagaaaggc     3300

ccacccgtga aggtgagcca gtgagttggt tacattttct cttgagggtt tagcttttca     3360

gacgacgcca aaaggtcgta cgtgaaatac ccaaatagtt ggccgcagcc gtcttgtcac     3420

cattaaactt ctcaagcgct tgctgcgggg tcagcaaacg cggagccggc gtctttgcgc     3480

tctcacgcgc cagctccggc agcagcaact gcatgaattg cggagtcaga tccggggtcg     3540

gctcaacgga caggaacagc gccaggcgtt ccatcatatt acgcagctcg cggatgttac     3600

ccggccagtc ataatgcagc agcaccgttt cgctcgcctg cagaccctgg cgcagtgccg     3660

cagagaacgg tgcgctcagg gctgccagcg agactttcag gaaagactcc gccagcggta     3720

aaatgtcggc gacacgttca cgcaacggcg ggagctgcag acgcagaatg ctcaggcggt     3780

agaacaggtc gcgacgaaaa cggccctgtt gcatatcctc ttccagattg cagtgggtcg     3840

cgctaatcac gcgcacgtct accggaaccg gttgatgacc accgacgcgg gtcacttctt     3900

tctcttccag cacacgcagc agacgggttt gcaatggcag cggcatctca ccgatctcgt     3960

ccaggaacaa ggtgccgccg tgggcaattt caaacaaacc agcacggcca ccgcgacggc     4020

tacccgtgaa tgcgccctct tcgtagccaa acagctcagc ttccagcagg ctttccgcga     4080

ttgcaccgca attaactgcc acaaacggat gagatttctt accctggcgg gcatcgtgac     4140

gggcgaaata ctcacgatgg attgcttgcg cagccagttc cttacccgta ccagtctcgc     4200

cttcgatcag aacagccgcg ctgctacgtg catacagcag aatggtctgg cgaacttgct     4260

ccatttgagg gctttggccc agcatatcac ccaggacata acgggtacgc agcgcattac     4320

gcgtcgcatc gtgggtgttg tggcgcaggc tcattctggt catgtccagg gcgtcgctga     4380

acgcctgacg caccgttgcc gcgctgtaga taaagatgcc cgtcatgccc gcttcttcgg     4440

ccaagtccgt gatcagaccc gcaccaacca cagcctcggt accgttcgct ttcagttcgt     4500

tgatctggcc acgtgcatct tcctcggtaa tgtagctgcg ttggtccagg cgcagattaa     4560

aggtcttttg aaacgcgacc agcgcaggga tcgtttcctg gtaggtgaca acgccaatcg     4620

aggaggtcag tttgcctgcc ttcgccagcg cctgcaagac atcgtaaccg ctcggcttaa     4680

tcaggatcac cggcacggac agacgggatt tcaggtaggc accattgcta cccgctgcga     4740

taatggcgtc acaacgctcg ttggccagct ttttgcgaat gtaggtaacg gctttctcga     4800

aacccagctg aatcggagtg atgttcgcca ggtgatcaaa ctccaggcta atgtcgcgga     4860

acaactcgaa cagacgggtg acgctaacgg tccaaataac tggtttatca tcgttcaaac     4920

gcggtgggtg tgccatggtg aatacctcct gttaagaaac cgaatattgg gtttaaactt     4980

gtttcataat tgttgcaatg aaacgcggtg aaacattgcc tgaaacgtta actgaaacgc     5040

atatttgcgg attagttcat gactttatct ctaacaaatt gaaattaaac atttaatttt     5100

attaaggcaa ttgtggcaca ccccttgctt tgtctttatc aacgcaaata acaagttgat     5160

aacaaaagct taggaggaaa acatagagac cggtctctct cgagtaacta gttgatagag     5220

atcaagcctt aacgaactaa gacccccgca ccgaaaggtc cgggggtttt ttttgacctt     5280

aaaaacataa ccgaggagca gaca                                            5304


<210>  4
<211>  21
<212>  PRT
<213>  Homo sapiens

<400>  4

Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu 
1               5                   10                  15      


Glu Asn Tyr Cys Asn 
            20      


<210>  5
<211>  30
<212>  PRT
<213>  Homo sapiens

<400>  5

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 
            20                  25                  30  


<210>  6
<211>  30
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin lispro, mature B chain

<400>  6

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Lys Pro Thr 
            20                  25                  30  


<210>  7
<211>  30
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin aspart, mature B chain

<400>  7

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Asp Lys Thr 
            20                  25                  30  


<210>  8
<211>  30
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin glulisine, mature B chain

<400>  8

Phe Val Lys Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Glu Thr 
            20                  25                  30  


<210>  9
<211>  21
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin glargine, mature A chain

<400>  9

Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu 
1               5                   10                  15      


Glu Asn Tyr Cys Gly 
            20      


<210>  10
<211>  32
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin glargine, mature B chain

<400>  10

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg 
            20                  25                  30          


<210>  11
<211>  29
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin degludec, mature B chain; deletion of B30 threonine and 
       modification of lysine at B29 with a hexadecanedioic acid 
       molecule bound to B29 through an L-gamma-Glu linker

<400>  11

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys 
            20                  25                  


<210>  12
<211>  29
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Insulin detemir, mature B chain; deletion of B30 threonine and 
       modification of lysine at B29 with a myristic acid molecule

<400>  12

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
1               5                   10                  15      


Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys 
            20                  25                  


<210>  13
<211>  168
<212>  PRT
<213>  Hog cholera virus (strain Alfort)

<400>  13

Met Glu Leu Asn His Phe Glu Leu Leu Tyr Lys Thr Ser Lys Gln Lys 
1               5                   10                  15      


Pro Val Gly Val Glu Glu Pro Val Tyr Asp Thr Ala Gly Arg Pro Leu 
            20                  25                  30          


Phe Gly Asn Pro Ser Glu Val His Pro Gln Ser Thr Leu Lys Leu Pro 
        35                  40                  45              


His Asp Arg Gly Arg Gly Asp Ile Arg Thr Thr Leu Arg Asp Leu Pro 
    50                  55                  60                  


Arg Lys Gly Asp Cys Arg Ser Gly Asn His Leu Gly Pro Val Ser Gly 
65                  70                  75                  80  


Ile Tyr Ile Lys Pro Gly Pro Val Tyr Tyr Gln Asp Tyr Thr Gly Pro 
                85                  90                  95      


Val Tyr His Arg Ala Pro Leu Glu Phe Phe Asp Glu Ala Gln Phe Cys 
            100                 105                 110         


Glu Val Thr Lys Arg Ile Gly Arg Val Thr Gly Ser Asp Gly Lys Leu 
        115                 120                 125             


Tyr His Ile Tyr Val Cys Val Asp Gly Cys Ile Leu Leu Lys Leu Ala 
    130                 135                 140                 


Lys Arg Gly Thr Pro Arg Thr Leu Lys Trp Ile Arg Asn Phe Thr Asn 
145                 150                 155                 160 


Cys Pro Leu Trp Val Thr Ser Cys 
                165             


<210>  14
<211>  90
<212>  PRT
<213>  Sus scrofa

<400>  14

His Phe Glu Gly Glu Lys Val Phe Arg Val Asn Val Glu Asp Glu Asn 
1               5                   10                  15      


Asp Ile Ser Leu Leu His Glu Leu Ala Ser Thr Arg Gln Ile Asp Phe 
            20                  25                  30          


Trp Lys Pro Asp Ser Val Thr Gln Ile Lys Pro His Ser Thr Val Asp 
        35                  40                  45              


Phe Arg Val Lys Ala Glu Asp Ile Leu Ala Val Glu Asp Phe Leu Glu 
    50                  55                  60                  


Gln Asn Glu Leu Gln Tyr Glu Val Leu Ile Asn Asn Leu Arg Ser Val 
65                  70                  75                  80  


Leu Glu Ala Gln Phe Asp Ser Arg Val Arg 
                85                  90  


<210>  15
<211>  91
<212>  PRT
<213>  Caenorhabditis elegans

<400>  15

Met Ala Asp Asp Ala Ala Gln Ala Gly Asp Asn Ala Glu Tyr Ile Lys 
1               5                   10                  15      


Ile Lys Val Val Gly Gln Asp Ser Asn Glu Val His Phe Arg Val Lys 
            20                  25                  30          


Tyr Gly Thr Ser Met Ala Lys Leu Lys Lys Ser Tyr Ala Asp Arg Thr 
        35                  40                  45              


Gly Val Ala Val Asn Ser Leu Arg Phe Leu Phe Asp Gly Arg Arg Ile 
    50                  55                  60                  


Asn Asp Asp Asp Thr Pro Lys Thr Leu Glu Met Glu Asp Asp Asp Val 
65                  70                  75                  80  


Ile Glu Val Tyr Gln Glu Gln Leu Gly Gly Phe 
                85                  90      


<210>  16
<211>  189
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  16

Met Lys Ala Ile Asp Lys Met Thr Asp Asn Pro Pro Gln Glu Gly Leu 
1               5                   10                  15      


Ser Gly Arg Lys Ile Ile Tyr Asp Glu Asp Gly Lys Pro Cys Arg Ser 
            20                  25                  30          


Cys Asn Thr Leu Leu Asp Phe Gln Tyr Val Thr Gly Lys Ile Ser Asn 
        35                  40                  45              


Gly Leu Lys Asn Leu Ser Ser Asn Gly Lys Leu Ala Gly Thr Gly Ala 
    50                  55                  60                  


Leu Thr Gly Glu Ala Ser Glu Leu Met Pro Gly Ser Arg Thr Tyr Arg 
65                  70                  75                  80  


Lys Val Asp Pro Pro Asp Val Glu Gln Leu Gly Arg Ser Ser Trp Thr 
                85                  90                  95      


Leu Leu His Ser Val Ala Ala Ser Tyr Pro Ala Gln Pro Thr Asp Gln 
            100                 105                 110         


Gln Lys Gly Glu Met Lys Gln Phe Leu Asn Ile Phe Ser His Ile Tyr 
        115                 120                 125             


Pro Cys Asn Trp Cys Ala Lys Asp Phe Glu Lys Tyr Ile Arg Glu Asn 
    130                 135                 140                 


Ala Pro Gln Val Glu Ser Arg Glu Glu Leu Gly Arg Trp Met Cys Glu 
145                 150                 155                 160 


Ala His Asn Lys Val Asn Lys Lys Leu Arg Lys Pro Lys Phe Asp Cys 
                165                 170                 175     


Asn Phe Trp Glu Lys Arg Trp Lys Asp Gly Trp Asp Glu 
            180                 185                 


<210>  17
<211>  570
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Polynucleotide encoding Erv1p, optimized for expression in 
       prokaryotes such as E. coli

<400>  17
atgaaagcga ttgataagat gaccgataat ccaccgcaag aaggtctgag cggccgtaaa       60

atcatctacg acgaagatgg caaaccgtgt cgtagctgca acaccctgct ggactttcaa      120

tatgtgacgg gtaagatttc caatggcctg aaaaacctga gcagcaatgg caagctggcc      180

ggtacgggtg ctttgaccgg tgaggcgtct gaactgatgc ctggtagccg tacgtaccgc      240

aaggttgatc cgccggacgt tgagcagctg ggtcgctcca gctggacttt gctgcatagc      300

gtcgcggcga gctacccggc acagccgacc gaccagcaaa agggtgagat gaaacagttt      360

ctgaacattt tctcgcacat ctatccgtgc aattggtgtg ccaaagactt tgaaaagtat      420

atccgtgaga atgcgccgca agtggagagc cgcgaagaac tgggccgttg gatgtgtgag      480

gcacacaaca aagtcaacaa aaagctgcgt aaaccgaagt tcgattgcaa cttctgggag      540

aagcgctgga aagacggctg ggatgagtaa                                       570


<210>  18
<211>  13
<212>  PRT
<213>  Homo sapiens

<400>  18

Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu 
1               5                   10              


<210>  19
<211>  8
<212>  PRT
<213>  Homo sapiens

<400>  19

Ala Leu Tyr Leu Val Cys Gly Glu 
1               5               


<210>  20
<211>  13
<212>  PRT
<213>  Homo sapiens

<400>  20

Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu 
1               5                   10              


<210>  21
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Fragment of human insulin glargine A chain produced by Glu-C 
       digest; amino acids 18-21 of SEQ ID NO:9

<400>  21

Asn Tyr Cys Gly 
1               


