                               SEQUENCE LISTING

<110> THE SCRIPPS RESEARCH INSTITUTE
 
<120> COMPOSITIONS AND METHODS FOR IN VIVO SYNTHESIS OF UNNATURAL 
      POLYPEPTIDES

<130> 36271-809.601

<140>
<141>

<150> 62/988,882
<151> 2020-03-12

<150> 62/913,664
<151> 2019-10-10

<160> 28    

<170> PatentIn version 3.5

<210> 1
<211> 575
<212> PRT
<213> Phaeodactylum tricornutum

<400> 1
Met Arg Pro Tyr Pro Thr Ile Ala Leu Ile Ser Val Phe Leu Ser Ala 
1               5                   10                  15      


Ala Thr Arg Ile Ser Ala Thr Ser Ser His Gln Ala Ser Ala Leu Pro 
            20                  25                  30          


Val Lys Lys Gly Thr His Val Pro Asp Ser Pro Lys Leu Ser Lys Leu 
        35                  40                  45              


Tyr Ile Met Ala Lys Thr Lys Ser Val Ser Ser Ser Phe Asp Pro Pro 
    50                  55                  60                  


Arg Gly Gly Ser Thr Val Ala Pro Thr Thr Pro Leu Ala Thr Gly Gly 
65                  70                  75                  80  


Ala Leu Arg Lys Val Arg Gln Ala Val Phe Pro Ile Tyr Gly Asn Gln 
                85                  90                  95      


Glu Val Thr Lys Phe Leu Leu Ile Gly Ser Ile Lys Phe Phe Ile Ile 
            100                 105                 110         


Leu Ala Leu Thr Leu Thr Arg Asp Thr Lys Asp Thr Leu Ile Val Thr 
        115                 120                 125             


Gln Cys Gly Ala Glu Ala Ile Ala Phe Leu Lys Ile Tyr Gly Val Leu 
    130                 135                 140                 


Pro Ala Ala Thr Ala Phe Ile Ala Leu Tyr Ser Lys Met Ser Asn Ala 
145                 150                 155                 160 


Met Gly Lys Lys Met Leu Phe Tyr Ser Thr Cys Ile Pro Phe Phe Thr 
                165                 170                 175     


Phe Phe Gly Leu Phe Asp Val Phe Ile Tyr Pro Asn Ala Glu Arg Leu 
            180                 185                 190         


His Pro Ser Leu Glu Ala Val Gln Ala Ile Leu Pro Gly Gly Ala Ala 
        195                 200                 205             


Ser Gly Gly Met Ala Val Leu Ala Lys Ile Ala Thr His Trp Thr Ser 
    210                 215                 220                 


Ala Leu Phe Tyr Val Met Ala Glu Ile Tyr Ser Ser Val Ser Val Gly 
225                 230                 235                 240 


Leu Leu Phe Trp Gln Phe Ala Asn Asp Val Val Asn Val Asp Gln Ala 
                245                 250                 255     


Lys Arg Phe Tyr Pro Leu Phe Ala Gln Met Ser Gly Leu Ala Pro Val 
            260                 265                 270         


Leu Ala Gly Gln Tyr Val Val Arg Phe Ala Ser Lys Ala Val Asn Phe 
        275                 280                 285             


Glu Ala Ser Met His Arg Leu Thr Ala Ala Val Thr Phe Ala Gly Ile 
    290                 295                 300                 


Met Ile Cys Ile Phe Tyr Gln Leu Ser Ser Ser Tyr Val Glu Arg Thr 
305                 310                 315                 320 


Glu Ser Ala Lys Pro Ala Ala Asp Asn Glu Gln Ser Ile Lys Pro Lys 
                325                 330                 335     


Lys Lys Lys Pro Lys Met Ser Met Val Glu Ser Gly Lys Phe Leu Ala 
            340                 345                 350         


Ser Ser Gln Tyr Leu Arg Leu Ile Ala Met Leu Val Leu Gly Tyr Gly 
        355                 360                 365             


Leu Ser Ile Asn Phe Thr Glu Ile Met Trp Lys Ser Leu Val Lys Lys 
    370                 375                 380                 


Gln Tyr Pro Asp Pro Leu Asp Tyr Gln Arg Phe Met Gly Asn Phe Ser 
385                 390                 395                 400 


Ser Ala Val Gly Leu Ser Thr Cys Ile Val Ile Phe Phe Gly Val His 
                405                 410                 415     


Val Ile Arg Leu Leu Gly Trp Lys Val Gly Ala Leu Ala Thr Pro Gly 
            420                 425                 430         


Ile Met Ala Ile Leu Ala Leu Pro Phe Phe Ala Cys Ile Leu Leu Gly 
        435                 440                 445             


Leu Asp Ser Pro Ala Arg Leu Glu Ile Ala Val Ile Phe Gly Thr Ile 
    450                 455                 460                 


Gln Ser Leu Leu Ser Lys Thr Ser Lys Tyr Ala Leu Phe Asp Pro Thr 
465                 470                 475                 480 


Thr Gln Met Ala Tyr Ile Pro Leu Asp Asp Glu Ser Lys Val Lys Gly 
                485                 490                 495     


Lys Ala Ala Ile Asp Val Leu Gly Ser Arg Ile Gly Lys Ser Gly Gly 
            500                 505                 510         


Ser Leu Ile Gln Gln Gly Leu Val Phe Val Phe Gly Asn Ile Ile Asn 
        515                 520                 525             


Ala Ala Pro Val Val Gly Val Val Tyr Tyr Ser Val Leu Val Ala Trp 
    530                 535                 540                 


Met Ser Ala Ala Gly Arg Leu Ser Gly Leu Phe Gln Ala Gln Thr Glu 
545                 550                 555                 560 


Met Asp Lys Ala Asp Lys Met Glu Ala Lys Thr Asn Lys Glu Lys 
                565                 570                 575 


<210> 2
<211> 40
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 2
atgggtctca cacaaactcg agtacaactt taactcacac                             40


<210> 3
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 3
atgggtctcg attccattct tttgtttgtc tgc                                    33


<210> 4
<211> 31
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 4
cataatggtc tcgctgctgc ccgataacca c                                      31


<210> 5
<211> 42
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 5
tgatattggt ctcggtcttt cgataaaaca ctctgagtag ag                          42


<210> 6
<211> 35
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 6
atgggtctcg aaacctgatc atgtagatcg aacgg                                  35


<210> 7
<211> 28
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 7
atgggtctca tctaacccgg ctgaacgg                                          28


<210> 8
<211> 32
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 8
atgggtctcc ggtagttcag cagggcagaa cg                                     32


<210> 9
<211> 34
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 9
atgggtctcg gaggggattt gaacccctgc catg                                   34


<210> 10
<211> 34
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 10
atattcggtc tcgtcagcag aatacgccga ttgg                                   34


<210> 11
<211> 33
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 11
acgcgttggt ctcggttatc gggcagcagc acc                                    33


<210> 12
<211> 29
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 12
attggtctcg gccgagcggt tgaaggcac                                         29


<210> 13
<211> 29
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 13
attggtctct ctggaaccct ttcgggtcg                                         29


<210> 14
<211> 24
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 14
ctcgagtaca actttaactc acac                                              24


<210> 15
<211> 24
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 15
gattccattc ttttgtttgt ctgc                                              24


<210> 16
<211> 19
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 16
gctgctgccc gataaccac                                                    19


<210> 17
<211> 29
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 17
ggtctttcga taaaacactc tgagtagag                                         29


<210> 18
<211> 26
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 18
gaaacctgat catgtagatc gaacgg                                            26


<210> 19
<211> 19
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 19
atctaacccg gctgaacgg                                                    19


<210> 20
<211> 23
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 20
ccgctgccac taggaagctt atg                                               23


<210> 21
<211> 27
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide

<400> 21
cctctagaaa atcattccgg aagtgtg                                           27


<210> 22
<211> 56
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (30)..(30)
<223> An unnatural nucleotide

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 22
ctctggaacc ctttcgggtc gccggtttgn tagaccggtg ccttcaaccg ctcggc           56


<210> 23
<211> 63
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (31)..(33)
<223> a, c, t, or g

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 23
ctcgagtaca actttaactc acacaatgta nnnatcacgg cagacaaaca aaagaatgga       60

atc                                                                     63


<210> 24
<211> 49
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (23)..(23)
<223> An unnatural nucleotide 

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 24
cagcagaata cgccgattgg cgntggcccg gtgctgctgc ccgataacc                   49


<210> 25
<211> 53
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (21)..(21)
<223> An unnatural nucleotide 

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 25
gctgctgccc gataaccaca ncctctctac tcagagtgtt ttatcgaaag acc              53


<210> 26
<211> 43
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (19)..(19)
<223> An unnatural nucleotide 

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 26
gctgcccgat aaccacagnt tgtctactca gagtgtttta tcg                         43


<210> 27
<211> 52
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (25)..(27)
<223> a, c, t, or g

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 27
gaatctaacc cggctgaacg gattnnnagt ccgttcgatc tacatgatca gg               52


<210> 28
<211> 52
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      oligonucleotide


<220>
<221> modified_base
<222> (27)..(27)
<223> An unnatural nucleotide 

<220> 
<223> See specification as filed for detailed description of
      substitutions and preferred embodiments

<400> 28
gatttgaacc cctgccatgc ggattancag tccgccgttc tgccctgctg aa               52



