                         SEQUENCE LISTING

<110>  R.P. Scherer Technologies
       Rabuka, David
       Drake, Penelope M.
       Kim, Yun Cheol
       Barfield, Robyn M.
       Bauzon, Maxine
       Ogunkoya, Ayodele
 
<120>  ANTIBODY SPECIFIC FOR MUCIN-1 AND METHODS OF USE THEREOF

<130>  RDWD-035WO

<150>  US 63/059,497
<151>  2020-07-31

<160>  46    

<170>  PatentIn version 3.5

<210>  1
<211>  117
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  1

Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
1               5                   10                  15      


Thr Val Lys Ile Ser Cys Lys Val Ser Gly Tyr Thr Phe Thr Asp His 
            20                  25                  30          


Thr Met His Trp Ile Lys Gln Arg Pro Gly Lys Gly Leu Glu Trp Met 
        35                  40                  45              


Gly Tyr Phe Tyr Pro Arg Asp Asp Ser Thr Asn Tyr Asn Glu Lys Phe 
    50                  55                  60                  


Lys Gly Arg Val Thr Leu Thr Ala Asp Lys Ser Thr Asp Thr Ala Tyr 
65                  70                  75                  80  


Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
                85                  90                  95      


Ala Arg Gly Leu Arg Tyr Ala Leu Asp Tyr Trp Gly Gln Gly Thr Leu 
            100                 105                 110         


Val Thr Val Ser Ser 
        115         


<210>  2
<211>  108
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  2

Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
1               5                   10                  15      


Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Ser Ser Val Ser Ser Ser 
            20                  25                  30          


Tyr Leu Tyr Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Trp 
        35                  40                  45              


Ile Tyr Gly Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser 
    50                  55                  60                  


Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr Ile Ser Ser Leu Glu 
65                  70                  75                  80  


Pro Glu Asp Ala Ala Val Tyr Tyr Cys His Gln Tyr Ala Trp Ser Pro 
                85                  90                  95      


Pro Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys 
            100                 105             


<210>  3
<211>  108
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  3

Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
1               5                   10                  15      


Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Ser Ser Val Gly Ser Ser 
            20                  25                  30          


Asn Leu Tyr Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Trp 
        35                  40                  45              


Ile Tyr Arg Ser Thr Lys Leu Ala Ser Gly Val Pro Ala Arg Phe Ser 
    50                  55                  60                  


Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr Ile Ser Ser Leu Glu 
65                  70                  75                  80  


Pro Glu Asp Ala Ala Val Tyr Tyr Cys His Gln Tyr Arg Trp Ser Pro 
                85                  90                  95      


Pro Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys 
            100                 105             


<210>  4
<211>  108
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  4

Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
1               5                   10                  15      


Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Ser Ser Val Ser Ser Ser 
            20                  25                  30          


Tyr Leu Tyr Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Trp 
        35                  40                  45              


Ile Ile Gly Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser 
    50                  55                  60                  


Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr Ile Ser Ser Leu Glu 
65                  70                  75                  80  


Pro Glu Asp Ala Ala Val Tyr Tyr Cys His Gln Tyr Ser Trp Ser Pro 
                85                  90                  95      


Pro Thr Phe Gly Gln Gly Thr Lys Leu Glu Ile Lys 
            100                 105             


<210>  5
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is Cys or Ser

<400>  5

Leu Xaa Thr Pro Ser Arg 
1               5       


<210>  6
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  6

Leu Cys Thr Pro Ser Arg 
1               5       


<210>  7
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  7

Gly Tyr Thr Phe Thr Asp His 
1               5           


<210>  8
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  8

Tyr Pro Arg Asp Asp Ser 
1               5       


<210>  9
<211>  8
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  9

Gly Leu Arg Tyr Ala Leu Asp Tyr 
1               5               


<210>  10
<211>  12
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  10

Arg Ala Ser Ser Ser Val Ser Ser Ser Tyr Leu Tyr 
1               5                   10          


<210>  11
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  11

Gly Thr Ser Asn Leu Ala Ser 
1               5           


<210>  12
<211>  9
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  12

His Gln Tyr Ala Trp Ser Pro Pro Thr 
1               5                   


<210>  13
<211>  12
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  13

Arg Ala Ser Ser Ser Val Gly Ser Ser Asn Leu Tyr 
1               5                   10          


<210>  14
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  14

Arg Ser Thr Lys Leu Ala Ser 
1               5           


<210>  15
<211>  9
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  15

His Gln Tyr Arg Trp Ser Pro Pro Thr 
1               5                   


<210>  16
<211>  9
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  16

His Gln Tyr Ser Trp Ser Pro Pro Thr 
1               5                   


<210>  17
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  17

Asp His Thr Met His 
1               5   


<210>  18
<211>  17
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  18

Tyr Phe Tyr Pro Arg Asp Asp Ser Thr Asn Tyr Asn Glu Lys Phe Lys 
1               5                   10                  15      


Gly 
    


<210>  19
<211>  351
<212>  DNA
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  19
gaggtccagc tggtacagtc tggggctgag gtgaagaagc ctggggctac agtgaaaatc       60

tcctgcaagg tttctggata caccttcacc gaccatacca tgcactggat caaacagcga      120

cctggaaaag ggcttgagtg gatgggatac ttctacccta gagatgattc cacaaattac      180

aacgagaagt tcaagggcag agtcaccctt accgcggaca aatctacaga cacagcctac      240

atggagctga gcagcctgag atctgaggac acggccgtgt attactgtgc gcgtggtctt      300

cgatacgctc ttgactactg gggccaagga accctggtca ccgtctcctc a               351


<210>  20
<211>  1255
<212>  PRT
<213>  Homo sapiens

<400>  20

Met Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
1               5                   10                  15      


Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 
            20                  25                  30          


Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser 
        35                  40                  45              


Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His 
    50                  55                  60                  


Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu 
65                  70                  75                  80  


Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln 
                85                  90                  95      


Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr 
            100                 105                 110         


Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro 
        115                 120                 125             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    130                 135                 140                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
145                 150                 155                 160 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                165                 170                 175     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            180                 185                 190         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        195                 200                 205             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    210                 215                 220                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
225                 230                 235                 240 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                245                 250                 255     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            260                 265                 270         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        275                 280                 285             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    290                 295                 300                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
305                 310                 315                 320 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                325                 330                 335     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            340                 345                 350         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        355                 360                 365             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    370                 375                 380                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
385                 390                 395                 400 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                405                 410                 415     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            420                 425                 430         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        435                 440                 445             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    450                 455                 460                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
465                 470                 475                 480 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                485                 490                 495     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            500                 505                 510         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        515                 520                 525             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    530                 535                 540                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
545                 550                 555                 560 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                565                 570                 575     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            580                 585                 590         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        595                 600                 605             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    610                 615                 620                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
625                 630                 635                 640 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                645                 650                 655     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            660                 665                 670         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        675                 680                 685             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    690                 695                 700                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
705                 710                 715                 720 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                725                 730                 735     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            740                 745                 750         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        755                 760                 765             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    770                 775                 780                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
785                 790                 795                 800 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                805                 810                 815     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            820                 825                 830         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        835                 840                 845             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
    850                 855                 860                 


Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
865                 870                 875                 880 


Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His 
                885                 890                 895     


Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
            900                 905                 910         


Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro 
        915                 920                 925             


Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Asn 
    930                 935                 940                 


Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser 
945                 950                 955                 960 


Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly 
                965                 970                 975     


Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe 
            980                 985                 990         


Ser Ile Pro Ser His His Ser Asp  Thr Pro Thr Thr Leu  Ala Ser His 
        995                 1000                 1005             


Ser Thr  Lys Thr Asp Ala Ser  Ser Thr His His Ser  Ser Val Pro 
    1010                 1015                 1020             


Pro Leu  Thr Ser Ser Asn His  Ser Thr Ser Pro Gln  Leu Ser Thr 
    1025                 1030                 1035             


Gly Val  Ser Phe Phe Phe Leu  Ser Phe His Ile Ser  Asn Leu Gln 
    1040                 1045                 1050             


Phe Asn  Ser Ser Leu Glu Asp  Pro Ser Thr Asp Tyr  Tyr Gln Glu 
    1055                 1060                 1065             


Leu Gln  Arg Asp Ile Ser Glu  Met Phe Leu Gln Ile  Tyr Lys Gln 
    1070                 1075                 1080             


Gly Gly  Phe Leu Gly Leu Ser  Asn Ile Lys Phe Arg  Pro Gly Ser 
    1085                 1090                 1095             


Val Val  Val Gln Leu Thr Leu  Ala Phe Arg Glu Gly  Thr Ile Asn 
    1100                 1105                 1110             


Val His  Asp Val Glu Thr Gln  Phe Asn Gln Tyr Lys  Thr Glu Ala 
    1115                 1120                 1125             


Ala Ser  Arg Tyr Asn Leu Thr  Ile Ser Asp Val Ser  Val Ser Asp 
    1130                 1135                 1140             


Val Pro  Phe Pro Phe Ser Ala  Gln Ser Gly Ala Gly  Val Pro Gly 
    1145                 1150                 1155             


Trp Gly  Ile Ala Leu Leu Val  Leu Val Cys Val Leu  Val Ala Leu 
    1160                 1165                 1170             


Ala Ile  Val Tyr Leu Ile Ala  Leu Ala Val Cys Gln  Cys Arg Arg 
    1175                 1180                 1185             


Lys Asn  Tyr Gly Gln Leu Asp  Ile Phe Pro Ala Arg  Asp Thr Tyr 
    1190                 1195                 1200             


His Pro  Met Ser Glu Tyr Pro  Thr Tyr His Thr His  Gly Arg Tyr 
    1205                 1210                 1215             


Val Pro  Pro Ser Ser Thr Asp  Arg Ser Pro Tyr Glu  Lys Val Ser 
    1220                 1225                 1230             


Ala Gly  Asn Gly Gly Ser Ser  Leu Ser Tyr Thr Asn  Pro Ala Val 
    1235                 1240                 1245             


Ala Ala  Thr Ser Ala Asn Leu  
    1250                 1255 


<210>  21
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  21

Gly Ser Gly Gly Ser 
1               5   


<210>  22
<211>  4
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  22

Gly Gly Gly Ser 
1               


<210>  23
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  23

Leu Ser Thr Pro Ser Arg 
1               5       


<210>  24
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is 2-formylglycine

<400>  24

Leu Xaa Thr Pro Ser Arg 
1               5       


<210>  25
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is a 2-formylglycine residue having 
       a covalently attached moiety

<400>  25

Leu Xaa Thr Pro Ser Arg 
1               5       


<210>  26
<211>  20
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  26

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro 
1               5                   10                  15      


Pro Ala His Gly 
            20  


<210>  27
<211>  60
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  27

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro 
1               5                   10                  15      


Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly 
            20                  25                  30          


Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg 
        35                  40                  45              


Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 
    50                  55                  60  


<210>  28
<211>  20
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  28

Pro Leu Pro Val Thr Ser Thr Ser Ser Ala Ser Thr Gly His Ala Thr 
1               5                   10                  15      


Pro Leu Ala Val 
            20  


<210>  29
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (1)..(1)
<223>  The amino acid at position 1 is either present or absent and, 
       when present, can be any amino acid though usually an aliphatic 
       amino acid, a sulfur-containing amino acid, or a polar, uncharged
       amino acid

<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is cysteine or serine

<220>
<221>  SITE
<222>  (3)..(3)
<223>  The amino acid at position 3 can be any amino acid though usually
       an aliphatic amino acid, a polar, uncharged amino acid, or a 
       sulfur containing amino acid

<220>
<221>  SITE
<222>  (4)..(4)
<223>  The amino acid at position 4 is proline or alanine

<220>
<221>  SITE
<222>  (5)..(5)
<223>  The amino acid at position 5 is any amino acid though usually an 
       aliphatic amino acid, a polar, uncharged amino acid, or a sulfur 
       containing amino acid

<220>
<221>  SITE
<222>  (6)..(6)
<223>  The amino acid at position 6 is a basic amino acid (e.g., Arg, 
       Lys, His) or an aliphatic amino acid (e.g., Ala, Gly, Leu, Val, 
       Ile, or Pro)

<400>  29

Xaa Xaa Xaa Xaa Xaa Xaa 
1               5       


<210>  30
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (1)..(1)
<223>  The amino acid at position 1 is either present or absent and, 
       when present, is any amino acid

<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is 2-formylglycine

<220>
<221>  SITE
<222>  (3)..(3)
<223>  The amino acid at position 3 is any amino acid

<220>
<221>  SITE
<222>  (4)..(4)
<223>  The amino acid at position 4 is proline or alanine

<220>
<221>  SITE
<222>  (5)..(5)
<223>  The amino acid at position 5 is any amino acid

<220>
<221>  SITE
<222>  (6)..(6)
<223>  The amino acid at position 6 is a basic amino acid

<400>  30

Xaa Xaa Xaa Xaa Xaa Xaa 
1               5       


<210>  31
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (1)..(1)
<223>  The amino acid at position 1 is either present or absent and, 
       when present, can be any amino acid, though usually an aliphatic 
       amino acid, a sulfur-containing amino acid, or a polar, uncharged
       amino acid

<220>
<221>  SITE
<222>  (2)..(2)
<223>  The amino acid at position 2 is a 2-formylglycine residue having 
       a covalently attached moiety

<220>
<221>  SITE
<222>  (3)..(3)
<223>  The amino acid at position 3 is any amino acid, though usually an
       aliphatic amino acid, a sulfur-containing amino acid, or a polar,
       uncharged amino acid

<220>
<221>  SITE
<222>  (4)..(4)
<223>  The amino acid at position 4 is proline or alanine

<220>
<221>  SITE
<222>  (5)..(5)
<223>  The amino acid at position 5 is any amino acid though usually an 
       aliphatic amino acid, a sulfur-containing amino acid, or a polar,
       uncharged amino acid

<220>
<221>  SITE
<222>  (6)..(6)
<223>  The amino acid at position 6 is a basic amino acid (e.g., Arg, 
       Lys, His), or an aliphatic amino acid (e.g., Ala, Gly, Leu, Val, 
       Ile, Pro)

<400>  31

Xaa Xaa Xaa Xaa Xaa Xaa 
1               5       


<210>  32
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence


<220>
<221>  SITE
<222>  (3)..(10)
<223>  Each amino acid at positions 3 to 10 may either be present or 
       absent

<400>  32

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
1               5                   10  


<210>  33
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  33

Ser Ser Val Ser Ser Ser Tyr 
1               5           


<210>  34
<211>  8
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  34

Gly Tyr Thr Phe Thr Asp His Thr 
1               5               


<210>  35
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  35

Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1               5                   10  


<210>  36
<211>  8
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  36

Asp Tyr Lys Asp Asp Asp Asp Lys 
1               5               


<210>  37
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  37

Cys Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu 
1               5                   10      


<210>  38
<211>  324
<212>  DNA
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  38
gaaattgtgt tgacacagtc tccagccacc ctgtctttgt ctccagggga aagagccacc       60

ctctcctgca gggccagttc aagtgttagc agcagctact tatactggta ccagcagaaa      120

cctggccagg ctcccaggct ctggatctat ggtacctcca accttgcctc cggcgtccca      180

gcaaggttca gtggcagtgg gtctgggaca gactacactc tcaccatcag ctccctggag      240

cctgaagatg cggcagttta ttactgtcac caatacgcct ggtccccgcc gacgttcggc      300

caagggacca agttggaaat caaa                                             324


<210>  39
<211>  324
<212>  DNA
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  39
gaaattgtgt tgacacagtc tccagccacc ctgtctttgt ctccagggga aagagccacc       60

ctctcctgca gggccagttc aagtgttggc agcagcaact tatactggta ccagcagaaa      120

cctggccagg ctcccaggct ctggatctat aggtccacca aacttgcctc cggcgtccca      180

gcaaggttca gtggcagtgg gtctgggaca gactacactc tcaccatcag ctccctggag      240

cctgaagatg cggcagttta ttactgtcac caatacagat ggtccccgcc gacgttcggc      300

caagggacca agttggaaat caaa                                             324


<210>  40
<211>  324
<212>  DNA
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  40
gaaattgtgt tgacacagtc tccagccacc ctgtctttgt ctccagggga aagagccacc       60

ctctcctgca gggccagttc aagtgttagc agcagctact tatactggta ccagcagaaa      120

cctggccagg ctcccaggct ctggatcatt ggtacctcca accttgcctc cggcgtccca      180

gcaaggttca gtggcagtgg gtctgggaca gactacactc tcaccatcag ctccctggag      240

cctgaagatg cggcagttta ttactgtcac caatactcct ggtccccgcc gacgttcggc      300

caagggacca agttggaaat caaa                                             324


<210>  41
<211>  13
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  41

Arg Ala Ser Ser Ser Val Gly Ser Ser Ser Tyr Leu Tyr 
1               5                   10              


<210>  42
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  42

Gly Arg Thr Ser Ser Thr Asn Lys Leu Ala Ser 
1               5                   10      


<210>  43
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  43

His Gln Tyr Ala Arg Ser Trp Ser Pro Pro Thr 
1               5                   10      


<210>  44
<211>  8
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  44

Phe Tyr Pro Arg Asp Asp Ser Thr 
1               5               


<210>  45
<211>  10
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  45

Ala Arg Gly Leu Arg Tyr Ala Leu Asp Tyr 
1               5                   10  


<210>  46
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  synthetic sequence

<400>  46

Ser Ser Val Gly Ser Ser Asn 
1               5           


