                         SEQUENCE LISTING

<110>  Gyroscope Therapeutics Limited
 
<120>  Combinations of Complement Factors, and vectors encoding therefor

<130>  P116472PCT

<140>  PCT/GB2019/053698
<141>  2019-12-23

<150>  GB 1821082.3
<151>  2018-12-21

<160>  27    

<170>  PatentIn version 3.5

<210>  1
<211>  583
<212>  PRT
<213>  Homo sapiens

<400>  1

Met Lys Leu Leu His Val Phe Leu Leu Phe Leu Cys Phe His Leu Arg 
1               5                   10                  15      


Phe Cys Lys Val Thr Tyr Thr Ser Gln Glu Asp Leu Val Glu Lys Lys 
            20                  25                  30          


Cys Leu Ala Lys Lys Tyr Thr His Leu Ser Cys Asp Lys Val Phe Cys 
        35                  40                  45              


Gln Pro Trp Gln Arg Cys Ile Glu Gly Thr Cys Val Cys Lys Leu Pro 
    50                  55                  60                  


Tyr Gln Cys Pro Lys Asn Gly Thr Ala Val Cys Ala Thr Asn Arg Arg 
65                  70                  75                  80  


Ser Phe Pro Thr Tyr Cys Gln Gln Lys Ser Leu Glu Cys Leu His Pro 
                85                  90                  95      


Gly Thr Lys Phe Leu Asn Asn Gly Thr Cys Thr Ala Glu Gly Lys Phe 
            100                 105                 110         


Ser Val Ser Leu Lys His Gly Asn Thr Asp Ser Glu Gly Ile Val Glu 
        115                 120                 125             


Val Lys Leu Val Asp Gln Asp Lys Thr Met Phe Ile Cys Lys Ser Ser 
    130                 135                 140                 


Trp Ser Met Arg Glu Ala Asn Val Ala Cys Leu Asp Leu Gly Phe Gln 
145                 150                 155                 160 


Gln Gly Ala Asp Thr Gln Arg Arg Phe Lys Leu Ser Asp Leu Ser Ile 
                165                 170                 175     


Asn Ser Thr Glu Cys Leu His Val His Cys Arg Gly Leu Glu Thr Ser 
            180                 185                 190         


Leu Ala Glu Cys Thr Phe Thr Lys Arg Arg Thr Met Gly Tyr Gln Asp 
        195                 200                 205             


Phe Ala Asp Val Val Cys Tyr Thr Gln Lys Ala Asp Ser Pro Met Asp 
    210                 215                 220                 


Asp Phe Phe Gln Cys Val Asn Gly Lys Tyr Ile Ser Gln Met Lys Ala 
225                 230                 235                 240 


Cys Asp Gly Ile Asn Asp Cys Gly Asp Gln Ser Asp Glu Leu Cys Cys 
                245                 250                 255     


Lys Ala Cys Gln Gly Lys Gly Phe His Cys Lys Ser Gly Val Cys Ile 
            260                 265                 270         


Pro Ser Gln Tyr Gln Cys Asn Gly Glu Val Asp Cys Ile Thr Gly Glu 
        275                 280                 285             


Asp Glu Val Gly Cys Ala Gly Phe Ala Ser Val Thr Gln Glu Glu Thr 
    290                 295                 300                 


Glu Ile Leu Thr Ala Asp Met Asp Ala Glu Arg Arg Arg Ile Lys Ser 
305                 310                 315                 320 


Leu Leu Pro Lys Leu Ser Cys Gly Val Lys Asn Arg Met His Ile Arg 
                325                 330                 335     


Arg Lys Arg Ile Val Gly Gly Lys Arg Ala Gln Leu Gly Asp Leu Pro 
            340                 345                 350         


Trp Gln Val Ala Ile Lys Asp Ala Ser Gly Ile Thr Cys Gly Gly Ile 
        355                 360                 365             


Tyr Ile Gly Gly Cys Trp Ile Leu Thr Ala Ala His Cys Leu Arg Ala 
    370                 375                 380                 


Ser Lys Thr His Arg Tyr Gln Ile Trp Thr Thr Val Val Asp Trp Ile 
385                 390                 395                 400 


His Pro Asp Leu Lys Arg Ile Val Ile Glu Tyr Val Asp Arg Ile Ile 
                405                 410                 415     


Phe His Glu Asn Tyr Asn Ala Gly Thr Tyr Gln Asn Asp Ile Ala Leu 
            420                 425                 430         


Ile Glu Met Lys Lys Asp Gly Asn Lys Lys Asp Cys Glu Leu Pro Arg 
        435                 440                 445             


Ser Ile Pro Ala Cys Val Pro Trp Ser Pro Tyr Leu Phe Gln Pro Asn 
    450                 455                 460                 


Asp Thr Cys Ile Val Ser Gly Trp Gly Arg Glu Lys Asp Asn Glu Arg 
465                 470                 475                 480 


Val Phe Ser Leu Gln Trp Gly Glu Val Lys Leu Ile Ser Asn Cys Ser 
                485                 490                 495     


Lys Phe Tyr Gly Asn Arg Phe Tyr Glu Lys Glu Met Glu Cys Ala Gly 
            500                 505                 510         


Thr Tyr Asp Gly Ser Ile Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro 
        515                 520                 525             


Leu Val Cys Met Asp Ala Asn Asn Val Thr Tyr Val Trp Gly Val Val 
    530                 535                 540                 


Ser Trp Gly Glu Asn Cys Gly Lys Pro Glu Phe Pro Gly Val Tyr Thr 
545                 550                 555                 560 


Lys Val Ala Asn Tyr Phe Asp Trp Ile Ser Tyr His Val Gly Arg Pro 
                565                 570                 575     


Phe Ile Ser Gln Tyr Asn Val 
            580             


<210>  2
<211>  1752
<212>  DNA
<213>  Homo sapiens

<400>  2
atgaagcttc ttcatgtttt cctgttattt ctgtgcttcc acttaaggtt ttgcaaggtc       60

acttatacat ctcaagagga tctggtggag aaaaagtgct tagcaaaaaa atatactcac      120

ctctcctgcg ataaagtctt ctgccagcca tggcagagat gcattgaggg cacctgtgtt      180

tgtaaactac cgtatcagtg cccaaagaat ggcactgcag tgtgtgcaac taacaggaga      240

agcttcccaa catactgtca acaaaagagt ttggaatgtc ttcatccagg gacaaagttt      300

ttaaataacg gaacatgcac agccgaagga aagtttagtg tttccttgaa gcatggaaat      360

acagattcag agggaatagt tgaagtaaaa cttgtggacc aagataagac aatgttcata      420

tgcaaaagca gctggagcat gagggaagcc aacgtggcct gccttgacct tgggtttcaa      480

caaggtgctg atactcaaag aaggtttaag ttgtctgatc tctctataaa ttccactgaa      540

tgtctacatg tgcattgccg aggattagag accagtttgg ctgaatgtac ttttactaag      600

agaagaacta tgggttacca ggatttcgct gatgtggttt gttatacaca gaaagcagat      660

tctccaatgg atgacttctt tcagtgtgtg aatgggaaat acatttctca gatgaaagcc      720

tgtgatggta tcaatgattg tggagaccaa agtgatgaac tgtgttgtaa agcatgccaa      780

ggcaaaggct tccattgcaa atcgggtgtt tgcattccaa gccagtatca atgcaatggt      840

gaggtggact gcattacagg ggaagatgaa gttggctgtg caggctttgc atctgtggct      900

caagaagaaa cagaaatttt gactgctgac atggatgcag aaagaagacg gataaaatca      960

ttattaccta aactatcttg tggagttaaa aacagaatgc acattcgaag gaaacgaatt     1020

gtgggaggaa agcgagcaca actgggagac ctcccatggc aggtggcaat taaggatgcc     1080

agtggaatca cctgtggggg aatttatatt ggtggctgtt ggattctgac tgctgcacat     1140

tgtctcagag ccagtaaaac tcatcgttac caaatatgga caacagtagt agactggata     1200

caccccgacc ttaaacgtat agtaattgaa tacgtggata gaattatttt ccatgaaaac     1260

tacaatgcag gcacttacca aaatgacatc gctttgattg aaatgaaaaa agacggaaac     1320

aaaaaagatt gtgagctgcc tcgttccatc cctgcctgtg tcccctggtc tccttaccta     1380

ttccaaccta atgatacatg catcgtttct ggctggggac gagaaaaaga taacgaaaga     1440

gtcttttcac ttcagtgggg tgaagttaaa ctaataagca actgctctaa gttttacgga     1500

aatcgtttct atgaaaaaga aatggaatgt gcaggtacat atgatggttc catcgatgcc     1560

tgtaaagggg actctggagg ccccttagtc tgtatggatg ccaacaatgt gacttatgtc     1620

tggggtgttg tgagttgggg ggaaaactgt ggaaaaccag agttcccagg tgtttacacc     1680

aaagtggcca attattttga ctggattagc taccatgtag gaaggccttt tatttctcag     1740

tacaatgtat aa                                                         1752


<210>  3
<211>  1231
<212>  PRT
<213>  Homo sapiens

<400>  3

Met Arg Leu Leu Ala Lys Ile Ile Cys Leu Met Leu Trp Ala Ile Cys 
1               5                   10                  15      


Val Ala Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Glu Ile 
            20                  25                  30          


Leu Thr Gly Ser Trp Ser Asp Gln Thr Tyr Pro Glu Gly Thr Gln Ala 
        35                  40                  45              


Ile Tyr Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Val Ile Met 
    50                  55                  60                  


Val Cys Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys 
65                  70                  75                  80  


Gln Lys Arg Pro Cys Gly His Pro Gly Asp Thr Pro Phe Gly Thr Phe 
                85                  90                  95      


Thr Leu Thr Gly Gly Asn Val Phe Glu Tyr Gly Val Lys Ala Val Tyr 
            100                 105                 110         


Thr Cys Asn Glu Gly Tyr Gln Leu Leu Gly Glu Ile Asn Tyr Arg Glu 
        115                 120                 125             


Cys Asp Thr Asp Gly Trp Thr Asn Asp Ile Pro Ile Cys Glu Val Val 
    130                 135                 140                 


Lys Cys Leu Pro Val Thr Ala Pro Glu Asn Gly Lys Ile Val Ser Ser 
145                 150                 155                 160 


Ala Met Glu Pro Asp Arg Glu Tyr His Phe Gly Gln Ala Val Arg Phe 
                165                 170                 175     


Val Cys Asn Ser Gly Tyr Lys Ile Glu Gly Asp Glu Glu Met His Cys 
            180                 185                 190         


Ser Asp Asp Gly Phe Trp Ser Lys Glu Lys Pro Lys Cys Val Glu Ile 
        195                 200                 205             


Ser Cys Lys Ser Pro Asp Val Ile Asn Gly Ser Pro Ile Ser Gln Lys 
    210                 215                 220                 


Ile Ile Tyr Lys Glu Asn Glu Arg Phe Gln Tyr Lys Cys Asn Met Gly 
225                 230                 235                 240 


Tyr Glu Tyr Ser Glu Arg Gly Asp Ala Val Cys Thr Glu Ser Gly Trp 
                245                 250                 255     


Arg Pro Leu Pro Ser Cys Glu Glu Lys Ser Cys Asp Asn Pro Tyr Ile 
            260                 265                 270         


Pro Asn Gly Asp Tyr Ser Pro Leu Arg Ile Lys His Arg Thr Gly Asp 
        275                 280                 285             


Glu Ile Thr Tyr Gln Cys Arg Asn Gly Phe Tyr Pro Ala Thr Arg Gly 
    290                 295                 300                 


Asn Thr Ala Lys Cys Thr Ser Thr Gly Trp Ile Pro Ala Pro Arg Cys 
305                 310                 315                 320 


Thr Leu Lys Pro Cys Asp Tyr Pro Asp Ile Lys His Gly Gly Leu Tyr 
                325                 330                 335     


His Glu Asn Met Arg Arg Pro Tyr Phe Pro Val Ala Val Gly Lys Tyr 
            340                 345                 350         


Tyr Ser Tyr Tyr Cys Asp Glu His Phe Glu Thr Pro Ser Gly Ser Tyr 
        355                 360                 365             


Trp Asp His Ile His Cys Thr Gln Asp Gly Trp Ser Pro Ala Val Pro 
    370                 375                 380                 


Cys Leu Arg Lys Cys Tyr Phe Pro Tyr Leu Glu Asn Gly Tyr Asn Gln 
385                 390                 395                 400 


Asn Tyr Gly Arg Lys Phe Val Gln Gly Lys Ser Ile Asp Val Ala Cys 
                405                 410                 415     


His Pro Gly Tyr Ala Leu Pro Lys Ala Gln Thr Thr Val Thr Cys Met 
            420                 425                 430         


Glu Asn Gly Trp Ser Pro Thr Pro Arg Cys Ile Arg Val Lys Thr Cys 
        435                 440                 445             


Ser Lys Ser Ser Ile Asp Ile Glu Asn Gly Phe Ile Ser Glu Ser Gln 
    450                 455                 460                 


Tyr Thr Tyr Ala Leu Lys Glu Lys Ala Lys Tyr Gln Cys Lys Leu Gly 
465                 470                 475                 480 


Tyr Val Thr Ala Asp Gly Glu Thr Ser Gly Ser Ile Thr Cys Gly Lys 
                485                 490                 495     


Asp Gly Trp Ser Ala Gln Pro Thr Cys Ile Lys Ser Cys Asp Ile Pro 
            500                 505                 510         


Val Phe Met Asn Ala Arg Thr Lys Asn Asp Phe Thr Trp Phe Lys Leu 
        515                 520                 525             


Asn Asp Thr Leu Asp Tyr Glu Cys His Asp Gly Tyr Glu Ser Asn Thr 
    530                 535                 540                 


Gly Ser Thr Thr Gly Ser Ile Val Cys Gly Tyr Asn Gly Trp Ser Asp 
545                 550                 555                 560 


Leu Pro Ile Cys Tyr Glu Arg Glu Cys Glu Leu Pro Lys Ile Asp Val 
                565                 570                 575     


His Leu Val Pro Asp Arg Lys Lys Asp Gln Tyr Lys Val Gly Glu Val 
            580                 585                 590         


Leu Lys Phe Ser Cys Lys Pro Gly Phe Thr Ile Val Gly Pro Asn Ser 
        595                 600                 605             


Val Gln Cys Tyr His Phe Gly Leu Ser Pro Asp Leu Pro Ile Cys Lys 
    610                 615                 620                 


Glu Gln Val Gln Ser Cys Gly Pro Pro Pro Glu Leu Leu Asn Gly Asn 
625                 630                 635                 640 


Val Lys Glu Lys Thr Lys Glu Glu Tyr Gly His Ser Glu Val Val Glu 
                645                 650                 655     


Tyr Tyr Cys Asn Pro Arg Phe Leu Met Lys Gly Pro Asn Lys Ile Gln 
            660                 665                 670         


Cys Val Asp Gly Glu Trp Thr Thr Leu Pro Val Cys Ile Val Glu Glu 
        675                 680                 685             


Ser Thr Cys Gly Asp Ile Pro Glu Leu Glu His Gly Trp Ala Gln Leu 
    690                 695                 700                 


Ser Ser Pro Pro Tyr Tyr Tyr Gly Asp Ser Val Glu Phe Asn Cys Ser 
705                 710                 715                 720 


Glu Ser Phe Thr Met Ile Gly His Arg Ser Ile Thr Cys Ile His Gly 
                725                 730                 735     


Val Trp Thr Gln Leu Pro Gln Cys Val Ala Ile Asp Lys Leu Lys Lys 
            740                 745                 750         


Cys Lys Ser Ser Asn Leu Ile Ile Leu Glu Glu His Leu Lys Asn Lys 
        755                 760                 765             


Lys Glu Phe Asp His Asn Ser Asn Ile Arg Tyr Arg Cys Arg Gly Lys 
    770                 775                 780                 


Glu Gly Trp Ile His Thr Val Cys Ile Asn Gly Arg Trp Asp Pro Glu 
785                 790                 795                 800 


Val Asn Cys Ser Met Ala Gln Ile Gln Leu Cys Pro Pro Pro Pro Gln 
                805                 810                 815     


Ile Pro Asn Ser His Asn Met Thr Thr Thr Leu Asn Tyr Arg Asp Gly 
            820                 825                 830         


Glu Lys Val Ser Val Leu Cys Gln Glu Asn Tyr Leu Ile Gln Glu Gly 
        835                 840                 845             


Glu Glu Ile Thr Cys Lys Asp Gly Arg Trp Gln Ser Ile Pro Leu Cys 
    850                 855                 860                 


Val Glu Lys Ile Pro Cys Ser Gln Pro Pro Gln Ile Glu His Gly Thr 
865                 870                 875                 880 


Ile Asn Ser Ser Arg Ser Ser Gln Glu Ser Tyr Ala His Gly Thr Lys 
                885                 890                 895     


Leu Ser Tyr Thr Cys Glu Gly Gly Phe Arg Ile Ser Glu Glu Asn Glu 
            900                 905                 910         


Thr Thr Cys Tyr Met Gly Lys Trp Ser Ser Pro Pro Gln Cys Glu Gly 
        915                 920                 925             


Leu Pro Cys Lys Ser Pro Pro Glu Ile Ser His Gly Val Val Ala His 
    930                 935                 940                 


Met Ser Asp Ser Tyr Gln Tyr Gly Glu Glu Val Thr Tyr Lys Cys Phe 
945                 950                 955                 960 


Glu Gly Phe Gly Ile Asp Gly Pro Ala Ile Ala Lys Cys Leu Gly Glu 
                965                 970                 975     


Lys Trp Ser His Pro Pro Ser Cys Ile Lys Thr Asp Cys Leu Ser Leu 
            980                 985                 990         


Pro Ser Phe Glu Asn Ala Ile Pro  Met Gly Glu Lys Lys  Asp Val Tyr 
        995                 1000                 1005             


Lys Ala  Gly Glu Gln Val Thr  Tyr Thr Cys Ala Thr  Tyr Tyr Lys 
    1010                 1015                 1020             


Met Asp  Gly Ala Ser Asn Val  Thr Cys Ile Asn Ser  Arg Trp Thr 
    1025                 1030                 1035             


Gly Arg  Pro Thr Cys Arg Asp  Thr Ser Cys Val Asn  Pro Pro Thr 
    1040                 1045                 1050             


Val Gln  Asn Ala Tyr Ile Val  Ser Arg Gln Met Ser  Lys Tyr Pro 
    1055                 1060                 1065             


Ser Gly  Glu Arg Val Arg Tyr  Gln Cys Arg Ser Pro  Tyr Glu Met 
    1070                 1075                 1080             


Phe Gly  Asp Glu Glu Val Met  Cys Leu Asn Gly Asn  Trp Thr Glu 
    1085                 1090                 1095             


Pro Pro  Gln Cys Lys Asp Ser  Thr Gly Lys Cys Gly  Pro Pro Pro 
    1100                 1105                 1110             


Pro Ile  Asp Asn Gly Asp Ile  Thr Ser Phe Pro Leu  Ser Val Tyr 
    1115                 1120                 1125             


Ala Pro  Ala Ser Ser Val Glu  Tyr Gln Cys Gln Asn  Leu Tyr Gln 
    1130                 1135                 1140             


Leu Glu  Gly Asn Lys Arg Ile  Thr Cys Arg Asn Gly  Gln Trp Ser 
    1145                 1150                 1155             


Glu Pro  Pro Lys Cys Leu His  Pro Cys Val Ile Ser  Arg Glu Ile 
    1160                 1165                 1170             


Met Glu  Asn Tyr Asn Ile Ala  Leu Arg Trp Thr Ala  Lys Gln Lys 
    1175                 1180                 1185             


Leu Tyr  Ser Arg Thr Gly Glu  Ser Val Glu Phe Val  Cys Lys Arg 
    1190                 1195                 1200             


Gly Tyr  Arg Leu Ser Ser Arg  Ser His Thr Leu Arg  Thr Thr Cys 
    1205                 1210                 1215             


Trp Asp  Gly Lys Leu Glu Tyr  Pro Thr Cys Ala Lys  Arg 
    1220                 1225                 1230     


<210>  4
<211>  3696
<212>  DNA
<213>  Homo sapiens

<400>  4
atgagacttc tagcaaagat tatttgcctt atgttatggg ctatttgtgt agcagaagat       60

tgcaatgaac ttcctccaag aagaaataca gaaattctga caggttcctg gtctgaccaa      120

acatatccag aaggcaccca ggctatctat aaatgccgcc ctggatatag atctcttgga      180

aatgtaataa tggtatgcag gaagggagaa tgggttgctc ttaatccatt aaggaaatgt      240

cagaaaaggc cctgtggaca tcctggagat actccttttg gtacttttac ccttacagga      300

ggaaatgtgt ttgaatatgg tgtaaaagct gtgtatacat gtaatgaggg gtatcaattg      360

ctaggtgaga ttaattaccg tgaatgtgac acagatggat ggaccaatga tattcctata      420

tgtgaagttg tgaagtgttt accagtgaca gcaccagaga atggaaaaat tgtcagtagt      480

gcaatggaac cagatcggga ataccatttt ggacaagcag tacggtttgt atgtaactca      540

ggctacaaga ttgaaggaga tgaagaaatg cattgttcag acgatggttt ttggagtaaa      600

gagaaaccaa agtgtgtgga aatttcatgc aaatccccag atgttataaa tggatctcct      660

atatctcaga agattattta taaggagaat gaacgatttc aatataaatg taacatgggt      720

tatgaataca gtgaaagagg agatgctgta tgcactgaat ctggatggcg tccgttgcct      780

tcatgtgaag aaaaatcatg tgataatcct tatattccaa atggtgacta ctcaccttta      840

aggattaaac acagaactgg agatgaaatc acgtaccagt gtagaaatgg tttttatcct      900

gcaacccggg gaaatacagc aaaatgcaca agtactggct ggatacctgc tccgagatgt      960

accttgaaac cttgtgatta tccagacatt aaacatggag gtctatatca tgagaatatg     1020

cgtagaccat actttccagt agctgtagga aaatattact cctattactg tgatgaacat     1080

tttgagactc cgtcaggaag ttactgggat cacattcatt gcacacaaga tggatggtcg     1140

ccagcagtac catgcctcag aaaatgttat tttccttatt tggaaaatgg atataatcaa     1200

aatcatggaa gaaagtttgt acagggtaaa tctatagacg ttgcctgcca tcctggctac     1260

gctcttccaa aagcgcagac cacagttaca tgtatggaga atggctggtc tcctactccc     1320

agatgcatcc gtgtcaaaac atgttccaaa tcaagtatag atattgagaa tgggtttatt     1380

tctgaatctc agtatacata tgccttaaaa gaaaaagcga aatatcaatg caaactagga     1440

tatgtaacag cagatggtga aacatcagga tcaattacat gtgggaaaga tggatggtca     1500

gctcaaccca cgtgcattaa atcttgtgat atcccagtat ttatgaatgc cagaactaaa     1560

aatgacttca catggtttaa gctgaatgac acattggact atgaatgcca tgatggttat     1620

gaaagcaata ctggaagcac cactggttcc atagtgtgtg gttacaatgg ttggtctgat     1680

ttacccatat gttatgaaag agaatgcgaa cttcctaaaa tagatgtaca cttagttcct     1740

gatcgcaaga aagaccagta taaagttgga gaggtgttga aattctcctg caaaccagga     1800

tttacaatag ttggacctaa ttccgttcag tgctaccact ttggattgtc tcctgacctc     1860

ccaatatgta aagagcaagt acaatcatgt ggtccacctc ctgaactcct caatgggaat     1920

gttaaggaaa aaacgaaaga agaatatgga cacagtgaag tggtggaata ttattgcaat     1980

cctagatttc taatgaaggg acctaataaa attcaatgtg ttgatggaga gtggacaact     2040

ttaccagtgt gtattgtgga ggagagtacc tgtggagata tacctgaact tgaacatggc     2100

tgggcccagc tttcttcccc tccttattac tatggagatt cagtggaatt caattgctca     2160

gaatcattta caatgattgg acacagatca attacgtgta ttcatggagt atggacccaa     2220

cttccccagt gtgtggcaat agataaactt aagaagtgca aatcatcaaa tttaattata     2280

cttgaggaac atttaaaaaa caagaaggaa ttcgatcata attctaacat aaggtacaga     2340

tgtagaggaa aagaaggatg gatacacaca gtctgcataa atggaagatg ggatccagaa     2400

gtgaactgct caatggcaca aatacaatta tgcccacctc cacctcagat tcccaattct     2460

cacaatatga caaccacact gaattatcgg gatggagaaa aagtatctgt tctttgccaa     2520

gaaaattatc taattcagga aggagaagaa attacatgca aagatggaag atggcagtca     2580

ataccactct gtgttgaaaa aattccatgt tcacaaccac ctcagataga acacggaacc     2640

attaattcat ccaggtcttc acaagaaagt tatgcacatg ggactaaatt gagttatact     2700

tgtgagggtg gtttcaggat atctgaagaa aatgaaacaa catgctacat gggaaaatgg     2760

agttctccac ctcagtgtga aggccttcct tgtaaatctc cacctgagat ttctcatggt     2820

gttgtagctc acatgtcaga cagttatcag tatggagaag aagttacgta caaatgtttt     2880

gaaggttttg gaattgatgg gcctgcaatt gcaaaatgct taggagaaaa atggtctcac     2940

cctccatcat gcataaaaac agattgtctc agtttaccta gctttgaaaa tgccataccc     3000

atgggagaga agaaggatgt gtataaggcg ggtgagcaag tgacttacac ttgtgcaaca     3060

tattacaaaa tggatggagc cagtaatgta acatgcatta atagcagatg gacaggaagg     3120

ccaacatgca gagacacctc ctgtgtgaat ccgcccacag tacaaaatgc ttatatagtg     3180

tcgagacaga tgagtaaata tccatctggt gagagagtac gttatcaatg taggagccct     3240

tatgaaatgt ttggggatga agaagtgatg tgtttaaatg gaaactggac ggaaccacct     3300

caatgcaaag attctacagg aaaatgtggg ccccctccac ctattgacaa tggggacatt     3360

acttcattcc cgttgtcagt atatgctcca gcttcatcag ttgagtacca atgccagaac     3420

ttgtatcaac ttgagggtaa caagcgaata acatgtagaa atggacaatg gtcagaacca     3480

ccaaaatgct tacatccgtg tgtaatatcc cgagaaatta tggaaaatta taacatagca     3540

ttaaggtgga cagccaaaca gaagctttat tcgagaacag gtgaatcagt tgaatttgtg     3600

tgtaaacggg gatatcgtct ttcatcacgt tctcacacat tgcgaacaac atgttgggat     3660

gggaaactgg agtatccaac ttgtgcaaaa agatag                               3696


<210>  5
<211>  934
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  example promoter sequence

<400>  5
attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg       60

tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat      120

gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca      180

gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat      240

taccatggtc gaggtgagcc ccacgttctg cttcactctc cccatctccc ccccctcccc      300

acccccaatt ttgtatttat ttatttttta attattttgt gcagcgatgg gggcgggggg      360

gggggggggg cgcgcgccag gcggggcggg gcggggcgag gggcggggcg gggcgaggcg      420

gagaggtgcg gcggcagcca atcagagcgg cgcgctccga aagtttcctt ttatggcgag      480

gcggcggcgg cggcggccct ataaaaagcg aagcgcgcgg cgggcgggag tcgctgcgcg      540

ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact      600

gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta      660

gcgcttggtt taatgacggc ttgtttcttt tctgtggctg cgtgaaagcc ttgaggggct      720

ccgggagggc cctttgtgcg gggggagcgg ctcggggctg tccgcggggg gacggctgcc      780

ttcggggggg acggggcagg gcggggttcg gcttctggcg tgtgaccggc ggctctagag      840

cctctgctaa ccatgttcat gccttcttct ttttcctaca gctcctgggc aacgtgctgg      900

ttattgtgct gtctcatcat tttggcaaag aatt                                  934


<210>  6
<211>  270
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Bovine Growth Hormone poly-A (bGH poly-A) signal sequence

<400>  6
tcgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt gcccctcccc       60

cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga      120

aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga      180

cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat      240

ggcttctgag gcggaaagaa ccagctgggg                                       270


<210>  7
<211>  588
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  example woodchuck hepatitis post-transcriptional regulatory 
       element (WPRE) sequence

<400>  7
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc       60

cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta      120

tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt      180

ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg      240

gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta      300

ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt      360

tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg      420

cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca      480

atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc      540

gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgc                   588


<210>  8
<211>  1752
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon-optimised nucleotide sequence encoding Complement Factor I

<400>  8
atgaagctgc tgcatgtctt tctgctgttt ctgtgcttcc atctgcggtt ctgtaaagtg       60

acctatacta gccaggagga tctggtggag aagaagtgtc tggccaagaa gtacacacac      120

ctgagctgcg acaaggtgtt ctgtcagcct tggcagcggt gcatcgaggg cacctgcgtg      180

tgcaagctgc cttaccagtg cccaaagaac ggcaccgccg tgtgcgccac aaatcggaga      240

tcttttccaa catattgcca gcagaagagc ctggagtgtc tgcaccccgg caccaagttc      300

ctgaacaatg gcacctgcac agccgagggc aagttttctg tgagcctgaa gcacggcaac      360

acagatagcg agggcatcgt ggaggtgaag ctggtggacc aggataagac catgttcatc      420

tgtaagagct cctggtccat gagggaggca aacgtggcat gcctggatct gggattccag      480

cagggagcag acacacagag gcgctttaag ctgtccgacc tgtctatcaa tagcaccgag      540

tgcctgcacg tgcactgtag gggcctggag acatccctgg cagagtgcac cttcacaaag      600

cggagaacca tgggctacca ggactttgcc gacgtggtgt gctataccca gaaggccgat      660

agccccatgg acgatttctt tcagtgcgtg aacggcaagt atatctccca gatgaaggcc      720

tgcgacggca tcaatgactg tggcgatcag tctgacgagc tgtgctgtaa ggcctgtcag      780

ggcaagggct tccactgcaa gagcggcgtg tgcatccctt cccagtacca gtgcaacggc      840

gaggtggatt gtatcacagg agaggacgaa gtgggatgcg caggatttgc atctgtggca      900

caggaggaga cagagatcct gacagccgac atggatgccg agaggcgccg gatcaagtct      960

ctgctgccta agctgagctg tggcgtgaag aatcggatgc acatcagaag gaagcgcatc     1020

gtgggaggca agagggcaca gctgggcgat ctgccatggc aggtggccat caaggacgcc     1080

tctggcatca cctgcggcgg catctacatc ggaggatgtt ggatcctgac cgcagcacac     1140

tgcctgagag caagcaagac acacaggtat cagatctgga ccacagtggt ggattggatc     1200

cacccagacc tgaagagaat cgtgatcgag tacgtggata ggatcatctt tcacgagaac     1260

tacaatgccg gcacatatca gaacgacatc gccctgatcg agatgaagaa ggatggcaat     1320

aagaaggact gtgagctgcc cagatccatc cctgcatgcg tgccatggag cccctatctg     1380

ttccagccca acgatacctg catcgtgtcc ggatggggaa gggagaagga caatgagcgg     1440

gtgttttctc tgcagtgggg cgaggtgaag ctgatctcca actgttctaa gttctacggc     1500

aataggtttt atgagaagga gatggagtgc gccggcacct acgatggcag catcgacgcc     1560

tgtaagggcg attccggagg accactggtg tgcatggacg caaacaatgt gacatacgtg     1620

tggggagtgg tgtcctgggg agagaactgc ggcaagccag agttccccgg cgtatatacc     1680

aaggtggcca attattttga ttggatttcc taccacgtcg gcaggccctt tatttcccag     1740

tataatgtct aa                                                         1752


<210>  9
<211>  583
<212>  PRT
<213>  Homo sapiens

<400>  9

Met Lys Leu Leu His Val Phe Leu Leu Phe Leu Cys Phe His Leu Arg 
1               5                   10                  15      


Phe Cys Lys Val Thr Tyr Thr Ser Gln Glu Asp Leu Val Glu Lys Lys 
            20                  25                  30          


Cys Leu Ala Lys Lys Tyr Thr His Leu Ser Cys Asp Lys Val Phe Cys 
        35                  40                  45              


Gln Pro Trp Gln Arg Cys Ile Glu Gly Thr Cys Val Cys Lys Leu Pro 
    50                  55                  60                  


Tyr Gln Cys Pro Lys Asn Gly Thr Ala Val Cys Ala Thr Asn Arg Arg 
65                  70                  75                  80  


Ser Phe Pro Thr Tyr Cys Gln Gln Lys Ser Leu Glu Cys Leu His Pro 
                85                  90                  95      


Gly Thr Lys Phe Leu Asn Asn Gly Thr Cys Thr Ala Glu Gly Lys Phe 
            100                 105                 110         


Ser Val Ser Leu Lys His Gly Asn Thr Asp Ser Glu Gly Ile Val Glu 
        115                 120                 125             


Val Lys Leu Val Asp Gln Asp Lys Thr Met Phe Ile Cys Lys Ser Ser 
    130                 135                 140                 


Trp Ser Met Arg Glu Ala Asn Val Ala Cys Leu Asp Leu Gly Phe Gln 
145                 150                 155                 160 


Gln Gly Ala Asp Thr Gln Arg Arg Phe Lys Leu Ser Asp Leu Ser Ile 
                165                 170                 175     


Asn Ser Thr Glu Cys Leu His Val His Cys Arg Gly Leu Glu Thr Ser 
            180                 185                 190         


Leu Ala Glu Cys Thr Phe Thr Lys Arg Arg Thr Met Gly Tyr Gln Asp 
        195                 200                 205             


Phe Ala Asp Val Val Cys Tyr Thr Gln Lys Ala Asp Ser Pro Met Asp 
    210                 215                 220                 


Asp Phe Phe Gln Cys Val Asn Gly Lys Tyr Ile Ser Gln Met Lys Ala 
225                 230                 235                 240 


Cys Asp Gly Ile Asn Asp Cys Gly Asp Gln Ser Asp Glu Leu Cys Cys 
                245                 250                 255     


Lys Ala Cys Gln Gly Lys Gly Phe His Cys Lys Ser Gly Val Cys Ile 
            260                 265                 270         


Pro Ser Gln Tyr Gln Cys Asn Gly Glu Val Asp Cys Ile Thr Gly Glu 
        275                 280                 285             


Asp Glu Val Gly Cys Ala Gly Phe Ala Ser Val Ala Gln Glu Glu Thr 
    290                 295                 300                 


Glu Ile Leu Thr Ala Asp Met Asp Ala Glu Arg Arg Arg Ile Lys Ser 
305                 310                 315                 320 


Leu Leu Pro Lys Leu Ser Cys Gly Val Lys Asn Arg Met His Ile Arg 
                325                 330                 335     


Arg Lys Arg Ile Val Gly Gly Lys Arg Ala Gln Leu Gly Asp Leu Pro 
            340                 345                 350         


Trp Gln Val Ala Ile Lys Asp Ala Ser Gly Ile Thr Cys Gly Gly Ile 
        355                 360                 365             


Tyr Ile Gly Gly Cys Trp Ile Leu Thr Ala Ala His Cys Leu Arg Ala 
    370                 375                 380                 


Ser Lys Thr His Arg Tyr Gln Ile Trp Thr Thr Val Val Asp Trp Ile 
385                 390                 395                 400 


His Pro Asp Leu Lys Arg Ile Val Ile Glu Tyr Val Asp Arg Ile Ile 
                405                 410                 415     


Phe His Glu Asn Tyr Asn Ala Gly Thr Tyr Gln Asn Asp Ile Ala Leu 
            420                 425                 430         


Ile Glu Met Lys Lys Asp Gly Asn Lys Lys Asp Cys Glu Leu Pro Arg 
        435                 440                 445             


Ser Ile Pro Ala Cys Val Pro Trp Ser Pro Tyr Leu Phe Gln Pro Asn 
    450                 455                 460                 


Asp Thr Cys Ile Val Ser Gly Trp Gly Arg Glu Lys Asp Asn Glu Arg 
465                 470                 475                 480 


Val Phe Ser Leu Gln Trp Gly Glu Val Lys Leu Ile Ser Asn Cys Ser 
                485                 490                 495     


Lys Phe Tyr Gly Asn Arg Phe Tyr Glu Lys Glu Met Glu Cys Ala Gly 
            500                 505                 510         


Thr Tyr Asp Gly Ser Ile Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro 
        515                 520                 525             


Leu Val Cys Met Asp Ala Asn Asn Val Thr Tyr Val Trp Gly Val Val 
    530                 535                 540                 


Ser Trp Gly Glu Asn Cys Gly Lys Pro Glu Phe Pro Gly Val Tyr Thr 
545                 550                 555                 560 


Lys Val Ala Asn Tyr Phe Asp Trp Ile Ser Tyr His Val Gly Arg Pro 
                565                 570                 575     


Phe Ile Ser Gln Tyr Asn Val 
            580             


<210>  10
<211>  1752
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence encoding Complement Factor I

<400>  10
atgaaactgc tgcatgtctt cctcctcttc ctgtgcttcc acctccgttt ctgtaaagtc       60

acctacacta gccaggagga tctggtggag aagaaatgcc tggccaagaa gtatacccac      120

ctgagctgcg acaaagtgtt ctgccagccc tggcaacgct gcattgaagg tacttgtgtg      180

tgcaagctgc cctaccagtg ccccaagaac ggcacggccg tgtgtgccac caacaggagg      240

agcttcccca cctactgcca gcagaagagc ctggaatgcc tccaccctgg caccaagttt      300

ctgaacaacg ggacctgcac agccgagggg aaattcagcg tctccctcaa gcacggcaat      360

acagactccg agggcattgt ggaagtgaag ctggtggacc aggacaagac catgttcatc      420

tgcaaaagca gctggtccat gcgggaggcc aatgtcgcct gcctggacct gggcttccag      480

cagggcgctg atacacagcg ccgctttaaa ctcagtgacc tcagcatcaa cagcactgag      540

tgtctgcacg tgcactgccg gggcctggag accagcctgg ctgagtgcac cttcaccaag      600

cgcaggacca tgggctacca ggattttgca gatgtggtct gctacaccca gaaggcagac      660

agccccatgg atgacttctt ccagtgtgtc aatggcaagt acatttccca gatgaaggct      720

tgtgacggga tcaatgattg cggggatcag agcgatgagc tctgctgcaa ggcctgccaa      780

gggaagggct ttcactgtaa gtctggggtg tgcatccctt ctcagtatca gtgcaacgga      840

gaggtggact gcatcactgg ggaggacgag gtgggctgtg ctggcttcgc ctctgtggcc      900

caggaggaga cagagatcct cacagctgac atggatgcag agcggcggcg catcaagagt      960

ctgctcccaa agctctcctg cggcgttaag aatcgcatgc acatccggag gaagcggatc     1020

gttggaggca aacgggctca gctgggggac ttgccgtggc aggtggccat caaagatgcc     1080

tccggaatca cctgtggtgg catctacatc ggcggctgct ggatcctgac cgccgcccac     1140

tgccttcggg ccagcaagac tcaccgctac cagatctgga ccaccgtggt ggattggatt     1200

caccccgacc tgaagaggat tgtcattgag tatgtcgacc gcatcatctt ccatgaaaac     1260

tacaatgccg ggacgtatca gaacgacatc gccctcatcg agatgaagaa ggatgggaac     1320

aagaaggact gtgagctgcc tcgctccatc cccgcctgtg taccatggtc tccgtacctg     1380

ttccagccaa atgacacatg catcgtgagc ggctggggcc gcgagaaaga caacgagagg     1440

gtcttctccc tgcagtgggg tgaagtcaag ctgatcagca actgctccaa gttctacggc     1500

aaccgcttct atgagaagga gatggagtgc gccggcacct atgacggcag cattgacgcg     1560

tgcaagggag acagtggggg ccccctggtc tgcatggacg ccaacaatgt gacctacgtg     1620

tggggagttg tgtcctgggg cgagaactgt ggcaagcctg agttcccggg cgtgtacaca     1680

aaggtggcaa actattttga ctggatctcc tatcacgttg gcaggccctt catttcacag     1740

tacaacgtat aa                                                         1752


<210>  11
<211>  449
<212>  PRT
<213>  Homo sapiens

<400>  11

Met Arg Leu Leu Ala Lys Ile Ile Cys Leu Met Leu Trp Ala Ile Cys 
1               5                   10                  15      


Val Ala Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Glu Ile 
            20                  25                  30          


Leu Thr Gly Ser Trp Ser Asp Gln Thr Tyr Pro Glu Gly Thr Gln Ala 
        35                  40                  45              


Ile Tyr Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Ile Ile Met 
    50                  55                  60                  


Val Cys Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys 
65                  70                  75                  80  


Gln Lys Arg Pro Cys Gly His Pro Gly Asp Thr Pro Phe Gly Thr Phe 
                85                  90                  95      


Thr Leu Thr Gly Gly Asn Val Phe Glu Tyr Gly Val Lys Ala Val Tyr 
            100                 105                 110         


Thr Cys Asn Glu Gly Tyr Gln Leu Leu Gly Glu Ile Asn Tyr Arg Glu 
        115                 120                 125             


Cys Asp Thr Asp Gly Trp Thr Asn Asp Ile Pro Ile Cys Glu Val Val 
    130                 135                 140                 


Lys Cys Leu Pro Val Thr Ala Pro Glu Asn Gly Lys Ile Val Ser Ser 
145                 150                 155                 160 


Ala Met Glu Pro Asp Arg Glu Tyr His Phe Gly Gln Ala Val Arg Phe 
                165                 170                 175     


Val Cys Asn Ser Gly Tyr Lys Ile Glu Gly Asp Glu Glu Met His Cys 
            180                 185                 190         


Ser Asp Asp Gly Phe Trp Ser Lys Glu Lys Pro Lys Cys Val Glu Ile 
        195                 200                 205             


Ser Cys Lys Ser Pro Asp Val Ile Asn Gly Ser Pro Ile Ser Gln Lys 
    210                 215                 220                 


Ile Ile Tyr Lys Glu Asn Glu Arg Phe Gln Tyr Lys Cys Asn Met Gly 
225                 230                 235                 240 


Tyr Glu Tyr Ser Glu Arg Gly Asp Ala Val Cys Thr Glu Ser Gly Trp 
                245                 250                 255     


Arg Pro Leu Pro Ser Cys Glu Glu Lys Ser Cys Asp Asn Pro Tyr Ile 
            260                 265                 270         


Pro Asn Gly Asp Tyr Ser Pro Leu Arg Ile Lys His Arg Thr Gly Asp 
        275                 280                 285             


Glu Ile Thr Tyr Gln Cys Arg Asn Gly Phe Tyr Pro Ala Thr Arg Gly 
    290                 295                 300                 


Asn Thr Ala Lys Cys Thr Ser Thr Gly Trp Ile Pro Ala Pro Arg Cys 
305                 310                 315                 320 


Thr Leu Lys Pro Cys Asp Tyr Pro Asp Ile Lys His Gly Gly Leu Tyr 
                325                 330                 335     


His Glu Asn Met Arg Arg Pro Tyr Phe Pro Val Ala Val Gly Lys Tyr 
            340                 345                 350         


Tyr Ser Tyr Tyr Cys Asp Glu His Phe Glu Thr Pro Ser Gly Ser Tyr 
        355                 360                 365             


Trp Asp His Ile His Cys Thr Gln Asp Gly Trp Ser Pro Ala Val Pro 
    370                 375                 380                 


Cys Leu Arg Lys Cys Tyr Phe Pro Tyr Leu Glu Asn Gly Tyr Asn Gln 
385                 390                 395                 400 


Asn Tyr Gly Arg Lys Phe Val Gln Gly Lys Ser Ile Asp Val Ala Cys 
                405                 410                 415     


His Pro Gly Tyr Ala Leu Pro Lys Ala Gln Thr Thr Val Thr Cys Met 
            420                 425                 430         


Glu Asn Gly Trp Ser Pro Thr Pro Arg Cys Ile Arg Val Ser Phe Thr 
        435                 440                 445             


Leu 
    


<210>  12
<211>  1350
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence encoding FHL1

<400>  12
atgcgcctcc tggccaagat catctgcctc atgctgtggg ccatctgcgt ggctgaggac       60

tgcaatgagc tgccgcccag gaggaacaca gagatcctga cagggagctg gtctgaccag      120

acctaccctg agggcaccca ggcgatctac aagtgccggc cgggctacag gagcctgggg      180

aacatcatca tggtgtgtag aaagggcgaa tgggtggccc tcaaccccct gaggaagtgc      240

cagaagcggc cctgtggcca ccccggggac acacccttcg ggaccttcac cctgaccggc      300

ggcaatgtgt ttgagtacgg cgtgaaggct gtctacacat gcaacgaggg gtaccagctg      360

ctgggcgaga ttaactaccg ggagtgtgac accgatgggt ggaccaacga cattcccatc      420

tgtgaggtgg tcaagtgtct ccccgtgaca gccccagaaa atggcaaaat cgtgagcagc      480

gccatggagc ctgaccgcga atatcacttt gggcaggccg tgaggtttgt gtgcaactcg      540

ggctacaaaa ttgaaggtga tgaggagatg cactgcagcg atgatggctt ctggtccaag      600

gagaagccca aatgtgtgga gatctcctgc aagtctcccg acgtgatcaa cggcagccca      660

atcagccaga agattattta caaagagaac gagcgcttcc agtacaagtg taacatgggc      720

tatgagtatt cagagagggg agatgccgtc tgcactgaga gcggctggag accactgcct      780

agctgcgagg aaaagagttg tgacaaccct tacatcccaa atggcgacta ctcccctctg      840

cggatcaaac accggaccgg ggatgaaatc acctatcagt gccgcaatgg attctacccg      900

gccacccgcg gcaacaccgc caaatgcacc agcacaggct ggatccccgc cccccgctgt      960

acgctgaagc cttgcgacta tccagacatc aagcacggag gcctgtacca cgaaaacatg     1020

cggcggcctt atttccctgt ggcagtgggg aagtactaca gctactactg cgacgagcac     1080

ttcgagaccc cctctggctc ctactgggac cacatccact gcacacagga cggctggtct     1140

ccagctgtgc cctgcctgag gaaatgctac ttcccctacc tggagaacgg atacaaccag     1200

aactatggcc gcaagttcgt gcagggcaag agcatcgatg tggcctgcca ccctggctac     1260

gccctgccca aggcccagac aactgtgacc tgcatggaga atggttggag ccccaccccg     1320

cgctgcatcc gggtgtcctt cacgctctga                                      1350


<210>  13
<211>  585
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  cytomegalovirus (CMV) promoter sequence

<400>  13
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc       60

ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca      120

ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac atcaagtgta      180

tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta      240

tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat      300

cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat agcggtttga      360

ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca      420

aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg      480

taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc gtcagatcgc      540

ctggagacgc catccacgct gttttgacct ccatagaaga caccg                      585


<210>  14
<211>  223
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Bovine Growth Hormone poly-A (bGH poly-A) signal sequence

<400>  14
gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg       60

gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg      120

agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg      180

gaagacaata gcaggcatgc tggggatgcg gtgggctcta tgg                        223


<210>  15
<211>  245
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  WPRE3 sequence (shortened version of WPRE, contains only minimal 
       gamma and alpha elements)

<400>  15
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct       60

ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt      120

atggctttca ttttctcctc cttgtataaa tcctggttag ttcttgccac ggcggaactc      180

atcgccgcct gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc      240

gtggt                                                                  245


<210>  16
<211>  1350
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence encoding Complement Factor H-like Protein 1 
       (FHL1)

<400>  16
atgagacttc tagcaaagat tatttgcctt atgttatggg ctatttgtgt agcagaagat       60

tgcaatgaac ttcctccaag aagaaataca gaaattctga caggttcctg gtctgaccaa      120

acatatccag aaggcaccca ggctatctat aaatgccgcc ctggatatag atctcttgga      180

aatataataa tggtatgcag gaagggagaa tgggttgctc ttaatccatt aaggaaatgt      240

cagaaaaggc cctgtggaca tcctggagat actccttttg gtacttttac ccttacagga      300

ggaaatgtgt ttgaatatgg tgtaaaagct gtgtatacat gtaatgaggg gtatcaattg      360

ctaggtgaga ttaattaccg tgaatgtgac acagatggat ggaccaatga tattcctata      420

tgtgaagttg tgaagtgttt accagtgaca gcaccagaga atggaaaaat tgtcagtagt      480

gcaatggaac cagatcggga ataccatttt ggacaagcag tacggtttgt atgtaactca      540

ggctacaaga ttgaaggaga tgaagaaatg cattgttcag acgatggttt ttggagtaaa      600

gagaaaccaa agtgtgtgga aatttcatgc aaatccccag atgttataaa tggatctcct      660

atatctcaga agattattta taaggagaat gaacgatttc aatataaatg taacatgggt      720

tatgaataca gtgaaagagg agatgctgta tgcactgaat ctggatggcg tccgttgcct      780

tcatgtgaag aaaaatcatg tgataatcct tatattccaa atggtgacta ctcaccttta      840

aggattaaac acagaactgg agatgaaatc acgtaccagt gtagaaatgg tttttatcct      900

gcaacccggg gaaatacagc aaaatgcaca agtactggct ggatacctgc tccgagatgt      960

accttgaaac cttgtgatta tccagacatt aaacatggag gtctatatca tgagaatatg     1020

cgtagaccat actttccagt agctgtagga aaatattact cctattactg tgatgaacat     1080

tttgagactc cgtcaggaag ttactgggat cacattcatt gcacacaaga tggatggtcg     1140

ccagcagtac catgcctcag aaaatgttat tttccttatt tggaaaatgg atataatcaa     1200

aattatggaa gaaagtttgt acagggtaaa tctatagacg ttgcctgcca tcctggctac     1260

gctcttccaa aagcgcagac cacagttaca tgtatggaga atggctggtc tcctactccc     1320

agatgcatcc gtgtcagctt taccctctga                                      1350


<210>  17
<211>  120
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  linker sequence

<400>  17
cgaaggaaac gaggaagcgg agaagccaga cacaaacaga aaattgtggc accggtgaaa       60

cagactttga attttgacct tctcaagttg gcgggagacg tcgagtccaa ccctgggccc      120


<210>  18
<211>  121
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' adeno-associated virus (AAV) inverted terminal repeat (ITR) 
       sequence

<400>  18
cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg       60

gcctcagtga gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc      120

t                                                                      121


<210>  19
<211>  121
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' AAV ITR sequence

<400>  19
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg       60

ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc      120

g                                                                      121


<210>  20
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5' ITR adjacent sequence

<400>  20
tgtagttaat gattaacccg ccatgctact tatctacgta gccatgctct aggtacc          57


<210>  21
<211>  85
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3' ITR adjacent sequence

<400>  21
cttctgaggc ggaaagaacc agctggggct cgactagagc atggctacgt agataagtag       60

catggcgggt taatcattaa ctaca                                             85


<210>  22
<211>  4674
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  polynucleotide sequence of the invention

<400>  22
cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg       60

gcctcagtga gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc      120

ttgtagttaa tgattaaccc gccatgctac ttatctacgt agccatgctc taggtaccgg      180

agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc aacgaccccc      240

gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg actttccatt      300

gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt ggcagtacat caagtgtatc      360

atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg      420

cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg      480

ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact      540

cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa      600

atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta      660

ggcgtgtacg gtgggaggtc tatataagca gagctcgttt agtgaaccgt cagatcgcct      720

ggagacgcca tccacgctgt tttgacctcc atagaagaca ccgactagtg ccaccatgcg      780

cctcctggcc aagatcatct gcctcatgct gtgggccatc tgcgtggctg aggactgcaa      840

tgagctgccg cccaggagga acacagagat cctgacaggg agctggtctg accagaccta      900

ccctgagggc acccaggcga tctacaagtg ccggccgggc tacaggagcc tggggaacat      960

catcatggtg tgtagaaagg gcgaatgggt ggccctcaac cccctgagga agtgccagaa     1020

gcggccctgt ggccaccccg gggacacacc cttcgggacc ttcaccctga ccggcggcaa     1080

tgtgtttgag tacggcgtga aggctgtcta cacatgcaac gaggggtacc agctgctggg     1140

cgagattaac taccgggagt gtgacaccga tgggtggacc aacgacattc ccatctgtga     1200

ggtggtcaag tgtctccccg tgacagcccc agaaaatggc aaaatcgtga gcagcgccat     1260

ggagcctgac cgcgaatatc actttgggca ggccgtgagg tttgtgtgca actcgggcta     1320

caaaattgaa ggtgatgagg agatgcactg cagcgatgat ggcttctggt ccaaggagaa     1380

gcccaaatgt gtggagatct cctgcaagtc tcccgacgtg atcaacggca gcccaatcag     1440

ccagaagatt atttacaaag agaacgagcg cttccagtac aagtgtaaca tgggctatga     1500

gtattcagag aggggagatg ccgtctgcac tgagagcggc tggagaccac tgcctagctg     1560

cgaggaaaag agttgtgaca acccttacat cccaaatggc gactactccc ctctgcggat     1620

caaacaccgg accggggatg aaatcaccta tcagtgccgc aatggattct acccggccac     1680

ccgcggcaac accgccaaat gcaccagcac aggctggatc cccgcccccc gctgtacgct     1740

gaagccttgc gactatccag acatcaagca cggaggcctg taccacgaaa acatgcggcg     1800

gccttatttc cctgtggcag tggggaagta ctacagctac tactgcgacg agcacttcga     1860

gaccccctct ggctcctact gggaccacat ccactgcaca caggacggct ggtctccagc     1920

tgtgccctgc ctgaggaaat gctacttccc ctacctggag aacggataca accagaacta     1980

tggccgcaag ttcgtgcagg gcaagagcat cgatgtggcc tgccaccctg gctacgccct     2040

gcccaaggcc cagacaactg tgacctgcat ggagaatggt tggagcccca ccccgcgctg     2100

catccgggtg tccttcacgc tccgaaggaa acgaggaagc ggagaagcca gacacaaaca     2160

gaaaattgtg gcaccggtga aacagacttt gaattttgac cttctcaagt tggcgggaga     2220

cgtcgagtcc aaccctgggc ccatgaaact gctgcatgtc ttcctcctct tcctgtgctt     2280

ccacctccgt ttctgtaaag tcacctacac tagccaggag gatctggtgg agaagaaatg     2340

cctggccaag aagtataccc acctgagctg cgacaaagtg ttctgccagc cctggcaacg     2400

ctgcattgaa ggtacttgtg tgtgcaagct gccctaccag tgccccaaga acggcacggc     2460

cgtgtgtgcc accaacagga ggagcttccc cacctactgc cagcagaaga gcctggaatg     2520

cctccaccct ggcaccaagt ttctgaacaa cgggacctgc acagccgagg ggaaattcag     2580

cgtctccctc aagcacggca atacagactc cgagggcatt gtggaagtga agctggtgga     2640

ccaggacaag accatgttca tctgcaaaag cagctggtcc atgcgggagg ccaatgtcgc     2700

ctgcctggac ctgggcttcc agcagggcgc tgatacacag cgccgcttta aactcagtga     2760

cctcagcatc aacagcactg agtgtctgca cgtgcactgc cggggcctgg agaccagcct     2820

ggctgagtgc accttcacca agcgcaggac catgggctac caggattttg cagatgtggt     2880

ctgctacacc cagaaggcag acagccccat ggatgacttc ttccagtgtg tcaatggcaa     2940

gtacatttcc cagatgaagg cttgtgacgg gatcaatgat tgcggggatc agagcgatga     3000

gctctgctgc aaggcctgcc aagggaaggg ctttcactgt aagtctgggg tgtgcatccc     3060

ttctcagtat cagtgcaacg gagaggtgga ctgcatcact ggggaggacg aggtgggctg     3120

tgctggcttc gcctctgtgg cccaggagga gacagagatc ctcacagctg acatggatgc     3180

agagcggcgg cgcatcaaga gtctgctccc aaagctctcc tgcggcgtta agaatcgcat     3240

gcacatccgg aggaagcgga tcgttggagg caaacgggct cagctggggg acttgccgtg     3300

gcaggtggcc atcaaagatg cctccggaat cacctgtggt ggcatctaca tcggcggctg     3360

ctggatcctg accgccgccc actgccttcg ggccagcaag actcaccgct accagatctg     3420

gaccaccgtg gtggattgga ttcaccccga cctgaagagg attgtcattg agtatgtcga     3480

ccgcatcatc ttccatgaaa actacaatgc cgggacgtat cagaacgaca tcgccctcat     3540

cgagatgaag aaggatggga acaagaagga ctgtgagctg cctcgctcca tccccgcctg     3600

tgtaccatgg tctccgtacc tgttccagcc aaatgacaca tgcatcgtga gcggctgggg     3660

ccgcgagaaa gacaacgaga gggtcttctc cctgcagtgg ggtgaagtca agctgatcag     3720

caactgctcc aagttctacg gcaaccgctt ctatgagaag gagatggagt gcgccggcac     3780

ctatgacggc agcattgacg cgtgcaaggg agacagtggg ggccccctgg tctgcatgga     3840

cgccaacaat gtgacctacg tgtggggagt tgtgtcctgg ggcgagaact gtggcaagcc     3900

tgagttcccg ggcgtgtaca caaaggtggc aaactatttt gactggatct cctatcacgt     3960

tggcaggccc ttcatttcac agtacaacgt ataactcgag aatcaacctc tggattacaa     4020

aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc tatgtggata     4080

cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca ttttctcctc     4140

cttgtataaa tcctggttag ttcttgccac ggcggaactc atcgccgcct gccttgcccg     4200

ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgtgcc ttctagttgc     4260

cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc     4320

actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct     4380

attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg     4440

catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctcg     4500

actagagcat ggctacgtag ataagtagca tggcgggtta atcattaact acaaggaacc     4560

cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg aggccgggcg     4620

accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg agcg           4674


<210>  23
<211>  4548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  polynucleotide sequence of the invention

<400>  23
cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg       60

gcctcagtga gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc      120

tggcgcgccg gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc      180

caacgacccc cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg      240

gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca      300

tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc      360

ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt      420

attagtcatc gctattacca tggtgatgcg gttttggcag tacatcaatg ggcgtggata      480

gcggtttgac tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt      540

ttggcaccaa aatcaacggg actttccaaa atgtcgtaac aactccgccc cattgacgca      600

aatgggcggt aggcgtgtac ggtgggaggt ctatataagc agagctcgtt tagtgaaccg      660

tcagatcgcc tggagacgcc atccacgctg ttttgacctc catagaagac accgactagt      720

gccaccatgc gcctcctggc caagatcatc tgcctcatgc tgtgggccat ctgcgtggct      780

gaggactgca atgagctgcc gcccaggagg aacacagaga tcctgacagg gagctggtct      840

gaccagacct accctgaggg cacccaggcg atctacaagt gccggccggg ctacaggagc      900

ctggggaaca tcatcatggt gtgtagaaag ggcgaatggg tggccctcaa ccccctgagg      960

aagtgccaga agcggccctg tggccacccc ggggacacac ccttcgggac cttcaccctg     1020

accggcggca atgtgtttga gtacggcgtg aaggctgtct acacatgcaa cgaggggtac     1080

cagctgctgg gcgagattaa ctaccgggag tgtgacaccg atgggtggac caacgacatt     1140

cccatctgtg aggtggtcaa gtgtctcccc gtgacagccc cagaaaatgg caaaatcgtg     1200

agcagcgcca tggagcctga ccgcgaatat cactttgggc aggccgtgag gtttgtgtgc     1260

aactcgggct acaaaattga aggtgatgag gagatgcact gcagcgatga tggcttctgg     1320

tccaaggaga agcccaaatg tgtggagatc tcctgcaagt ctcccgacgt gatcaacggc     1380

agcccaatca gccagaagat tatttacaaa gagaacgagc gcttccagta caagtgtaac     1440

atgggctatg agtattcaga gaggggagat gccgtctgca ctgagagcgg ctggagacca     1500

ctgcctagct gcgaggaaaa gagttgtgac aacccttaca tcccaaatgg cgactactcc     1560

cctctgcgga tcaaacaccg gaccggggat gaaatcacct atcagtgccg caatggattc     1620

tacccggcca cccgcggcaa caccgccaaa tgcaccagca caggctggat ccccgccccc     1680

cgctgtacgc tgaagccttg cgactatcca gacatcaagc acggaggcct gtaccacgaa     1740

aacatgcggc ggccttattt ccctgtggca gtggggaagt actacagcta ctactgcgac     1800

gagcacttcg agaccccctc tggctcctac tgggaccaca tccactgcac acaggacggc     1860

tggtctccag ctgtgccctg cctgaggaaa tgctacttcc cctacctgga gaacggatac     1920

aaccagaact atggccgcaa gttcgtgcag ggcaagagca tcgatgtggc ctgccaccct     1980

ggctacgccc tgcccaaggc ccagacaact gtgacctgca tggagaatgg ttggagcccc     2040

accccgcgct gcatccgggt gtccttcacg ctccgaagga aacgaggaag cggagaagcc     2100

agacacaaac agaaaattgt ggcaccggtg aaacagactt tgaattttga ccttctcaag     2160

ttggcgggag acgtcgagtc caaccctggg cccatgaaac tgctgcatgt cttcctcctc     2220

ttcctgtgct tccacctccg tttctgtaaa gtcacctaca ctagccagga ggatctggtg     2280

gagaagaaat gcctggccaa gaagtatacc cacctgagct gcgacaaagt gttctgccag     2340

ccctggcaac gctgcattga aggtacttgt gtgtgcaagc tgccctacca gtgccccaag     2400

aacggcacgg ccgtgtgtgc caccaacagg aggagcttcc ccacctactg ccagcagaag     2460

agcctggaat gcctccaccc tggcaccaag tttctgaaca acgggacctg cacagccgag     2520

gggaaattca gcgtctccct caagcacggc aatacagact ccgagggcat tgtggaagtg     2580

aagctggtgg accaggacaa gaccatgttc atctgcaaaa gcagctggtc catgcgggag     2640

gccaatgtcg cctgcctgga cctgggcttc cagcagggcg ctgatacaca gcgccgcttt     2700

aaactcagtg acctcagcat caacagcact gagtgtctgc acgtgcactg ccggggcctg     2760

gagaccagcc tggctgagtg caccttcacc aagcgcagga ccatgggcta ccaggatttt     2820

gcagatgtgg tctgctacac ccagaaggca gacagcccca tggatgactt cttccagtgt     2880

gtcaatggca agtacatttc ccagatgaag gcttgtgacg ggatcaatga ttgcggggat     2940

cagagcgatg agctctgctg caaggcctgc caagggaagg gctttcactg taagtctggg     3000

gtgtgcatcc cttctcagta tcagtgcaac ggagaggtgg actgcatcac tggggaggac     3060

gaggtgggct gtgctggctt cgcctctgtg gcccaggagg agacagagat cctcacagct     3120

gacatggatg cagagcggcg gcgcatcaag agtctgctcc caaagctctc ctgcggcgtt     3180

aagaatcgca tgcacatccg gaggaagcgg atcgttggag gcaaacgggc tcagctgggg     3240

gacttgccgt ggcaggtggc catcaaagat gcctccggaa tcacctgtgg tggcatctac     3300

atcggcggct gctggatcct gaccgccgcc cactgccttc gggccagcaa gactcaccgc     3360

taccagatct ggaccaccgt ggtggattgg attcaccccg acctgaagag gattgtcatt     3420

gagtatgtcg accgcatcat cttccatgaa aactacaatg ccgggacgta tcagaacgac     3480

atcgccctca tcgagatgaa gaaggatggg aacaagaagg actgtgagct gcctcgctcc     3540

atccccgcct gtgtaccatg gtctccgtac ctgttccagc caaatgacac atgcatcgtg     3600

agcggctggg gccgcgagaa agacaacgag agggtcttct ccctgcagtg gggtgaagtc     3660

aagctgatca gcaactgctc caagttctac ggcaaccgct tctatgagaa ggagatggag     3720

tgcgccggca cctatgacgg cagcattgac gcgtgcaagg gagacagtgg gggccccctg     3780

gtctgcatgg acgccaacaa tgtgacctac gtgtggggag ttgtgtcctg gggcgagaac     3840

tgtggcaagc ctgagttccc gggcgtgtac acaaaggtgg caaactattt tgactggatc     3900

tcctatcacg ttggcaggcc cttcatttca cagtacaacg tataactcga gaatcaacct     3960

ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg     4020

ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc     4080

attttctcct ccttgtataa atcctggtta gttcttgcca cggcggaact catcgccgcc     4140

tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgtgc     4200

cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag     4260

gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta     4320

ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag     4380

acaatagcag gcatgctggg gatgcggtgg gctctatggg cggccgcagg aacccctagt     4440

gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa     4500

ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcg                  4548


<210>  24
<211>  2039
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example Complement Receptor 1 (CR1) sequence

<400>  24

Met Gly Ala Ser Ser Pro Arg Ser Pro Glu Pro Val Gly Pro Pro Ala 
1               5                   10                  15      


Pro Gly Leu Pro Phe Cys Cys Gly Gly Ser Leu Leu Ala Val Val Val 
            20                  25                  30          


Leu Leu Ala Leu Pro Val Ala Trp Gly Gln Cys Asn Ala Pro Glu Trp 
        35                  40                  45              


Leu Pro Phe Ala Arg Pro Thr Asn Leu Thr Asp Glu Phe Glu Phe Pro 
    50                  55                  60                  


Ile Gly Thr Tyr Leu Asn Tyr Glu Cys Arg Pro Gly Tyr Ser Gly Arg 
65                  70                  75                  80  


Pro Phe Ser Ile Ile Cys Leu Lys Asn Ser Val Trp Thr Gly Ala Lys 
                85                  90                  95      


Asp Arg Cys Arg Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro Val Asn 
            100                 105                 110         


Gly Met Val His Val Ile Lys Gly Ile Gln Phe Gly Ser Gln Ile Lys 
        115                 120                 125             


Tyr Ser Cys Thr Lys Gly Tyr Arg Leu Ile Gly Ser Ser Ser Ala Thr 
    130                 135                 140                 


Cys Ile Ile Ser Gly Asp Thr Val Ile Trp Asp Asn Glu Thr Pro Ile 
145                 150                 155                 160 


Cys Asp Arg Ile Pro Cys Gly Leu Pro Pro Thr Ile Thr Asn Gly Asp 
                165                 170                 175     


Phe Ile Ser Thr Asn Arg Glu Asn Phe His Tyr Gly Ser Val Val Thr 
            180                 185                 190         


Tyr Arg Cys Asn Pro Gly Ser Gly Gly Arg Lys Val Phe Glu Leu Val 
        195                 200                 205             


Gly Glu Pro Ser Ile Tyr Cys Thr Ser Asn Asp Asp Gln Val Gly Ile 
    210                 215                 220                 


Trp Ser Gly Pro Ala Pro Gln Cys Ile Ile Pro Asn Lys Cys Thr Pro 
225                 230                 235                 240 


Pro Asn Val Glu Asn Gly Ile Leu Val Ser Asp Asn Arg Ser Leu Phe 
                245                 250                 255     


Ser Leu Asn Glu Val Val Glu Phe Arg Cys Gln Pro Gly Phe Val Met 
            260                 265                 270         


Lys Gly Pro Arg Arg Val Lys Cys Gln Ala Leu Asn Lys Trp Glu Pro 
        275                 280                 285             


Glu Leu Pro Ser Cys Ser Arg Val Cys Gln Pro Pro Pro Asp Val Leu 
    290                 295                 300                 


His Ala Glu Arg Thr Gln Arg Asp Lys Asp Asn Phe Ser Pro Gly Gln 
305                 310                 315                 320 


Glu Val Phe Tyr Ser Cys Glu Pro Gly Tyr Asp Leu Arg Gly Ala Ala 
                325                 330                 335     


Ser Met Arg Cys Thr Pro Gln Gly Asp Trp Ser Pro Ala Ala Pro Thr 
            340                 345                 350         


Cys Glu Val Lys Ser Cys Asp Asp Phe Met Gly Gln Leu Leu Asn Gly 
        355                 360                 365             


Arg Val Leu Phe Pro Val Asn Leu Gln Leu Gly Ala Lys Val Asp Phe 
    370                 375                 380                 


Val Cys Asp Glu Gly Phe Gln Leu Lys Gly Ser Ser Ala Ser Tyr Cys 
385                 390                 395                 400 


Val Leu Ala Gly Met Glu Ser Leu Trp Asn Ser Ser Val Pro Val Cys 
                405                 410                 415     


Glu Gln Ile Phe Cys Pro Ser Pro Pro Val Ile Pro Asn Gly Arg His 
            420                 425                 430         


Thr Gly Lys Pro Leu Glu Val Phe Pro Phe Gly Lys Thr Val Asn Tyr 
        435                 440                 445             


Thr Cys Asp Pro His Pro Asp Arg Gly Thr Ser Phe Asp Leu Ile Gly 
    450                 455                 460                 


Glu Ser Thr Ile Arg Cys Thr Ser Asp Pro Gln Gly Asn Gly Val Trp 
465                 470                 475                 480 


Ser Ser Pro Ala Pro Arg Cys Gly Ile Leu Gly His Cys Gln Ala Pro 
                485                 490                 495     


Asp His Phe Leu Phe Ala Lys Leu Lys Thr Gln Thr Asn Ala Ser Asp 
            500                 505                 510         


Phe Pro Ile Gly Thr Ser Leu Lys Tyr Glu Cys Arg Pro Glu Tyr Tyr 
        515                 520                 525             


Gly Arg Pro Phe Ser Ile Thr Cys Leu Asp Asn Leu Val Trp Ser Ser 
    530                 535                 540                 


Pro Lys Asp Val Cys Lys Arg Lys Ser Cys Lys Thr Pro Pro Asp Pro 
545                 550                 555                 560 


Val Asn Gly Met Val His Val Ile Thr Asp Ile Gln Val Gly Ser Arg 
                565                 570                 575     


Ile Asn Tyr Ser Cys Thr Thr Gly His Arg Leu Ile Gly His Ser Ser 
            580                 585                 590         


Ala Glu Cys Ile Leu Ser Gly Asn Ala Ala His Trp Ser Thr Lys Pro 
        595                 600                 605             


Pro Ile Cys Gln Arg Ile Pro Cys Gly Leu Pro Pro Thr Ile Ala Asn 
    610                 615                 620                 


Gly Asp Phe Ile Ser Thr Asn Arg Glu Asn Phe His Tyr Gly Ser Val 
625                 630                 635                 640 


Val Thr Tyr Arg Cys Asn Pro Gly Ser Gly Gly Arg Lys Val Phe Glu 
                645                 650                 655     


Leu Val Gly Glu Pro Ser Ile Tyr Cys Thr Ser Asn Asp Asp Gln Val 
            660                 665                 670         


Gly Ile Trp Ser Gly Pro Ala Pro Gln Cys Ile Ile Pro Asn Lys Cys 
        675                 680                 685             


Thr Pro Pro Asn Val Glu Asn Gly Ile Leu Val Ser Asp Asn Arg Ser 
    690                 695                 700                 


Leu Phe Ser Leu Asn Glu Val Val Glu Phe Arg Cys Gln Pro Gly Phe 
705                 710                 715                 720 


Val Met Lys Gly Pro Arg Arg Val Lys Cys Gln Ala Leu Asn Lys Trp 
                725                 730                 735     


Glu Pro Glu Leu Pro Ser Cys Ser Arg Val Cys Gln Pro Pro Pro Asp 
            740                 745                 750         


Val Leu His Ala Glu Arg Thr Gln Arg Asp Lys Asp Asn Phe Ser Pro 
        755                 760                 765             


Gly Gln Glu Val Phe Tyr Ser Cys Glu Pro Gly Tyr Asp Leu Arg Gly 
    770                 775                 780                 


Ala Ala Ser Met Arg Cys Thr Pro Gln Gly Asp Trp Ser Pro Ala Ala 
785                 790                 795                 800 


Pro Thr Cys Glu Val Lys Ser Cys Asp Asp Phe Met Gly Gln Leu Leu 
                805                 810                 815     


Asn Gly Arg Val Leu Phe Pro Val Asn Leu Gln Leu Gly Ala Lys Val 
            820                 825                 830         


Asp Phe Val Cys Asp Glu Gly Phe Gln Leu Lys Gly Ser Ser Ala Ser 
        835                 840                 845             


Tyr Cys Val Leu Ala Gly Met Glu Ser Leu Trp Asn Ser Ser Val Pro 
    850                 855                 860                 


Val Cys Glu Gln Ile Phe Cys Pro Ser Pro Pro Val Ile Pro Asn Gly 
865                 870                 875                 880 


Arg His Thr Gly Lys Pro Leu Glu Val Phe Pro Phe Gly Lys Ala Val 
                885                 890                 895     


Asn Tyr Thr Cys Asp Pro His Pro Asp Arg Gly Thr Ser Phe Asp Leu 
            900                 905                 910         


Ile Gly Glu Ser Thr Ile Arg Cys Thr Ser Asp Pro Gln Gly Asn Gly 
        915                 920                 925             


Val Trp Ser Ser Pro Ala Pro Arg Cys Gly Ile Leu Gly His Cys Gln 
    930                 935                 940                 


Ala Pro Asp His Phe Leu Phe Ala Lys Leu Lys Thr Gln Thr Asn Ala 
945                 950                 955                 960 


Ser Asp Phe Pro Ile Gly Thr Ser Leu Lys Tyr Glu Cys Arg Pro Glu 
                965                 970                 975     


Tyr Tyr Gly Arg Pro Phe Ser Ile Thr Cys Leu Asp Asn Leu Val Trp 
            980                 985                 990         


Ser Ser Pro Lys Asp Val Cys Lys  Arg Lys Ser Cys Lys  Thr Pro Pro 
        995                 1000                 1005             


Asp Pro  Val Asn Gly Met Val  His Val Ile Thr Asp  Ile Gln Val 
    1010                 1015                 1020             


Gly Ser  Arg Ile Asn Tyr Ser  Cys Thr Thr Gly His  Arg Leu Ile 
    1025                 1030                 1035             


Gly His  Ser Ser Ala Glu Cys  Ile Leu Ser Gly Asn  Thr Ala His 
    1040                 1045                 1050             


Trp Ser  Thr Lys Pro Pro Ile  Cys Gln Arg Ile Pro  Cys Gly Leu 
    1055                 1060                 1065             


Pro Pro  Thr Ile Ala Asn Gly  Asp Phe Ile Ser Thr  Asn Arg Glu 
    1070                 1075                 1080             


Asn Phe  His Tyr Gly Ser Val  Val Thr Tyr Arg Cys  Asn Leu Gly 
    1085                 1090                 1095             


Ser Arg  Gly Arg Lys Val Phe  Glu Leu Val Gly Glu  Pro Ser Ile 
    1100                 1105                 1110             


Tyr Cys  Thr Ser Asn Asp Asp  Gln Val Gly Ile Trp  Ser Gly Pro 
    1115                 1120                 1125             


Ala Pro  Gln Cys Ile Ile Pro  Asn Lys Cys Thr Pro  Pro Asn Val 
    1130                 1135                 1140             


Glu Asn  Gly Ile Leu Val Ser  Asp Asn Arg Ser Leu  Phe Ser Leu 
    1145                 1150                 1155             


Asn Glu  Val Val Glu Phe Arg  Cys Gln Pro Gly Phe  Val Met Lys 
    1160                 1165                 1170             


Gly Pro  Arg Arg Val Lys Cys  Gln Ala Leu Asn Lys  Trp Glu Pro 
    1175                 1180                 1185             


Glu Leu  Pro Ser Cys Ser Arg  Val Cys Gln Pro Pro  Pro Glu Ile 
    1190                 1195                 1200             


Leu His  Gly Glu His Thr Pro  Ser His Gln Asp Asn  Phe Ser Pro 
    1205                 1210                 1215             


Gly Gln  Glu Val Phe Tyr Ser  Cys Glu Pro Gly Tyr  Asp Leu Arg 
    1220                 1225                 1230             


Gly Ala  Ala Ser Leu His Cys  Thr Pro Gln Gly Asp  Trp Ser Pro 
    1235                 1240                 1245             


Glu Ala  Pro Arg Cys Ala Val  Lys Ser Cys Asp Asp  Phe Leu Gly 
    1250                 1255                 1260             


Gln Leu  Pro His Gly Arg Val  Leu Phe Pro Leu Asn  Leu Gln Leu 
    1265                 1270                 1275             


Gly Ala  Lys Val Ser Phe Val  Cys Asp Glu Gly Phe  Arg Leu Lys 
    1280                 1285                 1290             


Gly Ser  Ser Val Ser His Cys  Val Leu Val Gly Met  Arg Ser Leu 
    1295                 1300                 1305             


Trp Asn  Asn Ser Val Pro Val  Cys Glu His Ile Phe  Cys Pro Asn 
    1310                 1315                 1320             


Pro Pro  Ala Ile Leu Asn Gly  Arg His Thr Gly Thr  Pro Ser Gly 
    1325                 1330                 1335             


Asp Ile  Pro Tyr Gly Lys Glu  Ile Ser Tyr Thr Cys  Asp Pro His 
    1340                 1345                 1350             


Pro Asp  Arg Gly Met Thr Phe  Asn Leu Ile Gly Glu  Ser Thr Ile 
    1355                 1360                 1365             


Arg Cys  Thr Ser Asp Pro His  Gly Asn Gly Val Trp  Ser Ser Pro 
    1370                 1375                 1380             


Ala Pro  Arg Cys Glu Leu Ser  Val Arg Ala Gly His  Cys Lys Thr 
    1385                 1390                 1395             


Pro Glu  Gln Phe Pro Phe Ala  Ser Pro Thr Ile Pro  Ile Asn Asp 
    1400                 1405                 1410             


Phe Glu  Phe Pro Val Gly Thr  Ser Leu Asn Tyr Glu  Cys Arg Pro 
    1415                 1420                 1425             


Gly Tyr  Phe Gly Lys Met Phe  Ser Ile Ser Cys Leu  Glu Asn Leu 
    1430                 1435                 1440             


Val Trp  Ser Ser Val Glu Asp  Asn Cys Arg Arg Lys  Ser Cys Gly 
    1445                 1450                 1455             


Pro Pro  Pro Glu Pro Phe Asn  Gly Met Val His Ile  Asn Thr Asp 
    1460                 1465                 1470             


Thr Gln  Phe Gly Ser Thr Val  Asn Tyr Ser Cys Asn  Glu Gly Phe 
    1475                 1480                 1485             


Arg Leu  Ile Gly Ser Pro Ser  Thr Thr Cys Leu Val  Ser Gly Asn 
    1490                 1495                 1500             


Asn Val  Thr Trp Asp Lys Lys  Ala Pro Ile Cys Glu  Ile Ile Ser 
    1505                 1510                 1515             


Cys Glu  Pro Pro Pro Thr Ile  Ser Asn Gly Asp Phe  Tyr Ser Asn 
    1520                 1525                 1530             


Asn Arg  Thr Ser Phe His Asn  Gly Thr Val Val Thr  Tyr Gln Cys 
    1535                 1540                 1545             


His Thr  Gly Pro Asp Gly Glu  Gln Leu Phe Glu Leu  Val Gly Glu 
    1550                 1555                 1560             


Arg Ser  Ile Tyr Cys Thr Ser  Lys Asp Asp Gln Val  Gly Val Trp 
    1565                 1570                 1575             


Ser Ser  Pro Pro Pro Arg Cys  Ile Ser Thr Asn Lys  Cys Thr Ala 
    1580                 1585                 1590             


Pro Glu  Val Glu Asn Ala Ile  Arg Val Pro Gly Asn  Arg Ser Phe 
    1595                 1600                 1605             


Phe Thr  Leu Thr Glu Ile Ile  Arg Phe Arg Cys Gln  Pro Gly Phe 
    1610                 1615                 1620             


Val Met  Val Gly Ser His Thr  Val Gln Cys Gln Thr  Asn Gly Arg 
    1625                 1630                 1635             


Trp Gly  Pro Lys Leu Pro His  Cys Ser Arg Val Cys  Gln Pro Pro 
    1640                 1645                 1650             


Pro Glu  Ile Leu His Gly Glu  His Thr Leu Ser His  Gln Asp Asn 
    1655                 1660                 1665             


Phe Ser  Pro Gly Gln Glu Val  Phe Tyr Ser Cys Glu  Pro Ser Tyr 
    1670                 1675                 1680             


Asp Leu  Arg Gly Ala Ala Ser  Leu His Cys Thr Pro  Gln Gly Asp 
    1685                 1690                 1695             


Trp Ser  Pro Glu Ala Pro Arg  Cys Thr Val Lys Ser  Cys Asp Asp 
    1700                 1705                 1710             


Phe Leu  Gly Gln Leu Pro His  Gly Arg Val Leu Leu  Pro Leu Asn 
    1715                 1720                 1725             


Leu Gln  Leu Gly Ala Lys Val  Ser Phe Val Cys Asp  Glu Gly Phe 
    1730                 1735                 1740             


Arg Leu  Lys Gly Arg Ser Ala  Ser His Cys Val Leu  Ala Gly Met 
    1745                 1750                 1755             


Lys Ala  Leu Trp Asn Ser Ser  Val Pro Val Cys Glu  Gln Ile Phe 
    1760                 1765                 1770             


Cys Pro  Asn Pro Pro Ala Ile  Leu Asn Gly Arg His  Thr Gly Thr 
    1775                 1780                 1785             


Pro Phe  Gly Asp Ile Pro Tyr  Gly Lys Glu Ile Ser  Tyr Ala Cys 
    1790                 1795                 1800             


Asp Thr  His Pro Asp Arg Gly  Met Thr Phe Asn Leu  Ile Gly Glu 
    1805                 1810                 1815             


Ser Ser  Ile Arg Cys Thr Ser  Asp Pro Gln Gly Asn  Gly Val Trp 
    1820                 1825                 1830             


Ser Ser  Pro Ala Pro Arg Cys  Glu Leu Ser Val Pro  Ala Ala Cys 
    1835                 1840                 1845             


Pro His  Pro Pro Lys Ile Gln  Asn Gly His Tyr Ile  Gly Gly His 
    1850                 1855                 1860             


Val Ser  Leu Tyr Leu Pro Gly  Met Thr Ile Ser Tyr  Ile Cys Asp 
    1865                 1870                 1875             


Pro Gly  Tyr Leu Leu Val Gly  Lys Gly Phe Ile Phe  Cys Thr Asp 
    1880                 1885                 1890             


Gln Gly  Ile Trp Ser Gln Leu  Asp His Tyr Cys Lys  Glu Val Asn 
    1895                 1900                 1905             


Cys Ser  Phe Pro Leu Phe Met  Asn Gly Ile Ser Lys  Glu Leu Glu 
    1910                 1915                 1920             


Met Lys  Lys Val Tyr His Tyr  Gly Asp Tyr Val Thr  Leu Lys Cys 
    1925                 1930                 1935             


Glu Asp  Gly Tyr Thr Leu Glu  Gly Ser Pro Trp Ser  Gln Cys Gln 
    1940                 1945                 1950             


Ala Asp  Asp Arg Trp Asp Pro  Pro Leu Ala Lys Cys  Thr Ser Arg 
    1955                 1960                 1965             


Thr His  Asp Ala Leu Ile Val  Gly Thr Leu Ser Gly  Thr Ile Phe 
    1970                 1975                 1980             


Phe Ile  Leu Leu Ile Ile Phe  Leu Ser Trp Ile Ile  Leu Lys His 
    1985                 1990                 1995             


Arg Lys  Gly Asn Asn Ala His  Glu Asn Pro Lys Glu  Val Ala Ile 
    2000                 2005                 2010             


His Leu  His Ser Gln Gly Gly  Ser Ser Val His Pro  Arg Thr Leu 
    2015                 2020                 2025             


Gln Thr  Asn Glu Glu Asn Ser  Arg Val Leu Pro 
    2030                 2035                 


<210>  25
<211>  6120
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence encoding CR1

<400>  25
atgggggcct cttctccaag aagcccggag cctgtcgggc cgccggcgcc cggtctcccc       60

ttctgctgcg gaggatccct gctggcggtt gtggtgctgc ttgcgctgcc ggtggcctgg      120

ggtcaatgca atgccccaga atggcttcca tttgccaggc ctaccaacct aactgatgaa      180

tttgagtttc ccattgggac atatctgaac tatgaatgcc gccctggtta ttccggaaga      240

ccgttttcta tcatctgcct aaaaaactca gtctggactg gtgctaagga caggtgcaga      300

cgtaaatcat gtcgtaatcc tccagatcct gtgaatggca tggtgcatgt gatcaaaggc      360

atccagttcg gatcccaaat taaatattct tgtactaaag gataccgact cattggttcc      420

tcgtctgcca catgcatcat ctcaggtgat actgtcattt gggataatga aacacctatt      480

tgtgacagaa ttccttgtgg gctacccccc accatcacca atggagattt cattagcacc      540

aacagagaga attttcacta tggatcagtg gtgacctacc gctgcaatcc tggaagcgga      600

gggagaaagg tgtttgagct tgtgggtgag ccctccatat actgcaccag caatgacgat      660

caagtgggca tctggagcgg ccccgcccct cagtgcatta tacctaacaa atgcacgcct      720

ccaaatgtgg aaaatggaat attggtatct gacaacagaa gcttattttc cttaaatgaa      780

gttgtggagt ttaggtgtca gcctggcttt gtcatgaaag gaccccgccg tgtgaagtgc      840

caggccctga acaaatggga gccggagcta ccaagctgct ccagggtatg tcagccacct      900

ccagatgtcc tgcatgctga gcgtacccaa agggacaagg acaacttttc acctgggcag      960

gaagtgttct acagctgtga gcccggctac gacctcagag gggctgcgtc tatgcgctgc     1020

acaccccagg gagactggag ccctgcagcc cccacatgtg aagtgaaatc ctgtgatgac     1080

ttcatgggcc aacttcttaa tggccgtgtg ctatttccag taaatctcca gcttggagca     1140

aaagtggatt ttgtttgtga tgaaggattt caattaaaag gcagctctgc tagttactgt     1200

gtcttggctg gaatggaaag cctttggaat agcagtgttc cagtgtgtga acaaatcttt     1260

tgtccaagtc ctccagttat tcctaatggg agacacacag gaaaacctct ggaagtcttt     1320

ccctttggga aaacagtaaa ttacacatgc gacccccacc cagacagagg gacgagcttc     1380

gacctcattg gagagagcac catccgctgc acaagtgacc ctcaagggaa tggggtttgg     1440

agcagccctg cccctcgctg tggaattctg ggtcactgtc aagccccaga tcattttctg     1500

tttgccaagt tgaaaaccca aaccaatgca tctgactttc ccattgggac atctttaaag     1560

tacgaatgcc gtcctgagta ctacgggagg ccattctcta tcacatgtct agataacctg     1620

gtctggtcaa gtcccaaaga tgtctgtaaa cgtaaatcat gtaaaactcc tccagatcca     1680

gtgaatggca tggtgcatgt gatcacagac atccaggttg gatccagaat caactattct     1740

tgtactacag ggcaccgact cattggtcac tcatctgctg aatgtatcct ctcgggcaat     1800

gctgcccatt ggagcacgaa gccgccaatt tgtcaacgaa ttccttgtgg gctacccccc     1860

accatcgcca atggagattt cattagcacc aacagagaga attttcacta tggatcagtg     1920

gtgacctacc gctgcaatcc tggaagcgga gggagaaagg tgtttgagct tgtgggtgag     1980

ccctccatat actgcaccag caatgacgat caagtgggca tctggagcgg cccggcccct     2040

cagtgcatta tacctaacaa atgcacgcct ccaaatgtgg aaaatggaat attggtatct     2100

gacaacagaa gcttattttc cttaaatgaa gttgtggagt ttaggtgtca gcctggcttt     2160

gtcatgaaag gaccccgccg tgtgaagtgc caggccctga acaaatggga gccggagcta     2220

ccaagctgct ccagggtatg tcagccacct ccagatgtcc tgcatgctga gcgtacccaa     2280

agggacaagg acaacttttc acccgggcag gaagtgttct acagctgtga gcccggctat     2340

gacctcagag gggctgcgtc tatgcgctgc acaccccagg gagactggag ccctgcagcc     2400

cccacatgtg aagtgaaatc ctgtgatgac ttcatgggcc aacttcttaa tggccgtgtg     2460

ctatttccag taaatctcca gcttggagca aaagtggatt ttgtttgtga tgaaggattt     2520

caattaaaag gcagctctgc tagttattgt gtcttggctg gaatggaaag cctttggaat     2580

agcagtgttc cagtgtgtga acaaatcttt tgtccaagtc ctccagttat tcctaatggg     2640

agacacacag gaaaacctct ggaagtcttt ccctttggaa aagcagtaaa ttacacatgc     2700

gacccccacc cagacagagg gacgagcttc gacctcattg gagagagcac catccgctgc     2760

acaagtgacc ctcaagggaa tggggtttgg agcagccctg cccctcgctg tggaattctg     2820

ggtcactgtc aagccccaga tcattttctg tttgccaagt tgaaaaccca aaccaatgca     2880

tctgactttc ccattgggac atctttaaag tacgaatgcc gtcctgagta ctacgggagg     2940

ccattctcta tcacatgtct agataacctg gtctggtcaa gtcccaaaga tgtctgtaaa     3000

cgtaaatcat gtaaaactcc tccagatcca gtgaatggca tggtgcatgt gatcacagac     3060

atccaggttg gatccagaat caactattct tgtactacag ggcaccgact cattggtcac     3120

tcatctgctg aatgtatcct ctcaggcaat actgcccatt ggagcacgaa gccgccaatt     3180

tgtcaacgaa ttccttgtgg gctaccccca accatcgcca atggagattt cattagcacc     3240

aacagagaga attttcacta tggatcagtg gtgacctacc gctgcaatct tggaagcaga     3300

gggagaaagg tgtttgagct tgtgggtgag ccctccatat actgcaccag caatgacgat     3360

caagtgggca tctggagcgg ccccgcccct cagtgcatta tacctaacaa atgcacgcct     3420

ccaaatgtgg aaaatggaat attggtatct gacaacagaa gcttattttc cttaaatgaa     3480

gttgtggagt ttaggtgtca gcctggcttt gtcatgaaag gaccccgccg tgtgaagtgc     3540

caggccctga acaaatggga gccagagtta ccaagctgct ccagggtgtg tcagccgcct     3600

ccagaaatcc tgcatggtga gcatacccca agccatcagg acaacttttc acctgggcag     3660

gaagtgttct acagctgtga gcctggctat gacctcagag gggctgcgtc tctgcactgc     3720

acaccccagg gagactggag ccctgaagcc ccgagatgtg cagtgaaatc ctgtgatgac     3780

ttcttgggtc aactccctca tggccgtgtg ctatttccac ttaatctcca gcttggggca     3840

aaggtgtcct ttgtctgtga tgaagggttt cgcttaaagg gcagttccgt tagtcattgt     3900

gtcttggttg gaatgagaag cctttggaat aacagtgttc ctgtgtgtga acatatcttt     3960

tgtccaaatc ctccagctat ccttaatggg agacacacag gaactccctc tggagatatt     4020

ccctatggaa aagaaatatc ttacacatgt gacccccacc cagacagagg gatgaccttc     4080

aacctcattg gggagagcac catccgctgc acaagtgacc ctcatgggaa tggggtttgg     4140

agcagccctg cccctcgctg tgaactttct gttcgtgctg gtcactgtaa aaccccagag     4200

cagtttccat ttgccagtcc tacgatccca attaatgact ttgagtttcc agtcgggaca     4260

tctttgaatt atgaatgccg tcctgggtat tttgggaaaa tgttctctat ctcctgccta     4320

gaaaacttgg tctggtcaag tgttgaagac aactgtagac gaaaatcatg tggacctcca     4380

ccagaaccct tcaatggaat ggtgcatata aacacagata cacagtttgg atcaacagtt     4440

aattattctt gtaatgaagg gtttcgactc attggttccc catctactac ttgtctcgtc     4500

tcaggcaata atgtcacatg ggataagaag gcacctattt gtgagatcat atcttgtgag     4560

ccacctccaa ccatatccaa tggagacttc tacagcaaca atagaacatc ttttcacaat     4620

ggaacggtgg taacttacca gtgccacact ggaccagatg gagaacagct gtttgagctt     4680

gtgggagaac ggtcaatata ttgcaccagc aaagatgatc aagttggtgt ttggagcagc     4740

cctccccctc ggtgtatttc tactaataaa tgcacagctc cagaagttga aaatgcaatt     4800

agagtaccag gaaacaggag tttctttacc ctcactgaga tcatcagatt tagatgtcag     4860

cccgggtttg tcatggtagg gtcccacact gtgcagtgcc agaccaatgg cagatggggg     4920

cccaagctgc cacactgctc cagggtgtgt cagccgcctc cagaaatcct gcatggtgag     4980

cataccctaa gccatcagga caacttttca cctgggcagg aagtgttcta cagctgtgag     5040

cccagctatg acctcagagg ggctgcgtct ctgcactgca cgccccaggg agactggagc     5100

cctgaagccc ctagatgtac agtgaaatcc tgtgatgact tcctgggcca actccctcat     5160

ggccgtgtgc tacttccact taatctccag cttggggcaa aggtgtcctt tgtttgcgat     5220

gaagggttcc gattaaaagg caggtctgct agtcattgtg tcttggctgg aatgaaagcc     5280

ctttggaata gcagtgttcc agtgtgtgaa caaatctttt gtccaaatcc tccagctatc     5340

cttaatggga gacacacagg aactcccttt ggagatattc cctatggaaa agaaatatct     5400

tacgcatgcg acacccaccc agacagaggg atgaccttca acctcattgg ggagagctcc     5460

atccgctgca caagtgaccc tcaagggaat ggggtttgga gcagccctgc ccctcgctgt     5520

gaactttctg ttcctgctgc ctgcccacat ccacccaaga tccaaaacgg gcattacatt     5580

ggaggacacg tatctctata tcttcctggg atgacaatca gctacatttg tgaccccggc     5640

tacctgttag tgggaaaggg cttcattttc tgtacagacc agggaatctg gagccaattg     5700

gatcattatt gcaaagaagt aaattgtagc ttcccactgt ttatgaatgg aatctcgaag     5760

gagttagaaa tgaaaaaagt atatcactat ggagattatg tgactttgaa gtgtgaagat     5820

gggtatactc tggaaggcag tccctggagc cagtgccagg cggatgacag atgggaccct     5880

cctctggcca aatgtacctc tcgtacacat gatgctctca tagttggcac tttatctggt     5940

acgatcttct ttattttact catcattttc ctctcttgga taattctaaa gcacagaaaa     6000

ggcaataatg cacatgaaaa ccctaaagaa gtggctatcc atttacattc tcaaggaggc     6060

agcagcgttc atccccgaac tctgcaaaca aatgaagaaa atagcagggt ccttccttga     6120


<210>  26
<211>  392
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  example Membrane Cofactor Protein (MCP) sequence

<400>  26

Met Glu Pro Pro Gly Arg Arg Glu Cys Pro Phe Pro Ser Trp Arg Phe 
1               5                   10                  15      


Pro Gly Leu Leu Leu Ala Ala Met Val Leu Leu Leu Tyr Ser Phe Ser 
            20                  25                  30          


Asp Ala Cys Glu Glu Pro Pro Thr Phe Glu Ala Met Glu Leu Ile Gly 
        35                  40                  45              


Lys Pro Lys Pro Tyr Tyr Glu Ile Gly Glu Arg Val Asp Tyr Lys Cys 
    50                  55                  60                  


Lys Lys Gly Tyr Phe Tyr Ile Pro Pro Leu Ala Thr His Thr Ile Cys 
65                  70                  75                  80  


Asp Arg Asn His Thr Trp Leu Pro Val Ser Asp Asp Ala Cys Tyr Arg 
                85                  90                  95      


Glu Thr Cys Pro Tyr Ile Arg Asp Pro Leu Asn Gly Gln Ala Val Pro 
            100                 105                 110         


Ala Asn Gly Thr Tyr Glu Phe Gly Tyr Gln Met His Phe Ile Cys Asn 
        115                 120                 125             


Glu Gly Tyr Tyr Leu Ile Gly Glu Glu Ile Leu Tyr Cys Glu Leu Lys 
    130                 135                 140                 


Gly Ser Val Ala Ile Trp Ser Gly Lys Pro Pro Ile Cys Glu Lys Val 
145                 150                 155                 160 


Leu Cys Thr Pro Pro Pro Lys Ile Lys Asn Gly Lys His Thr Phe Ser 
                165                 170                 175     


Glu Val Glu Val Phe Glu Tyr Leu Asp Ala Val Thr Tyr Ser Cys Asp 
            180                 185                 190         


Pro Ala Pro Gly Pro Asp Pro Phe Ser Leu Ile Gly Glu Ser Thr Ile 
        195                 200                 205             


Tyr Cys Gly Asp Asn Ser Val Trp Ser Arg Ala Ala Pro Glu Cys Lys 
    210                 215                 220                 


Val Val Lys Cys Arg Phe Pro Val Val Glu Asn Gly Lys Gln Ile Ser 
225                 230                 235                 240 


Gly Phe Gly Lys Lys Phe Tyr Tyr Lys Ala Thr Val Met Phe Glu Cys 
                245                 250                 255     


Asp Lys Gly Phe Tyr Leu Asp Gly Ser Asp Thr Ile Val Cys Asp Ser 
            260                 265                 270         


Asn Ser Thr Trp Asp Pro Pro Val Pro Lys Cys Leu Lys Val Leu Pro 
        275                 280                 285             


Pro Ser Ser Thr Lys Pro Pro Ala Leu Ser His Ser Val Ser Thr Ser 
    290                 295                 300                 


Ser Thr Thr Lys Ser Pro Ala Ser Ser Ala Ser Gly Pro Arg Pro Thr 
305                 310                 315                 320 


Tyr Lys Pro Pro Val Ser Asn Tyr Pro Gly Tyr Pro Lys Pro Glu Glu 
                325                 330                 335     


Gly Ile Leu Asp Ser Leu Asp Val Trp Val Ile Ala Val Ile Val Ile 
            340                 345                 350         


Ala Ile Val Val Gly Val Ala Val Ile Cys Val Val Pro Tyr Arg Tyr 
        355                 360                 365             


Leu Gln Arg Arg Lys Lys Lys Gly Thr Tyr Leu Thr Asp Glu Thr His 
    370                 375                 380                 


Arg Glu Val Lys Phe Thr Ser Leu 
385                 390         


<210>  27
<211>  1179
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  nucleotide sequence encoding MCP

<400>  27
atggagcctc ccggccgccg cgagtgtccc tttccttcct ggcgctttcc tgggttgctt       60

ctggcggcca tggtgttgct gctgtactcc ttctccgatg cctgtgagga gccaccaaca      120

tttgaagcta tggagctcat tggtaaacca aaaccctact atgagattgg tgaacgagta      180

gattataagt gtaaaaaagg atacttctat atacctcctc ttgccaccca tactatttgt      240

gatcggaatc atacatggct acctgtctca gatgacgcct gttatagaga aacatgtcca      300

tatatacggg atcctttaaa tggccaagca gtccctgcaa atgggactta cgagtttggt      360

tatcagatgc actttatttg taatgagggt tattacttaa ttggtgaaga aattctatat      420

tgtgaactta aaggatcagt agcaatttgg agcggtaagc ccccaatatg tgaaaaggtt      480

ttgtgtacac cacctccaaa aataaaaaat ggaaaacaca cctttagtga agtagaagta      540

tttgagtatc ttgatgcagt aacttatagt tgtgatcctg cacctggacc agatccattt      600

tcacttattg gagagagcac gatttattgt ggtgacaatt cagtgtggag tcgtgctgct      660

ccagagtgta aagtggtcaa atgtcgattt ccagtagtcg aaaatggaaa acagatatca      720

ggatttggaa aaaaatttta ctacaaagca acagttatgt ttgaatgcga taagggtttt      780

tacctcgatg gcagcgacac aattgtctgt gacagtaaca gtacttggga tcccccagtt      840

ccaaagtgtc ttaaagtgct gcctccatct agtacaaaac ctccagcttt gagtcattca      900

gtgtcgactt cttccactac aaaatctcca gcgtccagtg cctcaggtcc taggcctact      960

tacaagcctc cagtctcaaa ttatccagga tatcctaaac ctgaggaagg aatacttgac     1020

agtttggatg tttgggtcat tgctgtgatt gttattgcca tagttgttgg agttgcagta     1080

atttgtgttg tcccgtacag atatcttcaa aggaggaaga agaaaggcac atacctaact     1140

gatgagaccc acagagaagt aaaatttact tctctctga                            1179


