﻿<110>  南京金斯瑞生物科技有限公司
<120>  一种基于免疫算法的密码子优化方法
<160>  6


<210>  1
<211>  430
<212>  PRT
<213>  人工序列
<220>
<223>  JNK3蛋白序列
<400>  1
Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val 
1               5                   10                  15      
Lys Ile Ala Phe Cys Gln Gly Phe Asp Lys Gln Val Asp Val Ser Tyr 
            20                  25                  30          
Ile Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gln Phe 
        35                  40                  45              
Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr 
    50                  55                  60                  
Gln Asn Leu Lys Pro Ile Gly Ser Gly Ala Gln Gly Ile Val Cys Ala 
65                  70                  75                  80  
Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala Ile Lys Lys Leu Ser 
                85                  90                  95      
Arg Pro Phe Gln Asn Gln Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 
            100                 105                 110         
Val Leu Met Lys Cys Val Asn His Lys Asn Ile Ile Ser Leu Leu Asn 
        115                 120                 125             
Val Phe Thr Pro Gln Lys Thr Leu Glu Glu Phe Gln Asp Val Tyr Leu 
    130                 135                 140                 
Val Met Glu Leu Met Asp Ala Asn Leu Cys Gln Val Ile Gln Met Glu 
145                 150                 155                 160 
Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gln Met Leu Cys Gly 
                165                 170                 175     
Ile Lys His Leu His Ser Ala Gly Ile Ile His Arg Asp Leu Lys Pro 
            180                 185                 190         
Ser Asn Ile Val Val Lys Ser Asp Cys Thr Leu Lys Ile Leu Asp Phe 
        195                 200                 205             
Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 
    210                 215                 220                 
Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val Ile Leu Gly Met Gly Tyr 
225                 230                 235                 240 
Lys Glu Asn Val Asp Ile Trp Ser Val Gly Cys Ile Met Gly Glu Met 
                245                 250                 255     
Val Arg His Lys Ile Leu Phe Pro Gly Arg Asp Tyr Ile Asp Gln Trp 
            260                 265                 270         
Asn Lys Val Ile Glu Gln Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 
        275                 280                 285             
Lys Leu Gln Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 
    290                 295                 300                 
Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala 
305                 310                 315                 320 
Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gln Ala Arg Asp Leu Leu 
                325                 330                 335     
Ser Lys Met Leu Val Ile Asp Pro Ala Lys Arg Ile Ser Val Asp Asp 
            340                 345                 350         
Ala Leu Gln His Pro Tyr Ile Asn Val Trp Tyr Asp Pro Ala Glu Val 
        355                 360                 365             
Glu Ala Pro Pro Pro Gln Ile Tyr Asp Lys Gln Leu Asp Glu Arg Glu 
    370                 375                 380                 
His Thr Ile Glu Glu Trp Lys Glu Leu Ile Tyr Lys Glu Val Met Asn 
385                 390                 395                 400 
Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Gly Gln Pro Ser Pro 
                405                 410                 415     
Ser Ala Gln Val Gln Gln Asp Tyr Lys Asp Asp Asp Asp Lys 
            420                 425                 430 


<210>  2
<211>  246
<212>  PRT
<213>  人工序列
<220>
<223>  GFP蛋白序列
<400>  2
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val 
1               5                   10                  15      
Glu Leu Asp Gly Asp Val Asn Gly Gln Lys Phe Ser Val Ser Gly Glu 
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys 
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
    50                  55                  60                  
Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln 
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg 
                85                  90                  95      
Thr Ile Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile 
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Met Glu Tyr Asn 
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Pro Lys Asn Gly 
145                 150                 155                 160 
Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Lys Asp Gly Ser Val 
                165                 170                 175     
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro 
            180                 185                 190         
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser 
        195                 200                 205             
Lys Asp Pro Asn Glu Lys Arg Asp His Met Ile Leu Leu Glu Phe Val 
    210                 215                 220                 
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys Asp Tyr 
225                 230                 235                 240 
Lys Asp Asp Asp Asp Lys 
                245     


<210>  3
<211>  1290
<212>  DNA
<213>  人工序列
<220>
<223>  优化前JNK3蛋白编码序列
<400>  3
atgagcctcc atttcttata ctactgcagt gaaccaacat tggatgtgaa aattgccttt     60

tgtcagggat tcgataaaca agtggatgtg tcatatattg ccaaacatta caacatgagc    120

aaaagcaaag ttgacaacca gttctacagt gtggaagtgg gagactcaac cttcacagtt    180

ctcaagcgct accagaatct aaagcctatt ggctctgggg ctcagggcat agtttgtgcc    240

gcgtatgatg ctgtccttga cagaaatgtg gccattaaga agctcagcag accctttcag    300

aaccaaacac atgccaagag agcgtaccgg gagctggtcc tcatgaagtg tgtgaaccat    360

aaaaacatta ttagtttatt aaatgtcttc acaccccaga aaacgctgga ggagttccaa    420

gatgtttact tagtaatgga actgatggat gccaacttat gtcaagtgat tcagatggaa    480

ttagaccatg agcgaatgtc ttacctgctg taccaaatgt tgtgtggcat taagcacctc    540

cattctgctg gaattattca cagggattta aaaccaagta acattgtagt caagtctgat    600

tgcacattga aaatcctgga ctttggactg gccaggacag caggcacaag cttcatgatg    660

actccatatg tggtgacacg ttattacaga gcccctgagg tcatcctggg gatgggctac    720

aaggagaacg tggatatatg gtctgtggga tgcattatgg gagaaatggt tcgccacaaa    780

atcctctttc caggaaggga ctatattgac cagtggaata aggtaattga acaactagga    840

acaccatgtc cagaattcat gaagaaattg caacccacag taagaaacta tgtggagaat    900

cggcccaagt atgcgggact caccttcccc aaactcttcc cagattccct cttcccagcg    960

gactccgagc acaataaact caaagccagc caagccaggg acttgttgtc aaagatgcta   1020

gtgattgacc cagcaaaaag aatatcagtg gacgacgcct tacagcatcc ctacatcaac   1080

gtctggtatg acccagccga agtggaggcg cctccacctc agatatatga caagcagttg   1140

gatgaaagag aacacacaat tgaagaatgg aaagaactta tctacaagga agtaatgaat   1200

tcagaagaaa agactaaaaa tggtgtagta aaaggacagc cttctccttc agcacaggtg   1260

cagcaggact acaaggatga tgatgacaaa                                    1290


<210>  4
<211>  738
<212>  DNA
<213>  人工序列
<220>
<223>  优化前GFP蛋白编码序列
<400>  4
atgagtaaag gagaagaact tttcactgga gttgtcccaa ttcttgttga attagatggc     60

gatgttaatg ggcaaaaatt ctctgtcagt ggagagggtg aaggtgatgc aacatacgga    120

aaacttaccc ttaaatttat ttgcactact gggaagctac ctgttccatg gccaacactt    180

gtcactactt tctcttatgg tgttcaatgc ttttcaagat acccagatca tatgaaacag    240

catgactttt tcaagagtgc catgcccgaa ggttatgtac aggaaagaac tatattttac    300

aaagatgacg ggaactacaa gacacgtgct gaagtcaagt ttgaaggtga tacccttgtt    360

aatagaatcg agttaaaagg tattgatttt aaagaagatg gaaacattct tggacacaaa    420

atggaataca actataactc acataatgta tacatcatgg cagacaaacc aaagaatgga    480

atcaaagtta acttcaaaat tagacacaac attaaagatg gaagcgttca attagcagac    540

cattatcaac aaaatactcc aattggcgat ggccctgtcc ttttaccaga caaccattac    600

ctgtccacac aatctgccct ttccaaagat cccaacgaaa agagagatca catgatcctt    660

cttgagtttg taacagctgc tgggattaca catggcatgg atgaactata caaagactac    720

aaagatgatg atgacaag                                                  738


<210>  5
<211>  1290
<212>  DNA
<213>  人工序列
<220>
<223>  优化后JNK3蛋白编码序列
<400>  5
atgtctctgc acttcctgta ctactgttct gagcccaccc tggacgtgaa gattgccttc     60

tgccagggct ttgacaagca ggtggatgtg agctacatcg ccaagcacta caacatgtcc    120

aagagcaagg tggacaacca gttctacagc gtggaggtgg gagacagcac cttcacagtg    180

ctgaagagat accagaacct gaagccaatt ggctctggag cccagggcat tgtgtgtgct    240

gcctatgatg ctgtgctgga cagaaatgtg gccatcaaga agctgagcag acccttccag    300

aaccagacac atgccaagag agcctacaga gagctggtgc tgatgaagtg tgtgaaccac    360

aagaacatca tcagcctgct gaatgtgttc acccctcaga agacactgga ggagttccag    420

gatgtgtacc tggtgatgga gctcatggat gccaacctgt gccaggtgat ccagatggag    480

ctggaccatg agaggatgag ctacctgctg taccagatgc tgtgtggcat caagcacctg    540

cacagtgctg gaatcatcca cagagacctg aagccaagca acattgtggt gaagtctgac    600

tgtacactga agatcctgga ctttggactg gccagaacag ccggcacatc ttttatgatg    660

acaccatacg tggtgacaag atactacaga gcccctgagg tgatcctggg catgggctac    720

aaggagaacg tggacatctg gtctgtgggc tgcatcatgg gagagatggt gagacacaag    780

atcctgtttc ctggaagaga ctacattgac cagtggaaca aggtgattga gcagctgggc    840

accccttgtc ctgagttcat gaagaagctg cagccaactg tgaggaacta tgtggagaac    900

agaccaaagt atgctggcct gaccttcccc aagctcttcc ctgacagcct gtttcctgct    960

gattctgagc acaacaagct gaaggccagc caggccagag acctgctgag caagatgctg   1020

gtgattgatc ctgccaagag aatctctgtg gatgatgccc tgcagcaccc ctacatcaat   1080

gtgtggtacg acccagctga ggtggaggcc ccacctccac agatctatga caagcagctg   1140

gatgagagag agcacacaat tgaagagtgg aaggagctga tctacaaaga agtgatgaac   1200

tctgaggaga agaccaagaa tggagtggtg aagggccagc cctctccaag cgcccaggtg   1260

cagcaggact acaaggatga tgatgacaaa                                    1290


<210>  6
<211>  738
<212>  DNA
<213>  人工序列
<220>
<223>  优化后GFP蛋白编码序列
<400>  6
atgagcaagg gagaggaact gttcacagga gtggtgccca tcctggtgga gctggatgga     60

gatgtgaatg gccagaagtt ttctgtgtct ggggaaggag aaggcgatgc cacctatggc    120

aagctgacac tgaagttcat ctgcaccaca gggaagctgc ctgtgccctg gccaacactg    180

gtgaccacct tctcctatgg agtccagtgc ttcagcagat acccagacca catgaagcag    240

catgacttct tcaagagtgc catgcctgag ggctatgtgc aggagagaac catcttctat    300

aaggatgatg gaaactacaa gacaagagct gaggtgaagt ttgagggaga caccctggtg    360

aacagaattg agctgaaggg cattgacttc aaggaggatg gcaacatcct gggccacaag    420

atggagtaca attacaacag ccacaatgtg tacatcatgg ctgataagcc aaagaatgga    480

atcaaggtga acttcaagat tagacacaac atcaaagacg gatctgtgca gctggctgac    540

cattaccagc agaacacacc cattggagat ggcccagtgc tgctgcccga caaccactac    600

ctgagcacac agtctgccct gagtaaggac cctaatgaga agagggacca catgattctg    660

ctggagtttg tgacagctgc tggcatcacc catggcatgg atgagctgta caaggactac    720

aaagatgatg atgacaag                                                  738



