               SEQUENCE LISTING

<110> Technische Unversität Dresden

<120> induced photoreceptor cells

<130> 2077/19WO

<150> EP19170478.2
<151> 2019-04-30
<150> EP19160600.3
<151> 2019-03-04

<160> 13

<170> BiSSAP 1.3.6

<210> 1
<211> 1098
<212> PRT
<213> Homo sapiens


<220> 
<223> human GON4L protein GenBank: AAI 17558.1

<400> 1
Met Tyr Pro Glu Leu Leu Pro Val Cys Ser Leu Lys Ala Lys Asn Pro 
1               5                   10                  15      
Gln Asp Lys Ile Val Phe Thr Lys Ala Glu Asp Asn Leu Leu Ala Leu 
            20                  25                  30          
Gly Leu Lys His Phe Glu Gly Thr Glu Phe Pro Asn Pro Leu Ile Ser 
        35                  40                  45              
Lys Tyr Leu Leu Thr Cys Lys Thr Ala His Gln Leu Thr Val Arg Ile 
    50                  55                  60                  
Lys Asn Leu Asn Met Asn Arg Ala Pro Asp Asn Ile Ile Lys Phe Tyr 
65                  70                  75                  80  
Lys Lys Thr Lys Gln Leu Pro Val Leu Gly Lys Cys Cys Glu Glu Ile 
                85                  90                  95      
Gln Pro His Gln Trp Lys Pro Pro Ile Glu Arg Glu Glu His Arg Leu 
            100                 105                 110         
Pro Phe Trp Leu Lys Ala Ser Leu Pro Ser Ile Gln Glu Glu Leu Arg 
        115                 120                 125             
His Met Ala Asp Gly Ala Arg Glu Val Gly Asn Met Thr Gly Thr Thr 
    130                 135                 140                 
Glu Ile Asn Ser Asp Arg Ser Leu Glu Lys Asp Asn Leu Glu Leu Gly 
145                 150                 155                 160 
Ser Glu Ser Arg Tyr Pro Leu Leu Leu Pro Lys Gly Val Val Leu Lys 
                165                 170                 175     
Leu Lys Pro Val Ala Thr Arg Phe Pro Arg Lys Ala Trp Arg Gln Lys 
            180                 185                 190         
Arg Ser Ser Val Leu Lys Pro Leu Leu Ile Gln Pro Ser Pro Ser Leu 
        195                 200                 205             
Gln Pro Ser Phe Asn Pro Gly Lys Thr Pro Ala Arg Ser Thr His Ser 
    210                 215                 220                 
Glu Ala Pro Pro Ser Lys Met Val Leu Arg Ile Pro His Pro Ile Gln 
225                 230                 235                 240 
Pro Ala Thr Val Leu Gln Thr Val Pro Gly Val Pro Pro Leu Gly Val 
                245                 250                 255     
Ser Gly Gly Glu Ser Phe Glu Ser Pro Ala Ala Leu Pro Ala Val Pro 
            260                 265                 270         
Pro Glu Ala Arg Thr Ser Phe Pro Leu Ser Glu Ser Gln Thr Leu Leu 
        275                 280                 285             
Ser Ser Ala Pro Val Pro Lys Val Met Leu Pro Ser Leu Ala Pro Ser 
    290                 295                 300                 
Lys Phe Arg Lys Pro Tyr Val Arg Arg Arg Pro Ser Lys Arg Arg Gly 
305                 310                 315                 320 
Val Lys Ala Ser Pro Cys Met Lys Pro Ala Pro Val Ile His His Pro 
                325                 330                 335     
Ala Ser Val Ile Phe Thr Val Pro Ala Thr Thr Val Lys Ile Val Ser 
            340                 345                 350         
Leu Gly Gly Gly Cys Asn Met Ile Gln Pro Val Asn Ala Ala Val Ala 
        355                 360                 365             
Gln Ser Pro Gln Thr Ile Pro Ile Thr Thr Leu Leu Val Asn Pro Thr 
    370                 375                 380                 
Ser Phe Pro Cys Pro Leu Asn Gln Ser Leu Val Ala Ser Ser Val Ser 
385                 390                 395                 400 
Pro Leu Ile Val Ser Gly Asn Ser Val Asn Leu Pro Ile Pro Ser Thr 
                405                 410                 415     
Pro Glu Asp Lys Ala His Val Asn Val Asp Ile Ala Cys Ala Val Ala 
            420                 425                 430         
Asp Gly Glu Asn Ala Phe Gln Gly Leu Glu Pro Lys Leu Glu Pro Gln 
        435                 440                 445             
Glu Leu Ser Pro Leu Ser Ala Thr Val Phe Pro Lys Val Glu His Ser 
    450                 455                 460                 
Pro Gly Pro Pro Leu Ala Asp Ala Glu Cys Gln Glu Gly Leu Ser Glu 
465                 470                 475                 480 
Asn Ser Ala Cys Arg Trp Thr Val Val Lys Thr Glu Glu Gly Arg Gln 
                485                 490                 495     
Ala Leu Glu Pro Leu Pro Gln Gly Ile Gln Glu Ser Leu Asn Asn Pro 
            500                 505                 510         
Thr Pro Gly Asp Leu Glu Glu Ile Val Lys Met Glu Pro Glu Glu Ala 
        515                 520                 525             
Arg Glu Glu Ile Ser Gly Ser Pro Glu Arg Asp Ile Cys Asp Asp Ile 
    530                 535                 540                 
Lys Val Glu His Ala Val Glu Leu Asp Thr Gly Ala Pro Ser Glu Glu 
545                 550                 555                 560 
Leu Ser Ser Ala Gly Glu Val Thr Lys Gln Thr Val Leu Gln Lys Glu 
                565                 570                 575     
Glu Glu Arg Ser Gln Pro Thr Lys Thr Pro Ser Ser Ser Gln Glu Pro 
            580                 585                 590         
Pro Asp Glu Gly Thr Ser Gly Thr Asp Val Asn Lys Gly Ser Ser Lys 
        595                 600                 605             
Asn Ala Leu Ser Ser Val Asp Pro Glu Val Arg Leu Ser Ser Pro Pro 
    610                 615                 620                 
Gly Lys Pro Glu Asp Ser Ser Ser Val Asp Gly Gln Ser Val Gly Thr 
625                 630                 635                 640 
Pro Val Gly Pro Glu Thr Gly Gly Glu Lys Asn Gly Pro Glu Glu Glu 
                645                 650                 655     
Glu Glu Glu Asp Phe Asp Asp Leu Thr Gln Asp Glu Glu Asp Glu Met 
            660                 665                 670         
Ser Ser Ala Ser Glu Glu Ser Val Leu Ser Val Pro Glu Leu Gln Glu 
        675                 680                 685             
Thr Met Glu Lys Leu Thr Trp Leu Ala Ser Glu Arg Arg Met Ser Gln 
    690                 695                 700                 
Glu Gly Glu Ser Glu Glu Glu Asn Ser Gln Glu Glu Asn Ser Glu Pro 
705                 710                 715                 720 
Glu Glu Glu Glu Glu Glu Glu Ala Glu Gly Met Glu Ser Leu Gln Lys 
                725                 730                 735     
Glu Asp Glu Met Thr Asp Glu Ala Val Gly Asp Ser Ala Glu Lys Pro 
            740                 745                 750         
Pro Thr Phe Ala Ser Pro Glu Thr Ala Pro Glu Val Glu Thr Ser Arg 
        755                 760                 765             
Thr Pro Pro Gly Glu Ser Ile Lys Ala Ala Gly Lys Gly Arg Asn Asn 
    770                 775                 780                 
His Arg Ala Arg Asn Lys Arg Gly Ser Arg Ala Arg Ala Ser Lys Asp 
785                 790                 795                 800 
Thr Ser Lys Leu Leu Leu Leu Tyr Asp Glu Asp Ile Leu Glu Arg Asp 
                805                 810                 815     
Pro Leu Arg Glu Gln Lys Asp Leu Ala Phe Ala Gln Ala Tyr Leu Thr 
            820                 825                 830         
Arg Val Arg Glu Ala Leu Gln His Ile Pro Gly Lys Tyr Glu Asp Phe 
        835                 840                 845             
Leu Gln Val Ile Tyr Glu Phe Glu Ser Ser Thr Gln Arg Arg Thr Ala 
    850                 855                 860                 
Val Asp Leu Tyr Lys Ser Leu Gln Ile Leu Leu Gln Asp Trp Pro Gln 
865                 870                 875                 880 
Leu Leu Lys Asp Phe Ala Ala Phe Leu Leu Pro Glu Gln Ala Leu Ala 
                885                 890                 895     
Cys Gly Leu Phe Glu Glu Gln Gln Ala Phe Glu Lys Ser Arg Lys Phe 
            900                 905                 910         
Leu Arg Gln Leu Glu Ile Cys Phe Ala Glu Asn Pro Ser His His Gln 
        915                 920                 925             
Lys Ile Ile Lys Val Leu Gln Gly Cys Ala Asp Cys Leu Pro Gln Glu 
    930                 935                 940                 
Ile Thr Glu Leu Lys Thr Gln Met Trp Gln Leu Leu Lys Gly His Asp 
945                 950                 955                 960 
His Leu Gln Asp Glu Phe Ser Ile Phe Phe Asp His Leu Arg Pro Ala 
                965                 970                 975     
Ala Ser Arg Met Gly Asp Phe Glu Glu Ile Asn Trp Thr Glu Glu Lys 
            980                 985                 990         
Glu Tyr Glu Phe Asp Gly Phe Glu Glu Val Ala Leu Pro Asp Val Glu 
        995                 1000                1005            
Glu Glu Glu Glu Pro Pro Lys Ile Pro Thr Ala Ser Lys Asn Lys Arg 
    1010                1015                1020                
Lys Lys Glu Ile Gly Val Gln Asn His Asp Lys Glu Thr Glu Trp Pro 
1025                1030                1035                1040
Asp Gly Ala Lys Asp Cys Ala Cys Ser Cys His Glu Gly Gly Pro Asp 
                1045                1050                1055    
Ser Lys Leu Lys Lys Ser Lys Arg Arg Ser Cys Ser His Cys Ser Ser 
            1060                1065                1070        
Lys Val Arg Lys Val Ser Arg Val Pro Arg Val Ser Glu Leu Leu Gly 
        1075                1080                1085            
Asp Cys Leu Leu Pro Arg Ile Val Pro Tyr 
    1090                1095            

<210> 2
<211> 2241
<212> PRT
<213> Homo sapiens


<220> 
<223> human GON4L isoform A, GenBank: AAR01260.1

<400> 2
Met Leu Pro Cys Lys Lys Arg Arg Thr Thr Val Thr Glu Ser Leu Gln 
1               5                   10                  15      
His Lys Gly Asn Gln Glu Glu Asn Asn Val Asp Leu Glu Ser Ala Val 
            20                  25                  30          
Lys Pro Glu Ser Asp Gln Val Lys Asp Leu Ser Ser Val Ser Leu Ser 
        35                  40                  45              
Trp Asp Pro Ser His Gly Arg Val Ala Gly Phe Glu Val Gln Ser Leu 
    50                  55                  60                  
Gln Asp Ala Gly Asn Gln Leu Gly Met Glu Asp Thr Ser Leu Ser Ser 
65                  70                  75                  80  
Gly Met Leu Thr Gln Asn Thr Asn Val Pro Ile Leu Glu Gly Val Asp 
                85                  90                  95      
Val Ala Ile Ser Gln Gly Ile Thr Leu Pro Ser Leu Glu Ser Phe His 
            100                 105                 110         
Pro Leu Asn Ile His Ile Gly Lys Gly Lys Leu His Ala Thr Gly Ser 
        115                 120                 125             
Lys Arg Gly Lys Lys Met Thr Leu Arg Pro Gly Pro Val Thr Gln Glu 
    130                 135                 140                 
Asp Arg Cys Asp His Leu Thr Leu Lys Glu Pro Phe Ser Gly Glu Pro 
145                 150                 155                 160 
Ser Glu Glu Val Lys Glu Glu Gly Gly Lys Pro Gln Met Asn Ser Glu 
                165                 170                 175     
Gly Glu Ile Pro Ser Leu Pro Ser Gly Ser Gln Ser Ala Lys Pro Val 
            180                 185                 190         
Ser Gln Pro Arg Lys Ser Thr Gln Pro Asp Val Cys Ala Ser Pro Gln 
        195                 200                 205             
Glu Lys Pro Leu Arg Thr Leu Phe His Gln Pro Glu Glu Glu Ile Glu 
    210                 215                 220                 
Asp Gly Gly Leu Phe Ile Pro Met Glu Glu Gln Asp Asn Glu Glu Ser 
225                 230                 235                 240 
Glu Lys Arg Arg Lys Lys Lys Lys Gly Thr Lys Arg Lys Arg Asp Gly 
                245                 250                 255     
Arg Gly Gln Glu Gly Thr Leu Ala Tyr Asp Leu Lys Leu Asp Asp Met 
            260                 265                 270         
Leu Asp Arg Thr Leu Glu Asp Gly Ala Lys Gln His Asn Leu Thr Ala 
        275                 280                 285             
Val Asn Val Arg Asn Ile Leu His Glu Val Ile Thr Asn Glu His Val 
    290                 295                 300                 
Val Ala Met Met Lys Ala Ala Ile Ser Glu Thr Glu Asp Met Pro Met 
305                 310                 315                 320 
Phe Glu Pro Lys Met Thr Arg Ser Lys Leu Lys Glu Val Val Glu Lys 
                325                 330                 335     
Gly Val Val Ile Pro Thr Trp Asn Ile Ser Pro Ile Lys Lys Ala Asn 
            340                 345                 350         
Glu Ile Lys Pro Pro Gln Phe Val Asp Ile His Leu Glu Glu Asp Asp 
        355                 360                 365             
Ser Ser Asp Glu Glu Tyr Gln Pro Asp Asp Glu Glu Glu Asp Glu Thr 
    370                 375                 380                 
Ala Glu Glu Ser Leu Leu Glu Ser Asp Val Glu Ser Thr Ala Ser Ser 
385                 390                 395                 400 
Pro Arg Gly Ala Lys Lys Ser Arg Leu Arg Gln Ser Ser Glu Met Thr 
                405                 410                 415     
Glu Thr Asp Glu Glu Ser Gly Ile Leu Ser Glu Ala Glu Lys Val Thr 
            420                 425                 430         
Thr Pro Ala Ile Arg His Ile Ser Ala Glu Val Val Pro Met Gly Pro 
        435                 440                 445             
Pro Pro Pro Pro Lys Pro Lys Gln Thr Arg Asp Ser Thr Phe Met Glu 
    450                 455                 460                 
Lys Leu His Ala Val Asp Glu Glu Leu Ala Ser Ser Pro Val Cys Met 
465                 470                 475                 480 
Asp Ser Phe Gln Pro Met Asp Asp Ser Leu Ile Ala Phe Arg Thr Arg 
                485                 490                 495     
Ser Lys Met Pro Leu Lys Asp Val Pro Leu Gly Gln Leu Glu Ala Glu 
            500                 505                 510         
Leu Gln Ala Pro Asp Ile Thr Pro Asp Met Tyr Asp Pro Asn Thr Ala 
        515                 520                 525             
Asp Asp Glu Asp Trp Lys Met Trp Leu Gly Gly Leu Met Asn Asp Asp 
    530                 535                 540                 
Val Gly Asn Glu Asp Glu Ala Asp Asp Asp Asp Asp Pro Glu Tyr Asn 
545                 550                 555                 560 
Phe Leu Glu Asp Leu Asp Glu Pro Asp Thr Glu Asp Phe Arg Thr Asp 
                565                 570                 575     
Arg Ala Val Arg Ile Thr Lys Lys Glu Val Asn Glu Leu Met Glu Glu 
            580                 585                 590         
Leu Phe Glu Thr Phe Gln Asp Glu Met Gly Phe Ser Asn Met Glu Asp 
        595                 600                 605             
Asp Gly Pro Glu Glu Glu Glu Cys Val Ala Glu Pro Arg Pro Asn Phe 
    610                 615                 620                 
Asn Thr Pro Gln Ala Leu Arg Phe Glu Glu Pro Leu Ala Asn Leu Leu 
625                 630                 635                 640 
Asn Glu Gln His Arg Thr Val Lys Glu Leu Phe Glu Gln Leu Lys Met 
                645                 650                 655     
Lys Lys Ser Ser Ala Lys Gln Leu Gln Glu Val Glu Lys Val Lys Pro 
            660                 665                 670         
Gln Ser Glu Lys Val His Gln Thr Leu Ile Leu Asp Pro Ala Gln Arg 
        675                 680                 685             
Lys Arg Leu Gln Gln Gln Met Gln Gln His Val Gln Leu Leu Thr Gln 
    690                 695                 700                 
Ile His Leu Leu Ala Thr Cys Asn Pro Asn Leu Asn Pro Glu Ala Thr 
705                 710                 715                 720 
Thr Thr Arg Ile Phe Leu Lys Glu Leu Gly Thr Phe Ala Gln Ser Ser 
                725                 730                 735     
Ile Ala Leu His His Gln Tyr Asn Pro Lys Phe Gln Thr Leu Phe Gln 
            740                 745                 750         
Pro Cys Asn Leu Met Gly Ala Met Gln Leu Ile Glu Asp Phe Ser Thr 
        755                 760                 765             
His Val Ser Ile Asp Cys Ser Pro His Lys Thr Val Lys Lys Thr Ala 
    770                 775                 780                 
Asn Glu Phe Pro Cys Leu Pro Lys Gln Val Ala Trp Ile Leu Ala Thr 
785                 790                 795                 800 
Ser Lys Val Phe Met Tyr Pro Glu Leu Leu Pro Val Cys Ser Leu Lys 
                805                 810                 815     
Ala Lys Asn Pro Gln Asp Lys Ile Val Phe Thr Lys Ala Glu Asp Asn 
            820                 825                 830         
Leu Leu Ala Leu Gly Leu Lys His Phe Glu Gly Thr Glu Phe Pro Asn 
        835                 840                 845             
Pro Leu Ile Ser Lys Tyr Leu Leu Thr Cys Lys Thr Ala His Gln Leu 
    850                 855                 860                 
Thr Val Arg Ile Lys Asn Leu Asn Met Asn Arg Ala Pro Asp Asn Ile 
865                 870                 875                 880 
Ile Lys Phe Tyr Lys Lys Thr Lys Gln Leu Pro Val Leu Gly Lys Cys 
                885                 890                 895     
Cys Glu Glu Ile Gln Pro His Gln Trp Lys Pro Pro Ile Glu Arg Glu 
            900                 905                 910         
Glu His Arg Leu Pro Phe Trp Leu Lys Ala Ser Leu Pro Ser Ile Gln 
        915                 920                 925             
Glu Glu Leu Arg His Met Ala Asp Gly Ala Arg Glu Val Gly Asn Met 
    930                 935                 940                 
Thr Gly Thr Thr Glu Ile Asn Ser Asp Arg Ser Leu Glu Lys Asp Asn 
945                 950                 955                 960 
Leu Glu Leu Gly Ser Glu Ser Arg Tyr Pro Leu Leu Leu Pro Lys Gly 
                965                 970                 975     
Val Val Leu Lys Leu Lys Pro Val Ala Thr Arg Phe Pro Arg Lys Ala 
            980                 985                 990         
Trp Arg Gln Lys Arg Ser Ser Val Leu Lys Pro Leu Leu Ile Gln Pro 
        995                 1000                1005            
Ser Pro Ser Leu Gln Pro Ser Phe Asn Pro Gly Lys Thr Pro Ala Arg 
    1010                1015                1020                
Ser Thr His Ser Glu Ala Pro Pro Ser Lys Met Val Leu Arg Ile Pro 
1025                1030                1035                1040
His Pro Ile Gln Pro Ala Thr Val Leu Gln Thr Val Pro Gly Val Pro 
                1045                1050                1055    
Pro Leu Gly Val Ser Gly Gly Glu Ser Phe Glu Ser Pro Ala Ala Leu 
            1060                1065                1070        
Pro Ala Val Pro Pro Glu Ala Arg Thr Ser Phe Pro Leu Ser Glu Ser 
        1075                1080                1085            
Gln Thr Leu Leu Ser Ser Ala Pro Val Pro Lys Val Met Leu Pro Ser 
    1090                1095                1100                
Leu Ala Pro Ser Lys Phe Arg Lys Pro Tyr Val Arg Arg Arg Pro Ser 
1105                1110                1115                1120
Lys Arg Arg Gly Val Lys Ala Ser Pro Cys Met Lys Pro Ala Pro Val 
                1125                1130                1135    
Ile His His Pro Ala Ser Val Ile Phe Thr Val Pro Ala Thr Thr Val 
            1140                1145                1150        
Lys Ile Val Ser Leu Gly Gly Gly Cys Asn Met Ile Gln Pro Val Asn 
        1155                1160                1165            
Ala Ala Val Ala Gln Ser Pro Gln Thr Ile Pro Ile Thr Thr Leu Leu 
    1170                1175                1180                
Val Asn Pro Thr Ser Phe Pro Cys Pro Leu Asn Gln Ser Leu Val Ala 
1185                1190                1195                1200
Ser Ser Val Ser Pro Leu Ile Val Ser Gly Asn Ser Val Asn Leu Pro 
                1205                1210                1215    
Ile Pro Ser Thr Pro Glu Asp Lys Ala His Val Asn Val Asp Ile Ala 
            1220                1225                1230        
Cys Ala Val Ala Asp Gly Glu Asn Ala Phe Gln Gly Leu Glu Pro Lys 
        1235                1240                1245            
Leu Glu Pro Gln Glu Leu Ser Pro Leu Ser Ala Thr Val Phe Pro Lys 
    1250                1255                1260                
Val Glu His Ser Pro Gly Pro Pro Leu Ala Asp Ala Glu Cys Gln Glu 
1265                1270                1275                1280
Gly Leu Ser Glu Asn Ser Ala Cys Arg Trp Thr Val Val Lys Thr Glu 
                1285                1290                1295    
Glu Gly Arg Gln Ala Leu Glu Pro Leu Pro Gln Gly Ile Gln Glu Ser 
            1300                1305                1310        
Leu Asn Asn Pro Thr Pro Gly Asp Leu Glu Glu Ile Val Lys Met Glu 
        1315                1320                1325            
Pro Glu Glu Ala Arg Glu Glu Ile Ser Gly Ser Pro Glu Arg Asp Ile 
    1330                1335                1340                
Cys Asp Asp Ile Lys Val Glu His Ala Val Glu Leu Asp Thr Gly Ala 
1345                1350                1355                1360
Pro Ser Glu Glu Leu Ser Ser Ala Gly Glu Val Thr Lys Gln Thr Val 
                1365                1370                1375    
Leu Gln Lys Glu Glu Glu Arg Ser Gln Pro Thr Lys Thr Pro Ser Ser 
            1380                1385                1390        
Ser Gln Glu Pro Pro Asp Glu Gly Thr Ser Gly Thr Asp Val Asn Lys 
        1395                1400                1405            
Gly Ser Ser Lys Asn Ala Leu Ser Ser Met Asp Pro Glu Val Arg Leu 
    1410                1415                1420                
Ser Ser Pro Pro Gly Lys Pro Glu Asp Ser Ser Ser Val Asp Gly Gln 
1425                1430                1435                1440
Ser Val Gly Thr Pro Val Gly Pro Glu Thr Gly Gly Glu Lys Asn Gly 
                1445                1450                1455    
Pro Glu Glu Glu Glu Glu Glu Asp Phe Asp Asp Leu Thr Gln Asp Glu 
            1460                1465                1470        
Glu Asp Glu Met Ser Ser Ala Ser Glu Glu Ser Val Leu Ser Val Pro 
        1475                1480                1485            
Glu Leu Gln Glu Thr Met Glu Lys Leu Thr Trp Leu Ala Ser Glu Arg 
    1490                1495                1500                
Arg Met Ser Gln Glu Gly Glu Ser Glu Glu Glu Asn Ser Gln Glu Glu 
1505                1510                1515                1520
Asn Ser Glu Pro Glu Glu Glu Glu Glu Glu Glu Ala Glu Gly Met Glu 
                1525                1530                1535    
Ser Leu Gln Lys Glu Asp Glu Met Thr Asp Glu Ala Val Gly Asp Ser 
            1540                1545                1550        
Ala Glu Lys Pro Pro Thr Phe Ala Ser Pro Glu Thr Ala Pro Glu Val 
        1555                1560                1565            
Glu Thr Ser Arg Thr Pro Pro Gly Glu Ser Ile Lys Ala Ala Gly Lys 
    1570                1575                1580                
Gly Arg Asn Asn His Arg Ala Arg Asn Lys Arg Gly Ser Arg Ala Arg 
1585                1590                1595                1600
Ala Ser Lys Asp Thr Ser Lys Leu Leu Leu Leu Tyr Asp Glu Asp Ile 
                1605                1610                1615    
Leu Glu Arg Asp Pro Leu Arg Glu Gln Lys Asp Leu Ala Phe Ala Gln 
            1620                1625                1630        
Ala Tyr Leu Thr Arg Val Arg Glu Ala Leu Gln His Ile Pro Gly Lys 
        1635                1640                1645            
Tyr Glu Asp Phe Leu Gln Val Ile Tyr Glu Phe Glu Ser Ser Thr Gln 
    1650                1655                1660                
Arg Arg Thr Ala Val Asp Leu Tyr Lys Ser Leu Gln Ile Leu Leu Gln 
1665                1670                1675                1680
Asp Trp Pro Gln Leu Leu Lys Asp Phe Ala Ala Phe Leu Leu Pro Glu 
                1685                1690                1695    
Gln Ala Leu Ala Cys Gly Leu Phe Glu Glu Gln Gln Ala Phe Glu Lys 
            1700                1705                1710        
Ser Arg Lys Phe Leu Arg Gln Leu Glu Ile Cys Phe Ala Glu Asn Pro 
        1715                1720                1725            
Ser His His Gln Lys Ile Ile Lys Val Leu Gln Gly Cys Ala Asp Cys 
    1730                1735                1740                
Leu Pro Gln Glu Ile Thr Glu Leu Lys Thr Gln Met Trp Gln Leu Leu 
1745                1750                1755                1760
Lys Gly His Asp His Leu Gln Asp Glu Phe Ser Ile Phe Phe Asp His 
                1765                1770                1775    
Leu Arg Pro Ala Ala Ser Arg Met Gly Asp Phe Glu Glu Ile Asn Trp 
            1780                1785                1790        
Thr Glu Glu Lys Glu Tyr Glu Phe Asp Gly Phe Glu Glu Val Ala Leu 
        1795                1800                1805            
Pro Asp Val Glu Glu Glu Glu Glu Pro Pro Lys Ile Pro Thr Ala Ser 
    1810                1815                1820                
Lys Asn Lys Arg Lys Lys Glu Ile Gly Val Gln Asn His Asp Lys Glu 
1825                1830                1835                1840
Thr Glu Trp Pro Asp Gly Ala Lys Asp Cys Ala Cys Ser Cys His Glu 
                1845                1850                1855    
Gly Gly Pro Asp Ser Lys Leu Lys Lys Ser Lys Arg Arg Ser Cys Ser 
            1860                1865                1870        
His Cys Ser Ser Lys Val Cys Asp Ser Lys Ser Tyr Lys Ser Lys Glu 
        1875                1880                1885            
Pro His Glu Leu Val Gly Ser Ser Pro His Arg Glu Ala Ser Pro Met 
    1890                1895                1900                
Pro Gly Ala Lys Glu Ala Gly Gln Gly Lys Asp Met Met Glu Glu Glu 
1905                1910                1915                1920
Ala Pro Glu Glu Arg Glu Ser Thr Glu Ala Thr Gln Ser Arg Thr Val 
                1925                1930                1935    
Arg Thr Thr Arg Lys Gly Glu Met Pro Val Ser Ala Gly Leu Ala Val 
            1940                1945                1950        
Gly Ser Thr Leu Pro Ser Pro Arg Glu Val Thr Val Thr Glu Arg Leu 
        1955                1960                1965            
Leu Leu Asp Gly Pro Pro Pro His Ser Pro Glu Thr Pro Gln Phe Pro 
    1970                1975                1980                
Pro Thr Thr Gly Ala Val Leu Tyr Thr Val Lys Arg Asn Gln Val Gly 
1985                1990                1995                2000
Pro Glu Val Arg Ser Cys Pro Lys Ala Ser Pro Arg Leu Gln Lys Glu 
                2005                2010                2015    
Arg Glu Gly Gln Lys Ala Val Ser Glu Ser Glu Ala Leu Met Leu Val 
            2020                2025                2030        
Trp Asp Ala Ser Glu Thr Glu Lys Leu Pro Gly Thr Val Glu Pro Pro 
        2035                2040                2045            
Ala Ser Phe Leu Ser Pro Val Ser Ser Lys Thr Arg Asp Ala Gly Arg 
    2050                2055                2060                
Arg His Val Ser Gly Lys Pro Asp Thr Gln Glu Arg Trp Leu Pro Ser 
2065                2070                2075                2080
Ser Arg Ala Arg Val Lys Thr Arg Asp Arg Thr Cys Pro Val His Glu 
                2085                2090                2095    
Ser Pro Ser Gly Ile Asp Thr Ser Glu Thr Ser Pro Lys Ala Pro Arg 
            2100                2105                2110        
Gly Gly Leu Ala Lys Asp Ser Gly Thr Gln Ala Lys Gly Pro Glu Gly 
        2115                2120                2125            
Glu Gln Gln Pro Lys Ala Ala Glu Ala Thr Val Cys Ala Asn Asn Ser 
    2130                2135                2140                
Lys Val Ser Ser Thr Gly Glu Lys Val Val Leu Trp Thr Arg Glu Ala 
2145                2150                2155                2160
Asp Arg Val Ile Leu Thr Met Cys Gln Glu Gln Gly Ala Gln Pro Gln 
                2165                2170                2175    
Thr Phe Asn Ile Ile Ser Gln Gln Leu Gly Asn Lys Thr Pro Ala Glu 
            2180                2185                2190        
Val Ser His Arg Phe Arg Glu Leu Met Gln Leu Phe His Thr Ala Cys 
        2195                2200                2205            
Glu Ala Ser Ser Glu Asp Glu Asp Asp Ala Thr Ser Thr Ser Asn Ala 
    2210                2215                2220                
Asp Gln Leu Ser Asp His Gly Asp Leu Leu Ser Glu Glu Glu Leu Asp 
2225                2230                2235                2240
Glu 
    

<210> 3
<211> 1529
<212> PRT
<213> Homo sapiens


<220> 
<223> human GON4L isoform B, GenBank: AAR01262.1

<400> 3
Met Leu Pro Cys Lys Lys Arg Arg Thr Thr Val Thr Glu Ser Leu Gln 
1               5                   10                  15      
His Lys Gly Asn Gln Glu Glu Asn Asn Val Asp Leu Glu Ser Ala Val 
            20                  25                  30          
Lys Pro Glu Ser Asp Gln Val Lys Asp Leu Ser Ser Val Ser Leu Ser 
        35                  40                  45              
Trp Asp Pro Ser His Gly Arg Val Ala Gly Phe Glu Val Gln Ser Leu 
    50                  55                  60                  
Gln Asp Ala Gly Asn Gln Leu Gly Met Glu Asp Thr Ser Leu Ser Ser 
65                  70                  75                  80  
Gly Met Leu Thr Gln Asn Thr Asn Val Pro Ile Leu Glu Gly Val Asp 
                85                  90                  95      
Val Ala Ile Ser Gln Gly Ile Thr Leu Pro Ser Leu Glu Ser Phe His 
            100                 105                 110         
Pro Leu Asn Ile His Ile Gly Lys Gly Lys Leu His Ala Thr Gly Ser 
        115                 120                 125             
Lys Arg Gly Lys Lys Met Thr Leu Arg Pro Gly Pro Val Thr Gln Glu 
    130                 135                 140                 
Asp Arg Cys Asp His Leu Thr Leu Lys Glu Pro Phe Ser Gly Glu Pro 
145                 150                 155                 160 
Ser Glu Glu Val Lys Glu Glu Gly Gly Lys Pro Gln Met Asn Ser Glu 
                165                 170                 175     
Gly Glu Ile Pro Ser Leu Pro Ser Gly Ser Gln Ser Ala Lys Pro Val 
            180                 185                 190         
Ser Gln Pro Arg Lys Ser Thr Gln Pro Asp Val Cys Ala Ser Pro Gln 
        195                 200                 205             
Glu Lys Pro Leu Arg Thr Leu Phe His Gln Pro Glu Glu Glu Ile Glu 
    210                 215                 220                 
Asp Gly Gly Leu Phe Ile Pro Met Glu Glu Gln Asp Asn Glu Glu Ser 
225                 230                 235                 240 
Glu Lys Arg Arg Lys Lys Lys Lys Gly Thr Lys Arg Lys Arg Asp Gly 
                245                 250                 255     
Arg Gly Gln Glu Gly Thr Leu Ala Tyr Asp Leu Lys Leu Asp Asp Met 
            260                 265                 270         
Leu Asp Arg Thr Leu Glu Asp Gly Ala Lys Gln His Asn Leu Thr Ala 
        275                 280                 285             
Val Asn Val Arg Asn Ile Leu His Glu Val Ile Thr Asn Glu His Val 
    290                 295                 300                 
Val Ala Met Met Lys Ala Ala Ile Ser Glu Thr Glu Asp Met Pro Met 
305                 310                 315                 320 
Phe Glu Pro Lys Met Thr Arg Ser Lys Leu Lys Glu Val Val Glu Lys 
                325                 330                 335     
Gly Val Val Ile Pro Thr Trp Asn Ile Ser Pro Ile Lys Lys Ala Asn 
            340                 345                 350         
Glu Ile Lys Pro Pro Gln Phe Val Asp Ile His Leu Glu Glu Asp Asp 
        355                 360                 365             
Ser Ser Asp Glu Glu Tyr Gln Pro Asp Asp Glu Glu Glu Asp Glu Thr 
    370                 375                 380                 
Ala Glu Glu Ser Leu Leu Glu Ser Asp Val Glu Ser Thr Ala Ser Ser 
385                 390                 395                 400 
Pro Arg Gly Ala Lys Lys Ser Arg Leu Arg Gln Ser Ser Glu Met Thr 
                405                 410                 415     
Glu Thr Asp Glu Glu Ser Gly Ile Leu Ser Glu Ala Glu Lys Val Thr 
            420                 425                 430         
Ala Pro Ala Ile Arg His Ile Ser Ala Glu Val Val Pro Met Gly Pro 
        435                 440                 445             
Pro Pro Pro Pro Lys Pro Lys Gln Thr Arg Asp Ser Thr Phe Met Glu 
    450                 455                 460                 
Lys Leu His Ala Val Asp Glu Glu Leu Ala Ser Ser Pro Val Cys Met 
465                 470                 475                 480 
Asp Ser Phe Gln Pro Met Asp Asp Ser Leu Ile Ala Phe Arg Thr Arg 
                485                 490                 495     
Ser Lys Met Pro Leu Lys Asp Val Pro Leu Gly Gln Leu Glu Ala Glu 
            500                 505                 510         
Leu Gln Ala Pro Asp Ile Thr Pro Asp Met Tyr Asp Pro Asn Thr Ala 
        515                 520                 525             
Asp Asp Glu Asp Trp Lys Met Trp Leu Gly Gly Leu Met Asn Asp Asp 
    530                 535                 540                 
Val Gly Asn Glu Asp Glu Ala Asp Asp Asp Asp Asp Pro Glu Tyr Asn 
545                 550                 555                 560 
Phe Leu Glu Asp Leu Asp Glu Pro Asp Thr Glu Asp Phe Arg Thr Asp 
                565                 570                 575     
Arg Ala Val Arg Ile Thr Lys Lys Glu Val Asn Glu Leu Met Glu Glu 
            580                 585                 590         
Leu Phe Glu Thr Phe Gln Asp Glu Met Gly Phe Ser Asn Met Glu Asp 
        595                 600                 605             
Asp Gly Pro Glu Glu Glu Glu Cys Val Ala Glu Pro Arg Pro Asn Phe 
    610                 615                 620                 
Asn Thr Pro Gln Ala Leu Arg Phe Glu Glu Pro Leu Ala Asn Leu Leu 
625                 630                 635                 640 
Asn Glu Gln His Arg Thr Val Lys Glu Leu Phe Glu Gln Leu Lys Met 
                645                 650                 655     
Lys Lys Ser Ser Ala Lys Gln Leu Gln Glu Val Glu Lys Val Lys Pro 
            660                 665                 670         
Gln Ser Glu Lys Val His Gln Thr Leu Ile Leu Asp Pro Ala Gln Arg 
        675                 680                 685             
Lys Arg Leu Gln Gln Gln Met Gln Gln His Val Gln Leu Leu Thr Gln 
    690                 695                 700                 
Ile His Leu Leu Ala Thr Cys Asn Pro Asn Leu Asn Pro Glu Ala Thr 
705                 710                 715                 720 
Thr Thr Arg Ile Phe Leu Lys Glu Leu Gly Thr Phe Ala Gln Ser Ser 
                725                 730                 735     
Ile Ala Leu His His Gln Tyr Asn Pro Lys Phe Gln Thr Leu Phe Gln 
            740                 745                 750         
Pro Cys Asn Leu Met Gly Ala Met Gln Leu Ile Glu Asp Phe Ser Thr 
        755                 760                 765             
His Val Ser Ile Asp Cys Ser Pro His Lys Thr Val Lys Lys Thr Ala 
    770                 775                 780                 
Asn Glu Phe Pro Cys Leu Pro Lys Gln Val Ala Trp Ile Leu Ala Thr 
785                 790                 795                 800 
Ser Lys Val Phe Met Tyr Pro Glu Leu Leu Pro Val Cys Ser Leu Lys 
                805                 810                 815     
Ala Lys Asn Pro Gln Asp Lys Ile Val Phe Thr Lys Ala Glu Asp Asn 
            820                 825                 830         
Leu Leu Ala Leu Gly Leu Lys His Phe Glu Gly Thr Glu Phe Pro Asn 
        835                 840                 845             
Pro Leu Ile Ser Lys Tyr Leu Leu Thr Cys Lys Thr Ala His Gln Leu 
    850                 855                 860                 
Thr Val Arg Ile Lys Asn Leu Asn Met Asn Arg Ala Pro Asp Asn Ile 
865                 870                 875                 880 
Ile Lys Phe Tyr Lys Lys Thr Lys Gln Leu Pro Val Leu Gly Lys Cys 
                885                 890                 895     
Cys Glu Glu Ile Gln Pro His Gln Trp Lys Pro Pro Ile Glu Arg Glu 
            900                 905                 910         
Glu His Arg Leu Pro Phe Trp Leu Lys Ala Ser Leu Pro Ser Ile Gln 
        915                 920                 925             
Glu Glu Leu Arg His Met Ala Asp Gly Ala Arg Glu Val Gly Asn Met 
    930                 935                 940                 
Thr Gly Thr Thr Glu Ile Asn Ser Asp Arg Ser Leu Glu Lys Asp Asn 
945                 950                 955                 960 
Leu Glu Leu Gly Ser Glu Ser Arg Tyr Pro Leu Leu Leu Pro Lys Gly 
                965                 970                 975     
Val Val Leu Lys Leu Lys Pro Val Ala Thr Arg Ser Pro Arg Lys Ala 
            980                 985                 990         
Trp Arg Gln Lys Arg Ser Ser Val Leu Lys Pro Leu Leu Ile Gln Pro 
        995                 1000                1005            
Ser Pro Ser Leu Gln Pro Ser Phe Asn Pro Gly Lys Thr Pro Ala Arg 
    1010                1015                1020                
Ser Thr His Ser Glu Ala Pro Pro Ser Lys Met Val Leu Arg Ile Pro 
1025                1030                1035                1040
His Pro Ile Gln Pro Ala Thr Val Leu Gln Thr Val Pro Gly Val Pro 
                1045                1050                1055    
Pro Leu Gly Val Ser Gly Gly Glu Ser Phe Glu Ser Pro Ala Ala Leu 
            1060                1065                1070        
Pro Ala Val Pro Pro Glu Ala Arg Thr Ser Phe Pro Leu Ser Glu Ser 
        1075                1080                1085            
Gln Thr Leu Leu Ser Ser Ala Pro Val Pro Lys Val Met Leu Pro Ser 
    1090                1095                1100                
Leu Ala Pro Ser Lys Phe Arg Lys Pro Tyr Val Arg Arg Arg Pro Ser 
1105                1110                1115                1120
Lys Arg Arg Gly Val Lys Ala Ser Pro Cys Met Lys Pro Ala Pro Val 
                1125                1130                1135    
Ile His His Pro Ala Ser Val Ile Phe Thr Val Pro Ala Thr Thr Val 
            1140                1145                1150        
Lys Ile Val Ser Leu Gly Gly Gly Cys Asn Met Ile Gln Pro Val Asn 
        1155                1160                1165            
Ala Ala Val Ala Gln Ser Pro Gln Thr Ile Pro Ile Thr Thr Leu Leu 
    1170                1175                1180                
Val Asn Pro Thr Ser Phe Pro Cys Pro Leu Asn Gln Ser Leu Val Ala 
1185                1190                1195                1200
Ser Ser Val Ser Pro Leu Ile Val Ser Gly Asn Ser Val Asn Leu Pro 
                1205                1210                1215    
Ile Pro Ser Thr Pro Glu Asp Lys Ala His Val Asn Val Asp Ile Ala 
            1220                1225                1230        
Cys Ala Val Ala Asp Gly Glu Asn Ala Phe Gln Gly Leu Glu Pro Lys 
        1235                1240                1245            
Leu Glu Pro Gln Glu Leu Ser Pro Leu Ser Ala Thr Val Phe Pro Lys 
    1250                1255                1260                
Val Glu His Ser Pro Gly Pro Pro Leu Ala Asp Ala Glu Cys Gln Glu 
1265                1270                1275                1280
Gly Leu Ser Glu Asn Ser Ala Cys Arg Trp Thr Val Val Lys Thr Glu 
                1285                1290                1295    
Glu Gly Arg Gln Ala Leu Glu Pro Leu Pro Gln Gly Ile Gln Glu Ser 
            1300                1305                1310        
Leu Asn Asn Pro Thr Pro Gly Asp Leu Glu Glu Ile Val Lys Met Glu 
        1315                1320                1325            
Pro Glu Glu Ala Arg Glu Glu Ile Ser Gly Ser Pro Glu Arg Asp Ile 
    1330                1335                1340                
Cys Asp Asp Ile Lys Val Glu His Ala Val Glu Leu Asp Thr Gly Ala 
1345                1350                1355                1360
Pro Ser Glu Glu Leu Ser Ser Ala Gly Glu Val Thr Lys Gln Thr Val 
                1365                1370                1375    
Leu Gln Lys Glu Glu Gly Arg Ser Gln Pro Thr Lys Thr Pro Ser Ser 
            1380                1385                1390        
Ser Gln Glu Pro Pro Asp Glu Gly Thr Ser Gly Thr Asp Val Asn Lys 
        1395                1400                1405            
Gly Ser Ser Lys Asn Ala Leu Ser Ser Met Asp Pro Glu Val Arg Leu 
    1410                1415                1420                
Ser Ser Pro Pro Gly Lys Pro Glu Asp Ser Ser Ser Val Asp Gly Gln 
1425                1430                1435                1440
Ser Val Gly Thr Pro Val Gly Pro Glu Thr Gly Gly Glu Lys Asn Gly 
                1445                1450                1455    
Pro Glu Glu Glu Glu Glu Glu Asp Phe Asp Asp Leu Thr Gln Asp Glu 
            1460                1465                1470        
Glu Asp Glu Met Ser Ser Ala Ser Glu Glu Ser Val Leu Ser Val Pro 
        1475                1480                1485            
Glu Leu Gln Val Arg Ala Gly Glu Tyr Ser Gln Val Phe Arg Gly Leu 
    1490                1495                1500                
Ser Asn Met Tyr His Leu Leu Ile Cys His Leu Leu Ala Cys Cys Thr 
1505                1510                1515                1520
Met Asp Ser Pro Lys Ile Ile Cys Ile 
                1525                

<210> 4
<211> 2241
<212> PRT
<213> Homo sapiens


<220> 
<223> human GON4L isoform C, GenBank: AAR01261.1

<400> 4
Met Leu Pro Cys Lys Lys Arg Arg Thr Thr Val Thr Glu Ser Leu Gln 
1               5                   10                  15      
His Lys Gly Asn Gln Glu Glu Asn Asn Val Asp Leu Glu Ser Ala Val 
            20                  25                  30          
Lys Pro Glu Ser Asp Gln Val Lys Asp Leu Ser Ser Val Ser Leu Ser 
        35                  40                  45              
Trp Asp Pro Ser His Gly Arg Val Ala Gly Phe Glu Val Gln Ser Leu 
    50                  55                  60                  
Gln Asp Ala Gly Asn Gln Leu Gly Met Glu Asp Thr Ser Leu Ser Ser 
65                  70                  75                  80  
Gly Met Leu Thr Gln Asn Thr Asn Val Pro Ile Leu Glu Gly Val Asp 
                85                  90                  95      
Val Ala Ile Ser Gln Gly Ile Thr Leu Pro Ser Leu Glu Ser Phe His 
            100                 105                 110         
Pro Leu Asn Ile His Ile Gly Lys Gly Lys Leu His Ala Thr Gly Ser 
        115                 120                 125             
Lys Arg Gly Lys Lys Met Thr Leu Arg Pro Gly Pro Val Thr Gln Glu 
    130                 135                 140                 
Asp Arg Cys Asp His Leu Thr Leu Lys Glu Pro Phe Ser Gly Glu Pro 
145                 150                 155                 160 
Ser Glu Glu Val Lys Glu Glu Gly Gly Lys Pro Gln Met Asn Ser Glu 
                165                 170                 175     
Gly Glu Ile Pro Ser Leu Pro Ser Gly Ser Gln Ser Ala Lys Pro Val 
            180                 185                 190         
Ser Gln Pro Arg Lys Ser Thr Gln Pro Asp Val Cys Ala Ser Pro Gln 
        195                 200                 205             
Glu Lys Pro Leu Arg Thr Leu Phe His Gln Pro Glu Glu Glu Ile Glu 
    210                 215                 220                 
Asp Gly Gly Leu Phe Ile Pro Met Glu Glu Gln Asp Asn Glu Glu Ser 
225                 230                 235                 240 
Glu Lys Arg Arg Lys Lys Lys Lys Gly Thr Lys Arg Lys Arg Asp Gly 
                245                 250                 255     
Arg Gly Gln Glu Gly Thr Leu Ala Tyr Asp Leu Lys Leu Asp Asp Met 
            260                 265                 270         
Leu Asp Arg Thr Leu Glu Asp Gly Ala Lys Gln His Asn Leu Thr Ala 
        275                 280                 285             
Val Asn Val Arg Asn Ile Leu His Glu Val Ile Thr Asn Glu His Val 
    290                 295                 300                 
Val Ala Met Met Lys Ala Ala Ile Ser Glu Thr Glu Asp Met Pro Met 
305                 310                 315                 320 
Phe Glu Pro Lys Met Thr Arg Ser Lys Leu Lys Glu Val Val Glu Lys 
                325                 330                 335     
Gly Val Val Ile Pro Thr Trp Asn Ile Ser Pro Ile Lys Lys Ala Asn 
            340                 345                 350         
Glu Ile Lys Pro Pro Gln Phe Val Asp Ile His Leu Glu Glu Asp Asp 
        355                 360                 365             
Ser Ser Asp Glu Glu Tyr Gln Pro Asp Asp Glu Glu Glu Asp Glu Thr 
    370                 375                 380                 
Ala Glu Glu Ser Leu Leu Glu Ser Asp Val Glu Ser Thr Ala Ser Ser 
385                 390                 395                 400 
Pro Arg Gly Ala Lys Lys Ser Arg Leu Arg Gln Ser Ser Glu Met Thr 
                405                 410                 415     
Glu Thr Asp Glu Glu Ser Gly Ile Leu Ser Glu Ala Glu Lys Val Thr 
            420                 425                 430         
Thr Pro Ala Ile Arg His Ile Ser Ala Glu Val Val Pro Met Gly Pro 
        435                 440                 445             
Pro Pro Pro Pro Lys Pro Lys Gln Thr Arg Asp Ser Thr Phe Met Glu 
    450                 455                 460                 
Lys Leu His Ala Val Asp Glu Glu Leu Ala Ser Ser Pro Val Cys Met 
465                 470                 475                 480 
Asp Ser Phe Gln Pro Met Asp Asp Ser Leu Ile Ala Phe Arg Thr Arg 
                485                 490                 495     
Ser Lys Met Pro Leu Lys Asp Val Pro Leu Gly Gln Leu Glu Ala Glu 
            500                 505                 510         
Leu Gln Ala Pro Asp Ile Thr Pro Asp Met Tyr Asp Pro Asn Thr Ala 
        515                 520                 525             
Asp Asp Glu Asp Trp Lys Met Trp Leu Gly Gly Leu Met Asn Asp Asp 
    530                 535                 540                 
Val Gly Asn Glu Asp Glu Ala Asp Asp Asp Asp Asp Pro Glu Tyr Asn 
545                 550                 555                 560 
Phe Leu Glu Asp Leu Asp Glu Pro Asp Thr Glu Asp Phe Arg Thr Asp 
                565                 570                 575     
Arg Ala Val Arg Ile Thr Lys Lys Glu Val Asn Glu Leu Met Glu Glu 
            580                 585                 590         
Leu Phe Glu Thr Phe Gln Asp Glu Met Gly Phe Ser Asn Met Glu Asp 
        595                 600                 605             
Asp Gly Pro Glu Glu Glu Glu Cys Val Ala Glu Pro Arg Pro Asn Phe 
    610                 615                 620                 
Asn Thr Pro Gln Ala Leu Arg Phe Glu Glu Pro Leu Ala Asn Leu Leu 
625                 630                 635                 640 
Asn Glu Gln His Arg Thr Val Lys Glu Leu Phe Glu Gln Leu Lys Met 
                645                 650                 655     
Lys Lys Ser Ser Ala Lys Gln Leu Gln Glu Val Glu Lys Val Lys Pro 
            660                 665                 670         
Gln Ser Glu Lys Val His Gln Thr Leu Ile Leu Asp Pro Ala Gln Arg 
        675                 680                 685             
Lys Arg Leu Gln Gln Gln Met Gln Gln His Val Gln Leu Leu Thr Gln 
    690                 695                 700                 
Ile His Leu Leu Ala Thr Cys Asn Pro Asn Leu Asn Pro Glu Ala Thr 
705                 710                 715                 720 
Thr Thr Arg Ile Phe Leu Lys Glu Leu Gly Thr Phe Ala Gln Ser Ser 
                725                 730                 735     
Ile Ala Leu His His Gln Tyr Asn Pro Lys Phe Gln Thr Leu Phe Gln 
            740                 745                 750         
Pro Cys Asn Leu Met Gly Ala Met Gln Leu Ile Glu Asp Phe Ser Thr 
        755                 760                 765             
His Val Ser Ile Asp Cys Ser Pro His Lys Thr Val Lys Lys Thr Ala 
    770                 775                 780                 
Asn Glu Phe Pro Cys Leu Pro Lys Gln Val Ala Trp Ile Leu Ala Thr 
785                 790                 795                 800 
Ser Lys Val Phe Met Tyr Pro Glu Leu Leu Pro Val Cys Ser Leu Lys 
                805                 810                 815     
Ala Lys Asn Pro Gln Asp Lys Ile Val Phe Thr Lys Ala Glu Asp Asn 
            820                 825                 830         
Leu Leu Ala Leu Gly Leu Lys His Phe Glu Gly Thr Glu Phe Pro Asn 
        835                 840                 845             
Pro Leu Ile Ser Lys Tyr Leu Leu Thr Cys Lys Thr Ala His Gln Leu 
    850                 855                 860                 
Thr Val Arg Ile Lys Asn Leu Asn Met Asn Arg Ala Pro Asp Asn Ile 
865                 870                 875                 880 
Ile Lys Phe Tyr Lys Lys Thr Lys Gln Leu Pro Val Leu Gly Lys Cys 
                885                 890                 895     
Cys Glu Glu Ile Gln Pro His Gln Trp Lys Pro Pro Ile Glu Arg Glu 
            900                 905                 910         
Glu His Arg Leu Pro Phe Trp Leu Lys Ala Ser Leu Pro Ser Ile Gln 
        915                 920                 925             
Glu Glu Leu Arg His Met Ala Asp Gly Ala Arg Glu Val Gly Asn Met 
    930                 935                 940                 
Thr Gly Thr Thr Glu Ile Asn Ser Asp Arg Ser Leu Glu Lys Asp Asn 
945                 950                 955                 960 
Leu Glu Leu Gly Ser Glu Ser Arg Tyr Pro Leu Leu Leu Pro Lys Gly 
                965                 970                 975     
Val Val Leu Lys Leu Lys Pro Val Ala Thr Arg Phe Pro Arg Lys Ala 
            980                 985                 990         
Trp Arg Gln Lys Arg Ser Ser Val Leu Lys Pro Leu Leu Ile Gln Pro 
        995                 1000                1005            
Ser Pro Ser Leu Gln Pro Ser Phe Asn Pro Gly Lys Thr Pro Ala Arg 
    1010                1015                1020                
Ser Thr His Ser Glu Ala Pro Pro Ser Lys Met Val Leu Arg Ile Pro 
1025                1030                1035                1040
His Pro Ile Gln Pro Ala Thr Val Leu Gln Thr Val Pro Gly Val Pro 
                1045                1050                1055    
Pro Leu Gly Val Ser Gly Gly Glu Ser Phe Glu Ser Pro Ala Ala Leu 
            1060                1065                1070        
Pro Ala Val Pro Pro Glu Ala Arg Thr Ser Phe Pro Leu Ser Glu Ser 
        1075                1080                1085            
Gln Thr Leu Leu Ser Ser Ala Pro Val Pro Lys Val Met Leu Pro Ser 
    1090                1095                1100                
Leu Ala Pro Ser Lys Phe Arg Lys Pro Tyr Val Arg Arg Arg Pro Ser 
1105                1110                1115                1120
Lys Arg Arg Gly Val Lys Ala Ser Pro Cys Met Lys Pro Ala Pro Val 
                1125                1130                1135    
Ile His His Pro Ala Ser Val Ile Phe Thr Val Pro Ala Thr Thr Val 
            1140                1145                1150        
Lys Ile Val Ser Leu Gly Gly Gly Cys Asn Met Ile Gln Pro Val Asn 
        1155                1160                1165            
Ala Ala Val Ala Gln Ser Pro Gln Thr Ile Pro Ile Thr Thr Leu Leu 
    1170                1175                1180                
Val Asn Pro Thr Ser Phe Pro Cys Pro Leu Asn Gln Ser Leu Val Ala 
1185                1190                1195                1200
Ser Ser Val Ser Pro Leu Ile Val Ser Gly Asn Ser Val Asn Leu Pro 
                1205                1210                1215    
Ile Pro Ser Thr Pro Glu Asp Lys Ala His Val Asn Val Asp Ile Ala 
            1220                1225                1230        
Cys Ala Val Ala Asp Gly Glu Asn Ala Phe Gln Gly Leu Glu Pro Lys 
        1235                1240                1245            
Leu Glu Pro Gln Glu Leu Ser Pro Leu Ser Ala Thr Val Phe Pro Lys 
    1250                1255                1260                
Val Glu His Ser Pro Gly Pro Pro Leu Ala Asp Ala Glu Cys Gln Glu 
1265                1270                1275                1280
Gly Leu Ser Glu Asn Ser Ala Cys Arg Trp Thr Val Val Lys Thr Glu 
                1285                1290                1295    
Glu Gly Arg Gln Ala Leu Glu Pro Leu Pro Gln Gly Ile Gln Glu Ser 
            1300                1305                1310        
Leu Asn Asn Pro Thr Pro Gly Asp Leu Glu Glu Ile Val Lys Met Glu 
        1315                1320                1325            
Pro Glu Glu Ala Arg Glu Glu Ile Ser Gly Ser Pro Glu Arg Asp Ile 
    1330                1335                1340                
Cys Asp Asp Ile Lys Val Glu His Ala Val Glu Leu Asp Thr Gly Ala 
1345                1350                1355                1360
Pro Ser Glu Glu Leu Ser Ser Ala Gly Glu Val Thr Lys Gln Thr Val 
                1365                1370                1375    
Leu Gln Lys Glu Glu Glu Arg Ser Gln Pro Thr Lys Thr Pro Ser Ser 
            1380                1385                1390        
Ser Gln Glu Pro Pro Asp Glu Gly Thr Ser Gly Thr Asp Val Asn Lys 
        1395                1400                1405            
Gly Ser Ser Lys Asn Ala Leu Ser Ser Met Asp Pro Glu Val Arg Leu 
    1410                1415                1420                
Ser Ser Pro Pro Gly Lys Pro Glu Asp Ser Ser Ser Val Asp Gly Gln 
1425                1430                1435                1440
Ser Val Gly Thr Pro Val Gly Pro Glu Thr Gly Gly Glu Lys Asn Gly 
                1445                1450                1455    
Pro Glu Glu Glu Glu Glu Glu Asp Phe Asp Asp Leu Thr Gln Asp Glu 
            1460                1465                1470        
Glu Asp Glu Met Ser Ser Ala Ser Glu Glu Ser Val Leu Ser Val Pro 
        1475                1480                1485            
Glu Leu Gln Glu Thr Met Glu Lys Leu Thr Trp Leu Ala Ser Glu Arg 
    1490                1495                1500                
Arg Met Ser Gln Glu Gly Glu Ser Glu Glu Glu Asn Ser Gln Glu Glu 
1505                1510                1515                1520
Asn Ser Glu Pro Glu Glu Glu Glu Glu Glu Glu Ala Glu Gly Met Glu 
                1525                1530                1535    
Ser Leu Gln Lys Glu Asp Glu Met Thr Asp Glu Ala Val Gly Asp Ser 
            1540                1545                1550        
Ala Glu Lys Pro Pro Thr Phe Ala Ser Pro Glu Thr Ala Pro Glu Val 
        1555                1560                1565            
Glu Thr Ser Arg Thr Pro Pro Gly Glu Ser Ile Lys Ala Ala Gly Lys 
    1570                1575                1580                
Gly Arg Asn Asn His Arg Ala Arg Asn Lys Arg Gly Ser Arg Ala Arg 
1585                1590                1595                1600
Ala Ser Lys Asp Thr Ser Lys Leu Leu Leu Leu Tyr Asp Glu Asp Ile 
                1605                1610                1615    
Leu Glu Arg Asp Pro Leu Arg Glu Gln Lys Asp Leu Ala Phe Ala Gln 
            1620                1625                1630        
Ala Tyr Leu Thr Arg Val Arg Glu Ala Leu Gln His Ile Pro Gly Lys 
        1635                1640                1645            
Tyr Glu Asp Phe Leu Gln Val Ile Tyr Glu Phe Glu Ser Ser Thr Gln 
    1650                1655                1660                
Arg Arg Thr Ala Val Asp Leu Tyr Lys Ser Leu Gln Ile Leu Leu Gln 
1665                1670                1675                1680
Asp Trp Pro Gln Leu Leu Lys Asp Phe Ala Ala Phe Leu Leu Pro Glu 
                1685                1690                1695    
Gln Ala Leu Ala Cys Gly Leu Phe Glu Glu Gln Gln Ala Phe Glu Lys 
            1700                1705                1710        
Ser Arg Lys Phe Leu Arg Gln Leu Glu Ile Cys Phe Ala Glu Asn Pro 
        1715                1720                1725            
Ser His His Gln Lys Ile Ile Lys Val Leu Gln Gly Cys Ala Asp Cys 
    1730                1735                1740                
Leu Pro Gln Glu Ile Thr Glu Leu Lys Thr Gln Met Trp Gln Leu Leu 
1745                1750                1755                1760
Lys Gly His Asp His Leu Gln Asp Glu Phe Ser Ile Phe Phe Asp His 
                1765                1770                1775    
Leu Arg Pro Ala Ala Ser Arg Met Gly Asp Phe Glu Glu Ile Asn Trp 
            1780                1785                1790        
Thr Glu Glu Lys Glu Tyr Glu Phe Asp Gly Phe Glu Glu Val Ala Leu 
        1795                1800                1805            
Pro Asp Val Glu Glu Glu Glu Glu Pro Pro Lys Ile Pro Thr Ala Ser 
    1810                1815                1820                
Lys Asn Lys Arg Lys Lys Glu Ile Gly Val Gln Asn His Asp Lys Glu 
1825                1830                1835                1840
Thr Glu Trp Pro Asp Gly Ala Lys Asp Cys Ala Cys Ser Cys His Glu 
                1845                1850                1855    
Gly Gly Pro Asp Ser Lys Leu Lys Lys Ser Lys Arg Arg Ser Cys Ser 
            1860                1865                1870        
His Cys Ser Ser Lys Val Cys Asp Ser Lys Ser Tyr Lys Ser Lys Glu 
        1875                1880                1885            
Pro His Glu Leu Val Gly Ser Ser Pro His Arg Glu Ala Ser Pro Met 
    1890                1895                1900                
Pro Gly Ala Lys Glu Ala Gly Gln Gly Lys Asp Met Met Glu Glu Glu 
1905                1910                1915                1920
Ala Pro Glu Glu Arg Glu Ser Thr Glu Ala Thr Gln Ser Arg Thr Val 
                1925                1930                1935    
Arg Thr Thr Arg Lys Gly Glu Met Pro Val Ser Ala Gly Leu Ala Val 
            1940                1945                1950        
Gly Ser Thr Leu Pro Ser Pro Arg Glu Val Thr Val Thr Glu Arg Leu 
        1955                1960                1965            
Leu Leu Asp Gly Pro Pro Pro His Ser Pro Glu Thr Pro Gln Phe Pro 
    1970                1975                1980                
Pro Thr Thr Gly Ala Val Leu Tyr Thr Val Lys Arg Asn Gln Val Gly 
1985                1990                1995                2000
Pro Glu Val Arg Ser Cys Pro Lys Ala Ser Pro Arg Leu Gln Lys Glu 
                2005                2010                2015    
Arg Glu Gly Gln Lys Ala Val Ser Glu Ser Glu Ala Leu Met Leu Val 
            2020                2025                2030        
Trp Asp Ala Ser Glu Thr Glu Lys Leu Pro Gly Thr Val Glu Pro Pro 
        2035                2040                2045            
Ala Ser Phe Leu Ser Pro Val Ser Ser Lys Thr Arg Asp Ala Gly Arg 
    2050                2055                2060                
Arg His Val Ser Gly Lys Pro Asp Thr Gln Glu Arg Trp Leu Pro Ser 
2065                2070                2075                2080
Ser Arg Ala Arg Val Lys Thr Arg Asp Arg Thr Cys Pro Val His Glu 
                2085                2090                2095    
Ser Pro Ser Gly Ile Asp Thr Ser Glu Thr Ser Pro Lys Ala Pro Arg 
            2100                2105                2110        
Gly Gly Leu Ala Lys Asp Ser Gly Thr Gln Ala Lys Gly Pro Glu Gly 
        2115                2120                2125            
Glu Gln Gln Pro Lys Ala Ala Glu Ala Thr Val Cys Ala Asn Asn Ser 
    2130                2135                2140                
Lys Val Ser Ser Thr Gly Glu Lys Val Val Leu Trp Thr Arg Glu Ala 
2145                2150                2155                2160
Asp Arg Val Ile Leu Thr Met Cys Gln Glu Gln Gly Ala Gln Pro Gln 
                2165                2170                2175    
Thr Phe Asn Ile Ile Ser Gln Gln Leu Gly Asn Lys Thr Pro Ala Glu 
            2180                2185                2190        
Val Ser His Arg Phe Arg Glu Leu Met Gln Leu Phe His Thr Ala Cys 
        2195                2200                2205            
Glu Ala Ser Ser Glu Asp Glu Asp Asp Ala Thr Ser Thr Ser Asn Ala 
    2210                2215                2220                
Asp Gln Leu Ser Asp His Gly Asp Leu Leu Ser Glu Glu Glu Leu Asp 
2225                2230                2235                2240
Glu 
    

<210> 5
<211> 356
<212> PRT
<213> Homo sapiens


<220> 
<223> human NEUROD1, GenBank: BAJ84018.1

<400> 5
Met Thr Lys Ser Tyr Ser Glu Ser Gly Leu Met Gly Glu Pro Gln Pro 
1               5                   10                  15      
Gln Gly Pro Pro Ser Trp Thr Asp Glu Cys Leu Ser Ser Gln Asp Glu 
            20                  25                  30          
Glu His Glu Ala Asp Lys Lys Glu Asp Asp Leu Glu Ala Met Asn Ala 
        35                  40                  45              
Glu Glu Asp Ser Leu Arg Asn Gly Gly Glu Glu Glu Asp Glu Asp Glu 
    50                  55                  60                  
Asp Leu Glu Glu Glu Glu Glu Glu Glu Glu Glu Asp Asp Asp Gln Lys 
65                  70                  75                  80  
Pro Lys Arg Arg Gly Pro Lys Lys Lys Lys Met Thr Lys Ala Arg Leu 
                85                  90                  95      
Glu Arg Phe Lys Leu Arg Arg Met Lys Ala Asn Ala Arg Glu Arg Asn 
            100                 105                 110         
Arg Met His Gly Leu Asn Ala Ala Leu Asp Asn Leu Arg Lys Val Val 
        115                 120                 125             
Pro Cys Tyr Ser Lys Thr Gln Lys Leu Ser Lys Ile Glu Thr Leu Arg 
    130                 135                 140                 
Leu Ala Lys Asn Tyr Ile Trp Ala Leu Ser Glu Ile Leu Arg Ser Gly 
145                 150                 155                 160 
Lys Ser Pro Asp Leu Val Ser Phe Val Gln Thr Leu Cys Lys Gly Leu 
                165                 170                 175     
Ser Gln Pro Thr Thr Asn Leu Val Ala Gly Cys Leu Gln Leu Asn Pro 
            180                 185                 190         
Arg Thr Phe Leu Pro Glu Gln Asn Gln Asp Met Pro Pro His Leu Pro 
        195                 200                 205             
Thr Ala Ser Ala Ser Phe Pro Val His Pro Tyr Ser Tyr Gln Ser Pro 
    210                 215                 220                 
Gly Leu Pro Ser Pro Pro Tyr Gly Thr Met Asp Ser Ser His Val Phe 
225                 230                 235                 240 
His Val Lys Pro Pro Pro His Ala Tyr Ser Ala Ala Leu Glu Pro Phe 
                245                 250                 255     
Phe Glu Ser Pro Leu Thr Asp Cys Thr Ser Pro Ser Phe Asp Gly Pro 
            260                 265                 270         
Leu Ser Pro Pro Leu Ser Ile Asn Gly Asn Phe Ser Phe Lys His Glu 
        275                 280                 285             
Pro Ser Ala Glu Phe Glu Lys Asn Tyr Ala Phe Thr Met His Tyr Pro 
    290                 295                 300                 
Ala Ala Thr Leu Ala Gly Ala Gln Ser His Gly Ser Ile Phe Ser Gly 
305                 310                 315                 320 
Thr Ala Ala Pro Arg Cys Glu Ile Pro Ile Asp Asn Ile Met Ser Phe 
                325                 330                 335     
Asp Ser His Ser His His Glu Arg Val Met Ser Ala Gln Leu Asn Ala 
            340                 345                 350         
Ile Phe His Asp 
        355     

<210> 6
<211> 297
<212> PRT
<213> Homo sapiens


<220> 
<223> human OTX2 Isoform A, NCBI Reference Sequence: NP_068374.1

<400> 6
Met Met Ser Tyr Leu Lys Gln Pro Pro Tyr Ala Val Asn Gly Leu Ser 
1               5                   10                  15      
Leu Thr Thr Ser Gly Met Asp Leu Leu His Pro Ser Val Gly Tyr Pro 
            20                  25                  30          
Gly Pro Trp Ala Ser Cys Pro Ala Ala Thr Pro Arg Lys Gln Arg Arg 
        35                  40                  45              
Glu Arg Thr Thr Phe Thr Arg Ala Gln Leu Asp Val Leu Glu Ala Leu 
    50                  55                  60                  
Phe Ala Lys Thr Arg Tyr Pro Asp Ile Phe Met Arg Glu Glu Val Ala 
65                  70                  75                  80  
Leu Lys Ile Asn Leu Pro Glu Ser Arg Val Gln Val Trp Phe Lys Asn 
                85                  90                  95      
Arg Arg Ala Lys Cys Arg Gln Gln Gln Gln Gln Gln Gln Asn Gly Gly 
            100                 105                 110         
Gln Asn Lys Val Arg Pro Ala Lys Lys Lys Thr Ser Pro Ala Arg Glu 
        115                 120                 125             
Val Ser Ser Glu Ser Gly Thr Ser Gly Gln Phe Thr Pro Pro Ser Ser 
    130                 135                 140                 
Thr Ser Val Pro Thr Ile Ala Ser Ser Ser Ala Pro Val Ser Ile Trp 
145                 150                 155                 160 
Ser Pro Ala Ser Ile Ser Pro Leu Ser Asp Pro Leu Ser Thr Ser Ser 
                165                 170                 175     
Ser Cys Met Gln Arg Ser Tyr Pro Met Thr Tyr Thr Gln Ala Ser Gly 
            180                 185                 190         
Tyr Ser Gln Gly Tyr Ala Gly Ser Thr Ser Tyr Phe Gly Gly Met Asp 
        195                 200                 205             
Cys Gly Ser Tyr Leu Thr Pro Met His His Gln Leu Pro Gly Pro Gly 
    210                 215                 220                 
Ala Thr Leu Ser Pro Met Gly Thr Asn Ala Val Thr Ser His Leu Asn 
225                 230                 235                 240 
Gln Ser Pro Ala Ser Leu Ser Thr Gln Gly Tyr Gly Ala Ser Ser Leu 
                245                 250                 255     
Gly Phe Asn Ser Thr Thr Asp Cys Leu Asp Tyr Lys Asp Gln Thr Ala 
            260                 265                 270         
Ser Trp Lys Leu Asn Phe Asn Ala Asp Cys Leu Asp Tyr Lys Asp Gln 
        275                 280                 285             
Thr Ser Ser Trp Lys Phe Gln Val Leu 
    290                 295         

<210> 7
<211> 289
<212> PRT
<213> Homo sapiens


<220> 
<223> human OTX2 Isoform B, NCBI Reference Sequence: NP_001257453.1

<400> 7
Met Met Ser Tyr Leu Lys Gln Pro Pro Tyr Ala Val Asn Gly Leu Ser 
1               5                   10                  15      
Leu Thr Thr Ser Gly Met Asp Leu Leu His Pro Ser Val Gly Tyr Pro 
            20                  25                  30          
Ala Thr Pro Arg Lys Gln Arg Arg Glu Arg Thr Thr Phe Thr Arg Ala 
        35                  40                  45              
Gln Leu Asp Val Leu Glu Ala Leu Phe Ala Lys Thr Arg Tyr Pro Asp 
    50                  55                  60                  
Ile Phe Met Arg Glu Glu Val Ala Leu Lys Ile Asn Leu Pro Glu Ser 
65                  70                  75                  80  
Arg Val Gln Val Trp Phe Lys Asn Arg Arg Ala Lys Cys Arg Gln Gln 
                85                  90                  95      
Gln Gln Gln Gln Gln Asn Gly Gly Gln Asn Lys Val Arg Pro Ala Lys 
            100                 105                 110         
Lys Lys Thr Ser Pro Ala Arg Glu Val Ser Ser Glu Ser Gly Thr Ser 
        115                 120                 125             
Gly Gln Phe Thr Pro Pro Ser Ser Thr Ser Val Pro Thr Ile Ala Ser 
    130                 135                 140                 
Ser Ser Ala Pro Val Ser Ile Trp Ser Pro Ala Ser Ile Ser Pro Leu 
145                 150                 155                 160 
Ser Asp Pro Leu Ser Thr Ser Ser Ser Cys Met Gln Arg Ser Tyr Pro 
                165                 170                 175     
Met Thr Tyr Thr Gln Ala Ser Gly Tyr Ser Gln Gly Tyr Ala Gly Ser 
            180                 185                 190         
Thr Ser Tyr Phe Gly Gly Met Asp Cys Gly Ser Tyr Leu Thr Pro Met 
        195                 200                 205             
His His Gln Leu Pro Gly Pro Gly Ala Thr Leu Ser Pro Met Gly Thr 
    210                 215                 220                 
Asn Ala Val Thr Ser His Leu Asn Gln Ser Pro Ala Ser Leu Ser Thr 
225                 230                 235                 240 
Gln Gly Tyr Gly Ala Ser Ser Leu Gly Phe Asn Ser Thr Thr Asp Cys 
                245                 250                 255     
Leu Asp Tyr Lys Asp Gln Thr Ala Ser Trp Lys Leu Asn Phe Asn Ala 
            260                 265                 270         
Asp Cys Leu Asp Tyr Lys Asp Gln Thr Ser Ser Trp Lys Phe Gln Val 
        275                 280                 285             
Leu 
    

<210> 8
<211> 4587
<212> DNA
<213> Homo sapiens


<400> 8
atgttgccct gtaagaagag aagaactaca gtgacagagt ccctacagca taaaggcaat       60

caagaggaaa acaacgtaga cctagaatca gccgttaaac cagaatctga ccaggttaag      120

gacttgagtt cggtgtcact atcctgggat ccaagtcatg gcagagtagc tggcttcgaa      180

gtacagtctt tgcaggatgc aggaaatcag cttggtatgg aggatacatc tctgagctct      240

ggaatgctca cccagaacac aaatgtacca attctagaag gtgttgatgt ggccatctct      300

cagggaatca ccctaccttc cttggagtct tttcaccccc ttaatataca cattggtaaa      360

ggaaaactcc acgctactgg ctcaaagaga gggaaaaaaa tgacactcag gcctgggcca      420

gttacccaag aagacagatg tgatcatctt accctaaagg agcctttttc aggagagcct      480

agtgaagaag tcaaggaaga aggagggaaa cctcaaatga attctgaagg ggagatacct      540

tccctgccat caggcagcca atctgcaaaa ccagtaagcc agcccaggaa atcaacccag      600

ccagatgttt gtgcctctcc tcaagaaaag ccactcagga ctctgtttca ccaacctgag      660

gaagagatag aagatggtgg actcttcatt ccaatggaag aacaagacaa tgaagaaagt      720

gagaaaagga gaaaaaagaa aaagggtacc aagaggaaac gagatggaag gggtcaagaa      780

gggaccttgg catatgacct gaaactggat gacatgcttg accgtacctt ggaggatggt      840

gccaagcagc acaatctaac agcagtcaat gtccgaaaca tccttcatga agtaatcaca      900

aatgaacacg tggtagctat gatgaaagca gccatcagtg agacggaaga tatgccaatg      960

tttgagccta aaatgacacg ctctaaactg aaggaagtag tggaaaaagg agtggtaatt     1020

ccaacatgga atatttcacc aattaagaag gccaatgaaa ttaagcctcc tcagtttgtg     1080

gatatccacc ttgaagaaga tgattcctca gatgaagaat accagccgga tgatgaagaa     1140

gaagatgaaa ctgctgaaga gagcttattg gaaagtgatg ttgaaagcac tgcttcatct     1200

ccacgtgggg caaagaaatc cagattgagg cagtcttctg agatgactga aacagatgag     1260

gagagtggca tattatcaga ggctgagaaa gtcaccacac cagccatcag gcacatcagt     1320

gctgaggtag tgcccatggg gcccccgccc cctccaaagc cgaaacagac cagagatagt     1380

actttcatgg agaagttaca tgcggtagat gaggagctgg cttccagtcc agtctgcatg     1440

gattctttcc agcccatgga tgacagtctc attgcatttc gaacgcgttc taagatgccc     1500

ctgaaagatg ttcccctggg ccaattagag gcagagctcc aagctccaga catcactcca     1560

gatatgtatg accccaatac ggcagatgat gaggactgga agatgtggct ggggggactt     1620

atgaatgatg atgtggggaa tgaagatgaa gcagatgatg atgatgatcc agaatataat     1680

ttcctggaag acctcgatga accagacaca gaggatttcc ggactgaccg ggcagtgaga     1740

atcaccaaaa aggaagtaaa tgagctgatg gaagagctgt ttgaaacttt ccaagatgag     1800

atgggattct ccaacatgga agatgatggc ccagaagagg aggagtgtgt agctgagcct     1860

cgtcctaact ttaacacccc tcaagctcta cggtttgagg aaccactggc caacctgtta     1920

aatgaacaac atcggacagt gaaggagcta tttgaacagc tgaagatgaa gaaatcttca     1980

gccaaacagc tgcaggaagt agagaaggtt aaaccccaga gtgagaaagt tcatcagact     2040

ctgattctgg acccagcaca gaggaagaga ctccagcagc agatgcagca gcacgttcag     2100

ctcttgaccc aaatccacct tcttgccacc tgcaacccca acctcaatcc ggaggccact     2160

accaccagga tatttcttaa agagctggga acctttgctc aaagctccat cgcccttcac     2220

catcagtaca accccaagtt tcagaccctg ttccaaccct gtaacttgat gggagctatg     2280

cagctgattg aagacttcag cacacatgtc agcattgact gcagccctca taaaactgtc     2340

aagaagactg cgaatgaatt tccctgtttg ccaaagcaag tggcttggat tctggccaca     2400

agcaaggttt tcatgtatcc agagttactt ccagtgtgtt ccctgaaggc aaagaatccc     2460

caggataaga tcgtcttcac caaggctgag gacaatttgt tagctttagg actgaagcat     2520

tttgaaggaa ctgagtttcc taatcctcta atcagcaagt accttctaac ctgcaaaact     2580

gcccaccaac tgacagtgag aatcaagaac ctcaacatga acagagctcc tgacaacatc     2640

attaaatttt ataagaagac caaacagctg ccagtcctag gaaaatgctg tgaagagatc     2700

cagccacatc agtggaagcc acctatagag agagaagaac accggctccc attctggtta     2760

aaggccagtc tgccatccat ccaggaagaa ctgcggcaca tggctgatgg tgctagagag     2820

gtaggaaata tgactggaac cactgagatc aactcagatc gaagcctaga aaaagacaat     2880

ttggagttgg ggagtgaatc tcggtaccca ctgctattgc ctaagggtgt agtcctgaaa     2940

ctgaagccag ttgccacccg tttccccagg aaggcttgga gacagaagcg ttcatcagtc     3000

ctgaagcccc tccttatcca acccagcccc tctctccagc ccagcttcaa ccctgggaaa     3060

acaccagccc gatcaactca ttcagaagcc cctccgagca aaatggtgct ccggattcct     3120

cacccaatac agccagccac tgttttacag acagttccag gtgtccctcc actgggggtc     3180

agtggaggtg agagttttga gtctcctgca gcactgcctg ctgtgccccc tgaggccagg     3240

acaagcttcc ctctgtctga gtcccagact ttgctctctt ctgcccctgt gcccaaggta     3300

atgctgccct cccttgcccc ttctaagttt cgaaagccat atgtgagacg gagaccctca     3360

aagagaagag gagtcaaggc ctctccctgt atgaaacctg cccctgttat ccaccaccct     3420

gcatctgtta tcttcactgt tcctgctacc actgtgaaga ttgtgagcct tggcggtggc     3480

tgtaacatga tccagcctgt caatgcggct gtggcccaga gtccccagac tattcccatc     3540

actaccctct tggttaaccc tacttccttc ccctgtccat tgaaccagtc ccttgtggcc     3600

tcctctgtct cacccttaat tgtttctggc aattctgtga atcttcctat accatccacc     3660

cctgaagata aggcccacgt gaatgtggac attgcttgtg ctgtggctga tggggaaaat     3720

gcctttcagg gcctagaacc caaattagag ccccaggaac tatctcctct ctctgctact     3780

gttttcccga aagtggaaca tagcccaggg cctccactag cagatgcaga gtgccaagaa     3840

ggattgtcag agaatagtgc ctgtcgctgg accgttgtga aaacagagga ggggaggcaa     3900

gctctggagc cgctccctca gggcatccag gagtctctaa acaaccctac ccctggggat     3960

ttagaggaaa ttgtcaagat ggaacctgaa gaagctagag aggaaatcag tggatcccct     4020

gagcgtgata tttgtgatga catcaaagtg gaacatgctg tggaattgga cactggtgcc     4080

ccaagcgagg agttgagcag tgctggagaa gtaacgaaac agacagtctt acagaaggaa     4140

gaggagagga gtcagccaac taaaacccct tcatcttctc aagagccccc tgatgaagga     4200

acctcaggga cagatgtgaa caaaggatca tcaaagaatg ctttgtcctc aatggatcct     4260

gaagtgaggc ttagtagccc cccagggaag ccagaagatt catccagtgt tgatggtcag     4320

tcagtgggga ctccagttgg gccagaaact ggaggagaga agaatgggcc agaagaagag     4380

gaagaagagg actttgatga cctcacccaa gatgaggaag atgaaatgtc atcagcttct     4440

gaggaatctg tgctttctgt cccagaactc caggtgagag ctggagaata ttctcaagta     4500

tttcgtggac tcagtaatat gtatcactta ttgatatgcc acctgcttgc ttgctgcact     4560

atggatagtc ctaaaatcat ttgtatt                                         4587


<210> 9
<211> 4641
<212> DNA
<213> Homo sapiens


<400> 9
atgttgccct gtaagaagag aagaactaca gtgacagagt ccctacagca taaaggcaat       60

caagaggaaa acaacgtaga cctagaatca gccgttaaac cagaatctga ccaggttaag      120

gacttgagtt cggtgtcact atcctgggat ccaagtcatg gcagagtagc tggcttcgaa      180

gtacagtctt tgcaggatgc aggaaatcag cttggtatgg aggatacatc tctgagctct      240

ggaatgctca cccagaacac aaatgtacca attctagaag gtgttgatgt ggccatctct      300

cagggaatca ccctaccttc cttggagtct tttcaccccc ttaatataca cattggtaaa      360

ggaaaactcc acgctactgg ctcaaagaga gggaaaaaaa tgacactcag gcctgggcca      420

gttacccaag aagacagatg tgatcatctt accctaaagg agcctttttc aggagagcct      480

agtgaagaag tcaaggaaga aggagggaaa cctcaaatga attctgaagg ggagatacct      540

tccctgccat caggcagcca atctgcaaaa ccagtaagcc agcccaggaa atcaacccag      600

ccagatgttt gtgcctctcc tcaagaaaag ccactcagga ctctgtttca ccaacctgag      660

gaagagatag aagatggtgg actcttcatt ccaatggaag aacaagacaa tgaagaaagt      720

gagaaaagga gaaaaaagaa aaagggtacc aagaggaaac gagatggaag gggtcaagaa      780

gggaccttgg catatgacct gaaactggat gacatgcttg accgtacctt ggaggatggt      840

gccaagcagc acaatctaac agcagtcaat gtccgaaaca tccttcatga agtaatcaca      900

aatgaacacg tggtagctat gatgaaagca gccatcagtg agacggaaga tatgccaatg      960

tttgagccta aaatgacacg ctctaaactg aaggaagtag tggaaaaagg agtggtaatt     1020

ccaacatgga atatttcacc aattaagaag gccaatgaaa ttaagcctcc tcagtttgtg     1080

gatatccacc ttgaagaaga tgattcctca gatgaagaat accagccgga tgatgaagaa     1140

gaagatgaaa ctgctgaaga gagcttattg gaaagtgatg ttgaaagcac tgcttcatct     1200

ccacgtgggg caaagaaatc cagattgagg cagtcttctg agatgactga aacagatgag     1260

gagagtggca tattatcaga ggctgagaaa gtcaccacac cagccatcag gcacatcagt     1320

gctgaggtag tgcccatggg gcccccgccc cctccaaagc cgaaacagac cagagatagt     1380

actttcatgg agaagttaca tgcggtagat gaggagctgg cttccagtcc agtctgcatg     1440

gattctttcc agcccatgga tgacagtctc attgcatttc gaacgcgttc taagatgccc     1500

ctgaaagatg ttcccctggg ccaattagag gcagagctcc aagctccaga catcactcca     1560

gatatgtatg accccaatac ggcagatgat gaggactgga agatgtggct ggggggactt     1620

atgaatgatg atgtggggaa tgaagatgaa gcagatgatg atgatgatcc agaatataat     1680

ttcctggaag acctcgatga accagacaca gaggatttcc ggactgaccg ggcagtgaga     1740

atcaccaaaa aggaagtaaa tgagctgatg gaagagctgt ttgaaacttt ccaagatgag     1800

atgggattct ccaacatgga agatgatggc ccagaagagg aggagtgtgt agctgagcct     1860

cgtcctaact ttaacacccc tcaagctcta cggtttgagg aaccactggc caacctgtta     1920

aatgaacaac atcggacagt gaaggagcta tttgaacagc tgaagatgaa gaaatcttca     1980

gccaaacagc tgcaggaagt agagaaggtt aaaccccaga gtgagaaagt tcatcagact     2040

ctgattctgg acccagcaca gaggaagaga ctccagcagc agatgcagca gcacgttcag     2100

ctcttgaccc aaatccacct tcttgccacc tgcaacccca acctcaatcc ggaggccact     2160

accaccagga tatttcttaa agagctggga acctttgctc aaagctccat cgcccttcac     2220

catcagtaca accccaagtt tcagaccctg ttccaaccct gtaacttgat gggagctatg     2280

cagctgattg aagacttcag cacacatgtc agcattgact gcagccctca taaaactgtc     2340

aagaagactg cgaatgaatt tccctgtttg ccaaagcaag tggcttggat tctggccaca     2400

agcaaggttt tcatgtatcc agagttactt ccagtgtgtt ccctgaaggc aaagaatccc     2460

caggataaga tcgtcttcac caaggctgag gacaatttgt tagctttagg actgaagcat     2520

tttgaaggaa ctgagtttcc taatcctcta atcagcaagt accttctaac ctgcaaaact     2580

gcccaccaac tgacagtgag aatcaagaac ctcaacatga acagagctcc tgacaacatc     2640

attaaatttt ataagaagac caaacagctg ccagtcctag gaaaatgctg tgaagagatc     2700

cagccacatc agtggaagcc acctatagag agagaagaac accggctccc attctggtta     2760

aaggccagtc tgccatccat ccaggaagaa ctgcggcaca tggctgatgg tgctagagag     2820

gtaggaaata tgactggaac cactgagatc aactcagatc gaagcctaga aaaagacaat     2880

ttggagttgg ggagtgaatc tcggtaccca ctgctattgc ctaagggtgt agtcctgaaa     2940

ctgaagccag ttgccacccg tttccccagg aaggcttgga gacagaagcg ttcatcagtc     3000

ctgaagcccc tccttatcca acccagcccc tctctccagc ccagcttcaa ccctgggaaa     3060

acaccagccc gatcaactca ttcagaagcc cctccgagca aaatggtgct ccggattcct     3120

cacccaatac agccagccac tgttttacag acagttccag gtgtccctcc actgggggtc     3180

agtggaggtg agagttttga gtctcctgca gcactgcctg ctgtgccccc tgaggccagg     3240

acaagcttcc ctctgtctga gtcccagact ttgctctctt ctgcccctgt gcccaaggta     3300

atgctgccct cccttgcccc ttctaagttt cgaaagccat atgtgagacg gagaccctca     3360

aagagaagag gagtcaaggc ctctccctgt atgaaacctg cccctgttat ccaccaccct     3420

gcatctgtta tcttcactgt tcctgctacc actgtgaaga ttgtgagcct tggcggtggc     3480

tgtaacatga tccagcctgt caatgcggct gtggcccaga gtccccagac tattcccatc     3540

actaccctct tggttaaccc tacttccttc ccctgtccat tgaaccagtc ccttgtggcc     3600

tcctctgtct cacccttaat tgtttctggc aattctgtga atcttcctat accatccacc     3660

cctgaagata aggcccacgt gaatgtggac attgcttgtg ctgtggctga tggggaaaat     3720

gcctttcagg gcctagaacc caaattagag ccccaggaac tatctcctct ctctgctact     3780

gttttcccga aagtggaaca tagcccaggg cctccactag cagatgcaga gtgccaagaa     3840

ggattgtcag agaatagtgc ctgtcgctgg accgttgtga aaacagagga ggggaggcaa     3900

gctctggagc cgctccctca gggcatccag gagtctctaa acaaccctac ccctggggat     3960

ttagaggaaa ttgtcaagat ggaacctgaa gaagctagag aggaaatcag tggatcccct     4020

gagcgtgata tttgtgatga catcaaagtg gaacatgctg tggaattgga cactggtgcc     4080

ccaagcgagg agttgagcag tgctggagaa gtaacgaaac agacagtctt acagaaggaa     4140

gaggagagga gtcagccaac taaaacccct tcatcttctc aagagccccc tgatgaagga     4200

acctcaggga cagatgtgaa caaaggatca tcaaagaatg ctttgtcctc aatggatcct     4260

gaagtgaggc ttagtagccc cccagggaag ccagaagatt catccagtgt tgatggtcag     4320

tcagtgggga ctccagttgg gccagaaact ggaggagaga agaatgggcc agaagaagag     4380

gaagaagagg actttgatga cctcacccaa gatgaggaag atgaaatgtc atcagcttct     4440

gaggaatctg tgctttctgt cccagaactc caggtgagag ctggagaata ttctcaagta     4500

tttcgtggac tcagtaatat gtatcactta ttgatatgcc acctgcttgc ttgctgcact     4560

atggatagtc ctaaaatcat ttgtattctc gagggtaagc ctatccctaa ccctctcctc     4620

ggtctcgatt ctacgtaatg a                                               4641


<210> 10
<211> 1068
<212> DNA
<213> Homo sapiens


<400> 10
atgaccaaat cgtacagcga gagtgggctg atgggcgagc ctcagcccca aggtcctcca      60

agctggacag acgagtgtct cagttctcag gacgaggagc acgaggcaga caagaaggag     120

gacgacctcg aagccatgaa cgcagaggag gactcactga ggaacggggg agaggaggag     180

gacgaagatg aggacctgga agaggaggaa gaagaggaag aggaggatga cgatcaaaag     240

cccaagagac gcggccccaa aaagaagaag atgactaagg ctcgcctgga gcgttttaaa     300

ttgagacgca tgaaggctaa cgcccgggag cggaaccgca tgcacggact gaacgcggcg     360

ctagacaacc tgcgcaaggt ggtgccttgc tattctaaga cgcagaagct gtccaaaatc     420

gagactctgc gcttggccaa gaactacatc tgggctctgt cggagatcct gcgctcaggc     480

aaaagcccag acctggtctc cttcgttcag acgctttgca agggcttatc ccaacccacc     540

accaacctgg ttgcgggctg cctgcaactc aatcctcgga cttttctgcc tgagcagaac     600

caggacatgc ccccccacct gccgacggcc agcgcttcct tccctgtaca cccctactcc     660

taccagtcgc ctgggctgcc cagtccgcct tacggtacca tggacagctc ccatgtcttc     720

cacgttaagc ctccgccgca cgcctacagc gcagcgctgg agcccttctt tgaaagccct     780

ctgactgatt gcaccagccc ttcctttgat ggacccctca gcccgccgct cagcatcaat     840

ggcaacttct ctttcaaaca cgaaccgtcc gccgagtttg agaaaaatta tgcctttacc     900

atgcactatc ctgcagcgac actggcaggg gcccaaagcc acggatcaat cttctcaggc     960

accgctgccc ctcgctgcga gatccccata gacaatatta tgtccttcga tagccattca    1020

catcatgagc gagtcatgag tgcccagctc aatgccatat ttcatgat                 1068


<210> 11
<211> 1122
<212> DNA
<213> Homo sapiens


<400> 11
atgaccaaat cgtacagcga gagtgggctg atgggcgagc ctcagcccca aggtcctcca      60

agctggacag acgagtgtct cagttctcag gacgaggagc acgaggcaga caagaaggag     120

gacgacctcg aagccatgaa cgcagaggag gactcactga ggaacggggg agaggaggag     180

gacgaagatg aggacctgga agaggaggaa gaagaggaag aggaggatga cgatcaaaag     240

cccaagagac gcggccccaa aaagaagaag atgactaagg ctcgcctgga gcgttttaaa     300

ttgagacgca tgaaggctaa cgcccgggag cggaaccgca tgcacggact gaacgcggcg     360

ctagacaacc tgcgcaaggt ggtgccttgc tattctaaga cgcagaagct gtccaaaatc     420

gagactctgc gcttggccaa gaactacatc tgggctctgt cggagatcct gcgctcaggc     480

aaaagcccag acctggtctc cttcgttcag acgctttgca agggcttatc ccaacccacc     540

accaacctgg ttgcgggctg cctgcaactc aatcctcgga cttttctgcc tgagcagaac     600

caggacatgc ccccccacct gccgacggcc agcgcttcct tccctgtaca cccctactcc     660

taccagtcgc ctgggctgcc cagtccgcct tacggtacca tggacagctc ccatgtcttc     720

cacgttaagc ctccgccgca cgcctacagc gcagcgctgg agcccttctt tgaaagccct     780

ctgactgatt gcaccagccc ttcctttgat ggacccctca gcccgccgct cagcatcaat     840

ggcaacttct ctttcaaaca cgaaccgtcc gccgagtttg agaaaaatta tgcctttacc     900

atgcactatc ctgcagcgac actggcaggg gcccaaagcc acggatcaat cttctcaggc     960

accgctgccc ctcgctgcga gatccccata gacaatatta tgtccttcga tagccattca    1020

catcatgagc gagtcatgag tgcccagctc aatgccatat ttcatgatct cgagggtaag    1080

cctatcccta accctctcct cggtctcgat tctacgtaat ga                       1122


<210> 12
<211> 891
<212> DNA
<213> Homo sapiens


<400> 12
atgatgtctt atcttaagca accgccttac gcagtcaatg ggctgagtct gaccacttcg      60

ggtatggact tgctgcaccc ctccgtgggc tacccggggc cctgggcttc ttgtcccgca     120

gccacccccc ggaaacagcg ccgggagagg acgacgttca ctcgggcgca gctagatgtg     180

ctggaagcac tgtttgccaa gacccggtac ccagacatct tcatgcgaga ggaggtggca     240

ctgaaaatca acttgcccga gtcgagggtg caggtatggt ttaagaatcg aagagctaag     300

tgccgccaac aacagcaaca acagcagaat ggaggtcaaa acaaagtgag acctgccaaa     360

aagaagacat ctccagctcg ggaagtgagt tcagagagtg gaacaagtgg ccaattcact     420

cccccctcta gcacctcagt cccgaccatt gccagcagca gtgctcctgt gtctatctgg     480

agcccagctt ccatctcccc actgtcagat cccttgtcca cctcctcttc ctgcatgcag     540

aggtcctatc ccatgaccta tactcaggct tcaggttata gtcaaggata tgctggctca     600

acttcctact ttgggggcat ggactgtgga tcatatttga cccctatgca tcaccagctt     660

cccggaccag gggccacact cagtcccatg ggtaccaatg cagtcaccag ccatctcaat     720

cagtccccag cttctctttc cacccaggga tatggagctt caagcttggg ttttaactca     780

accactgatt gcttggatta taaggaccaa actgcctcct ggaagcttaa cttcaatgct     840

gactgcttgg attataaaga tcagacatcc tcgtggaaat tccaggtttt g              891


<210> 13
<211> 945
<212> DNA
<213> Homo sapiens


<400> 13
atgatgtctt atcttaagca accgccttac gcagtcaatg ggctgagtct gaccacttcg      60

ggtatggact tgctgcaccc ctccgtgggc tacccggggc cctgggcttc ttgtcccgca     120

gccacccccc ggaaacagcg ccgggagagg acgacgttca ctcgggcgca gctagatgtg     180

ctggaagcac tgtttgccaa gacccggtac ccagacatct tcatgcgaga ggaggtggca     240

ctgaaaatca acttgcccga gtcgagggtg caggtatggt ttaagaatcg aagagctaag     300

tgccgccaac aacagcaaca acagcagaat ggaggtcaaa acaaagtgag acctgccaaa     360

aagaagacat ctccagctcg ggaagtgagt tcagagagtg gaacaagtgg ccaattcact     420

cccccctcta gcacctcagt cccgaccatt gccagcagca gtgctcctgt gtctatctgg     480

agcccagctt ccatctcccc actgtcagat cccttgtcca cctcctcttc ctgcatgcag     540

aggtcctatc ccatgaccta tactcaggct tcaggttata gtcaaggata tgctggctca     600

acttcctact ttgggggcat ggactgtgga tcatatttga cccctatgca tcaccagctt     660

cccggaccag gggccacact cagtcccatg ggtaccaatg cagtcaccag ccatctcaat     720

cagtccccag cttctctttc cacccaggga tatggagctt caagcttggg ttttaactca     780

accactgatt gcttggatta taaggaccaa actgcctcct ggaagcttaa cttcaatgct     840

gactgcttgg attataaaga tcagacatcc tcgtggaaat tccaggtttt gctcgagggt     900

aagcctatcc ctaaccctct cctcggtctc gattctacgt aatga                     945


