                         SEQUENCE LISTING

<110>  The United States of America, as Represented by the 
       Secretary, Department of Health and Human Services
       Bar, Daniel Z
       Collins, Francis S
 
<120>  DETECTING PROTEIN INTERACTION SITES IN NUCLEIC ACID

<130>  E-031-2018 077867-1116029

<150>  US/62/615,233
<151>  2018-01-09

<160>  2     

<170>  PatentIn version 3.5

<210>  1
<211>  609
<212>  PRT
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  1

Met Thr Leu Lys Ala Leu Glu Leu Phe Ala Gly Ile Ala Gly Ile Thr 
1               5                   10                  15      


His Gly Leu Arg Gly Phe Val Glu Pro Val Ala Phe Val Glu Ile Asn 
            20                  25                  30          


Lys Asp Ala Gln Glu Phe Leu Ser Thr Lys Phe Pro Asp Lys Pro Val 
        35                  40                  45              


Phe Asp Asp Val Thr Lys Phe Ser Lys Arg Asp Phe Asp Glu Pro Ile 
    50                  55                  60                  


Asp Met Ile Thr Gly Gly Phe Pro Cys Thr Gly Phe Ser Ile Ala Gly 
65                  70                  75                  80  


Lys Arg Asn Gly Phe Glu His Ala Glu Ser Gly Leu Phe Gly Glu Val 
                85                  90                  95      


Val Arg Ile Thr Lys Glu Tyr Met Pro Lys Met Val Phe Leu Glu Asn 
            100                 105                 110         


Ser Gly Met Leu Ser His Lys Tyr Asn Leu Asp Ile Val Ile Arg Ser 
        115                 120                 125             


Met Asp Ser Leu Gly Tyr Asp Cys Arg Trp Val Thr Leu Arg Ala Thr 
    130                 135                 140                 


Val Val Gly Ala Leu His Thr Arg His Arg Trp Phe Cys Leu Cys Thr 
145                 150                 155                 160 


Arg Lys Asp His Ile Arg Glu Thr Leu Ile Cys Asp Arg Glu Val Thr 
                165                 170                 175     


Lys Phe Asp Trp Glu Asn Asp Arg Pro Pro Ile Gln Val Asp Ser Arg 
            180                 185                 190         


Ser Tyr Glu Asn Ser Arg Leu Val Arg Phe Ala Gly Tyr Ser Val Val 
        195                 200                 205             


Pro Asp Gln Ile Arg Tyr Ala Phe Thr Gly Leu Tyr Thr Gly Asn Phe 
    210                 215                 220                 


Ser Pro Ser Phe Ser Lys Thr Leu Val Pro Gly Ser Leu Glu Gly Ser 
225                 230                 235                 240 


Ile Cys Phe Asn Glu Asp Lys Ile Thr Asn Gly Tyr Tyr Lys Asp Gly 
                245                 250                 255     


Val Tyr Tyr Glu Phe Val Arg Thr Glu Thr His Arg Glu Pro Val Asn 
            260                 265                 270         


Ile Leu Leu Thr Pro Arg Glu Ile Pro Asn Lys His Asn Gly Lys Lys 
        275                 280                 285             


Leu Leu Thr Leu Pro Val Thr Lys Arg Tyr Trp Cys Thr Pro Cys Ala 
    290                 295                 300                 


Ser Tyr Gly Lys Gly Thr Ala Gly Gly Arg Val Leu Thr Asp Arg Ser 
305                 310                 315                 320 


Ser His Ser Leu Pro Thr Gln Val Lys Phe Ser Pro Glu Gly Glu Asp 
                325                 330                 335     


Gly Lys His Leu Ser Gly Lys Phe Cys Ala Trp Leu Met Gly Tyr Asp 
            340                 345                 350         


Lys Glu Tyr Leu Gly Asn Leu Leu Glu Tyr Gly Gly Ser Gly Gly Asp 
        355                 360                 365             


Tyr Lys Asp Asp Asp Asp Lys Glu Asn Leu Tyr Phe Gln Ser Gly Ser 
    370                 375                 380                 


Ala Gly Ser Ala Ala Gly Ser Gly Glu Phe Leu Pro Lys Thr Asp Thr 
385                 390                 395                 400 


Tyr Lys Leu Ile Leu Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr 
                405                 410                 415     


Glu Ala Val Asp Ala Ala Thr Ala Glu Lys Val Phe Lys Gln Tyr Ala 
            420                 425                 430         


Asn Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr Asp Asp Ala Thr Lys 
        435                 440                 445             


Thr Phe Thr Val Thr Glu Lys Pro Glu Val Ile Asp Ala Ser Glu Leu 
    450                 455                 460                 


Thr Pro Ala Val Thr Thr Tyr Lys Leu Val Ile Asn Gly Lys Thr Leu 
465                 470                 475                 480 


Lys Gly Glu Thr Thr Thr Glu Ala Val Asp Ala Ala Thr Ala Glu Lys 
                485                 490                 495     


Val Phe Lys Gln Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu Trp Thr 
            500                 505                 510         


Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val 
        515                 520                 525             


Ile Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val 
    530                 535                 540                 


Ile Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp 
545                 550                 555                 560 


Ala Glu Thr Ala Glu Lys Ala Phe Lys Gln Tyr Ala Asn Asp Asn Gly 
                565                 570                 575     


Val Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val 
            580                 585                 590         


Thr Glu Gly Gly Ser Gly Gly His His His His His His His His His 
        595                 600                 605             


His 
    


<210>  2
<211>  1830
<212>  DNA
<213>  artificial sequence

<220>
<223>  synthetic construct

<400>  2
atgacactta aagcgttgga gttatttgcc gggattgcgg gtatcacgca tgggctgcgc       60

ggttttgtcg agcctgtggc ctttgttgaa attaataagg acgcacaaga atttttatcg      120

accaagtttc cagataagcc cgtcttcgat gatgttacta agttttcgaa acgcgacttc      180

gatgaaccca tcgatatgat cacggggggg tttccatgca caggcttcag cattgcagga      240

aaacgtaatg ggtttgagca cgctgagtca ggtctgtttg gagaggtcgt ccgcattaca      300

aaagagtaca tgcctaagat ggtattcttg gaaaattcag gcatgttgag tcataagtat      360

aaccttgaca ttgtcatccg ctccatggat agcttggggt acgactgtcg ttgggttacg      420

cttcgcgcta ccgttgttgg agcccttcat acccgtcacc gctggttttg tttatgtact      480

cgcaaagacc acatccgtga aacattaatt tgtgatcgtg aagttaccaa gttcgattgg      540

gaaaatgatc gtccgcctat ccaagtagac tctcgctcgt acgaaaatag ccgcttggtc      600

cgcttcgctg gatatagtgt agttccagac cagattcgct acgcctttac cggtttatat      660

actggtaatt tctcgccatc gttctcgaag acattggtgc ctgggagcct ggaaggcagt      720

atttgtttta acgaagacaa aattactaat ggttactaca aggatggggt gtactatgag      780

tttgttcgca cagagactca tcgtgaacct gttaatattc ttttgacacc tcgtgagatc      840

cctaataaac ataacggtaa gaagctgctt accctgcccg tgaccaaacg ttattggtgt      900

acgccttgcg ccagttacgg caaagggacg gcaggtggtc gtgtcttgac ggaccgcagt      960

agccacagtt tacctaccca agttaagttt agcccggagg gggaagatgg aaagcacttg     1020

agcggtaagt tctgcgcatg gttaatgggt tacgataagg agtacttggg aaatctgtta     1080

gagtacggtg gatcgggagg ggattataaa gatgacgacg acaaggagaa cttgtatttc     1140

cagtcagggt cggcagggtc ggctgccggg tcaggagagt tccttccgaa gactgacacg     1200

tataaattaa ttttgaatgg caagactctg aagggtgaaa ctacaaccga agcggttgat     1260

gcggccacag cagagaaagt gtttaagcaa tacgccaatg ataacggagt agatggtgaa     1320

tggacgtatg atgacgcgac taaaaccttc acagttacgg agaaacccga agtaattgac     1380

gcatctgaat tgaccccagc agtaacgacg tataagcttg tgattaatgg taagacactt     1440

aaaggcgaaa caaccactga agcggtagat gccgcgacag ctgaaaaggt gtttaaacag     1500

tacgcaaatg acaacggcgt cgatggtgaa tggacttacg acgatgcaac taaaactttc     1560

acggtcacag aaaagcctga ggtaatcgat gcctccgaac ttacccccgc ggtgacgaca     1620

tacaaattgg ttatcaatgg caaaacactt aaaggggaga caactacaaa ggctgtggat     1680

gcggagacgg cggaaaaggc atttaaacag tatgcaaatg acaatggtgt agacggtgtt     1740

tggacttatg acgatgcaac caaaaccttt accgttaccg aaggaggcag tggaggtcac     1800

catcaccacc atcatcacca tcaccactga                                      1830


