                         SEQUENCE LISTING

<110>  UNIVERSITY OF SOUTHERN CALIFORNIA
 
<120>  GENOME ENGINEERING THE HUMAN IMMUNOGLOBULIN LOCUS TO EXPRESS 
       RECOMBINANT BINDING DOMAIN MOLECULES

<130>  00130-034WO1

<140>  Not yet assigned
<141>  2021-01-28

<150>  US 62/967,018
<151>  2020-01-28

<160>  33    

<170>  PatentIn version 3.5

<210>  1
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PAM sequence


<220>
<221>  misc_feature
<222>  (20)..(20)
<223>  N can be any nucleotide

<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<400>  1
tccgggtgaa gaggcagacg ngg                                               23


<210>  2
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Artificial Guide Sequence

<400>  2
tccgggtgaa gaggcag                                                      17


<210>  3
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  3
ttctggcttt ttccccaggc tctgggcagg cacaggctag gtgcccctaa atcgatgggg       60

ttggggtt                                                                68


<210>  4
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  4
cgcatcccgg ctatgcagcc ccagtccagg gcagcaaggc aggccccgtc atcgatgggg       60

ttggggtt                                                                68


<210>  5
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  5
acgcatcccg gctatgcagc cccagtccag ggcagcaagg caggccccgt atcgatgggg       60

ttggggtt                                                                68


<210>  6
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  6
aggccaaact ctccactccc tcagctcgga caccttctct cctcccagat atcgatgggg       60

ttggggtt                                                                68


<210>  7
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  7
ggctatgcag ccccagtcca gggcagcaag gcaggccccg tctgcctctt atcgatgggg       60

ttggggtt                                                                68


<210>  8
<211>  75
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  8
gcagccccag tccagggcag caaggcaggc cccgtatcga ttaaaccggt gagtttcatg       60

gttacttgcc tgaga                                                        75


<210>  9
<211>  65
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IGHG1 genomic locus partial sequence-Insert sequence

<400>  9
gcagccccag tccagggcag caaggcaggc cccgtatcga tggggttggg gttgcgcctt       60

ttcca                                                                   65


<210>  10
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CDR3 - AID motif


<220>
<221>  misc_feature
<222>  (14)..(14)
<223>  N can be A, G or C

<220>
<221>  misc_feature
<222>  (26)..(26)
<223>  N can be A, G or C

<220>
<221>  misc_feature
<222>  (30)..(30)
<223>  N can be C or T

<220>
<221>  misc_feature
<222>  (35)..(35)
<223>  N can be A, G or C

<220>
<221>  misc_feature
<222>  (41)..(41)
<223>  N can be A, G or C

<400>  10
gccaggagca agancaccta catcanctan aacancaacg nctacgacta c                51


<210>  11
<211>  17
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Exemplary hypermutation sites


<220>
<221>  MISC_FEATURE
<222>  (3)..(3)
<223>  X can be S or N

<220>
<221>  MISC_FEATURE
<222>  (5)..(5)
<223>  X can be S, T or N

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  X can be S, T or N

<220>
<221>  MISC_FEATURE
<222>  (12)..(12)
<223>  X can be S, T or N

<220>
<221>  MISC_FEATURE
<222>  (14)..(14)
<223>  X can be G, D or A

<400>  11

Ala Arg Xaa Lys Xaa Thr Tyr Ile Xaa Tyr Asn Xaa Asn Xaa Tyr Asp 
1               5                   10                  15      


Tyr 
    


<210>  12
<211>  1364
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Antigen Recognition Cassette

<400>  12
taaaccggtg agtttcatgg ttacttgcct gagaagatta aaaaaagtaa tgctacctta       60

tgagggagag tcccagggac caagatagca actgtcatag caaccgtcac actgctttgg      120

tcaaggagaa gaccctttgg ggaactgaaa acagaacctt gagcacatct gttgctttcg      180

ctcccatcct cctccaacag ggctgggtgg agcactccac accctttcac cggtcgtacg      240

gctcagccag agtaaaaatc acacccatga cctggccact gagggcttga tcaattcact      300

ttgaatttgg cattaaatac cattaaggta tattaactga ttttaaaata agatatattc      360

gtgaccatgt ttttaacttt caaaaatgta gctgccagtg tgtgatttta tttcagttgt      420

acaaaatatc taaacctata gcaatgtgat taataaaaac ttaaacatat tttccagtac      480

cttaattctg tgataggaaa attttaatct gagtatttta atttcataat ctctaaaata      540

gtttaatgat ttgtcattgt gttgctgtcg tttaccccag ctgatctcaa aagtgatatt      600

taaggagatt attttggtct gcaacaactt gatagggctc agcctctccc acccaacggg      660

tggaatcccc cagaggggga tttccaagag gccacctggc agttgctgag ggtcagaagt      720

gaagctagcc acttcctctt aggcaggtgg ccaagattac agttgacccg tacgtgcagc      780

tgtgcccagc ctgccccatc ccctgctcat ttgcatgttc ccagagcaca acctcctgcc      840

ctgaagcctt attaataggc tggtcacact ttgtgcagga gtcagactca gtcaggacac      900

agctggatcc actagtccag tgtggtggaa ttcaccatgg agtttgggct gagctggctt      960

tttcttgtgg ctattttaaa aggtgtccag tgtgaggtgc agctggtgga gtctggggga     1020

ggcttggtac aggccggggg gttcctgaga ctctcctgtg agctgagggg aagcatcttt     1080

aaccagtatg ccatggcctg gttccgccag gctccaggga aggagaggga gttcgtcgcc     1140

ggcatgggcg ccgtgcccca ctacggcgag ttcgtgaagg gccggttcac catctccaga     1200

gacaatgcca agagcacggt gtatctgcaa atgagcagcc tgaagcccga ggacacggcc     1260

atctatttct gtgccaggag caagagcacc tacatcagct acaacagcaa cggctacgac     1320

tactggggca ggggaaccca ggtcaccgtc tcctcaggtg agag                      1364


<210>  13
<211>  2373
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Antigen Recognition Cassette with Homology Arms

<400>  13
gaacctcgcg gacagttaag aacccagggg cctctgcgcc ctgggcccag ctctgtccca       60

caccgcggtc acatggcacc acctctcttg cagcctccac caagggccca tcggtcttcc      120

ccctggcacc ctcctccaag agcacctctg ggggcacagc ggccctgggc tgcctggtca      180

aggactactt ccccgaaccg gtgacggtgt cgtggaactc aggcgccctg accagcggcg      240

tgcacacctt cccggctgtc ctacagtcct caggactcta ctccctcagc agcgtggtga      300

ccgtgccctc cagcagcttg ggcacccaga cctacatctg caacgtgaat cacaagccca      360

gcaacaccaa ggtggacaag aaagttggtg agaggccagc acagggaggg agggtgtctg      420

ctggaagcca ggctcagcgc tcctgcctgg acgcatcccg gctatgcagc cccagtccag      480

ggcagcaagg caggccccgt atcgattaaa ccggtgagtt tcatggttac ttgcctgaga      540

agattaaaaa aagtaatgct accttatgag ggagagtccc agggaccaag atagcaactg      600

tcatagcaac cgtcacactg ctttggtcaa ggagaagacc ctttggggaa ctgaaaacag      660

aaccttgagc acatctgttg ctttcgctcc catcctcctc caacagggct gggtggagca      720

ctccacaccc tttcaccggt cgtacggctc agccagagta aaaatcacac ccatgacctg      780

gccactgagg gcttgatcaa ttcactttga atttggcatt aaataccatt aaggtatatt      840

aactgatttt aaaataagat atattcgtga ccatgttttt aactttcaaa aatgtagctg      900

ccagtgtgtg attttatttc agttgtacaa aatatctaaa cctatagcaa tgtgattaat      960

aaaaacttaa acatattttc cagtacctta attctgtgat aggaaaattt taatctgagt     1020

attttaattt cataatctct aaaatagttt aatgatttgt cattgtgttg ctgtcgttta     1080

ccccagctga tctcaaaagt gatatttaag gagattattt tggtctgcaa caacttgata     1140

gggctcagcc tctcccaccc aacgggtgga atcccccaga gggggatttc caagaggcca     1200

cctggcagtt gctgagggtc agaagtgaag ctagccactt cctcttaggc aggtggccaa     1260

gattacagtt gacccgtacg tgcagctgtg cccagcctgc cccatcccct gctcatttgc     1320

atgttcccag agcacaacct cctgccctga agccttatta ataggctggt cacactttgt     1380

gcaggagtca gactcagtca ggacacagct ggatccacta gtccagtgtg gtggaattca     1440

ccatggagtt tgggctgagc tggctttttc ttgtggctat tttaaaaggt gtccagtgtg     1500

aggtgcagct ggtggagtct gggggaggct tggtacaggc cggggggttc ctgagactct     1560

cctgtgagct gaggggaagc atctttaacc agtatgccat ggcctggttc cgccaggctc     1620

cagggaagga gagggagttc gtcgccggca tgggcgccgt gccccactac ggcgagttcg     1680

tgaagggccg gttcaccatc tccagagaca atgccaagag cacggtgtat ctgcaaatga     1740

gcagcctgaa gcccgaggac acggccatct atttctgtgc caggagcaag agcacctaca     1800

tcagctacaa cagcaacggc tacgactact ggggcagggg aacccaggtc accgtctcct     1860

caggtgagag ctcctgcctc ttcacccgga ggcctctgcc cgccccactc atgctcaggg     1920

agagggtctt ctggcttttt ccccaggctc tgggcaggca caggctaggt gcccctaacc     1980

caggccctgc acacaaaggg gcaggtgctg ggctcagacc tgccaagagc catatccggg     2040

aggaccctgc ccctgaccta agcccacccc aaaggccaaa ctctccactc cctcagctcg     2100

gacaccttct ctcctcccag attccagtaa ctcccaatct tctctctgca gagcccaaat     2160

cttgtgacaa aactcacaca tgcccaccgt gcccaggtaa gccagcccag gcctcgccct     2220

ccagctcaag gcgggacagg tgccctagag tagcctgcat ccagggacag gccccagccg     2280

ggtgctgaca cgtccacctc catctcttcc tcagcacctg aactcctggg gggaccgtca     2340

gtcttcctct tccccccaaa acccaaggac acc                                  2373


<210>  14
<211>  3207
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Antigen Recognition Cassette with Homology Arms and ITRs

<400>  14
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct gcggccgcac gcgtctgagg ccaagctaga gacactggac tgtgctgact      180

cccggcaggc acagagcgct gacctggctg ccgagccccg cctcctaggc tgcaggggtg      240

cctgcagaag ggcaccacag ggccaccggt cctgcaagct ttctggggca ggccaggcct      300

gaccttggct ttggggcagg gagggggcta aggtgaggca ggtggcgcca gccaggtgca      360

cacccaatgc ccatgagccc agacactgga cgctgaacct cgcggacagt taagaaccca      420

ggggcctctg cgccctgggc ccagctctgt cccacaccgc ggtcacatgg caccacctct      480

cttgcagcct ccaccaaggg cccatcggtc ttccccctgg caccctcctc caagagcacc      540

tctgggggca cagcggccct gggctgcctg gtcaaggact acttccccga accggtgacg      600

gtgtcgtgga actcaggcgc cctgaccagc ggcgtgcaca ccttcccggc tgtcctacag      660

tcctcaggac tctactccct cagcagcgtg gtgaccgtgc cctccagcag cttgggcacc      720

cagacctaca tctgcaacgt gaatcacaag cccagcaaca ccaaggtgga caagaaagtt      780

ggtgagaggc cagcacaggg agggagggtg tctgctggaa gccaggctca gcgctcctgc      840

ctggacgcat cccggctatg cagccccagt ccagggcagc aaggcaggcc ccgtatcgat      900

taaaccggtg agtttcatgg ttacttgcct gagaagatta aaaaaagtaa tgctacctta      960

tgagggagag tcccagggac caagatagca actgtcatag caaccgtcac actgctttgg     1020

tcaaggagaa gaccctttgg ggaactgaaa acagaacctt gagcacatct gttgctttcg     1080

ctcccatcct cctccaacag ggctgggtgg agcactccac accctttcac cggtcgtacg     1140

gctcagccag agtaaaaatc acacccatga cctggccact gagggcttga tcaattcact     1200

ttgaatttgg cattaaatac cattaaggta tattaactga ttttaaaata agatatattc     1260

gtgaccatgt ttttaacttt caaaaatgta gctgccagtg tgtgatttta tttcagttgt     1320

acaaaatatc taaacctata gcaatgtgat taataaaaac ttaaacatat tttccagtac     1380

cttaattctg tgataggaaa attttaatct gagtatttta atttcataat ctctaaaata     1440

gtttaatgat ttgtcattgt gttgctgtcg tttaccccag ctgatctcaa aagtgatatt     1500

taaggagatt attttggtct gcaacaactt gatagggctc agcctctccc acccaacggg     1560

tggaatcccc cagaggggga tttccaagag gccacctggc agttgctgag ggtcagaagt     1620

gaagctagcc acttcctctt aggcaggtgg ccaagattac agttgacccg tacgtgcagc     1680

tgtgcccagc ctgccccatc ccctgctcat ttgcatgttc ccagagcaca acctcctgcc     1740

ctgaagcctt attaataggc tggtcacact ttgtgcagga gtcagactca gtcaggacac     1800

agctggatcc actagtccag tgtggtggaa ttcaccatgg agtttgggct gagctggctt     1860

tttcttgtgg ctattttaaa aggtgtccag tgtgaggtgc agctggtgga gtctggggga     1920

ggcttggtac aggccggggg gttcctgaga ctctcctgtg agctgagggg aagcatcttt     1980

aaccagtatg ccatggcctg gttccgccag gctccaggga aggagaggga gttcgtcgcc     2040

ggcatgggcg ccgtgcccca ctacggcgag ttcgtgaagg gccggttcac catctccaga     2100

gacaatgcca agagcacggt gtatctgcaa atgagcagcc tgaagcccga ggacacggcc     2160

atctatttct gtgccaggag caagagcacc tacatcagct acaacagcaa cggctacgac     2220

tactggggca ggggaaccca ggtcaccgtc tcctcaggtg agagctcctg cctcttcacc     2280

cggaggcctc tgcccgcccc actcatgctc agggagaggg tcttctggct ttttccccag     2340

gctctgggca ggcacaggct aggtgcccct aacccaggcc ctgcacacaa aggggcaggt     2400

gctgggctca gacctgccaa gagccatatc cgggaggacc ctgcccctga cctaagccca     2460

ccccaaaggc caaactctcc actccctcag ctcggacacc ttctctcctc ccagattcca     2520

gtaactccca atcttctctc tgcagagccc aaatcttgtg acaaaactca cacatgccca     2580

ccgtgcccag gtaagccagc ccaggcctcg ccctccagct caaggcggga caggtgccct     2640

agagtagcct gcatccaggg acaggcccca gccgggtgct gacacgtcca cctccatctc     2700

ttcctcagca cctgaactcc tggggggacc gtcagtcttc ctcttccccc caaaacccaa     2760

ggacaccctc atgatctccc ggacccctga ggtcacatgc gtggtggtgg acgtgagcca     2820

cgaagaccct gaggtcaagt tcaactggta cgtggacggc gtggaggtgc ataatgccaa     2880

gacaaagccg cgggaggagc agtacaacag cacgtaccgt gtggtcagcg tcctcaccgt     2940

cctgcaccag gactggctga atggcaagga gtacaagtgc aaggtctcca acaaagccct     3000

cccagccccc atcgagagtc gacctgcaga agcttgcctc gagcagcgtg cggaccgagc     3060

ggccgcagga acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca     3120

ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga     3180

gcgagcgagc gcgcagctgc ctgcagg                                         3207


<210>  15
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  15
aggctaggtg cccctaaccc                                                   20


<210>  16
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  16
tagccgggat gcgtccaggc                                                   20


<210>  17
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  17
tgcatagccg ggatgcgtcc                                                   20


<210>  18
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  18
ctccgggtga agaggcagac                                                   20


<210>  19
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  19
tccgggtgaa gaggcagacg                                                   20


<210>  20
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  20
acccaggccc tgcacacaaa                                                   20


<210>  21
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  21
gattgggagt tactggaatc                                                   20


<210>  22
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  22
gcagaggcct ccgggtgaag                                                   20


<210>  23
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  23
gccccgtctg cctcttcacc                                                   20


<210>  24
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  24
ccgtctgcct cttcacccgg                                                   20


<210>  25
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  25
tccccaggct ctgggcaggc a                                                 21


<210>  26
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  26
ccccaggctc tgggcaggca c                                                 21


<210>  27
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  27
cccaggctct gggcaggcac a                                                 21


<210>  28
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  28
tgtgcagggc ctgggttagg g                                                 21


<210>  29
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  29
atgtggccct cgcaccccac                                                   20


<210>  30
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  30
aagccaaagg tgggacccgt                                                   20


<210>  31
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  31
agccaaaggt gggacccgtg                                                   20


<210>  32
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  32
gtgggacccg tggggtgcga                                                   20


<210>  33
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Genomic sequence targeted by gRNA

<400>  33
catgtggccc tcgcacccca                                                   20


