                         SEQUENCE LISTING

<110>  The Trustees of the University of Pennsylvania
 
<120>  COMPOSITIONS USEFUL IN TREATMENT OF OTC DEFICIENCY

<130>  UPN-14-7037APCT

<150>  61/950,157
<151>  2014-03-09

<160>  16    

<170>  PatentIn version 3.5

<210>  1
<211>  1062
<212>  DNA
<213>  homo sapiens


<220>
<221>  CDS
<222>  (1)..(1062)

<400>  1
atg ctg ttt aat ctg agg atc ctg tta aac aat gca gct ttt aga aat         48
Met Leu Phe Asn Leu Arg Ile Leu Leu Asn Asn Ala Ala Phe Arg Asn           
1               5                   10                  15                

ggt cac aac ttc atg gtt cga aat ttt cgg tgt gga caa cca cta caa         96
Gly His Asn Phe Met Val Arg Asn Phe Arg Cys Gly Gln Pro Leu Gln           
            20                  25                  30                    

aat aaa gtg cag ctg aag ggc cgt gac ctt ctc act cta aaa aac ttt        144
Asn Lys Val Gln Leu Lys Gly Arg Asp Leu Leu Thr Leu Lys Asn Phe           
        35                  40                  45                        

acc gga gaa gaa att aaa tat atg cta tgg cta tca gca gat ctg aaa        192
Thr Gly Glu Glu Ile Lys Tyr Met Leu Trp Leu Ser Ala Asp Leu Lys           
    50                  55                  60                            

ttt agg ata aaa cag aaa gga gag tat ttg cct tta ttg caa ggg aag        240
Phe Arg Ile Lys Gln Lys Gly Glu Tyr Leu Pro Leu Leu Gln Gly Lys           
65                  70                  75                  80            

tcc tta ggc atg att ttt gag aaa aga agt act cga aca aga ttg tct        288
Ser Leu Gly Met Ile Phe Glu Lys Arg Ser Thr Arg Thr Arg Leu Ser           
                85                  90                  95                

aca gaa aca ggc ttt gca ctt ctg gga gga cat cct tgt ttt ctt acc        336
Thr Glu Thr Gly Phe Ala Leu Leu Gly Gly His Pro Cys Phe Leu Thr           
            100                 105                 110                   

aca caa gat att cat ttg ggt gtg aat gaa agt ctc acg gac acg gcc        384
Thr Gln Asp Ile His Leu Gly Val Asn Glu Ser Leu Thr Asp Thr Ala           
        115                 120                 125                       

cgt gta ttg tct agc atg gca gat gca gta ttg gct cga gtg tat aaa        432
Arg Val Leu Ser Ser Met Ala Asp Ala Val Leu Ala Arg Val Tyr Lys           
    130                 135                 140                           

caa tca gat ttg gac acc ctg gct aaa gaa gca tcc atc cca att atc        480
Gln Ser Asp Leu Asp Thr Leu Ala Lys Glu Ala Ser Ile Pro Ile Ile           
145                 150                 155                 160           

aat ggg ctg tca gat ttg tac cat cct atc cag atc ctg gct gat tac        528
Asn Gly Leu Ser Asp Leu Tyr His Pro Ile Gln Ile Leu Ala Asp Tyr           
                165                 170                 175               

ctc acg ctc cag gaa cac tat agc tct ctg aaa ggt ctt acc ctc agc        576
Leu Thr Leu Gln Glu His Tyr Ser Ser Leu Lys Gly Leu Thr Leu Ser           
            180                 185                 190                   

tgg atc ggg gat ggg aac aat atc ctg cac tcc atc atg atg agc gca        624
Trp Ile Gly Asp Gly Asn Asn Ile Leu His Ser Ile Met Met Ser Ala           
        195                 200                 205                       

gcg aaa ttc gga atg cac ctt cag gca gct act cca aag ggt tat gag        672
Ala Lys Phe Gly Met His Leu Gln Ala Ala Thr Pro Lys Gly Tyr Glu           
    210                 215                 220                           

ccg gat gct agt gta acc aag ttg gca gag cag tat gcc aaa gag aat        720
Pro Asp Ala Ser Val Thr Lys Leu Ala Glu Gln Tyr Ala Lys Glu Asn           
225                 230                 235                 240           

ggt acc aag ctg ttg ctg aca aat gat cca ttg gaa gca gcg cat gga        768
Gly Thr Lys Leu Leu Leu Thr Asn Asp Pro Leu Glu Ala Ala His Gly           
                245                 250                 255               

ggc aat gta tta att aca gac act tgg ata agc atg gga caa gaa gag        816
Gly Asn Val Leu Ile Thr Asp Thr Trp Ile Ser Met Gly Gln Glu Glu           
            260                 265                 270                   

gag aag aaa aag cgg ctc cag gct ttc caa ggt tac cag gtt aca atg        864
Glu Lys Lys Lys Arg Leu Gln Ala Phe Gln Gly Tyr Gln Val Thr Met           
        275                 280                 285                       

aag act gct aaa gtt gct gcc tct gac tgg aca ttt tta cac tgc ttg        912
Lys Thr Ala Lys Val Ala Ala Ser Asp Trp Thr Phe Leu His Cys Leu           
    290                 295                 300                           

ccc aga aag cca gaa gaa gtg gat gat gaa gtc ttt tat tct cct cga        960
Pro Arg Lys Pro Glu Glu Val Asp Asp Glu Val Phe Tyr Ser Pro Arg           
305                 310                 315                 320           

tca cta gtg ttc cca gag gca gaa aac aga aag tgg aca atc atg gct       1008
Ser Leu Val Phe Pro Glu Ala Glu Asn Arg Lys Trp Thr Ile Met Ala           
                325                 330                 335               

gtc atg gtg tcc ctg ctg aca gat tac tca cct cag ctc cag aag cct       1056
Val Met Val Ser Leu Leu Thr Asp Tyr Ser Pro Gln Leu Gln Lys Pro           
            340                 345                 350                   

aaa ttt                                                               1062
Lys Phe                                                                   
                                                                          


<210>  2
<211>  354
<212>  PRT
<213>  homo sapiens

<400>  2

Met Leu Phe Asn Leu Arg Ile Leu Leu Asn Asn Ala Ala Phe Arg Asn 
1               5                   10                  15      


Gly His Asn Phe Met Val Arg Asn Phe Arg Cys Gly Gln Pro Leu Gln 
            20                  25                  30          


Asn Lys Val Gln Leu Lys Gly Arg Asp Leu Leu Thr Leu Lys Asn Phe 
        35                  40                  45              


Thr Gly Glu Glu Ile Lys Tyr Met Leu Trp Leu Ser Ala Asp Leu Lys 
    50                  55                  60                  


Phe Arg Ile Lys Gln Lys Gly Glu Tyr Leu Pro Leu Leu Gln Gly Lys 
65                  70                  75                  80  


Ser Leu Gly Met Ile Phe Glu Lys Arg Ser Thr Arg Thr Arg Leu Ser 
                85                  90                  95      


Thr Glu Thr Gly Phe Ala Leu Leu Gly Gly His Pro Cys Phe Leu Thr 
            100                 105                 110         


Thr Gln Asp Ile His Leu Gly Val Asn Glu Ser Leu Thr Asp Thr Ala 
        115                 120                 125             


Arg Val Leu Ser Ser Met Ala Asp Ala Val Leu Ala Arg Val Tyr Lys 
    130                 135                 140                 


Gln Ser Asp Leu Asp Thr Leu Ala Lys Glu Ala Ser Ile Pro Ile Ile 
145                 150                 155                 160 


Asn Gly Leu Ser Asp Leu Tyr His Pro Ile Gln Ile Leu Ala Asp Tyr 
                165                 170                 175     


Leu Thr Leu Gln Glu His Tyr Ser Ser Leu Lys Gly Leu Thr Leu Ser 
            180                 185                 190         


Trp Ile Gly Asp Gly Asn Asn Ile Leu His Ser Ile Met Met Ser Ala 
        195                 200                 205             


Ala Lys Phe Gly Met His Leu Gln Ala Ala Thr Pro Lys Gly Tyr Glu 
    210                 215                 220                 


Pro Asp Ala Ser Val Thr Lys Leu Ala Glu Gln Tyr Ala Lys Glu Asn 
225                 230                 235                 240 


Gly Thr Lys Leu Leu Leu Thr Asn Asp Pro Leu Glu Ala Ala His Gly 
                245                 250                 255     


Gly Asn Val Leu Ile Thr Asp Thr Trp Ile Ser Met Gly Gln Glu Glu 
            260                 265                 270         


Glu Lys Lys Lys Arg Leu Gln Ala Phe Gln Gly Tyr Gln Val Thr Met 
        275                 280                 285             


Lys Thr Ala Lys Val Ala Ala Ser Asp Trp Thr Phe Leu His Cys Leu 
    290                 295                 300                 


Pro Arg Lys Pro Glu Glu Val Asp Asp Glu Val Phe Tyr Ser Pro Arg 
305                 310                 315                 320 


Ser Leu Val Phe Pro Glu Ala Glu Asn Arg Lys Trp Thr Ile Met Ala 
                325                 330                 335     


Val Met Val Ser Leu Leu Thr Asp Tyr Ser Pro Gln Leu Gln Lys Pro 
            340                 345                 350         


Lys Phe 
        


<210>  3
<211>  1068
<212>  DNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC

<400>  3
atgctgttca acctgcgaat cctgctgaac aatgccgctt ttcggaacgg gcacaatttc       60

atggtgagga actttcgctg cggacagccc ctccagaaca aggtccagct gaagggcagg      120

gacctgctga ccctgaaaaa tttcacaggg gaggaaatca agtacatgct gtggctgtca      180

gccgatctga agttccggat caagcagaag ggcgaatatc tgcctctgct ccagggcaaa      240

agcctgggga tgatcttcga aaagcgcagt actcggacca gactgtcaac agagactgga      300

ttcgcactgc tgggaggaca cccatgtttt ctgaccacac aggacattca tctgggagtg      360

aacgagtccc tgaccgacac agcacgcgtc ctgagctcca tggctgatgc agtgctggct      420

cgagtctaca aacagtctga cctggatacc ctggccaagg aagcttctat cccaatcatt      480

aatggcctga gtgacctgta tcaccccatc cagattctgg ccgattacct gaccctccag      540

gagcattatt ctagtctgaa agggctgaca ctgagctgga ttggggacgg aaacaatatc      600

ctgcactcca ttatgatgag cgccgccaag tttggaatgc acctccaggc tgcaacccca      660

aaaggctacg aacccgatgc ctccgtgaca aagctggcag aacagtatgc caaagagaac      720

ggcactaagc tgctgctgac caatgaccct ctggaggccg ctcacggagg caacgtgctg      780

atcactgata cctggattag tatgggacag gaggaagaga agaagaagcg gctccaggcc      840

ttccagggct accaggtgac aatgaaaact gctaaggtcg cagccagcga ctggaccttt      900

ctgcattgcc tgcccagaaa gcctgaagag gtggacgatg aggtcttcta ctcacccaga      960

agcctggtgt ttcctgaagc tgagaatagg aagtggacaa tcatggcagt gatggtcagc     1020

ctgctgactg attattcccc tcagctccag aaaccaaagt tctgataa                  1068


<210>  4
<211>  1068
<212>  DNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC coding sequence

<400>  4
atgctgttca acctgcgaat cctgctgaac aacgccgctt ttcggaacgg gcacaacttt       60

atggtgagga actttcgctg cggacagccc ctccagaata aggtccagct gaagggcagg      120

gacctgctga ccctgaaaaa tttcacaggg gaggaaatca agtatatgct gtggctgtca      180

gctgatctga agttccggat caagcagaag ggcgaatatc tgcctctgct ccagggcaaa      240

agcctgggga tgatcttcga aaagcgcagt actcggacca gactgtcaac cgagactgga      300

ttcgctctgc tgggaggaca cccttgtttt ctgaccactc aggacattca cctgggagtg      360

aacgagtccc tgaccgacac tgctcgcgtc ctgagctcta tggccgacgc tgtgctagct      420

cgagtctaca aacagtccga cctggatacc ctggccaagg aagcttctat cccaattatt      480

aacggcctgt cagacctgta tcaccccatc cagattctgg ccgattacct gaccctccag      540

gagcactatt ctagtctgaa agggctgaca ctgagttgga ttggggacgg aaacaatatc      600

ctgcactcta ttatgatgtc agccgccaag tttggaatgc acctccaggc tgcaacccca      660

aaaggctacg aacccgatgc ctcagtgaca aagctggctg aacagtacgc caaagagaac      720

ggcactaagc tgctgctgac caacgaccct ctggaggccg ctcacggagg caacgtgctg      780

atcaccgata cctggattag tatgggacag gaggaagaga agaagaagcg gctccaggcc      840

ttccagggct accaggtgac aatgaaaacc gctaaggtcg cagccagcga ttggaccttt      900

ctgcactgcc tgcccagaaa gcccgaagag gtggacgacg aggtcttcta ctctcccaga      960

agcctggtgt ttcccgaagc tgagaatagg aagtggacaa ttatggcagt gatggtcagc     1020

ctgctgactg attattcacc tcagctccag aaaccaaagt tctgataa                  1068


<210>  5
<211>  1068
<212>  DNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC

<400>  5
atgctgttca acctgcgaat cctgctgaac aacgccgctt ttcggaacgg gcacaacttt       60

atggtgagga actttcgctg cggacagccc ctccagaata aggtccagct gaagggcagg      120

gacctgctga ccctgaaaaa tttcacaggg gaggaaatca agtatatgct gtggctgtca      180

gctgatctga agttccggat caagcagaag ggcgaatatc tgcctctgct ccagggcaaa      240

agcctgggga tgatcttcga aaagcgcagt actcggacca gactgtcaac cgagactgga      300

ttcgctctgc tgggaggaca cccttgtttt ctgaccactc aggacattca cctgggagtg      360

aacgagtccc tgaccgacac tgctcgcgtc ctgagctcta tggccgacgc tgtgctggct      420

cgagtctaca aacagtccga cctggatacc ctggccaagg aagcttctat cccaattatt      480

aacggcctgt cagacctgta tcaccccatc cagattctgg ccgattacct gaccctccag      540

gagcactatt ctagtctgaa agggctgaca ctgagttgga ttggggacgg aaacaatatc      600

ctgcactcta ttatgatgtc agccgccaag tttggaatgc acctccaggc tgcaacccca      660

aaaggctacg aacccgatgc ctcagtgaca aagctggctg aacagtacgc caaagagaac      720

ggcactaagc tgctgctgac caacgaccct ctggaggccg ctcacggagg caacgtgctg      780

atcaccgata cctggattag tatgggacag gaggaagaga agaagaagcg gctccaggcc      840

ttccagggct accaggtgac aatgaaaacc gctaaggtcg cagccagcga ttggaccttt      900

ctgcactgcc tgcccagaaa gcccgaagag gtggacgacg aggtcttcta ctctcccaga      960

agcctggtgt ttcccgaagc tgagaatagg aagtggacaa ttatggcagt gatggtcagc     1020

ctgctgactg attattcacc tcagctccag aaaccaaagt tctgataa                  1068


<210>  6
<211>  5195
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Plasmid pscAAVTBGhOTCLW


<220>
<221>  repeat_region
<222>  (5)..(109)
<223>  ITR

<220>
<221>  TATA_signal
<222>  (851)..(854)

<220>
<221>  CDS
<222>  (976)..(2037)
<223>  hOTCco-LW4

<220>
<221>  polyA
<222>  (2046)..(2182)
<223>  ITR, located on complement

<220>
<221>  repeat_region
<222>  (2211)..(2378)
<223>  ITR, located on complement

<220>
<221>  misc_feature
<222>  (4172)..(4760)
<223>  TBG/promoter

<400>  6
taggctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac       60

ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgt agccatgctc      120

taggaagatc aattcaattc acgcgtggta cctagaacta tagctagaat tcgcccttaa      180

gctagcaggt taatttttaa aaagcagtca aaagtccaag tggcccttgg cagcatttac      240

tctctctgtt tgctctggtt aataatctca ggagcacaaa cattccagat ccaggttaat      300

ttttaaaaag cagtcaaaag tccaagtggc ccttggcagc atttactctc tctgtttgct      360

ctggttaata atctcaggag cacaaacatt ccagatccgg cgcgccaggg ctggaagcta      420

cctttgacat catttcctct gcgaatgcat gtataatttc tacagaacct attagaaagg      480

atcacccagc ctctgctttt gtacaacttt cccttaaaaa actgccaatt ccactgctgt      540

ttggcccaat agtgagaact ttttcctgct gcctcttggt gcttttgcct atggccccta      600

ttctgcctgc tgaagacact cttgccagca tggacttaaa cccctccagc tctgacaatc      660

ctctttctct tttgttttac atgaagggtc tggcagccaa agcaatcact caaagttcaa      720

accttatcat tttttgcttt gttcctcttg gccttggttt tgtacatcag ctttgaaaat      780

accatcccag ggttaatgct ggggttaatt tataactaag agtgctctag ttttgcaata      840

caggacatgc tataaaaatg gaaagatgtt gctttctgag agacagcttt attgcggtag      900

tttatcacag ttaaattgct aacgcagtca gtgcttctga cacaacagtc tcgaacttaa      960

gctgcagccg ccacc atg ctg ttc aac ctg cga atc ctg ctg aac aac gcc      1011
                 Met Leu Phe Asn Leu Arg Ile Leu Leu Asn Asn Ala          
                 1               5                   10                   

gct ttt cgg aac ggg cac aac ttt atg gtg agg aac ttt cgc tgc gga       1059
Ala Phe Arg Asn Gly His Asn Phe Met Val Arg Asn Phe Arg Cys Gly           
        15                  20                  25                        

cag ccc ctc cag aat aag gtc cag ctg aag ggc agg gac ctg ctg acc       1107
Gln Pro Leu Gln Asn Lys Val Gln Leu Lys Gly Arg Asp Leu Leu Thr           
    30                  35                  40                            

ctg aaa aat ttc aca ggg gag gaa atc aag tat atg ctg tgg ctg tca       1155
Leu Lys Asn Phe Thr Gly Glu Glu Ile Lys Tyr Met Leu Trp Leu Ser           
45                  50                  55                  60            

gct gat ctg aag ttc cgg atc aag cag aag ggc gaa tat ctg cct ctg       1203
Ala Asp Leu Lys Phe Arg Ile Lys Gln Lys Gly Glu Tyr Leu Pro Leu           
                65                  70                  75                

ctc cag ggc aaa agc ctg ggg atg atc ttc gaa aag cgc agt act cgg       1251
Leu Gln Gly Lys Ser Leu Gly Met Ile Phe Glu Lys Arg Ser Thr Arg           
            80                  85                  90                    

acc aga ctg tca acc gag act gga ttc gct ctg ctg gga gga cac cct       1299
Thr Arg Leu Ser Thr Glu Thr Gly Phe Ala Leu Leu Gly Gly His Pro           
        95                  100                 105                       

tgt ttt ctg acc act cag gac att cac ctg gga gtg aac gag tcc ctg       1347
Cys Phe Leu Thr Thr Gln Asp Ile His Leu Gly Val Asn Glu Ser Leu           
    110                 115                 120                           

acc gac act gct cgc gtc ctg agc tct atg gcc gac gct gtg ctg gct       1395
Thr Asp Thr Ala Arg Val Leu Ser Ser Met Ala Asp Ala Val Leu Ala           
125                 130                 135                 140           

cga gtc tac aaa cag tcc gac ctg gat acc ctg gcc aag gaa gct tct       1443
Arg Val Tyr Lys Gln Ser Asp Leu Asp Thr Leu Ala Lys Glu Ala Ser           
                145                 150                 155               

atc cca att att aac ggc ctg tca gac ctg tat cac ccc atc cag att       1491
Ile Pro Ile Ile Asn Gly Leu Ser Asp Leu Tyr His Pro Ile Gln Ile           
            160                 165                 170                   

ctg gcc gat tac ctg acc ctc cag gag cac tat tct agt ctg aaa ggg       1539
Leu Ala Asp Tyr Leu Thr Leu Gln Glu His Tyr Ser Ser Leu Lys Gly           
        175                 180                 185                       

ctg aca ctg agt tgg att ggg gac gga aac aat atc ctg cac tct att       1587
Leu Thr Leu Ser Trp Ile Gly Asp Gly Asn Asn Ile Leu His Ser Ile           
    190                 195                 200                           

atg atg tca gcc gcc aag ttt gga atg cac ctc cag gct gca acc cca       1635
Met Met Ser Ala Ala Lys Phe Gly Met His Leu Gln Ala Ala Thr Pro           
205                 210                 215                 220           

aaa ggc tac gaa ccc gat gcc tca gtg aca aag ctg gct gaa cag tac       1683
Lys Gly Tyr Glu Pro Asp Ala Ser Val Thr Lys Leu Ala Glu Gln Tyr           
                225                 230                 235               

gcc aaa gag aac ggc act aag ctg ctg ctg acc aac gac cct ctg gag       1731
Ala Lys Glu Asn Gly Thr Lys Leu Leu Leu Thr Asn Asp Pro Leu Glu           
            240                 245                 250                   

gcc gct cac gga ggc aac gtg ctg atc acc gat acc tgg att agt atg       1779
Ala Ala His Gly Gly Asn Val Leu Ile Thr Asp Thr Trp Ile Ser Met           
        255                 260                 265                       

gga cag gag gaa gag aag aag aag cgg ctc cag gcc ttc cag ggc tac       1827
Gly Gln Glu Glu Glu Lys Lys Lys Arg Leu Gln Ala Phe Gln Gly Tyr           
    270                 275                 280                           

cag gtg aca atg aaa acc gct aag gtc gca gcc agc gat tgg acc ttt       1875
Gln Val Thr Met Lys Thr Ala Lys Val Ala Ala Ser Asp Trp Thr Phe           
285                 290                 295                 300           

ctg cac tgc ctg ccc aga aag ccc gaa gag gtg gac gac gag gtc ttc       1923
Leu His Cys Leu Pro Arg Lys Pro Glu Glu Val Asp Asp Glu Val Phe           
                305                 310                 315               

tac tct ccc aga agc ctg gtg ttt ccc gaa gct gag aat agg aag tgg       1971
Tyr Ser Pro Arg Ser Leu Val Phe Pro Glu Ala Glu Asn Arg Lys Trp           
            320                 325                 330                   

aca att atg gca gtg atg gtc agc ctg ctg act gat tat tca cct cag       2019
Thr Ile Met Ala Val Met Val Ser Leu Leu Thr Asp Tyr Ser Pro Gln           
        335                 340                 345                       

ctc cag aaa cca aag ttc tgataagcgg ccgctatttg tgaaatttgt              2067
Leu Gln Lys Pro Lys Phe                                                   
    350                                                                   

gatgctattg ctttatttgt aaccattata agctgcaata aacaagttaa caacaacaat     2127

tgcattcatt ttatgtttca ggttcagggg gaggtgtggg aggtttttta ggcatcgata     2187

aggatcttcc tagagcatgg ctacgtagat aagtagcatg gcgggttaat cattaactac     2247

aaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag     2307

gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag     2367

cgagcgcgca gccttaatta acctaattca ctggccgtcg ttttacaacg tcgtgactgg     2427

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg     2487

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc     2547

gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc     2607

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt     2667

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc     2727

cgatttagtg ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt     2787

agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt     2847

aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt     2907

gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa     2967

aaatttaacg cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg     3027

gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc     3087

tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta     3147

ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg     3207

ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg     3267

gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac     3327

gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg     3387

acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt     3447

actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg     3507

ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac     3567

cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt     3627

gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag     3687

caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc     3747

aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc     3807

ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta     3867

tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg     3927

ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga     3987

ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac     4047

ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa     4107

tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat     4167

cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc     4227

taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg     4287

gcttcagcag agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc     4347

acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg     4407

ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg     4467

ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa     4527

cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg     4587

aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga     4647

gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct     4707

gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca     4767

gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc     4827

ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg     4887

ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc     4947

caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca     5007

ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc     5067

attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga     5127

gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaga tttaattaag     5187

gccttaat                                                              5195


<210>  7
<211>  354
<212>  PRT
<213>  Artificial sequence

<220>
<223>  engineered hOTC

<400>  7

Met Leu Phe Asn Leu Arg Ile Leu Leu Asn Asn Ala Ala Phe Arg Asn 
1               5                   10                  15      


Gly His Asn Phe Met Val Arg Asn Phe Arg Cys Gly Gln Pro Leu Gln 
            20                  25                  30          


Asn Lys Val Gln Leu Lys Gly Arg Asp Leu Leu Thr Leu Lys Asn Phe 
        35                  40                  45              


Thr Gly Glu Glu Ile Lys Tyr Met Leu Trp Leu Ser Ala Asp Leu Lys 
    50                  55                  60                  


Phe Arg Ile Lys Gln Lys Gly Glu Tyr Leu Pro Leu Leu Gln Gly Lys 
65                  70                  75                  80  


Ser Leu Gly Met Ile Phe Glu Lys Arg Ser Thr Arg Thr Arg Leu Ser 
                85                  90                  95      


Thr Glu Thr Gly Phe Ala Leu Leu Gly Gly His Pro Cys Phe Leu Thr 
            100                 105                 110         


Thr Gln Asp Ile His Leu Gly Val Asn Glu Ser Leu Thr Asp Thr Ala 
        115                 120                 125             


Arg Val Leu Ser Ser Met Ala Asp Ala Val Leu Ala Arg Val Tyr Lys 
    130                 135                 140                 


Gln Ser Asp Leu Asp Thr Leu Ala Lys Glu Ala Ser Ile Pro Ile Ile 
145                 150                 155                 160 


Asn Gly Leu Ser Asp Leu Tyr His Pro Ile Gln Ile Leu Ala Asp Tyr 
                165                 170                 175     


Leu Thr Leu Gln Glu His Tyr Ser Ser Leu Lys Gly Leu Thr Leu Ser 
            180                 185                 190         


Trp Ile Gly Asp Gly Asn Asn Ile Leu His Ser Ile Met Met Ser Ala 
        195                 200                 205             


Ala Lys Phe Gly Met His Leu Gln Ala Ala Thr Pro Lys Gly Tyr Glu 
    210                 215                 220                 


Pro Asp Ala Ser Val Thr Lys Leu Ala Glu Gln Tyr Ala Lys Glu Asn 
225                 230                 235                 240 


Gly Thr Lys Leu Leu Leu Thr Asn Asp Pro Leu Glu Ala Ala His Gly 
                245                 250                 255     


Gly Asn Val Leu Ile Thr Asp Thr Trp Ile Ser Met Gly Gln Glu Glu 
            260                 265                 270         


Glu Lys Lys Lys Arg Leu Gln Ala Phe Gln Gly Tyr Gln Val Thr Met 
        275                 280                 285             


Lys Thr Ala Lys Val Ala Ala Ser Asp Trp Thr Phe Leu His Cys Leu 
    290                 295                 300                 


Pro Arg Lys Pro Glu Glu Val Asp Asp Glu Val Phe Tyr Ser Pro Arg 
305                 310                 315                 320 


Ser Leu Val Phe Pro Glu Ala Glu Asn Arg Lys Trp Thr Ile Met Ala 
                325                 330                 335     


Val Met Val Ser Leu Leu Thr Asp Tyr Ser Pro Gln Leu Gln Lys Pro 
            340                 345                 350         


Lys Phe 
        


<210>  8
<211>  1065
<212>  DNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC 

<400>  8
atgctgttca acctgagaat cctgctgaac aacgccgcct tcagaaacgg ccacaacttc       60

atggtgagaa acttcagatg cggccagccc ctgcagaaca aggtgcagct gaagggcaga      120

gacctgctga ccctgaagaa cttcaccggc gaggagatca agtacatgct gtggctgagc      180

gccgacctga agttcagaat caagcagaag ggcgagtacc tgcccctgct gcagggcaag      240

agcctgggca tgatcttcga gaagagaagc accagaacca gactgagcac cgagaccggc      300

ctggccctgc tgggcggcca cccctgcttc ctgaccaccc aggacatcca cctgggcgtg      360

aacgagagcc tgaccgacac cgccagagtg ctgagcagca tggccgacgc cgtgctggcc      420

agagtgtaca agcagagcga cctggacacc ctggccaagg aggccagcat ccccatcatc      480

aacggcctga gcgacctgta ccaccccatc cagatcctgg ccgactacct gaccctgcag      540

gagcactaca gcagcctgaa gggcctgacc ctgagctgga tcggcgacgg caacaacatc      600

ctgcacagca tcatgatgag cgccgccaag ttcggcatgc acctgcaggc cgccaccccc      660

aagggctacg agcccgacgc cagcgtgacc aagctggccg agcagtacgc caaggagaac      720

ggcaccaagc tgctgctgac caacgacccc ctggaggccg cccacggcgg caacgtgctg      780

atcaccgaca cctggatcag catgggccag gaggaggaga agaagaagag actgcaggcc      840

ttccagggct accaggtgac catgaagacc gccaaggtgg ccgccagcga ctggaccttc      900

ctgcactgcc tgcccagaaa gcccgaggag gtggacgacg aggtgttcta cagccccaga      960

agcctggtgt tccccgaggc cgagaacaga aagtggacca tcatggccgt gatggtgagc     1020

ctgctgaccg actacagccc ccagctgcag aagcccaagt tctga                     1065


<210>  9
<211>  1065
<212>  DNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC 

<400>  9
atgctgttca acctgcgcat cctgctgaac aacgccgcct tccgcaacgg ccacaacttc       60

atggtgcgca acttccgctg cggccagccc ctgcagaaca aggtgcagct gaagggccgc      120

gacctgctga ccctgaagaa cttcaccggc gaggagatca agtacatgct gtggctgagc      180

gccgacctga agttccgcat caagcagaag ggcgagtacc tgcccctgct gcagggcaag      240

agcctgggca tgatcttcga gaagcgcagc acccgcaccc gcctgagcac cgagaccggc      300

ctggccctgc tgggcggcca cccctgcttc ctgaccaccc aggacatcca cctgggcgtg      360

aacgagagcc tgaccgacac cgcccgcgtg ctgagcagca tggccgacgc cgtgctggcc      420

cgcgtgtaca agcagagcga cctggacacc ctggccaagg aggccagcat ccccatcatc      480

aacggcctga gcgacctgta ccaccccatc cagatcctgg ccgactacct gaccctgcag      540

gagcactaca gcagcctgaa gggcctgacc ctgagctgga tcggcgacgg caacaacatc      600

ctgcacagca tcatgatgag cgccgccaag ttcggcatgc acctgcaggc cgccaccccc      660

aagggctacg agcccgacgc cagcgtgacc aagctggccg agcagtacgc caaggagaac      720

ggcaccaagc tgctgctgac caacgacccc ctggaggccg cccacggcgg caacgtgctg      780

atcaccgaca cctggatcag catgggccag gaggaggaga agaagaagcg cctgcaggcc      840

ttccagggct accaggtgac catgaagacc gccaaggtgg ccgccagcga ctggaccttc      900

ctgcactgcc tgccccgcaa gcccgaggag gtggacgacg aggtgttcta cagcccccgc      960

agcctggtgt tccccgaggc cgagaaccgc aagtggacca tcatggccgt gatggtgagc     1020

ctgctgaccg actacagccc ccagctgcag aagcccaagt tctga                     1065


<210>  10
<211>  1068
<212>  RNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC RNA sequence

<400>  10
augcuguuca accugcgaau ccugcugaac aacgccgcuu uucggaacgg gcacaacuuu       60

auggugagga acuuucgcug cggacagccc cuccagaaua agguccagcu gaagggcagg      120

gaccugcuga cccugaaaaa uuucacaggg gaggaaauca aguauaugcu guggcuguca      180

gcugaucuga aguuccggau caagcagaag ggcgaauauc ugccucugcu ccagggcaaa      240

agccugggga ugaucuucga aaagcgcagu acucggacca gacugucaac cgagacugga      300

uucgcucugc ugggaggaca cccuuguuuu cugaccacuc aggacauuca ccugggagug      360

aacgaguccc ugaccgacac ugcucgcguc cugagcucua uggccgacgc ugugcuagcu      420

cgagucuaca aacaguccga ccuggauacc cuggccaagg aagcuucuau cccaauuauu      480

aacggccugu cagaccugua ucaccccauc cagauucugg ccgauuaccu gacccuccag      540

gagcacuauu cuagucugaa agggcugaca cugaguugga uuggggacgg aaacaauauc      600

cugcacucua uuaugauguc agccgccaag uuuggaaugc accuccaggc ugcaacccca      660

aaaggcuacg aacccgaugc cucagugaca aagcuggcug aacaguacgc caaagagaac      720

ggcacuaagc ugcugcugac caacgacccu cuggaggccg cucacggagg caacgugcug      780

aucaccgaua ccuggauuag uaugggacag gaggaagaga agaagaagcg gcuccaggcc      840

uuccagggcu accaggugac aaugaaaacc gcuaaggucg cagccagcga uuggaccuuu      900

cugcacugcc ugcccagaaa gcccgaagag guggacgacg aggucuucua cucucccaga      960

agccuggugu uucccgaagc ugagaauagg aaguggacaa uuauggcagu gauggucagc     1020

cugcugacug auuauucacc ucagcuccag aaaccaaagu ucugauaa                  1068


<210>  11
<211>  1068
<212>  RNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC RNA

<400>  11
augcuguuca accugcgaau ccugcugaac aacgccgcuu uucggaacgg gcacaacuuu       60

auggugagga acuuucgcug cggacagccc cuccagaaua agguccagcu gaagggcagg      120

gaccugcuga cccugaaaaa uuucacaggg gaggaaauca aguauaugcu guggcuguca      180

gcugaucuga aguuccggau caagcagaag ggcgaauauc ugccucugcu ccagggcaaa      240

agccugggga ugaucuucga aaagcgcagu acucggacca gacugucaac cgagacugga      300

uucgcucugc ugggaggaca cccuuguuuu cugaccacuc aggacauuca ccugggagug      360

aacgaguccc ugaccgacac ugcucgcguc cugagcucua uggccgacgc ugugcuggcu      420

cgagucuaca aacaguccga ccuggauacc cuggccaagg aagcuucuau cccaauuauu      480

aacggccugu cagaccugua ucaccccauc cagauucugg ccgauuaccu gacccuccag      540

gagcacuauu cuagucugaa agggcugaca cugaguugga uuggggacgg aaacaauauc      600

cugcacucua uuaugauguc agccgccaag uuuggaaugc accuccaggc ugcaacccca      660

aaaggcuacg aacccgaugc cucagugaca aagcuggcug aacaguacgc caaagagaac      720

ggcacuaagc ugcugcugac caacgacccu cuggaggccg cucacggagg caacgugcug      780

aucaccgaua ccuggauuag uaugggacag gaggaagaga agaagaagcg gcuccaggcc      840

uuccagggcu accaggugac aaugaaaacc gcuaaggucg cagccagcga uuggaccuuu      900

cugcacugcc ugcccagaaa gcccgaagag guggacgacg aggucuucua cucucccaga      960

agccuggugu uucccgaagc ugagaauagg aaguggacaa uuauggcagu gauggucagc     1020

cugcugacug auuauucacc ucagcuccag aaaccaaagu ucugauaa                  1068


<210>  12
<211>  1065
<212>  RNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC

<400>  12
augcuguuca accugcgcau ccugcugaac aacgccgccu uccgcaacgg ccacaacuuc       60

auggugcgca acuuccgcug cggccagccc cugcagaaca aggugcagcu gaagggccgc      120

gaccugcuga cccugaagaa cuucaccggc gaggagauca aguacaugcu guggcugagc      180

gccgaccuga aguuccgcau caagcagaag ggcgaguacc ugccccugcu gcagggcaag      240

agccugggca ugaucuucga gaagcgcagc acccgcaccc gccugagcac cgagaccggc      300

cuggcccugc ugggcggcca ccccugcuuc cugaccaccc aggacaucca ccugggcgug      360

aacgagagcc ugaccgacac cgcccgcgug cugagcagca uggccgacgc cgugcuggcc      420

cgcguguaca agcagagcga ccuggacacc cuggccaagg aggccagcau ccccaucauc      480

aacggccuga gcgaccugua ccaccccauc cagauccugg ccgacuaccu gacccugcag      540

gagcacuaca gcagccugaa gggccugacc cugagcugga ucggcgacgg caacaacauc      600

cugcacagca ucaugaugag cgccgccaag uucggcaugc accugcaggc cgccaccccc      660

aagggcuacg agcccgacgc cagcgugacc aagcuggccg agcaguacgc caaggagaac      720

ggcaccaagc ugcugcugac caacgacccc cuggaggccg cccacggcgg caacgugcug      780

aucaccgaca ccuggaucag caugggccag gaggaggaga agaagaagcg ccugcaggcc      840

uuccagggcu accaggugac caugaagacc gccaaggugg ccgccagcga cuggaccuuc      900

cugcacugcc ugccccgcaa gcccgaggag guggacgacg agguguucua cagcccccgc      960

agccuggugu uccccgaggc cgagaaccgc aaguggacca ucauggccgu gauggugagc     1020

cugcugaccg acuacagccc ccagcugcag aagcccaagu ucuga                     1065


<210>  13
<211>  1065
<212>  RNA
<213>  Artificial sequence

<220>
<223>  engineered hOTC 


<400>  13
augcuguuca accugagaau ccugcugaac aacgccgccu ucagaaacgg ccacaacuuc       60

auggugagaa acuucagaug cggccagccc cugcagaaca aggugcagcu gaagggcaga      120

gaccugcuga cccugaagaa cuucaccggc gaggagauca aguacaugcu guggcugagc      180

gccgaccuga aguucagaau caagcagaag ggcgaguacc ugccccugcu gcagggcaag      240

agccugggca ugaucuucga gaagagaagc accagaacca gacugagcac cgagaccggc      300

cuggcccugc ugggcggcca ccccugcuuc cugaccaccc aggacaucca ccugggcgug      360

aacgagagcc ugaccgacac cgccagagug cugagcagca uggccgacgc cgugcuggcc      420

agaguguaca agcagagcga ccuggacacc cuggccaagg aggccagcau ccccaucauc      480

aacggccuga gcgaccugua ccaccccauc cagauccugg ccgacuaccu gacccugcag      540

gagcacuaca gcagccugaa gggccugacc cugagcugga ucggcgacgg caacaacauc      600

cugcacagca ucaugaugag cgccgccaag uucggcaugc accugcaggc cgccaccccc      660

aagggcuacg agcccgacgc cagcgugacc aagcuggccg agcaguacgc caaggagaac      720

ggcaccaagc ugcugcugac caacgacccc cuggaggccg cccacggcgg caacgugcug      780

aucaccgaca ccuggaucag caugggccag gaggaggaga agaagaagag acugcaggcc      840

uuccagggcu accaggugac caugaagacc gccaaggugg ccgccagcga cuggaccuuc      900

cugcacugcc ugcccagaaa gcccgaggag guggacgacg agguguucua cagccccaga      960

agccuggugu uccccgaggc cgagaacaga aaguggacca ucauggccgu gauggugagc     1020

cugcugaccg acuacagccc ccagcugcag aagcccaagu ucuga                     1065


<210>  14
<211>  21
<212>  DNA
<213>  Artificial sequence

<220>
<223>  PCR forward primer

<400>  14
aaactgccaa ttccactgct g                                                 21


<210>  15
<211>  21
<212>  DNA
<213>  Artificial sequence

<220>
<223>  PCR reverse primer

<400>  15
ccataggcaa aagcaccaag a                                                 21


<210>  16
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Probe

<400>  16
ttggcccaat agtgagaact ttttcctgc                                         29


