!!NA_SEQUENCE 1.0 LOCUS TN5 5818 bp DNA BCT 18-MAY-1994 DEFINITION Complete sequence of E. coli transposon Tn5. ACCESSION U00004 L19385 VERSION U00004.1 GI:405822 KEYWORDS transposon Tn5; aminoglycoside-3'-O-phosphotransferase; kanamycin resistance; neomycin resistance; bleomycin resistance; streptomycin resistance; streptomycin phosphotransferase; transposase; transposition inhibitor. SOURCE Escherichia coli DNA. ORGANISM Escherichia coli Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 1400; 4219 to 5818) AUTHORS Auerswald,E.A., Ludwig,G. and Schaller,H. TITLE Structural analysis of Tn5 JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 107-113 (1981) MEDLINE 82049482 REFERENCE 2 (bases 1401 to 2300) AUTHORS Beck,E., Ludwig,G., Auerswald,E.A., Reiss,B. and Schaller,H. TITLE Nucleotide sequence and exact localization of the neomycin phosphotransferase gene from Tn5 JOURNAL Gene 19, 327-336 (1982) MEDLINE 83106478 REFERENCE 3 (bases 2301 to 4218) AUTHORS Mazodier,P., Cossart,P., Giraud,E. and Gasser,F. TITLE Completion of the nucleotide sequence of the central region of Tn5 confirms the presence of three resistance genes JOURNAL Nucleic Acids Research 13, 195-205 (1985) MEDLINE 85215465 REFERENCE 4 (sites) AUTHORS Johnson,R.C., Yin,J.C.P. and Reznikoff,W.S. TITLE Control of Tn5 transposition in Escherichia coli is mediated by protein from the right repeat JOURNAL Cell 30, 873-882 (1982) MEDLINE 83050973 REFERENCE 5 (sites) AUTHORS Rothstein,S.J., Jorgensen,R.A., Yin,J.C.-P., Yong-Di,Z., Johnson,R.C. and Reznikoff,W.S. TITLE Genetic organization of Tn5 JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 99-105 (1981) MEDLINE 82049532 REFERENCE 6 (sites) AUTHORS Rothstein,S.J. and Reznikoff,W.S. TITLE The functional differences in the inverted repeats of Tn5 are caused by a single base pair nonhomology JOURNAL Cell 23, 191-199 (1981) MEDLINE 81162719 REFERENCE 7 (sites) AUTHORS Johnson,R.C. and Reznikoff,W.S. TITLE DNA sequences at the ends of transposon Tn5 required for transposition JOURNAL Nature 304, 280-282 (1983) MEDLINE 83245055 REFERENCE 8 (sites) AUTHORS Berg,D.E. TITLE Transposon Tn5 JOURNAL (in) Berg,D.E. and Howe,M. (Eds.); MOBILE DNA: 185-210; American Society for Microbiology Press, Washington, D.C. (1989) COMMENT On May 17, 1994 this sequence version replaced gi:310980. These data kindly submitted in computer readable form by: Paul N. Hengen Laboratory of Mathematical Biology P.O. Box B National Cancer Institute Frederick Cancer Research and Development Center Frederick, Maryland 21702-1201 U.S.A. Phone: 301-846-5581 Fax: 301-846-5598 Email: pnh@ncifcrf.gov -and- Andrew R. Bradbury International School for Advanced Studies (SISSA) Via Beirut 2-4, Trieste 34013, Italy. Phone: +39 40 398995 Fax: +39 40 398991 Email: bradbury@icgeb.trieste.it. FEATURES Location/Qualifiers source 1. .5818 /organism="Escherichia coli" /db_xref="taxon:562" repeat_region order(1. .19,5800. .5818) /note="minimum length required for transposition" /citation=[7] /function="outside end" /rpt_type=inverted /evidence=experimental mRNA (1.80). .(1500.1550) repeat_region 1. .1534 /standard_name="IS50L" /note="IS50L is non-functional due to mutation at 1442" /citation=[1] /function="insertion sequence IS50L" /rpt_type=inverted /evidence=experimental CDS 92. .1444 /note="A Thymidine at position 1442 causes an ochre stop codon which prematurely terminates the protein at this point. Ochre suppressing strains allow readthrough and express a functional product identical to the transposase." /citation=[4] /codon_start=1 /transl_table=11 /function="non-functional" /evidence=experimental /product="protein #3" /protein_id="AAA73388.1" /db_xref="GI:405967" /translation="MITSALHRAADWAKSVFSSAALGDPRRTARLVNVAAQLAKYSGK SITISSEGSEAMQEGAYRFYRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTTSLS YRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEK ESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDRLAHNERFVVRSKHPRKDV ESGLYLIDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLKQGN ITLNAVLAEEINPPKGETPLKWLLLTGEPVESLAQALRVIDIYTHRWRIEEFHKAWKT GAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQS AETVLTPDECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWGALW " CDS 257. .1444 /note="A Thymidine at position 1442 causes an ochre stop codon which prematurely terminates the protein at this point. Ochre suppressing strains allow readthrough and express a functional product identical to the transposition inhibitor." /citation=[4] /codon_start=1 /transl_table=11 /function="non-functional" /evidence=experimental /product="protein #4" /protein_id="AAA73389.1" /db_xref="GI:405968" /translation="MQEGAYRFYRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTT SLSYRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADA DEKESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDRLAHNERFVVRSKHPR KDVESGLYLIDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLK QGNITLNAVLAEEINPPKGETPLKWLLLTGEPVESLAQALRVIDIYTHRWRIEEFHKA WKTGAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVE SQSAETVLTPDECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWG ALW" promoter 1413. .1462 /citation=[5] /function="aminoglycoside phosphotransferase promoter region" /evidence=experimental variation 1442 /note="A single base pair difference exists as a 'T' at position 1442 and a 'G' at position 4377 within the inverted repeat regions of IS50L and IS50R. This mutation renders IS50L inactive by prematurely terminating the transposase." /citation=[6] mRNA (1460.1540). .(3600.3650) repeat_region order(1515. .1533,4286. .4304) /note="minimum length required for transposition" /citation=[7] /function="inside end" /rpt_type=inverted /evidence=experimental CDS 1551. .2345 /citation=[2] /codon_start=1 /transl_table=11 /function="neomycin/kanamycin resistance" /evidence=experimental /product="aminoglycoside-3'-O-phosphotransferase" /protein_id="AAA73390.1" /db_xref="GI:405969" /translation="MIEQDGLHAGSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGR PVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDVVTEAGRDWLLLGEVPGQDL LSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERARTRMEAGLVDQDDLDE EHQGLAPAELFARLKARMPDGEDLVVTHGDACLPNIMVENGRFSGFIDCGRLGVADRY QDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFYRLLDEFF" conflict 1684 /note="A missing 'C' in the sequence published by Auerswald et al.(1981) is corrected within that published by Beck et al. (1982)." /citation=[1] CDS 2366. .2746 /citation=[3] /codon_start=1 /transl_table=11 /function="bleomycin resistance" /evidence=experimental /product="bleomycin resistance" /protein_id="AAA73391.1" /db_xref="GI:405970" /translation="MTDQATPNLPSRDFDSTAAFYERLGFGIVFRDAGWMILQRGDLM LEFFAHPGLDPLASWFSCCLRLDDLAEFYRQCKSVGIQETSSGYPRIHAPELQEWGGT MAALVDPDGTLLRLIQNELLAGIS" CDS 2785. .3585 /note="This protein confers streptomycin resistance in some species of Gram-negative bacteria, but is cryptic in Escherichia coli." /citation=[3] /codon_start=1 /transl_table=11 /function="streptomycin resistance" /evidence=experimental /product="streptomycin phosphotransferase" /protein_id="AAA73392.1" /db_xref="GI:405971" /translation="MERWRLLRDGELLTTHSSWILPVRQGDMPAMLKVARIPDEEAGY RLLTWWDGQGAARVFASAAGALLMERASGAGDLAQIAWSGQDDEACRILCDTAARLHA PRSGPPPDLHPLQEWFQPLFRLAAEHAALAPAASVARQLLAAPREVCPLHGDLHHENV LDFGDRGWLAIDPHGLLGERTFDYANIFTNPDLSDPGRPLAILPGRLEARLSIVVATT GFEPERLLRWIIAWTGLSAAWFIGDGDGEGEGAAIDLAVNAMARRLLD" repeat_region 4285. .5818 /standard_name="IS50R" /note="IS50R is functional in transposition" /citation=[1] /function="insertion sequence IS50R" /rpt_type=inverted /evidence=experimental CDS complement(4297. .5562) /citation=[4] /codon_start=1 /transl_table=11 /function="transposition inhibitor" /evidence=experimental /product="protein #2" /protein_id="AAA73394.1" /db_xref="GI:405973" /translation="MQEGAYRFYRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTT SLSYRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADA DEKESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDRLAHNERFVVRSKHPR KDVESGLYLIDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLK QGNITLNAVLAEEINPPKGETPLKWLLLTGEPVESLAQALRVIDIYTHRWRIEEFHKA WKTGAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVE SQSAETVLTPDECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWG ALWEGWEALQSKLDGFLAAKDLMAQGIKI" CDS complement(4297. .5727) /citation=[4] /codon_start=1 /transl_table=11 /function="transposase" /evidence=experimental /product="protein #1" /protein_id="AAA73393.1" /db_xref="GI:405972" /translation="MITSALHRAADWAKSVFSSAALGDPRRTARLVNVAAQLAKYSGK SITISSEGSEAMQEGAYRFYRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTTSLS YRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEK ESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDRLAHNERFVVRSKHPRKDV ESGLYLIDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLKQGN ITLNAVLAEEINPPKGETPLKWLLLTGEPVESLAQALRVIDIYTHRWRIEEFHKAWKT GAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQS AETVLTPDECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWGALW EGWEALQSKLDGFLAAKDLMAQGIKI" mRNA complement((4300.4350). .(5738.5818)) BASE COUNT 1125 a 1719 c 1777 g 1197 t ORIGIN U00004 Length: 5818 November 3, 2003 17:19 Type: N Check: 5005 .. 1 CTGACTCTTA TACACAAGTA GCGTCCTGAA CGGAACCTTT CCCGTTTTCC 51 AGGATCTGAC TTCCATGTGA CCTCCTAACA TGGTAACGTT CATGATAACT 101 TCTGCTCTTC ATCGTGCGGC CGACTGGGCT AAATCTGTGT TCTCTTCGGC 151 GGCGCTGGGT GATCCTCGCC GTACTGCCCG CTTGGTTAAC GTCGCCGCCC 201 AATTGGCAAA ATATTCTGGT AAATCAATAA CCATCTCATC AGAGGGTAGT 251 GAAGCCATGC AGGAAGGCGC TTACCGATTT TACCGCAATC CCAACGTTTC 301 TGCCGAGGCG ATCAGAAAGG CTGGCGCCAT GCAAACAGTC AAGTTGGCTC 351 AGGAGTTTCC CGAACTGCTG GCCATTGAGG ACACCACCTC TTTGAGTTAT 401 CGCCACCAGG TCGCCGAAGA GCTTGGCAAG CTGGGCTCTA TTCAGGATAA 451 ATCCCGCGGA TGGTGGGTTC ACTCCGTTCT CTTGCTCGAG GCCACCACAT 501 TCCGCACCGT AGGATTACTG CATCAGGAGT GGTGGATGCG CCCGGATGAC 551 CCTGCCGATG CGGATGAAAA GGAGAGTGGC AAATGGCTGG CAGCGGCCGC 601 AACTAGCCGG TTACGCATGG GCAGCATGAT GAGCAACGTG ATTGCGGTCT 651 GTGACCGCGA AGCCGATATT CATGCTTATC TGCAGGACAG GCTGGCGCAT 701 AACGAGCGCT TCGTGGTGCG CTCCAAGCAC CCACGCAAGG ACGTAGAGTC 751 TGGGTTGTAT CTGATCGACC ATCTGAAGAA CCAACCGGAG TTGGGTGGCT 801 ATCAGATCAG CATTCCGCAA AAGGGCGTGG TGGATAAACG CGGTAAACGT 851 AAAAATCGAC CAGCCCGCAA GGCGAGCTTG AGCCTGCGCA GTGGGCGCAT 901 CACGCTAAAA CAGGGGAATA TCACGCTCAA CGCGGTGCTG GCCGAGGAGA 951 TTAACCCGCC CAAGGGTGAG ACCCCGTTGA AATGGTTGTT GCTGACCGGC 1001 GAACCGGTCG AGTCGCTAGC CCAAGCCTTG CGCGTCATCG ACATTTATAC 1051 CCATCGCTGG CGGATCGAGG AGTTCCATAA GGCATGGAAA ACCGGAGCAG 1101 GAGCCGAGAG GCAACGCATG GAGGAGCCGG ATAATCTGGA GCGGATGGTC 1151 TCGATCCTCT CGTTTGTTGC GGTCAGGCTG TTACAGCTCA GAGAAAGCTT 1201 CACGCTGCCG CAAGCACTCA GGGCGCAAGG GCTGCTAAAG GAAGCGGAAC 1251 ACGTAGAAAG CCAGTCCGCA GAAACGGTGC TGACCCCGGA TGAATGTCAG 1301 CTACTGGGCT ATCTGGACAA GGGAAAACGC AAGCGCAAAG AGAAAGCAGG 1351 TAGCTTGCAG TGGGCTTACA TGGCGATAGC TAGACTGGGC GGTTTTATGG 1401 ACAGCAAGCG AACCGGAATT GCCAGCTGGG GCGCCCTCTG GTAAGGTTGG 1451 GAAGCCCTGC AAAGTAAACT GGATGGCTTT CTTGCCGCCA AGGATCTGAT 1501 GGCGCAGGGG ATCAAGATCT GATCAAGAGA CAGGATGAGG ATCGTTTCGC 1551 ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 1601 GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG 1651 CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 1701 ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 1751 ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 1801 TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG 1851 GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 1901 TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG 1951 ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 2001 GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC 2051 AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC 2101 TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT 2151 GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 2201 CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG 2251 GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 2301 TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGAGCGGG 2351 ACTCTGGGGT TCGAAATGAC CGACCAAGCG ACGCCCAACC TGCCATCACG 2401 AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC TTCGGAATCG 2451 TTTTCCGGGA CGCCGGCTGG ATGATCCTCC AGCGCGGGGA TCTCATGCTG 2501 GAGTTCTTCG CCCACCCCGG GCTCGATCCC CTCGCGAGTT GGTTCAGCTG 2551 CTGCCTGAGG CTGGACGACC TCGCGGAGTT CTACCGGCAG TGCAAATCCG 2601 TCGGCATCCA GGAAACCAGC AGCGGCTATC CGCGCATCCA TGCCCCCGAA 2651 CTGCAGGAGT GGGGAGGCAC GATGGCCGCT TTGGTCGACC CGGACGGGAC 2701 GCTCCTGCGC CTGATACAGA ACGAATTGCT TGCAGGCATC TCATGAGTGT 2751 GTCTTCCCGT TTTCCGCCTG AGGTCACTGC GTGGATGGAG CGCTGGCGCC 2801 TGCTGCGCGA CGGCGAGCTG CTCACCACCC ACTCGAGCTG GATACTTCCC 2851 GTCCGCCAGG GGGACATGCC GGCGATGCTG AAGGTCGCGC GCATTCCCGA 2901 TGAAGAGGCC GGTTACCGCC TGTTGACCTG GTGGGACGGG CAGGGCGCCG 2951 CCCGAGTCTT CGCCTCGGCG GCGGGCGCTC TGCTCATGGA GCGCGCGTCC 3001 GGGGCCGGGG ACCTTGCACA GATAGCGTGG TCCGGCCAGG ACGACGAGGC 3051 TTGCAGGATC CTCTGCGACA CCGCCGCTCG TCTGCACGCG CCGCGGTCCG 3101 GACCGCCGCC CGATCTCCAT CCGCTACAGG AATGGTTCCA GCCGCTTTTC 3151 CGGTTGGCCG CTGAGCACGC GGCACTTGCG CCCGCCGCCA GCGTAGCGCG 3201 CCAACTTCTG GCGGCGCCGC GCGAGGTGTG CCCGCTCCAC GGCGACCTGC 3251 ACCACGAGAA CGTGCTCGAC TTCGGCGACC GCGGCTGGCT GGCCATCGAC 3301 CCGCACGGAC TGCTCGGCGA GCGCACCTTC GACTATGCCA ACATCTTCAC 3351 GAATCCCGAT CTCAGCGACC CCGGTCGCCC GCTTGCGATC CTGCCGGGCA 3401 GGCTGGAGGC TCGACTCAGC ATTGTGGTCG CGACGACCGG GTTTGAGCCC 3451 GAACGGCTTC TTCGCTGGAT CATTGCATGG ACGGGCTTGT CGGCAGCCTG 3501 GTTCATCGGC GACGGCGACG GCGAGGGCGA GGGCGCTGCG ATTGATCTGG 3551 CCGTAAACGC CATGGCACGC CGGTTGCTTG ACTAGCGCGG TCACCGATCT 3601 CACCTGGTCG TCGAGCTAGG TCAGGCCGTG TCGGGCGTGA TCCGCTGGAA 3651 GTCGTTGCGG GCCACACCCG CCGCCTCGAA GCCCTGCACC AGGCCGGCAT 3701 CGTGGTGTGC GTGGCCGAGG GACTATGGAA GGTGCCGGAC GATCTGCCCG 3751 AGCAGGGCCG CCGCTATGAC GCCCAGCGTC TTGGTGGCGT GACGGTGGAG 3801 CTGAAATCGC ACCTGCCCAT CGAGCGGCAG GCCCGCGTGA TCGGTGCCAC 3851 CTGGCTTGAC CAGCAGTTGA TCGACGGTGG CTCGGGCTTG GGCGACCTGG 3901 GCTTTAGCAG TGAGGCCAAG TAGGCGATAC AGCAGCGCGC GGACTTCCTG 3951 GCCGAACAGG GACTGGCCGA GCGGCGCGGG CAGCGCGTGA TCCTCACCGG 4001 AATCTGCTCG GCAGCAGCGG GCTCGGGAAC TGGCGCAGGC CGCGAAGGAC 4051 ATTGCCGCCG ATACCGGCCT GGAGCATCGC CCCGTGGCCG ACGGCCAGCG 4101 CGTTGCCGGC GTCTACCGGC GCCCCGTCAT GCTCGCCAGC GGGCGAAATG 4151 GGATGCTTGA TGACGCCAAG GGGTCCAGCC TCGTGCGGTG GAAGCCCATC 4201 GAACAGCGGC TTGGGGAGCA GCTCGCCGCG ACGGTGCGCG GTGGCGGCGT 4251 GTCTTGGGAG ATTGGACGAC AGCGTGGGCC GGCCCCTGTC TCTTGATCAG 4301 ATCTTGATCC CCTGCGCCAT CAGATCCTTG GCGGCAAGAA AGCCATCCAG 4351 TTTACTTTGC AGGGCTTCCC AACCTTCCCA GAGGGCGCCC CAGCTGGCAA 4401 TTCCGGTTCG CTTGCTGTCC ATAAAACCGC CCAGTCTAGC TATCGCCATG 4451 TAAGCCCACT GCAAGCTACC TGCTTTCTCT TTGCGCTTGC GTTTTCCCTT 4501 GTCCAGATAG CCCAGTAGCT GACATTCATC CGGGGTCAGC ACCGTTTCTG 4551 CGGACTGGCT TTCTACGTGT TCCGCTTCCT TTAGCAGCCC TTGCGCCCTG 4601 AGTGCTTGCG GCAGCGTGAA GCTTTCTCTG AGCTGTAACA GCCTGACCGC 4651 AACAAACGAG AGGATCGAGA CCATCCGCTC CAGATTATCC GGCTCCTCCA 4701 TGCGTTGCCT CTCGGCTCCT GCTCCGGTTT TCCATGCCTT ATGGAACTCC 4751 TCGATCCGCC AGCGATGGGT ATAAATGTCG ATGACGCGCA AGGCTTGGGC 4801 TAGCGACTCG ACCGGTTCGC CGGTCAGCAA CAACCATTTC AACGGGGTCT 4851 CACCCTTGGG CGGGTTAATC TCCTCGGCCA GCACCGCGTT GAGCGTGATA 4901 TTCCCCTGTT TTAGCGTGAT GCGCCCACTG CGCAGGCTCA AGCTCGCCTT 4951 GCGGGCTGGT CGATTTTTAC GTTTACCGCG TTTATCCACC ACGCCCTTTT 5001 GCGGAATGCT GATCTGATAG CCACCCAACT CCGGTTGGTT CTTCAGATGG 5051 TCGATCAGAT ACAACCCAGA CTCTACGTCC TTGCGTGGGT GCTTGGAGCG 5101 CACCACGAAG CGCTCGTTAT GCGCCAGCCT GTCCTGCAGA TAAGCATGAA 5151 TATCGGCTTC GCGGTCACAG ACCGCAATCA CGTTGCTCAT CATGCTGCCC 5201 ATGCGTAACC GGCTAGTTGC GGCCGCTGCC AGCCATTTGC CACTCTCCTT 5251 TTCATCCGCA TCGGCAGGGT CATCCGGGCG CATCCACCAC TCCTGATGCA 5301 GTAATCCTAC GGTGCGGAAT GTGGTGGCCT CGAGCAAGAG AACGGAGTGA 5351 ACCCACCATC CGCGGGATTT ATCCTGAATA GAGCCCAGCT TGCCAAGCTC 5401 TTCGGCGACC TGGTGGCGAT AACTCAAAGA GGTGGTGTCC TCAATGGCCA 5451 GCAGTTCGGG AAACTCCTGA GCCAACTTGA CTGTTTGCAT GGCGCCAGCC 5501 TTTCTGATCG CCTCGGCAGA AACGTTGGGA TTGCGGTAAA ATCGGTAAGC 5551 GCCTTCCTGC ATGGCTTCAC TACCCTCTGA TGAGATGGTT ATTGATTTAC 5601 CAGAATATTT TGCCAATTGG GCGGCGACGT TAACCAAGCG GGCAGTACGG 5651 CGAGGATCAC CCAGCGCCGC CGAAGAGAAC ACAGATTTAG CCCAGTCGGC 5701 CGCACGATGA AGAGCAGAAG TTATCATGAA CGTTACCATG TTAGGAGGTC 5751 ACATGGAAGT CAGATCCTGG AAAACGGGAA AGGTTCCGTT CAGGACGCTA 5801 CTTGTGTATA AGAGTCAG