LOCUS NZ_EQ973490 23343 bp DNA linear CON 04-MAY-2021 DEFINITION Bacteroides cellulosilyticus DSM 14838 Scfld4, whole genome shotgun sequence. ACCESSION NZ_EQ973490 REGION: 1329075..1352417 VERSION NZ_EQ973490.1 DBLINK BioProject: PRJNA224116 BioSample: SAMN00008759 Assembly: GCF_000158035.1 KEYWORDS WGS; RefSeq. SOURCE Bacteroides cellulosilyticus DSM 14838 ORGANISM Bacteroides cellulosilyticus DSM 14838 Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides. REFERENCE 1 (bases 1 to 23343) AUTHORS Sudarsanam,P., Ley,R., Guruge,J., Turnbaugh,P.J., Mahowald,M., Liep,D. and Gordon,J. TITLE Draft genome sequence of Bacteroides cellulosilyticus (DSM 14838) JOURNAL Unpublished REFERENCE 2 (bases 1 to 23343) AUTHORS Fulton,L., Clifton,S., Fulton,B., Xu,J., Minx,P., Pepin,K.H., Johnson,M., Bhonagiri,V., Nash,W.E., Mardis,E.R. and Wilson,R.K. TITLE Direct Submission JOURNAL Submitted (18-DEC-2008) Genome Sequencing Center, Washington University School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA COMMENT REFSEQ INFORMATION: The reference sequence is identical to EQ973490.1. Bacteroides cellulosilyticus (GenBank Accession Number for 16S rDNA gene: AJ583243) is a member of the Bacteroidetes division of the domain bacteria and has been isolated from human feces. The sequenced strain was obtained from Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ) (DSM 14838). This is a Newbler assembly (http://www.454.com/enabling-technology/the-software.asp) comprised of one full plate FLX PE 454 Data with a Q20 coverage of 16.3X. This sequenced strain is part of a comprehensive, sequence-based survey of members of the normal human gut microbiota. A joint effort of the WU-GSC and the Center for Genome Sciences at Washington University School of Medicine, the purpose of this survey is to provide the general scientific community with a broad view of the gene content of 100 representatives of the major divisions represented in the intestine's microbial community. This information should provide a frame of reference for analyzing metagenomic studies of the human gut microbiome. Further details of this effort are described in a white paper entitled 'Extending Our View of Self: the Human Gut Microbiome Initiative (HGMI)' (http://www.genome.gov/Pages/Research/Sequencing/SeqProposals/HGMIS eq.pdf). These studies are supported by National Human Genome Research Institute. For answers to your questions regarding this assembly or project, or any other GSC genome project, please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. Bacteroides cellulosilyticus (GenBank Accession Number for 16S rDNA gene: AJ583243) is a member of the Bacteroidetes division of the domain bacteria and has been isolated from human feces. The sequenced strain was obtained from Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ) (DSM 14838). This is a Newbler assembly (http://www.454.com/enabling-technology/the-software.asp) comprised of one full plate FLX PE 454 Data with a Q20 coverage of 16.3X. This sequenced strain is part of a comprehensive, sequence-based survey of members of the normal human gut microbiota. A joint effort of the WU-GSC and the Center for Genome Sciences at Washington University School of Medicine, the purpose of this survey is to provide the general scientific community with a broad view of the gene content of 100 representatives of the major divisions represented in the intestine's microbial community. This information should provide a frame of reference for analyzing metagenomic studies of the human gut microbiome. Further details of this effort are described in a white paper entitled 'Extending Our View of Self: the Human Gut Microbiome Initiative (HGMI)' (http://www.genome.gov/Pages/Research/Sequencing/SeqProposals/HGMIS eq.pdf). These studies are supported by National Human Genome Research Institute. Coding sequences were predicted using GeneMark v3.3 and Glimmer2 v2.13. Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE 1.23 and non-coding RNA genes by RNAmmer-1.2 and Rfam v8.0. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs. For answers to your questions regarding this assembly or project, or any other GSC genome project, please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. Annotation was added to the contigs in March 2009 This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. Product names were updated in June 2013. The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Date :: 05/04/2021 07:17:00 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,145 CDSs (total) :: 5,090 Genes (coding) :: 4,764 CDSs (with protein) :: 4,764 Genes (RNA) :: 55 rRNAs :: 1, 1, 1 (5S, 16S, 23S) complete rRNAs :: 1, 1, 1 (5S, 16S, 23S) tRNAs :: 50 ncRNAs :: 2 Pseudo Genes (total) :: 326 CDSs (without protein) :: 326 Pseudo Genes (ambiguous residues) :: 19 of 326 Pseudo Genes (frameshifted) :: 133 of 326 Pseudo Genes (incomplete) :: 191 of 326 Pseudo Genes (internal stop) :: 15 of 326 Pseudo Genes (multiple problems) :: 29 of 326 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..23343 /organism="Bacteroides cellulosilyticus DSM 14838" /mol_type="genomic DNA" /submitter_seqid="Scfld4" /strain="DSM 14838" /isolation_source="biological product [ENVO:02000043]" /host="Homo sapiens" /culture_collection="DSM:14838" /type_material="type strain of Bacteroides cellulosilyticus" /db_xref="taxon:537012" gene complement(1..4095) /locus_tag="BACCELL_RS09475" /old_locus_tag="BACCELL_02140" CDS complement(1..4095) /locus_tag="BACCELL_RS09475" /old_locus_tag="BACCELL_02140" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007661009.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="WP_149949311.1" /translation="MKYTKTIALITTLLCFLQGQGLALAQDNPNQYNYLYLTIRNGLC DNSIRTIHKDHNSFMWFGTSNGLDRYDGYELKHYSTAPRQPYQFIESNYINDIDEDDN NYLWVASEAGIMSIDLLHENLNFYKEYSGKNNNVLYSPVQALLVDDFNNLWVGKSDGL AYIILNEERQIKDIRILKKDVDIKTIVKHGSDIWAGGDKCLLHFTPSGKQDYSNIPVI TNLDTSQLIFNRLFSYGDYLWIGTQSGLYCYNTQNQFCILYQHNPNNPHSISSNFITD IDKNSSGDIIIGTRNGVNIYQRNDQFVTFSRGIQARSLNDNIVNRIFVDKNDNIWVGT DFGGINIMAPQRITFNYSLQGYEKGVPNIISTVLEDKEGNILAGIVDGGLAIKRKGAD SFSLFKHNPGDPHSLAHNNISDIIQDLHGNYWISTIGGGLDKLDKNNLSHPVFEHYNS LNSSLTSDDIHDIALDSARNALWICSGSHINTLDFSTGAINRLKFYTQSKEPVHNMNT IFIDSQSRLWIGGNGVYVIDLENSRNTYECIYYQHKLDDPESKINEKITCIFETKKGE IYLGSLGNGIYLLEDNGNNEKYTFKNYAVRCGLSDTSISNILDDENGNLWISTLKGIY FFDINTKRAFKFDEGDGLLVPQFYKRSGCKTINHNMLLGTIDGFVTFSPLVNLPKQKQ RTITLTSVVCNGSQLIPYLNSDNLPVSISTTKELHLYPPQNSFEITFSCLDYIEQEKV FYFHRIHELEEHWNTGLVKRNAKYTNLPPGKYTLEIRCTNHDNTWSTEATCLSIIVHP PFYKTGWFYSLIAILIVSILLYIIYWYNARQQHIQKLLKEKIESMSEQMEAINKEKLS YFTNLAHEFKTPLTLIQGPASQLVRQTTDPKEKEDLQIINRNAQYLLSLVNQLIDLRK IDTQNLTLNYSQFNFSKFLNTTITDFSNLMKERNISFEKIYRLKSDHIYSDKENLHKI LFNLLSNAIKHTPDKGKITLHANQFIDKSNKLMQFISVTNSGSIIAPDEIDKIFNRFY RIPEQNKYTNYGQSSTGIGLHIVKELINLLDGTIKVKSSEKEGVSFRLYFPITLADAA ENEIHEEYKEPVPVEDKIEPFIPIDRSKPTLLLVEDNPDMRHYIKNMLKEKYNIAEAN NGEQGYKTAQNIVPDFIVSDLMMPICDGSDFCKRLREDKLLSHIPFLLLTANSSETAR IESYENGVDGYITKPFEQSVLLAHIDSILKNRDLRQKKFVEQDLNPILLEVGQPDQQF MNEVMNILEKNYADPQFGVKDLTERLNISYTVIYKKFVSLTGLPPVRFIQLYRLQIAK KILESSTNNVIVSEIAYRVGFNDPKYFTRCFVKQYKQTPSSFFK" gene complement(4115..5446) /locus_tag="BACCELL_RS09480" /old_locus_tag="BACCELL_02141" CDS complement(4115..5446) /locus_tag="BACCELL_RS09480" /old_locus_tag="BACCELL_02141" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007661008.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="family 43 glycosylhydrolase" /protein_id="WP_007211515.1" /translation="MKNKRLFLLSAFLVIIGTLKAQNPIITDQFTADPTAKVFEGKMY VYPSHDIPSPIERLKEWFCMADYHVFSSDNLIDWTDHGVILSQENVPWVAPDSYSMWA PECVYKNGKYYFYFPSTPKGEGKRGFSIGVAIADKPYGPFTPQATPIEGVNGIDPCVL IDKDGQAYIYWSGRGMSVAKLKDNMLELASEPMQIQGLPEGFKEGPFAFERNGKYYFT FPWVKEKTEVLAYAMGDNPMGPFEFKGIIMDESPTDCWTNHHSLVEYKDQWYLFYHHN DYSPEFDKNRSARVDSLFFNPDGTIQKVIPTLRGVGITDARKEIQLDRYSRLSNQGAA IDYLNQANKFEGWKTLLSKNGAWVQYNRVNFGDSTPRKVKARVTSDNGGTLQIRINGT NGPVISEIKVPKTDKWTNIESSVRSAQTGIHDLYVSLKGNGKVEIDWISFQ" gene complement(<5695..8280) /locus_tag="BACCELL_RS09485" /old_locus_tag="BACCELL_02142" /pseudo CDS complement(<5695..8280) /locus_tag="BACCELL_RS09485" /old_locus_tag="BACCELL_02142" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007661007.1" /note="incomplete; partial in the middle of a contig; missing C-terminus; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 3 C-terminal domain-containing protein" gene complement(8329..9387) /locus_tag="BACCELL_RS09490" /old_locus_tag="BACCELL_02143" CDS complement(8329..9387) /locus_tag="BACCELL_RS09490" /old_locus_tag="BACCELL_02143" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007661006.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 43 protein" /protein_id="WP_007211517.1" /translation="MKILFHFTITLFVLSLVACNQTKSPEPVKQYTFIGGKEGDKMDT TSFDSLQLSDPFILADEETQMYYLVGSGGSLWKSTNLKMWTGPYQYITVDTTSWIGTA PRIWAPELHKYKGKYYCFVTFTNPKIIVDTVPNRYNVQRRATHILTSDKAAGPYHPIS GTDYLPEDWSTLDGSFWEEDGVPYMVFCHEWMQTVNGMINYIQLAPDLSESIGTATTL FRASDAPWPREMKSIGELTFGMALEGYVADGPFLFRTGTGRLGMLWSSWSNSRCAQGV AYSGSGKLAGPWIQCNTPLIPNNSGHGMLFRTFDGKLLMCLHHQSLDSENPGPRRPTL FEADISGDEIKILGRYHP" gene complement(9434..10588) /locus_tag="BACCELL_RS09495" /old_locus_tag="BACCELL_02144" CDS complement(9434..10588) /locus_tag="BACCELL_RS09495" /old_locus_tag="BACCELL_02144" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007216414.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="esterase" /protein_id="WP_007211518.1" /translation="MKLKSILYTTLIASLITGCAQKPKEKEATAPVSASTNILGISYP VVNPDLSVTVRVNAPDADSVKLDLMKKYPMTKNADGIWEVTSDPQVPGFHYYFLEIDG CSVADPSSELFYGCGRMSSGIEIPEEGVDYYLPQHVPHGEIRTQMYYSDITQAWRKCL VYTPAGYDENNTQKYPVLYLQHGSGEDETGWSNQGKADNILDNLIASQKAVPMLVVMD RGYATDPKEKETGQKGRFNFNTFERVVINELIPMIDKNYRTLPDREHRAIAGLSMGGF QAVSIGLAHLDKFAHIGGFSGGGRMNSNELNTAYNGVFADAEAFNQKVKTLYISLGTE EAARFKNVTEFHDVLTQANINHIYYESPGTAHEWLTWRRSLHQFAGLIFK" gene complement(10596..11714) /locus_tag="BACCELL_RS09500" /old_locus_tag="BACCELL_02145" CDS complement(10596..11714) /locus_tag="BACCELL_RS09500" /old_locus_tag="BACCELL_02145" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004293186.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="WP_081450808.1" /translation="MIFCSIITGTMWAQGIPSVTNIPNAQYPMVNPDRTVTFKVKAPD AKSVQIVLDRPYEMQKGQDGTWELTSSPQVPGFHYYSVQIGGLQVTDASTYSYFGMSR QASAFEVPEEKVDFYLPQKGVPQGALRSRRFYSKVCDEWRRMYVYTPAEYEKNPDKRY PVLYLQHGGGEDERGWPNQGCIGAILDNLIASGKAKPMIVVMNCGYAVYAGNKYPEQQ PNARSSVDAFVAFEDMMIKDVIPLVDSTYRTLTDKENRAMAGLSWGGKQTLETTLHHR DLFSYIGSFSGALPIDKNTDIDALYEGAFKDPVTFNKDFKFIWLGIGEEEGPNAELLH AALSKKGINSLFYQSPGTAHEWLTWRRCLYQFTPLIFK" gene complement(11999..14503) /locus_tag="BACCELL_RS09510" /old_locus_tag="BACCELL_02147" CDS complement(11999..14503) /locus_tag="BACCELL_RS09510" /old_locus_tag="BACCELL_02147" /inference="COORDINATES: protein motif:HMM:NF012961.1,HMM:NF014924.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycoside hydrolase family 9 protein" /protein_id="WP_081450809.1" /translation="MKLKAIILCMGVVGCLIFTGCGKSTSFSRNSHLALNDSNYFETR GLNFFVFSNLYDATFDDSKISAVEIIHHGIRTATNGDVRMNPTPGQWNKLPKFVERMP DKEHNRIDVILEYPEYNFEYKLTGEARDGGFYLSVNVDKPLPEALHGIAGLNMEFMPP VFFGHSYIIDGKHGLFPTSPADFMTTINGEVEPTPMATGKLIDIAPDEPLKHITIRTT DDNNLSLFDGRNKQQNGNFVVRTLLPAGKTGKIAEWFITAETQSDWVRKPVVSYSQVG YHPTQKKVAVVELDKNDKPLSTVSLYKVCSDGSLSKALSGTPKTWGMYTRYNYLQFDF SSVTEPGIYMLEYGDQRTAPFPIATDVFQKAWFPTLDVFFPVQMDHMFVREAYRVWHG AAHLDDALQAPVNRIHWDGWRQGPTTGNKYKPLEHIPGLNVGGWFDAGDFDIQTGSQH AVVQAFALLWESFGVNRDETTINQKTRYTEIHVPDGKPDVLQQIEHGVLQLVAQVNAI GYAIPGINESHLYQYRHLGDAVNKTDNLVYNPRLDSLQTDGRTSGTPDDRWAFTNRTP HMNYGTAISLAAASRALKDYNPELSKEALRVARFIWDDEHNHQANPEEQTYSGRFSNP EFMKSIECRAAFELWRSTDYEGYKTKMNELLPTLLEQFNRNASVIAQMIPFMDNAFKK QVRPLVEAYAGELAEVDKENPYGVRISTAGWAGNRNIVQACITNYLLHRSYPELINPE YIYRGLNYLYGCHPCHNLSFVSGVGAQPKKVAYGSNRADFSFIPGGVVPGIRILKPDF PENREDYPFLWSENEYVIDLAASYIYLVNAVNSLLK" gene complement(<14832..16566) /locus_tag="BACCELL_RS09515" /old_locus_tag="BACCELL_02148" CDS complement(<14832..16566) /locus_tag="BACCELL_RS09515" /old_locus_tag="BACCELL_02148" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007661000.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RagB/SusD family nutrient uptake outer membrane protein" /protein_id="WP_007211522.1" /translation="MKKYIYSFAVCAALFCTSCDESRLNIEQKAVSSTENFYKTDQDA VDAMTAAYAQFLDNVCSSAGIYQPQFTLLNYSADDVFSAGTNYADHMDMRVFDEFRYD TGNAPLKELYQRYYKSIYASNLVITYVDPAASTVMARCVAEARVLRAYCHMMAALTFQ RPPKVDKLLGADDKPTNVESQKSLLEWCASECEEAVNDLDSRKGPNDVEGAFKVTKGF AYFVAGKSAVFAGDMARAERNLKPLVESPNYALVPGNRFRDLFHVEGNGCEEKIFEAN CFGNIDKGFWGNIQRGRWMTNNVLNWRYDRLGSRASICGANDGWGGGAINWKFAEKMY QNDGDGPRRKATFLTPDEFLYDSKLCGWETDFDADGNEMSLADKKKDPKRGIKSTEGL FAHGVYMEVKNIESPNDRNMTISDTHNSASFRIARLGEAYLLYAEACLETGNLDEGKK YLNAIQKRAEAPETELTTQTLRDEKQYELWFETCRFHDIVRWGIAKECQDAVVDQVPQ LYDDFFIQGTPYYGKEHHLRAELTHPLSVDQNLPASSYGFVKGKHEYFPFPKDVIDLN KELHQLNGWANN" gene complement(16590..19741) /locus_tag="BACCELL_RS09520" /old_locus_tag="BACCELL_02150" /pseudo CDS complement(16590..19741) /locus_tag="BACCELL_RS09520" /old_locus_tag="BACCELL_02150" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007216410.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="TonB-dependent receptor" gene complement(20005..21456) /locus_tag="BACCELL_RS09525" /old_locus_tag="BACCELL_02152" CDS complement(20005..21456) /locus_tag="BACCELL_RS09525" /old_locus_tag="BACCELL_02152" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004292093.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="family 43 glycosylhydrolase" /protein_id="WP_044153754.1" /translation="MKTKAILLGTAALMFVLTSEKAIAQIGTPYIHDPSTIMECDGKY YTFGTGGGGLISEDGWTWNGGGVRPGGGAAPDAVKIGDRYLIAYSATGGGLGGGHSGK VLTMWNKTLDPNSPDFKYTEPIEVASSVNDEDCDAIDAGLLLDPTDGRLWLSYGTYFG FIRLVELDPATGKRMEGNKEIDIAIDCEATDLIYRDGWYYLLGTHGTCCDGPNSTYNI VVGRSRKVTGPYVDNMGRKMLEGGGKMVIAAGNRQTGPGHFGLFKVADGVEKMSCHFE ADFDRGGRSVLGIRPLLWKNGWPVAGEVFKEGTYEIESERRGYALELAVDFVRMEYTR HRFWEKDDTPVVPLKSQTLEDVIGTWPQGKINVRIGDYMFRPHQKWTITAVPDGGGYL GGPYYKIVIEGTNRALAATTDAEVVTVPEFTGAPEQLWRIDQLTDGTYRIMPKEVPGT DKELVLVSVGDSTPSLGEFDMNSDNSKWNFRDH" gene complement(21459..22157) /locus_tag="BACCELL_RS09530" /old_locus_tag="BACCELL_02153" CDS complement(21459..22157) /locus_tag="BACCELL_RS09530" /old_locus_tag="BACCELL_02153" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004292094.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="WP_007211527.1" /translation="MNIFHRKSLLAACIAAMIGLHGYAQKNEASPRLSDYFSPATTNT MSPDSEGFIQRWLLLEPIDKPNRSNTVFTDSYIREAFATEYFPNQFTVLPKDGDKVKV GKQKLTWHALDSKLFNVKLFRFASGLKKQVYGVLFWAVTFIECPEDMENIRMSVGSNS ASMWWLNGEEAVILSGDRRMVKDDCLSRRLTLKKGKNIIRGAIINGPGMSDFCVRFLN ENGTPVNNITISYK" gene complement(22186..23343) /locus_tag="BACCELL_RS09535" /old_locus_tag="BACCELL_02154" CDS complement(22186..23343) /locus_tag="BACCELL_RS09535" /old_locus_tag="BACCELL_02154" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007216407.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="esterase" /protein_id="WP_044153756.1" /translation="MKNVLKLIFSSLLVFPFTMNAQQQDFPVGTTPNEHNINGADYPR IGEDRRVHFRIHAPNAQKVEISFRGEMTKEADGYWSLVSKEPEVIGFHYYQVIIDGVS AADPNGKPFFGMGKWVSGIEIPEKGVDYYSIKNVPHGLISQSWYYSDIRKEWRRCIVY TPAEYDKNPTKKYPVLYLQHGMGENETSWANQGKMNFIMDNLIAEGKAKPMIVVMDNG NIEVFKTNPGETPDGARKRFGAEFPAILVNEIIPHIESNFRTLTDRDNRAMAGLSWGG LLTFNTTLNNLDKFAYIGGFSGAGSIDLKQLDTVYGGVFKNRKAFNDKVHVFFLGIGS EEHPERTKNLSDGLQAAGINTIYYESPGTAHEFLTWRRCLKEFVPLLFKTK" ORIGIN 1 ctacttgaag aatgaactgg gcgtttgctt atattgtttc acaaagcaac gggtgaagta 61 cttggggtca ttaaatccca cccggtaggc tatttctgat acaatgacgt tattggtgct 121 actctctaat attttcttgg caatctgtag cctgtataat tggatgaaac gtaccggcgg 181 aagtccggtg agtgatacga atttcttata aatcactgta taactgatgt tcagcctttc 241 tgtcagatcc tttacaccga attgaggatc agcatagttc ttttccagaa tattcataac 301 ttcgttcatg aactgctggt cgggttgtcc cacttccagc aggataggat tcaagtcttg 361 ttctacgaat ttcttctgac gcaaatccct gttcttcaag atggagtcta tgtgggctaa 421 taatacggat tgttcgaaag gcttggtgat atatccgtct actccgtttt catagctttc 481 tattcgggca gtctcacttg aatttgccgt gagaagcagg aaaggaatgt gacttaaaag 541 cttgtcctcg cgcagtcttt tgcagaagtc ggaaccatcg catatgggca tcattagatc 601 ggatacaata aaatcgggca cgatattctg ggctgtctta tagccttgct caccgttgtt 661 tgcttcagcg atattgtatt tttctttcag catattcttg atataatgac gcatatcggg 721 attgtcttct accaataaca aagtcggctt gctcctgtct ataggaatga acggctcgat 781 tttgtcttca acgggtacgg gctctttata ttcctcatga atttcatttt ccgctgcgtc 841 cgccagcgta atggggaaat ataatctgaa agatacacct tctttttccg aactcttgac 901 tttaatcgtt ccgtctaata ggttgataag ctcttttacg atgtgaagtc cgatgcccgt 961 actgctctgt ccgtagtttg tatatttgtt ctgttcgggt atccggtaga aacggttgaa 1021 tattttgtct atttcgtcag gagctattat gctgccactg ttggtgacgg agataaactg 1081 catgagtttg ttcgatttgt cgatgaattg gttggcatgc aatgtgattt ttcctttatc 1141 gggagtatgc ttgatggcgt tggataatag attgaacagt atcttatgca ggttttcctt 1201 atcagaatag atatggtcac tcttcaggcg gtatatcttt tcaaagctga tgtttcgttc 1261 tttcatcagg ttggaaaagt cggttattgt tgtgttcagg aacttgctga agttgaattg 1321 agagtaattg agcgttaagt tctgggtatc gatcttgcgc aagtcaatca gttggttgac 1381 cagtgaaagc aggtattgag cgttccggtt gataatctgt aagtcctctt tttctttggg 1441 atcggtggtt tgcctgacaa gctggcttgc cggtccttgt atcagggtca ggggcgtctt 1501 aaactcatgt gcgaggttgg tgaaatagga tagcttctct ttgttgatag cctccatctg 1561 ctcggacatg ctttctatct tttctttcag aagtttctgg atatgttgtt gccttgcatt 1621 ataccaatag atgatataaa gcagaatgga cactatcaat atagcgataa gtgagtagaa 1681 ccatcctgtt ttatagaatg ggggatgtac aataatgctg agacaggtgg cttcggtaga 1741 ccaggtgttg tcgtggttgg tgcatcgtat ttccagcgta tattttcccg gcggcagatt 1801 ggtgtacttg gcgtttctct ttaccaatcc tgtgttccag tgctcctcca gttcatgaat 1861 gcgatggaag tagaacacct tttcctgttc gatataatcc aggcagctga atgtgatttc 1921 aaaggaattc tgtggtggat ataaatgcag ttccttggtt gttgagatac ttacgggcag 1981 attgtcgctg ttcagataag gtatcagctg gctgccgtta cagacaacgc tggtgagggt 2041 tatcgtccgc tgcttttgct tgggcaggtt taccagtggg ctgaatgtta cgaaaccatc 2101 gatagttccc agtaacatat tgtgattgat ggtcttacaa ccggatcttt tatagaattg 2161 gggtaccagt aacccgtctc cttcgtcgaa cttgaaggcc cgtttggtat tgatatcgaa 2221 gaagtaaatg ccttttaggg tactgatcca taagttcccg ttctcatcgt ccagaatgtt 2281 cgagatgctg gtgtccgaaa gaccgcagcg tacggcatag ttcttgaaag tatatttttc 2341 gttgttaccg ttgtcttcca acaggtagat gccatttccg aggctaccca gataaatctc 2401 ccctttttta gtttcgaata tacaagtgat cttttcattg attttgcttt cgggatcatc 2461 cagtttgtgc tgatagtata tacattcgta ggtattcctg gaattttcca ggtcgatgac 2521 gtacacgccg tttccaccaa tccataatct tgactgtgag tcgatgaaga ttgtattcat 2581 gttatgaacg ggttctttgg actgtgtgta gaactttagc cgattgatag ctcccgtaga 2641 gaaatcgagt gtattgatat ggctgccgct gcatatccac agtgcattcc gggcagagtc 2701 cagggctatg tcatggatgt catcggaggt cagggaggaa ttcaaactgt tgtaatgttc 2761 gaatacggga tgtgacagat tgtttttatc cagtttatcc agccctccgc ctattgttga 2821 tatccagtaa ttgccatgca ggtcctgaat gatatccgag atgttgttgt gagccagtga 2881 gtgcggatct cccgggttgt gtttgaagag cgaaaaggag tcagcacctt tcctcttaat 2941 ggcaagtccg ccatctacaa ttcctgccag gatgtttccc tctttgtctt cgagcaccgt 3001 gctgatgatg ttgggtactc ctttctcata cccttgcagt gaataattga aagttatgcg 3061 ttgcggagcc atgatattaa ttccgccgaa gtcggtccct acccagatat tatcattctt 3121 atcaacaaat atccggttga cgatattatc attcagggac cgggcttgta tgcctctgct 3181 gaaagtaacg aattggtcat ttcgctggta tatgttcact ccgtttctgg ttccgataat 3241 gatgtcgcct gatgaatttt tatcaatatc tgtgatgaag ttggaggaaa tgctgtgagg 3301 attattggga ttatgctgat agagtatgca gaactgattc tgggtgttat agcagtataa 3361 gccactttgg gtgccgatcc acagatagtc accatacgaa aacaggcggt tgaagattaa 3421 ctgtgaagta tctaagtttg taattacggg gatattgctg taatcctgtt tgcccgaagg 3481 agtgaaatgc aataaacact tgtcgcctcc cgcccagata tcactaccgt gttttacgat 3541 ggttttgata tccacatctt tcttcaatat ccggatatct tttatctgcc tttcttcgtt 3601 cagaatgatg taggccagtc cgtcactttt gcctacccag agattattga aatcgtcgac 3661 aagcagggcc tgaacgggag agtaaagcac attattgttc ttaccggagt attccttata 3721 gaagttgaga ttttcgtgaa ggagatcaat gctcatgatg ccggcttcgg aagctaccca 3781 caaatagttg ttgtcgtctt cgtcaatgtc attgatgtaa ttactttcaa taaactgata 3841 cggttgcctg ggagctgtgg agtagtgttt cagttcgtaa ccatcgtaac ggtcgagccc 3901 gttggaggtt ccgaaccaca tgaatgagtt gtgatctttg tgtatggttc ggatagagtt 3961 gtcgcataat ccgttccgga tagtaagata aaggtaattg tattggtttg gattatcctg 4021 ggcaagcgct aaaccctgcc cttgcaggaa gcagagcagg gttgtgatta gtgctattgt 4081 cttcgtatac ttcatatccg gcatggagat ttagttattg aaaactgatc cagtcgattt 4141 caactttacc gtttcctttt agagacacgt ataaatcgtg gataccggtc tgagcggaac 4201 gaaccgagga ttcaatattc gtccatttat cggttttggg aacttttatt tccgagataa 4261 ccggaccgtt tgttccgttt atacgtattt gcagtgttcc tccgttgtcc gaggttactc 4321 ttgctttcac tttccggggg gtgctatctc cgaagttcac ccggttgtac tgcacccatg 4381 caccgttttt gctcaataag gttttccagc cttcgaattt gtttgcttgg ttcaggtaat 4441 cgatggctgc tccctgattg cttaaccggc tgtaacgatc cagttgaatt tctttccggg 4501 catctgttat gcccactccg cgcagggtgg ggattacttt ttggatagtg ccgtccgggt 4561 tgaagaataa ggaatcgact cgtgccgatc tgtttttatc gaattcgggg gagtaatcgt 4621 tgtggtgata gaacaggtac cactggtctt tgtattccac caatgagtgg tgattggtcc 4681 agcaatcggt tggtgattca tccataatga tacccttgaa ctcaaaggga cccatgggat 4741 tgtctcccat cgcataagcc agaacttccg ttttctcctt tacccagggg aatgtgaagt 4801 aatattttcc gttgcgttca aaggcgaaag gtccttcttt gaatccttcg ggtaatcctt 4861 gaatttgcat gggttctgag gcgagttcaa gcatattgtc tttcagtttt gctactgaca 4921 ttccgcggcc tgaccagtat atgtaggctt gcccgtcttt atcaatgagc acgcaagggt 4981 cgatgccatt gactccttca atgggagtgg cctgcggggt gaaagggccg taaggcttgt 5041 cggctatggc aactccaatg ctgaaaccgc gtttgccttc gcctttaggg gtggatggga 5101 agtagaaata atatttccca ttcttgtata cacattcggg ggcccacata gaataggagt 5161 caggagctac ccaaggtaca ttttcctggc ttaggatcac tccgtggtct gtccagtcaa 5221 tcagattgtc tgacgagaaa acatggtaat ctgccataca gaaccattct ttcagacgct 5281 cgatggggct cggaatgtca tgtgaaggat agacatacat cttgccttca aaaactttgg 5341 cggtgggatc ggcagtgaac tgatccgtaa taatggggtt ctgagctttt aatgtaccta 5401 ttattaccag aaatgcggat aataaaaata gtcttttgtt tttcataagt tgactgtttt 5461 taatgtgtac gggctttcta tttcttcatc cggacacaaa gataaaataa aaattcgttg 5521 gcggaattaa tttctttctt ttcggttatg aatataattt taggcgcgga ttacgcggat 5581 tacacggatt acacggatta aaggctgcgc tacaacattg tactacagaa aaaaatccgt 5641 gtaatccgcg taatccgcgc ctaaattatt atcagcattt agttcattac tatcccgtaa 5701 agtgactttg acagattgca gatcccgggc tgcggaactc gttccataga ataactcgta 5761 ttcacccggt gccacacgca tggtatttgt agaaggatca aaacattcga atgattcgga 5821 tggcaactct atcttgacat cctgtttggc tcctgctgcg agttctacac gtgcataggc 5881 tttcagggac tttaatggtc cgtcagcatc atctgttttg cgtatgtaaa cctgtacgac 5941 ttcagttcca gagcgcttgc ctgtgttgga gaccggtaca gtcaaggtca gcgactcatc 6001 tgtatggagt tgggttttgt tgcaactggc tgtacctacg gcgaaatccg tgtaacttaa 6061 tccgaatcca aatgggaaca gagggtcgga catataccgg tatgtacgtc ctttcatgga 6121 gtaatcctca tagtcgggaa gttgtttcgt acttttgtag aaagtgacgg gcagcttgcc 6181 ggagggatta taatcaccaa acagcacgtc ggctacagca tatccaccca gttctcctcc 6241 ataccatgcc tggagaatag cgtcgcaact ttccgtttcg ggcagcaatg ccatagacga 6301 tccggaacag ttgacaaata ctacttgttt tccggcatct ttcagagctt tcaggaagtt 6361 tcgctgtaca gcgggtaatt ctatatccgt acggtcaccc cctttgaaac cgggaatatt 6421 cacgggcatt tcttctcctt ccagttgtgg cgaaatgcca cctacgaaga ccactacatc 6481 tatgccttta agtttcgaaa tgctttcact gtaattgatc gggaattctt tgccgaggtt 6541 gaacttcagg tttgcagccc agttttctac ttgtgcatag agtatttcta tcttatactc 6601 tttccctttc tccacttgta gcatggtgcg ggtatcggtg gttctccaag tgcggtgttt 6661 agtcatcgac ttgccgttta ccaataattc aaagtaactg caaccttcga gattgaggag 6721 aatttcgcct gattccttcg gagttaatac ggtttcgtat ttggctgaga acttttcaag 6781 atttactccg gggccaaaag tatgtaatcc gtatgtagtc acgttgatgg gctgggtggt 6841 acggagcgaa gaaacaggct ctccctggcg ttccggattg ttccagaatg tagcttttat 6901 gccgggtttc ccatctatgc tgcattggtc aagataggat tccaatgttt ggtcattcac 6961 caggtcacag cctttcatgt agacgatctg attctttttc agtttacttt tgaagccgtc 7021 caggatggta atggtctgat tcggagttcc gttgtagttg ccccacagca taggtttgtc 7081 atcggcgtta ggaccgataa ctgcaatctt gcggatggat ttacttaaag gcaatacatt 7141 gtttttgttc tgtaagagtg tcattgtctg gcgtgacata ttcagtgaca aatctttgtg 7201 agccttacag ttgactaccg acatcggtat ttttgtccag ttcactaatg acgggtcgtc 7261 catttctccg agttcaaagc gtccttccat caggcgaagc acatgtttgt ctacttcttc 7321 ttccgttatc aatcctttgg atacggcttc cggcagtttc tgataagcat atccgtatcc 7381 acattccaca tctgtacctg ccatggttcc tttgacggct gcatgtacgg catcggagga 7441 acttttgtgg gaagtccaga agtcggcaat ggctccacaa tcggagacta cgagatattt 7501 gaatccccac tcatcacgga gtatctgttg aagcagacgg gtgtttccac aacaaggatc 7561 atcgtccagg cgctggtagg cacacatcac ttcgcgtaca tcggcttttt gtaccagtgc 7621 cttgaacgca ggcaggtaag tttcccacaa gtcgcgggga ctgacgttgt tcagattggc 7681 ggtatgacgg ctccattccg gaccgctgtg tacggcataa tgtttggcac atgcaagcag 7741 tttgcgatac ttctcatttt caggtccctg caatcctttt accactgcaa tacccatttg 7801 tgaagtcagg taaggatctt ccccgtaagt ttcctgacca cgtccccaac ggggatcacg 7861 gaagatgttt acgttgggag tccagacgga gagactgtgg aaacggacgt cctccagtcc 7921 gttgcgtacg cgctcgttgt gcttggcacg catttcatcg gagactgcat tgaaaatgtc 7981 gaataccagt ttgtcattga atgaagcggc cattcctacc ggttcgggga agactgttac 8041 atttccctga ttggctaccc cgtgaagggc ttcactccac caattgaact tctttattcc 8101 caggcgtgga atggcttccg aatcatcaca catcaagagc gctttctctt ctaaagtcaa 8161 gcgtttcact aaatccttcg ctctttctgc cggacttagt tcgggattct gatacggcaa 8221 tgtttgtccc caccccgata gggagaataa caaaccacag gttactaata cctttttcat 8281 aataatgttg ttttcatggt ataaaattat gtttttattt cttggttgct aaggatgata 8341 tctccccagt attttaattt catctcctga aatatcggct tcgaacagtg tgggtcttcg 8401 tggacccggg ttctcggagt cgaggctttg gtgatgcagg cacatcagca actttccatc 8461 gaatgtacgg aataacatac catgccctga attattaggt atcagaggag tattgcactg 8521 aatccagggg ccggcaagtt tacccgaccc ggaataagcc actccctgcg cacaacggct 8581 gttgctccaa ctggaccaca acattcctaa acgtcccgta cctgtccgga acaggaacgg 8641 accatctgcc acatagcctt ccagtgccat gccgaaggtg agttcaccta tcgatttcat 8701 ttctcttggc catggggcat cggatgctct gaaaagggta gtagcggtcc ctattgattc 8761 tgacaggtcc ggagcaagct ggatgtagtt gatcattccg tttacggttt gcatccattc 8821 gtgacaaaag accatataag gtactccgtc ttcttcccag aaactaccgt cgagtgtaga 8881 ccagtcttcc ggcagatagt ctgtgccgct gatggggtgg tagggacctg ctgctttgtc 8941 ggatgtaagg atatgggtgg cacgacgttg cacattgtat ctgttgggta cagtgtctac 9001 tatgatcttg ggattggtaa aggtgacgaa acaatagtat ttacctttat atttatggag 9061 ctccggggcc catatccttg gagcggttcc aatccaggaa gttgtatcca cagtaatgta 9121 ttgataggga cctgtccaca tctttagatt cgtgcttttc cataaactgc cgccacttcc 9181 aaccagataa tacatctgtg tttcttcatc ggccaggata aagggatcgc ttaactgcaa 9241 agagtcaaag gaagtggtgt ccatcttatc tccttccttg ccgccgataa atgtatattg 9301 ctttaccggt tccgggctct tagtctgatt acaggcgacc agactaagaa cgaacagagt 9361 tattgtgaag tgaaacagta tcttcatagt tgcaagtatt gttaaaacta catctgtatt 9421 atctgaagca tcatcatttg aatattaagc ctgcaaattg atgcaatgaa cgccgccagg 9481 tgagccattc gtgggctgta ccgggagact cgtaatagat atggttgatg tttgcttgag 9541 ttagtacatc atggaattcc gttacatttt taaagcgtgc ggcttcttcg gttcccaagc 9601 tgatatacag tgttttcact ttttgattga aggcttcggc atcagcaaaa acgccgttgt 9661 aggctgtatt cagttcattg ctgttcatcc ttcctccgcc actgaatcca ccgatatgag 9721 caaatttatc taaatgggcc agcccgatgc ttacagcctg gaaacctccc attgacaagc 9781 cggctatggc acggtgttcc ctgtcgggca aagtacggta gtttttatca atcatgggta 9841 tcagttcatt gataaccacc cgttcaaaag tattgaagtt gaacctgcct ttttgtcccg 9901 tttctttttc tttggggtct gtagcatatc ctctgtccat gactaccagc atgggaacgg 9961 ctttttggct ggcaatcagg ttatccagaa tgttgtctgc ctttccttgg ttggaccatc 10021 cggtttcgtc ttctccgcta ccatgttgca gatataatac gggatacttc tgagtgttgt 10081 tctcgtcata acctgcagga gtatatacga ggcactttct ccatgcctga gtgatgtcgg 10141 agtagtacat ctgtgtgcgg atttcaccgt gtggaacgtg ctgaggtaaa tagtaatcca 10201 ctccttcttc aggaatttca atgccgcttg acatccggcc acatccgtag aacaattcgc 10261 tggaaggatc ggctacggaa cacccgtcta tctccaggaa ataatagtgg aaaccgggca 10321 cttgcggatc ggaggttact tcccaaatac cgtctgcatt cttggtcatg ggatatttct 10381 tcattaagtc cagttttacc gagtctgcat cgggagcatt taccctgacg gttactgata 10441 agtccggatt taccaccgga taactgatac cgaggatatt ggtcgatgcc gataccgggg 10501 cagttgcttc tttttctttg ggtttctgtg cacatccggt aatcagggaa gcgatcagag 10561 ttgtatataa gatggatttc agtttcatag tctgattatt taaagataag aggtgtaaat 10621 tgatagagac agcgccgcca ggtgagccat tcgtgggctg tgccgggtga ctgatagaat 10681 aggctgttga ttcccttctt gctgagtgct gcatgcagaa gttcggcatt gggaccttct 10741 tcttcaccga ttcccagcca gatgaacttg aaatctttat tgaatgtgac gggatctttg 10801 aaagcgcctt cgtagagtgc gtctatatcc gtatttttgt ctatgggcaa tgctccgctg 10861 aatgatccga tgtaagagaa taaatcccgg tggtgcagag tggtttccag agtttgttta 10921 cctccccatg acaaacctgc catggcacgg ttctctttgt ccgtcaatgt gcggtaagtc 10981 gaatctacaa gtggaattac atcttttatc atcatgtctt cgaaggcgac gaatgcgtct 11041 accgatgagc gggcattcgg ctgttgttcg ggatatttgt ttcctgcata tacggcatat 11101 ccgcagttca tcactacaat catcggtttg gcttttccgc tggctatcag gttgtcgagg 11161 atggcaccga tacatccttg gttgggccaa cctctttcgt cttcgccacc tccatgctgc 11221 aagtacagga cggggtaacg cttatccgga ttcttctcat actcggcagg ggtatatacg 11281 tacattctgc gccattcgtc gcatactttc gagtagaatc tgcgggaacg gagtgctcct 11341 tgcggtaccc ctttctgcgg taagtagaaa tctacttttt cctcgggcac ttcgaaggca 11401 cttgcctggc ggctcatccc gaagtaggaa taggttgagg cgtcagttac ctgcaagcct 11461 cctatttgta ccgagtaata gtggaaaccg ggcacttgcg gagaagaggt gagttcccac 11521 gttccgtcct gtcctttttg catttcgtag ggacggtcga gaacgatttg tacgctcttt 11581 gcatcgggag cttttacttt gaaagtcaca gttcggtcgg ggttcaccat aggatattgg 11641 gcatttggaa tattggtaac tgagggtatt ccctgtgccc acatcgtccc tgtgataatg 11701 gaacaaaata tcattgcgct tcccgtcttg ataattgcac ttgataattt ctttcttttc 11761 attgttcgtc tttttatact tttcaaaaaa actattcaaa tattaatgat aactgcaaaa 11821 gtagtatgta accctcttaa ggaggggtat aaaactgttt ttgaggagga cttttcttgc 11881 tggcagggca tttatgtaac acttgtgtta catttgcagg acacaagtgt tctgtttgtt 11941 caacacaagt gttctacaaa tgaaatgcaa gtggcctgta aatggaggac cggagacctt 12001 atttcagcag gctgttgact gcattcacca ggtagatgta gcttgctgcc agatctatta 12061 catactcatt ctcgctccaa agaaaaggat agtcctccct gttctcgggg aagtccggtt 12121 tcaatatcct gatgcccggc accacacctc cggggatgaa actgaagtcg gcccggttgc 12181 tcccgtaagc tacctttttg ggctgtgcgc ctactcccga aacaaaagac agattatggc 12241 acggatggca accgtacaga tagttcaatc ctctgtatat gtattccggg tttatcagct 12301 cgggatacga acgatggagc aggtagttcg taatacacgc ctgaacaatg tttctgttac 12361 ctgcccatcc ggcagtgctg atccttactc catacggatt ttctttgtct acttcggcaa 12421 gttctccggc ataagcttct accaagggcc ttacttgctt cttgaacgca ttatccatga 12481 aaggaatcat ttgtgcaatg acagaggcat tccggttgaa ttgttccaaa agggtgggga 12541 gcagttcgtt catttttgtc ttgtaacctt catagtccgt tgagcgccag agttcgaaag 12601 cggctctgca ttcgatggat ttcatgaatt caggattgct gaatcgtccg ctatacgttt 12661 gttcttcggg atttgcctgg tggttgtgtt cgtcgtccca gatgaagcga gctactcgca 12721 gggcttcttt gctcagttcc ggattgtaat ccttcaatgc acgcgaagcg gctgccagcg 12781 aaatggctgt gccataattc atgtgcggtg tccggttggt gaatgcccaa cggtcatcgg 12841 gagtaccact tgtacgcccg tcggtttgca ggctatccag gcgagggtta taaaccaggt 12901 tatccgtctt gtttaccgca tctcccaaat ggcggtactg atataaatga gactcgttga 12961 ttcccggaat ggcatagcct atcgcattca cctgcgctac taattgcaga acgccgtgtt 13021 caatctgttg caatacatcg ggtttgccgt cgggtacatg tatttccgta tagcgtgtct 13081 tttgattaat ggtggtttca tcgcggttta ccccaaaact ctcccacaaa agggcgaaag 13141 cctgtactac ggcatgttgt gatccggtct ggatatcgaa gtctcctgca tcgaaccatc 13201 cgccgacgtt cagaccgggg atatgctcca gtggtttgta cttgttgccg gtggtcggtc 13261 cttgtctcca accgtcccaa tggatacggt tgacgggagc ttgcagggca tcatccagat 13321 gtgcggcgcc atgccatacg cgataggctt cgcggacaaa catatgatcc atctgtacag 13381 ggaagaatac gtccagtgtc gggaaccatg ctttctgaaa cacatctgtt gcgatgggga 13441 aaggtgctgt gcgctggtcg ccatattcca gcatatagat gcccggttct gtgacagagg 13501 agaaatcaaa ttgaaggtaa ttgtagcgtg tgtacattcc ccaggttttg ggagtgccgg 13561 acagggcttt gctgaggcta ccgtcactac acaccttata taaagagacg gtagagagtg 13621 gcttgtcatt tttatccagt tccactacgg ccaccttctt ttgtgtgggg tggtaaccta 13681 cttgtgagta gctcactacg ggcttgcgca cccagtcgga ttgggtctcc gctgtgataa 13741 accattcggc tatcttgccg gtctttcctg ccggaagcag ggtacgcaca acgaagtttc 13801 cgttctgttg tttgttcctt ccgtcaaaca atgacaggtt attgtcatcg gtggtgcgga 13861 tggtgatgtg tttcaacggt tcgtccggtg caatgtctat cagtttgccg gtggccatgg 13921 gagtgggttc cacttcaccg ttgatggtag tcataaagtc ggccggagaa gtggggaaca 13981 gaccgtgctt gccgtctatt atgtaggaat gaccgaagaa gaccgggggc atgaattcca 14041 tattcagtcc ggctatgcca tgcaatgctt cgggtaaagg tttgtctacg tttacgctaa 14101 ggtagaaacc gccgtcacgg gcttctcctg tcaacttgta ttcgaagttg tattcgggat 14161 attccagtat gacgtcgatc cggttatgtt ctttatccgg catacgctcg acgaactttg 14221 gcaacttgtt ccattgtccg ggtgtcggat tcatgcggac atctccgttg gtagcggtgc 14281 ggatgccgtg atgaatgatt tctacggcac tgatcttgga atcatcgaag gtagcatcgt 14341 ataggttgct gaatacgaag aagttcaacc cgcgtgtttc gaagtagtta gagtcattta 14401 atgccagatg gctgttgcgg gaaaaggagg tactttttcc gcaaccggtg aatatcaggc 14461 atcccactac tcccatacaa agaatgattg cttttaattt catactattc attttcgtaa 14521 tcggttcact tatcaaataa ttaatcatgc gaatcaggag ttgaattgat tcatatcnnn 14581 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14641 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14701 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14761 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14821 nnnnnnnnnn naattatttg cccatccgtt aagctgatga agttctttgt ttaagtctat 14881 aacatctttt gggaatggga agtattcatg tttacccttt acgaagccat aagagcttgc 14941 aggtaaattt tgatctactg ataaaggatg agtcaactca gcacgtagat gatgttcttt 15001 tccatagtat ggtgtacctt ggatgaagaa gtcgtcatac agttgcggaa cctgatcaac 15061 gacagcatcc tgacactctt tagcaatgcc ccaacgaacg atgtcgtgaa aacgacaagt 15121 ctcgaaccaa agttcgtact gcttttcatc gcgaagagtc tgagttgtga gttctgtttc 15181 aggagcttct gcacgtttct gaatagcatt cagatatttt ttaccttcat ccaaattacc 15241 tgtttcaaga caagcctctg cataaagcag atacgcttct cccaatctgg cgatacggaa 15301 tgatgcagag ttatgagtgt cgctaatagt catattgcgg tcgttaggag attctatatt 15361 cttgacttcc atatatacgc catgtgcgaa caggccttct gtactcttga taccacgttt 15421 tggatctttt ttcttgtcgg caagagacat ctcatttccg tcagcatcga aatccgtttc 15481 ccaaccgcag agtttagagt cgtataagaa ctcgtccgga gtgaggaatg tagctttgcg 15541 acgcggaccg tcaccatcat tttgatacat cttttcggca aacttccagt tgatggcacc 15601 accgccccaa ccatcgttag cgccgcagat ggatgcgcga gagcctaagc ggtcataacg 15661 ccagttcaat acgttgtttg tcatccaacg gccacgttgg atattacccc aaaaaccttt 15721 atcaatgtta ccgaagcaat tagcctcgaa gattttttct tcgcagccat taccctctac 15781 atggaatagg tcacggaaac gatttcccgg aacaagtgca tagttgggag attcaaccaa 15841 aggcttaaga ttcctctctg cacgagccat atcgccggcg aatacggcag acttacccgc 15901 tacgaaatat gcaaaacctt ttgttacttt aaaagcacct tctacatcat taggaccttt 15961 acgtgagtca agatcattta cagcttcttc acactcggaa gcacaccact caagcagaga 16021 tttctggctt tcgacatttg taggtttgtc atcggcaccc agcagtttgt cgacttttgg 16081 aggacgttgg aatgtgagag ccgccatcat gtgacaataa gcgcgtaata cgcgagcctc 16141 tgcgacacaa cgggccataa ccgtacttgc agccggatct acatatgtta taacaaggtt 16201 acttgcataa atagacttat agtaacgctg atataattct ttaagaggag cgttgcctgt 16261 atcatagcgg aactcatcga atacacgcat atccatgtgg tcagcataat ttgtaccggc 16321 agagaaaaca tcatctgctg aataattgag taaagtgaac tgcggctgat agattccggc 16381 agacgaacat acattgtcaa ggaactgcgc ataagcggca gtcatagcat caacagcatc 16441 ttgatcggtt ttatagaagt tttcagtaga gctaacagcc ttctgttcga tgttcaaacg 16501 actttcatcg caagatgtgc agaacagtgc ggcacatacc gcaaacgaat atatatattt 16561 tttcattttc gtcagtatta atggttggtt tagaatgtta cattaagacc aaatgtcatc 16621 ttgcgcattg taggatatgc accggcatca agacctacat tattaccact tcttgaagca 16681 atttcagggt caagacccgg ataagatgta aatgtgaaga agtcatcaag tgacacgtaa 16741 gcacgaaggt tctctataaa tacctttttt gtgatagact ttggcaatgt gtatcctaat 16801 tgtaattgct taatcttgaa gtatgatccg tcaaatacaa gagcagaaga tgaccagtag 16861 attttatcac ctgcaattgt gttaagactg ggcatattcc ccttcttggc ttcatcgtag 16921 aaatgcttgg agatattgtt gtagcctgta cggtacatac cgtagtagac atcatttcct 16981 accgtacctg taccgaatac cgagaagtcg aaacccttgt actcaagatt gattgtaatg 17041 ccgtatgtaa gatcaggaat acctttaccg atattgaatt ggtcttcatc acccggagtt 17101 aatgtctcat tgccttcttt gtcaagataa agaggctgtc cttctgaatt cnnnnnnnnn 17161 nnnnnnnnnn nttacctata tatttgtaac cacgcatata ccataccggc atgccttgtt 17221 cgaagttggt gtatatagta ttgtttgcac ctactatact tccacccgat atacgtgtga 17281 tacggtcatc aaggtaagtt acttcattct tcaatgttgc aaagttgcct gatactgagt 17341 aattcagcct gcctatgttg cctttataag ttgcttcaaa ctcaaaacct ttgttctgta 17401 cctcacctgc attgatcatc tgtgaagata cacctgtctc aggcagacat ggagcgtcta 17461 caagaaggtc tttggtcgtt ttcttatagt agtccatgct gattgtcaac gcatcgttga 17521 ataaacgtaa gtcaagaccg aagtccagct gatcggaagt ttcccaagtg aggttaggat 17581 tagctaaacc gcttggcatg gaaccatttg taacagttgt accattacta tattgatacc 17641 actggccacc agttgcaata gaagaagaat atttatagcc gctaagtaca ttgatattac 17701 cgttacgtcc ccaagatgca cgcagcttgg caaatgaaac aatgttatca ggaactaaat 17761 ttttaaagaa cttctcatta ctgagtgtcc agcctgctga taatgatggg aagtaacccc 17821 aacgtttatc tgctggaagt tttgaggtat cgaatgcgtc agcacggaaa ttggcttgca 17881 ggctatagcg atcgtcataa gtatatccta aacggccgaa gtatgaaatg cttgaggact 17941 tacccggtgt tccactgaat gtacgggaaa ctttatctga tacaagtaca caatccaagt 18001 aacggaaatt ttcttcgtat gaagaaagca attcttcacc tgacgcacct gctgtcgtgc 18061 cgttactttc gctttggatg tatgacatac ctgccattgc cgaaatactg tgcttgccga 18121 ttgaagtcat gtagttggca aagttctccc actgatagta taggccgttg cttgtagtct 18181 ggctgataga gtatttggta tcacttacac gaccggtgat atagtaaggt tctgcgtagc 18241 tgtaagtatt gccctgagta atgcgagtac caaaacgaga agtaaatgta aagcccttga 18301 atggggtcag gtttgcgaag aacgtactat gtacattgat gccgccattt ttattgttag 18361 cattcttgtc acgctgtgca aatggagtgc caccggcaag ttctgtatta aagtatgaag 18421 tagcatacat acccctatca tcgccgtaga aacgatagtt tgtatcagaa gttccattct 18481 gaactgcgtc ataaacagaa cctgtggcag cgagatattg actgcggtct gtccagtgtg 18541 taggagtcag aggatccatt acaagaagca tttcaaaagc ggagccgtaa ccatattctg 18601 atacgctgcc tctacgccat ttctctatag aagtgttggt acctacctgc aaccagggtt 18661 ttattttata atcagcatta atctgagcgg ttaagcgttc gtagtagtcg tttttgcctt 18721 ttacgatgcc atcctgatta acgtatgtga ggtttgcaaa atactgccct ttgtcattgc 18781 taccctggaa tgaaagagag tgctgcttgc tccaagaagt tcccataaac tcgtcgaacc 18841 aatttgtatc agtattttga tcccaattat aatcgcgaat agcatccgac atggcagcct 18901 cgacgttata tccttgcatg cccatccagt ccttgaactc ttccgcattc agtaattgtg 18961 gaacatgtcc aagttggttc atagtataac gtacattata tgtgatcttg ccggtacctt 19021 ttcctccccc attcttggtt gtgataagta ctacaccgtt acctgcttca gaaccataga 19081 ttgctgcgga tgcggcatct ttcaatactt ccatagactc gatcaagcct ggatcaagat 19141 actggatgct tgaaaccttc aaaccatcca caatcaagag aggaccaata tttccggaat 19201 tggaggaata accacgtaca cggatttcag caccgctacc cggagcacct gatgaattca 19261 agatttgaat acctgcggct ttaccttgaa gggcggcggc agcgtcggtt gttgccaagc 19321 ctttaaggtc tttagagtta accgaggcaa cagcacctgt caggtcactc ttgcgctgaa 19381 caccataacc tacaaccaca acctcatcaa gcacttgtgt atcttcttcc aaacgaaagt 19441 tgattacgct acggccattg actttaacag tctgcgtttt gtagcctata aaagaagcgg 19501 taagggtagc attactgtta cttacctgaa tagagtaatt accattgaga tcggtaatac 19561 gaccgttgct ggttccttct tcaagaatag ccacaccgat cattggctca gaatcattgg 19621 ctgagacaac agtacccttg atttggattt gcgctgatgc tgatatgcac gtaaaaaaca 19681 caagcacaag cataaacaga aaaggaatct tctgaatttt ctggactaca tttttcatca 19741 taattagttt aagaattaat aattgcgttt ttcattcatc attagctacc gctacaatgg 19801 ttctctaacc atcatcatac atacattcca tagttttcaa ttatttaaga ttaacataaa 19861 aagaaaggca attcattctt taattctgaa tcggcaaggc aactgctgag tcgcattatt 19921 aattcaaatt ggttgcctga gccgattcgt tatttttgag tggaattacc cggtaatagt 19981 actttagagt aatagggctt gttcctaatg atcgcggaaa ttccatttgg aattgtcgct 20041 gttcatatcg aattcgccga gagaaggagt gctgtcacct acggatacca gaaccaattc 20101 cttgtcagtt cccggcactt ctttgggcat gatccggtag gtgccgtctg tcaactggtc 20161 gattcgccac agttgttcgg gggctcccgt gaactcagga actgtcacga cttccgcatc 20221 tgtcgtggcg gccagtgcgc gattggttcc ttcaatcacg atcttgtaat aagggccgcc 20281 caagtatccg cctccatcgg gcacggcagt aatggtccac ttctgatgag gacggaacat 20341 ataatcacca attcttacgt tgatttttcc ttgtggccaa gtcccgatca catcttccaa 20401 cgtctgcgat ttcaaaggta cgacgggcgt gtcgtctttt tcccagaagc ggtgtctggt 20461 atattccata cgtacaaaat caacggcaag ttcaagggca taccctctac gttcggattc 20521 aatctcatac gttccctctt tgaagacctc gccggcaacg ggccagccgt tcttccataa 20581 aagcgggcgt atgcccagca cactgcggcc accccggtcg aaatcggctt caaagtggca 20641 tgacatcttc tctactccgt cggctacttt gaaaagcccg aagtgtcccg gaccggtttg 20701 gcggttacct gcggcaatca ccatcttgcc accgccttcc agcatttttc ttcccatatt 20761 atcgacatat ggccccgtca ctttacgcga acggccgaca acaatattgt aggtagagtt 20821 cgggccatcg cagcaagtgc cgtgtgttcc cagaagataa taccagccgt cacgatagat 20881 cagatcggtt gcttcacagt caatggcgat gtcgatttct ttgttgccct ccatccgttt 20941 gccggttgcg gggtcgagtt ctacaagcct gataaagccg aaatacgtgc cgtaagacaa 21001 ccacaagcgt ccgtctgttg ggtcgagcag aagtccggca tcaatggcat cgcaatcttc 21061 atcattcact gaggaggcaa cttctatagg ctccgtatat ttaaaatccg gagaattggg 21121 gtctaaggtt ttattccaca tggttaatac tttaccgctg tgtcctccac cgagtcctcc 21181 tcctgtggca ctgtaagcaa tcaggtagcg gtcaccgatt ttcacggcat cgggcgcggc 21241 gcctccgccg ggtcttactc cgccaccgtt ccatgtccag ccgtcttcag atatcaatcc 21301 gcctccacct gtgccgaaag tataatattt accatcgcac tccatgatag tggacgggtc 21361 gtggatgtat ggagtgccga tttgtgcaat tgctttttca gatgtcaata caaacatcag 21421 ggctgccgtg cccagaagta tagcttttgt tttcatgatc atttatagct gatagtaatg 21481 ttgttaacgg gtgttccatt ctcattgagg aagcgaacac agaaatcact cattccggga 21541 ccgttgatga tggctccgcg aatgatgttc ttgcccttct ttaaagtcag ccggcgcgag 21601 aggcagtcat ccttgaccat acgccggtcg cccgaaagaa ttacggcttc ctcaccgttc 21661 agccaccaca tggatgcgga gttggaacct actgacatcc ggatgttctc catgtcttcg 21721 ggacattcta tgaatgtgac tgcccagaac agcacgccgt acacttgttt cttcaggccg 21781 gaggcaaaac ggaagagctt cacgttgaac agcttgctgt cgagcgcgtg ccaggttagc 21841 ttttgtttgc ctactttcac tttatccccg tctttgggca gaacggtgaa ttgattgggg 21901 aaatattccg ttgcaaatgc ttcgcgaatg taactgtcgg taaagaccgt gttggaacgg 21961 ttgggcttgt cgatgggttc cagcaacagc cagcgctgga taaagccctc actgtcgggt 22021 gacatcgtat tcgtggttgc cggagaaaag tagtctgaca accgggggga cgcttcattc 22081 ttttgagcat agccatgtaa accgatcata gccgcgatac aagccgccaa caatgatttt 22141 cgatggaaga tattcatcgt tatagtctgt tttattgatt atacattatt tcgttttaaa 22201 taatagagga acaaactctt tcagacatct gcgccaggtc aggaactcgt gggcagtgcc 22261 cggagactca taatagattg tattgattcc cgcagcctgc aagccgtcac tgaggttctt 22321 ggttctttcg ggatgttctt ccgaaccgat acccaggaag aagacgtgta ctttgtcgtt 22381 gaatgcttta cggtttttga ataccccgcc gtacacggta tcgagttgct tcagatcgat 22441 gctgcctgct ccgctgaatc cgccaatata ggcaaactta tccaagttgt tcagggtagt 22501 gttgaaagta agaagtccgc cccaggaaag tccggccata gcacggttgt ccctatcagt 22561 cagtgttcgg aaattggatt cgatatgagg aatgatttca ttcactaaga tagcaggaaa 22621 ttctgcaccg aatctttttc ttgcaccgtc gggagtttca cccggattgg tcttaaacac 22681 ctcgatattg ccattgtcca tgactacgat cattggtttg gctttacctt cggcaatcag 22741 gttatccata atgaaattca tcttgccctg gtttgcccaa ctcgtttcgt tttctcccat 22801 accgtgttgc aggtagagca ccggatactt cttggtaggg ttcttgtcat attccgccgg 22861 agtatagacg atacatctgc gccattcttt gcggatatca gagtagtacc agctttggct 22921 gatcagcccg tgaggcacgt tcttgatgga atagtagtcc acacctttct ccgggatttc 22981 aataccgctt acccatttgc ccataccgaa gaacggtttg ccattgggat cggcggcact 23041 tactccgtct attataacct gatagtagtg gaagcctatt acttcaggtt cttttgatac 23101 gagcgaccaa tatccgtctg cctctttggt catttcacct cggaaactga tttctacttt 23161 ctgggcgttg ggagcgtgta tcctgaagtg tacacgtctg tcctcaccta tgcggggata 23221 atcagcccca ttgatgttat gctcattggg agtggttcct accggaaaat cttgttgctg 23281 ggcattcata gtgaatggga acactaacaa agaagaaaaa ataagtttca atacattttt 23341 cat //