##gff-version 3
##sequence-region DS264584 1 12368
# conversion-by bp_genbank2gff3.pl
# organism Bacteroides ovatus ATCC 8483
# Note Bacteroides ovatus ATCC 8483 Scfld0231 genomic scaffold, whole genome shotgun sequence.
# date 25-MAY-2007
DS264584	GenBank	region	1	12368	.	+	1	ID=DS264584;Dbxref=BioProject:PRJNA18191,ATCC:8483,taxon:411476;Name=DS264584;Note=Bacteroides ovatus ATCC 8483 Scfld0231 genomic scaffold%2C whole genome shotgun sequence.,Bacteroides ovatus (GenBank Accession Number for 16S rDNA gene: X83952) is a member of the division Bacteroidetes. In one comprehensive 16S rDNA sequence-based enumeration of the colonic microbiota of three healthy adult humans,it represents,on average,0.034%25 of all 16S rDNA sequences and 0.071%25 of the sequences in its division (Eckburg et. al. (2005)). The sequenced strain was obtained from ATCC (ATCC 8483T). We have collected 6.9X coverage in plasmid end reads and 454 reads. We will be performing one round of automated sequence improvement (pre-finishing). Sequencing/Assembly: The genomic DNA was purified from liquid culture derived from a single bacterial colony. A hybrid sequencing strategy that utilized reads from both 454 GS-20 and ABI 3730xl sequencers was devised and implemented to generate the draft genome sequences. 454 reads were assembled using Newbler (454 Life Sciences) into 454 de novo contigs. These de novo contigs were converted in silico to 800 base paired reads ('superreads') with 400 base overlaps with neighboring superreads. Finally,PCAP (Huang,et al,Genome Research,13:2164,(2003)) was used to assemble the super-reads and the conventional 3730xl capillary reads. This sequenced strain is part of a comprehensive,sequence-based survey of members of the normal human gut microbiota. A joint effort of the WU-GSC and the Center for Genome Sciences at Washington University School of Medicine,the purpose of this survey is to provide the general scientific community with a broad view of the gene content of 100 representatives of the major divisions represented in the intestine's microbial community. This information should provide a frame of reference for analyzing metagenomic studies of the human gut microbiome. Further details of this effort are described in a white paper entitled 'Extending Our View of Self: the Human Gut Microbiome Initiative (HGMI)' (http://www.genome.gov/Pages/Research/Sequencing/SeqProposals/HGMIS eq.pdf). These studies are supported by National Human Genome Research Institute For answers to your questions regarding this assembly or project,or any other GSC genome project,please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. ;comment1=Bacteroides ovatus (GenBank Accession Number for 16S rDNA gene: X83952) is a member of the division Bacteroidetes. In one comprehensive 16S rDNA sequence-based enumeration of the colonic microbiota of three healthy adult humans%2C it represents%2C on average%2C 0.034%25 of all 16S rDNA sequences and 0.071%25 of the sequences in its division (Eckburg et. al. (2005)). The sequenced strain was obtained from ATCC (ATCC 8483T). We have collected 6.9X coverage in plasmid end reads and 454 reads. We will be performing one round of automated sequence improvement (pre-finishing). Sequencing/Assembly: The genomic DNA was purified from liquid culture derived from a single bacterial colony. A hybrid sequencing strategy that utilized reads from both 454 GS-20 and ABI 3730xl sequencers was devised and implemented to generate the draft genome sequences. 454 reads were assembled using Newbler (454 Life Sciences) into 454 de novo contigs. These de novo contigs were converted in silico to 800 base paired reads ('superreads') with 400 base overlaps with neighboring superreads. Finally%2C PCAP (Huang%2C et al%2C Genome Research%2C 13:2164%2C (2003)) was used to assemble the super-reads and the conventional 3730xl capillary reads. This sequenced strain is part of a comprehensive%2C sequence-based survey of members of the normal human gut microbiota. A joint effort of the WU-GSC and the Center for Genome Sciences at Washington University School of Medicine%2C the purpose of this survey is to provide the general scientific community with a broad view of the gene content of 100 representatives of the major divisions represented in the intestine's microbial community. This information should provide a frame of reference for analyzing metagenomic studies of the human gut microbiome. Further details of this effort are described in a white paper entitled 'Extending Our View of Self: the Human Gut Microbiome Initiative (HGMI)' (http://www.genome.gov/Pages/Research/Sequencing/SeqProposals/HGMIS eq.pdf). These studies are supported by National Human Genome Research Institute For answers to your questions regarding this assembly or project%2C or any other GSC genome project%2C please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. ;date=25-MAY-2007;mol_type=genomic DNA;organism=Bacteroides ovatus ATCC 8483;strain=ATCC 8483;type_material=type strain of Bacteroides ovatus
DS264584	GenBank	gene	1	747	.	-	1	ID=BACOVA_04719;Name=BACOVA_04719
DS264584	GenBank	mRNA	1	747	.	-	1	ID=BACOVA_04719;Parent=BACOVA_04719
DS264584	GenBank	CDS	1	747	.	-	1	ID=BACOVA_04719;Parent=BACOVA_04719;Dbxref=GI:156107118,InterPro:IPR013610;Name=BACOVA_04719;Note=KEGG: vfi:VFB02 1.1e-21 traC%3B DNA primase TraC%3B COG: COG4227 Antirestriction protein;codon_start=1;inference=protein motif:HMMPfam:IPR013610,similar to AA sequence:INSD:BAD49638.1;product=hypothetical protein;protein_id=EDO08863.1;transl_table=11;translation=length.562
DS264584	GenBank	exon	1	747	.	-	1	Parent=BACOVA_04719
DS264584	GenBank	gene	753	1409	.	-	1	ID=BACOVA_04720;Name=BACOVA_04720
DS264584	GenBank	mRNA	753	1409	.	-	1	ID=BACOVA_04720;Parent=BACOVA_04720
DS264584	GenBank	CDS	753	1409	.	-	1	ID=BACOVA_04720;Parent=BACOVA_04720;Dbxref=GI:156107119,InterPro:IPR001209,InterPro:IPR008994;Name=BACOVA_04720;codon_start=1;inference=protein motif:ScanRegExp:IPR001209,protein motif:superfamily:IPR008994,similar to AA sequence:INSD:BAD49637.1;product=hypothetical protein;protein_id=EDO08864.1;transl_table=11;translation=length.218
DS264584	GenBank	exon	753	1409	.	-	1	Parent=BACOVA_04720
DS264584	GenBank	gene	1444	1620	.	-	1	ID=BACOVA_04721;Name=BACOVA_04721
DS264584	GenBank	mRNA	1444	1620	.	-	1	ID=BACOVA_04721;Parent=BACOVA_04721
DS264584	GenBank	CDS	1444	1620	.	-	1	ID=BACOVA_04721;Parent=BACOVA_04721;Dbxref=GI:156107120;Name=BACOVA_04721;codon_start=1;inference=similar to AA sequence:INSD:BAD49636.1;product=hypothetical protein;protein_id=EDO08865.1;transl_table=11;translation=length.58
DS264584	GenBank	exon	1444	1620	.	-	1	Parent=BACOVA_04721
DS264584	GenBank	gene	1633	3189	.	-	1	ID=BACOVA_04722;Name=BACOVA_04722
DS264584	GenBank	mRNA	1633	3189	.	-	1	ID=BACOVA_04722;Parent=BACOVA_04722
DS264584	GenBank	CDS	1633	3189	.	-	1	ID=BACOVA_04722;Parent=BACOVA_04722;Dbxref=GI:156107121,InterPro:IPR002886,InterPro:IPR002901,InterPro:IPR011054,InterPro:IPR013338;Name=BACOVA_04722;Note=KEGG: tbd:Tbd_1629 1.4e-14 mannosyl-glycoprotein endo-beta-N-acetylglucosamidase K02395%3B COG: COG0739 Membrane proteins related to metalloendopeptidases;codon_start=1;inference=protein motif:HMMPfam:IPR002886,protein motif:HMMPfam:IPR002901,protein motif:HMMSmart:IPR013338,protein motif:superfamily:IPR011054,similar to AA sequence:INSD:BAD49635.1;product=mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase;protein_id=EDO08866.1;transl_table=11;translation=length.518
DS264584	GenBank	exon	1633	3189	.	-	1	Parent=BACOVA_04722
DS264584	GenBank	gene	3202	5463	.	-	1	ID=BACOVA_04723;Name=BACOVA_04723
DS264584	GenBank	mRNA	3202	5463	.	-	1	ID=BACOVA_04723;Parent=BACOVA_04723
DS264584	GenBank	CDS	3202	5463	.	-	1	ID=BACOVA_04723;Parent=BACOVA_04723;Dbxref=GI:156107122,InterPro:IPR007695;Name=BACOVA_04723;Note=COG: COG0249 Mismatch repair ATPase (MutS family)%3B Psort location: Cytoplasmic%2C score:8.96;codon_start=1;inference=protein motif:HMMPfam:IPR007695;product=MutS domain I protein;protein_id=EDO08867.1;transl_table=11;translation=length.753
DS264584	GenBank	exon	3202	5463	.	-	1	Parent=BACOVA_04723
DS264584	GenBank	gene	5491	7290	.	-	1	ID=BACOVA_04724;Name=BACOVA_04724
DS264584	GenBank	mRNA	5491	7290	.	-	1	ID=BACOVA_04724;Parent=BACOVA_04724
DS264584	GenBank	CDS	5491	7290	.	-	1	ID=BACOVA_04724;Parent=BACOVA_04724;Dbxref=GI:156107123;Name=BACOVA_04724;codon_start=1;inference=similar to AA sequence:INSD:BAD49633.1;product=hypothetical protein;protein_id=EDO08868.1;transl_table=11;translation=length.599
DS264584	GenBank	exon	5491	7290	.	-	1	Parent=BACOVA_04724
DS264584	GenBank	gene	7299	7580	.	-	1	ID=BACOVA_04725;Name=BACOVA_04725
DS264584	GenBank	mRNA	7299	7580	.	-	1	ID=BACOVA_04725;Parent=BACOVA_04725
DS264584	GenBank	CDS	7299	7580	.	-	1	ID=BACOVA_04725;Parent=BACOVA_04725;Dbxref=GI:156107124;Name=BACOVA_04725;Note=Psort location: Cytoplasmic%2C score:8.96;codon_start=1;inference=similar to AA sequence:INSD:ABP57344.1;product=hypothetical protein;protein_id=EDO08869.1;transl_table=11;translation=length.93
DS264584	GenBank	exon	7299	7580	.	-	1	Parent=BACOVA_04725
DS264584	GenBank	gene	7604	8434	.	-	1	ID=BACOVA_04726;Name=BACOVA_04726
DS264584	GenBank	mRNA	7604	8434	.	-	1	ID=BACOVA_04726;Parent=BACOVA_04726
DS264584	GenBank	CDS	7604	8434	.	-	1	ID=BACOVA_04726;Parent=BACOVA_04726;Dbxref=GI:156107125,InterPro:IPR002886,InterPro:IPR011054;Name=BACOVA_04726;Note=KEGG: ava:Ava_C0210 3.3e-08 peptidase M23B K08259%3B COG: COG0739 Membrane proteins related to metalloendopeptidases;codon_start=1;inference=protein motif:HMMPfam:IPR002886,protein motif:superfamily:IPR011054,similar to AA sequence:INSD:ABP57343.1;product=peptidase%2C M23 family;protein_id=EDO08870.1;transl_table=11;translation=length.276
DS264584	GenBank	exon	7604	8434	.	-	1	Parent=BACOVA_04726
DS264584	GenBank	gene	8459	9124	.	-	1	ID=BACOVA_04727;Name=BACOVA_04727
DS264584	GenBank	mRNA	8459	9124	.	-	1	ID=BACOVA_04727;Parent=BACOVA_04727
DS264584	GenBank	CDS	8459	9124	.	-	1	ID=BACOVA_04727;Parent=BACOVA_04727;Dbxref=GI:156107126;Name=BACOVA_04727;codon_start=1;inference=similar to AA sequence:INSD:ABP57342.1;product=hypothetical protein;protein_id=EDO08871.1;transl_table=11;translation=length.221
DS264584	GenBank	exon	8459	9124	.	-	1	Parent=BACOVA_04727
DS264584	GenBank	gene	9139	9822	.	-	1	ID=BACOVA_04728;Name=BACOVA_04728
DS264584	GenBank	mRNA	9139	9822	.	-	1	ID=BACOVA_04728;Parent=BACOVA_04728
DS264584	GenBank	CDS	9139	9822	.	-	1	ID=BACOVA_04728;Parent=BACOVA_04728;Dbxref=GI:156107127;Name=BACOVA_04728;codon_start=1;inference=similar to AA sequence:INSD:BAD49629.1;product=hypothetical protein;protein_id=EDO08872.1;transl_table=11;translation=length.227
DS264584	GenBank	exon	9139	9822	.	-	1	Parent=BACOVA_04728
DS264584	GenBank	gene	9837	10526	.	-	1	ID=BACOVA_04729;Name=BACOVA_04729
DS264584	GenBank	mRNA	9837	10526	.	-	1	ID=BACOVA_04729;Parent=BACOVA_04729
DS264584	GenBank	CDS	9837	10526	.	-	1	ID=BACOVA_04729;Parent=BACOVA_04729;Dbxref=GI:156107128;Name=BACOVA_04729;codon_start=1;inference=similar to AA sequence:INSD:BAD49628.1;product=hypothetical protein;protein_id=EDO08873.1;transl_table=11;translation=length.229
DS264584	GenBank	exon	9837	10526	.	-	1	Parent=BACOVA_04729
DS264584	GenBank	gene	10507	11004	.	-	1	ID=BACOVA_04730;Name=BACOVA_04730
DS264584	GenBank	mRNA	10507	11004	.	-	1	ID=BACOVA_04730;Parent=BACOVA_04730
DS264584	GenBank	CDS	10507	11004	.	-	1	ID=BACOVA_04730;Parent=BACOVA_04730;Dbxref=GI:156107129;Name=BACOVA_04730;codon_start=1;inference=similar to AA sequence:INSD:ABP57339.1;product=hypothetical protein;protein_id=EDO08874.1;transl_table=11;translation=length.165
DS264584	GenBank	exon	10507	11004	.	-	1	Parent=BACOVA_04730
DS264584	GenBank	gene	11001	12368	.	-	1	ID=BACOVA_04731;Name=BACOVA_04731
DS264584	GenBank	mRNA	11001	12368	.	-	1	ID=BACOVA_04731;Parent=BACOVA_04731
DS264584	GenBank	CDS	11001	12368	.	-	1	ID=BACOVA_04731;Parent=BACOVA_04731;Dbxref=GI:156107130;Name=BACOVA_04731;Note=COG: NOG33027 non supervised orthologous group%3B Psort location: Cytoplasmic%2C score:8.96;codon_start=1;inference=similar to AA sequence:INSD:BAD49626.1;product=hypothetical protein;protein_id=EDO08875.1;transl_table=11;translation=length.462
DS264584	GenBank	exon	11001	12368	.	-	1	Parent=BACOVA_04731