LOCUS       NC_007946               6176 bp    DNA     linear   CON 25-AUG-2019
DEFINITION  Escherichia coli UTI89, complete genome.
ACCESSION   NC_007946
VERSION     NC_007946.1
DBLINK      BioProject: PRJNA224116
            BioSample: SAMN00000110
            Assembly: GCF_000013265.1
KEYWORDS    RefSeq.
SOURCE      Escherichia coli UTI89
  ORGANISM  Escherichia coli UTI89
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 6176)
  AUTHORS   Chen,S.L., Hung,C.S., Xu,J., Reigstad,C.S., Magrini,V., Sabo,A.,
            Blasiar,D., Bieri,T., Meyer,R.R., Ozersky,P., Armstrong,J.R.,
            Fulton,R.S., Latreille,J.P., Spieth,J., Hooton,T.M., Mardis,E.R.,
            Hultgren,S.J. and Gordon,J.I.
  TITLE     Identification of genes subject to positive selection in
            uropathogenic strains of Escherichia coli: a comparative genomics
            approach
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 103 (15), 5977-5982 (2006)
   PUBMED   16585510
REFERENCE   2  (bases 1 to 6176)
  AUTHORS   Chen,S.L., Hung,C.-S., Xu,J., Reigstad,C.S., Magrini,V., Sabo,A.,
            Blasiar,D., Bieri,T., Meyer,R.R., Ozersky,P., Armstrong,J.R.,
            Fulton,R.S., Latreille,J.P., Spieth,J., Hooton,T.M., Mardis,E.R.,
            Hultgren,S.J. and Gordon,J.I.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-JAN-2006) Molecular Microbiology, Genetics, and
            Molecular Biology and Pharmacology, Washington University School of
            Medicine, 660 South Euclid Avenue, Saint Louis, MO 63110, USA
COMMENT     ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI RefSeq
            Annotation Date                   :: 08/25/2019 16:43:27
            Annotation Pipeline               :: NCBI Prokaryotic Genome
            Annotation Method                 :: Best-placed reference protein
            Annotation Software revision      :: 4.9
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
            Genes (total)                     :: 5,147
            CDSs (total)                      :: 5,030
            Genes (coding)                    :: 4,863
            CDSs (with protein)               :: 4,863
            Genes (RNA)                       :: 117
            rRNAs                             :: 8, 7, 7 (5S, 16S, 23S)
            complete rRNAs                    :: 8, 7, 7 (5S, 16S, 23S)
            tRNAs                             :: 89
            ncRNAs                            :: 6
            Pseudo Genes (total)              :: 167
            CDSs (without protein)            :: 167
            Pseudo Genes (ambiguous residues) :: 0 of 167
            Pseudo Genes (frameshifted)       :: 91 of 167
            Pseudo Genes (incomplete)         :: 79 of 167
            Pseudo Genes (internal stop)      :: 35 of 167
            Pseudo Genes (multiple problems)  :: 34 of 167
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
            REFSEQ INFORMATION: The reference sequence was derived from
            CP000243.
            The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            Bacteria is available by contacting Scott Hultgren
            (hultgren@borcim.wustl.edu).
            Annotation Pipeline (PGAP)
            set; GeneMarkS-2+
            repeat_region
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..6176
                     /organism="Escherichia coli UTI89"
                     /mol_type="genomic DNA"
                     /strain="UTI89"
                     /db_xref="taxon:364106"
     gene            complement(1..1440)
                     /locus_tag="UTI89_RS08480"
                     /old_locus_tag="UTI89_C1752"
     CDS             complement(1..1440)
                     /locus_tag="UTI89_RS08480"
                     /old_locus_tag="UTI89_C1752"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_000012594.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="GH1"
                     /protein_id="WP_000012592.1"
                     /translation="MSGFKEDFLWGGAVAAHQLEGGWNEGGKGISIADVMTAGAHGVPR
                     EVTEGVIDGLNYPNHEAIDFYHRYKTDIQLFAGMGFKCFRTSIAWTRIFPQGDEQEPNE
                     EGLQFYDDLFDECLKQGMEPVVTLSHFEMPYHLVTKYGGWRNRKLIDFFIHFASTVFTR
                     YKAKVKYWMTFNEINNQVNFSESLCPFTNSGILYSPEEDLNEREQIMYQAVHYELVASA
                     LAVQTGKLINPEFNIGCMIAMCPIYPLTCAPNDMMMATKAMHRRYWFTDVHARGYYPQH
                     MLNYFARKGFNLDITPDDNAILARGCVDFIGFSYYMSFTTQFSPDNPQLDYVEPRDLVS
                     NPYIDTSEWGWQIDPAGLRYSLNWFWDHFQLPLFIVENGFGAVDQRQADGTVNDHYRID
                     YFSSHIREMKKAVVEDGVDLIGYTPWGCIDLVSAGTGEMKKRYGMIYVDKDNEGKGTLE
                     RIRKASFYWYRDLIANNGENI"
     gene            complement(1464..3137)
                     /locus_tag="UTI89_RS08485"
                     /old_locus_tag="UTI89_C1753"
     CDS             complement(1464..3137)
                     /locus_tag="UTI89_RS08485"
                     /old_locus_tag="UTI89_C1753"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_016233185.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="gnl|TC-DB|P26218|1.B.3.1.3"
                     /protein_id="WP_001022795.1"
                     /translation="MNIKTLNVSLLSISIITALFPLNAMATKLTIEQRLELLENELSQN
                     KQELKATQNELGVYKFRLSTLQKSITENKYQSASLAEISATSPVADNIKNENGEQNSFA
                     AAHTINGSQQIAVIESKGDKTTIESVTLKDISKYIKDDIGFSYQGYFRSGWGTGNHGSP
                     QTYAAGSLGRFGNEMSGWFDLTLNQRVYNQDGRTANAVVTYDGNVGQQYNDAWFGDSAN
                     ENIMQFSDIYLTTRGFLPFAREAEFWVGKHKLPQYEIQMLDWKTLTTDVAAGVGIENWA
                     LGVGLFDMSLSRDDVDVYSRDFTRTSQMNTNSVDVRYRNIPLWDDATLSLMAKYSAPNK
                     TDQQQDNENDDSYFEMKDSWMLASVLRQNLQRDTFNEFTLQVANNSYASSFASFSDASN
                     TMAHGRYYYGDHTNGIAWRLISQGEMYLTDNIIMANALVYSHGEDVYSYESGAHSDFDS
                     IRTVIRPAWIWNTWNQTGLELGWFKQQNKTQQGVTLNESAYKTTLWHALKVGESILGSR
                     PEIRFYGTYINILDNELSNFKFNENSKDEFMAGIQAEVWW"
     gene            complement(3192..3503)
                     /locus_tag="UTI89_RS08490"
                     /old_locus_tag="UTI89_C1754"
     CDS             complement(3192..3503)
                     /locus_tag="UTI89_RS08490"
                     /old_locus_tag="UTI89_C1754"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_001314746.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="gnl|TC-DB|U5MLJ3|4.A.3.2.9"
                     /protein_id="WP_001304363.1"
                     /translation="MFADEELVMELLINAGQARSDAMEAIRCAGQKDWQGATKLMASSE
                     SACLQAHKIQTALISQDEGCGKIEVNLILIHAQDHLMNAILCQDLAREIISLRKELHA"
     gene            complement(3531..4853)
                     /locus_tag="UTI89_RS08495"
                     /old_locus_tag="UTI89_C1755"
     CDS             complement(3531..4853)
                     /locus_tag="UTI89_RS08495"
                     /old_locus_tag="UTI89_C1755"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_001736301.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="gnl|TC-DB|C4X745|4.A.3.2.7"
                     /protein_id="WP_001332138.1"
                     /translation="MGLMASFERGMERFLVPVAIKLNSQKHVAAVRDGFVFTFPIIMAS
                     SLIILINFAILSPDGFIAGLLHLNSIFPNLEKAQAIFTPVMNGSVNIMSIMIAFLVARN
                     VAISYEQDDLLCGLTAIGAFFIVYTPYQMIDGQAFLTTKYLGAQGLFVAVIVALITSEI
                     FCRLARNPKITITMPAAVPPAVARSFKVLLPIFFVMVFFSALNYCLTLISPAGLNDLIY
                     TLIQTPLKHMGTNIFAVIILGAVGNFLWVLGIHGPNTTSAIRETVFSEANLENLSWAAQ
                     HGTTWGAPYPITWTSINDAFANCGGSGMTLGLLLAIFIASKRAEYRDLAKMSFIPGIFN
                     INEPIMFGLPIVLNPIMMVPFIMVPIVNCAIGYFFVSMEIIPPVAYAVPWTTPGPLIAF
                     LGTGGNWLALLVGFLCLGVAIMIYLPFVIAANKVNNMATNG"
     gene            complement(4968..5279)
                     /locus_tag="UTI89_RS08500"
                     /old_locus_tag="UTI89_C1756"
     CDS             complement(4968..5279)
                     /locus_tag="UTI89_RS08500"
                     /old_locus_tag="UTI89_C1756"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_000722572.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="gnl|TC-DB|U5MIE1|4.A.3.2.9"
                     /protein_id="WP_000722574.1"
                     /translation="MKKILLVCAAGMSTSMLVKRMIDHATAISLEVNISALAIAEAKGK
                     IKNNEVDVVLLGPQVRFQKPEIEAVAQGKMPVAVIEMKDYGTMNGQAVLEFAMKLLQE"
     gene            5478..6176
                     /locus_tag="UTI89_RS08505"
                     /old_locus_tag="UTI89_C1757"
     CDS             5478..6176
                     /locus_tag="UTI89_RS08505"
                     /old_locus_tag="UTI89_C1757"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_000577178.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="STP|GntR"
                     /protein_id="WP_000577184.1"
                     /translation="MIFQKIARLLKSEINGNSWHVGDLLPSEAELAVRYNVSRNTLRKA
                     LSLLEGEGIIHRKHGSGTYIQKKNFVAHIDHMNSFSEIAHKSGKEAGSQIMKFEVQDAS
                     PTIATELNLVTGEQVYYIKRLRFIEDNAAQLEETWMSVARFPDLTVSHMQKSKFSYIEN
                     ECGIKIIGTFETFSPTFPTPEIASILRISPRDPILKIQTQAVDSNSIPLDYSLLYSNIF
                     EFQVKYFFPR"
ORIGIN
        1 tcatatattt tcgccattgt tggcgatgag atcccgatac cagtaaaacg atgctttacg
       61 tatccgttcc agcgttccct tcccttcgtt gtctttatcg acataaatca ttccgtagcg
      121 ttttttcatt tctcctgttc cggcagaaac caggtcaatg caaccccacg gggtgtagcc
      181 aattaagtca acaccatctt caactacggc ttttttcatt tcccgaatat gggaggaaaa
      241 gtaatcaatg cgatagtgat cgttcaccgt gccgtcagct tgtctctggt caaccgcacc
      301 aaatccattt tcgacaataa acagcggcaa ctggaaatga tcccagaacc agttgagtga
      361 ataacgtagc cctgccggat caatttgcca tccccattcg gatgtatcga tataagggtt
      421 gctgaccaaa tctcgtggtt caacataatc cagttgcgga ttatctggcg aaaattgcgt
      481 cgtaaaagac atgtagtagc taaagccgat aaagtcgaca caacctctgg caagaatcgc
      541 gttatcatct ggtgtgatat cgaggttgaa tcctttcctt gcaaagtaat tcagcatatg
      601 ttgcggataa tatccacgag catgaacatc agtaaaccag taacgacgat gcatcgcttt
      661 cgtggccatc atcatatcgt tgggtgcaca cgtcagagga tagatggggc acatagcaat
      721 catacagccg atattaaatt caggattgat cagttttcca gtctgtaccg ccagggcact
      781 ggcaactaac tcataatgta ccgcctggta cattatttgt tcgcgctcat tgagatcttc
      841 ctctggcgaa tacaagatac cggaattagt aaatggacac aggctttcgc tgaaattcac
      901 ctgattattg atttcgttaa acgtcatcca gtactttact tttgctttat agcgcgtgaa
      961 gaccgttgat gcgaagtgga taaaaaagtc gatcagttta cggtttcgcc agccaccata
     1021 ttttgtcacc agatgataag gcatctcaaa atgcgaaagc gtcaccacag gttccattcc
     1081 ctgcttcagg cattcatcga agagatcatc ataaaattgt aaaccctctt cattcggctc
     1141 ctgttcgtca ccttgcggaa agattcgtgt ccaggcaatg gaagttcgaa agcatttgaa
     1201 tcccatcccg gcaaataact gaatatctgt tttatagcga tgataaaaat caattgcttc
     1261 atgattggga taattaagtc cgtcgataac gccttctgtc acttcacgcg gcaccccgtg
     1321 agcgccagca gtcattacat cagcgatact gatgcctttt cctccttcat tccagccacc
     1381 ttccaattga tgtgcggcta ccgcaccgcc ccataaaaaa tcttctttaa atcctgacat
     1441 acacaactcc ttaactaaat agattaccac cagacttccg cctggatgcc ggccataaac
     1501 tcgtctttgc tgttctcatt aaacttaaaa ttcgataatt cgttatccag aatattgata
     1561 tacgtcccgt agaagcgaat ttctggtcgt gaacctaaaa tactttcacc cactttcaat
     1621 gcatgccaga gtgtcgtttt ataagccgat tcatttagcg ttaccccctg ctgagttttg
     1681 ttctgttgct taaaccagcc taattcaagc cccgtctgat tccatgtatt ccagatccag
     1741 gccggtctta ttacggtgcg aatactgtca aaatcactat gagcgccact ttcataacta
     1801 taaacatctt cgccatgaga atagacaagc gcgttagcca taataatatt gtcagtcaga
     1861 tacatctcgc cctgagagat taaacgccag gcgatcccat tggtatggtc accatagtaa
     1921 tagcgaccat gcgccatcgt attactggca tctgagaaac tggcaaaact gctggcatag
     1981 gaattattgg caacctgtaa cgtaaattca ttaaacgtat cgcgttgcaa gttttgccgt
     2041 aaaacagaag ccagcatcca gctatctttc atttcaaaat aactgtcgtc attttcatta
     2101 tcttgttgtt gatccgtttt attaggtgcg gaatatttag ccattaatga caacgtcgca
     2161 tcatcccata acgggatatt gcgataacga acatccacag aattggtatt catctgactg
     2221 gtacgcgtaa aatcacgaga gtaaacatcg acatcatctc ggcttaagga catatcaaac
     2281 agccctacac caagcgccca gttttcaatc cccacacccg cggcaacatc cgtggttaag
     2341 gttttccagt ccagcatttg gatctcatat tgcgggagtt tatgtttgcc gacccagaat
     2401 tctgcctctc gtgcgaaggg taaaaaacct cgcgttgtca gataaatatc actgaactgc
     2461 atgatatttt cattggcact gtcaccaaac caggcatcgt tatactgctg acctacgttt
     2521 ccatcatagg taacgaccgc atttgccgtt ctaccgtcct gattataaac acgctgattt
     2581 aaggtcaggt caaaccaacc actcatctcg ttaccaaaac gtcccagaga acccgctgca
     2641 taagtttgtg gtgagccgtg attaccggtg ccccagcctg agcggaagta cccctgatag
     2701 ctgaatccaa tatcatcctt tatatattta ctgatatctt tcagggtcac gctttcgata
     2761 gtggttttat cgcctttact ttcaataacg gcaatttgct gcgatccatt tatagtatgt
     2821 gctgcggcaa acgagttctg ttcaccgttt tcatttttga tgttatcagc aacgggagat
     2881 gtcgctgata tttcggcaag cgaggccgat tgatatttat tttctgtgat gcttttttgt
     2941 aatgtcgaaa gtcggaattt atatactccc agttcattct gtgttgcttt cagctcttgt
     3001 ttattttgcg acaattcatt ttcaagcaat tcaaggcgct gctctatggt taattttgtt
     3061 gccatcgcgt tcaacggaaa caatgctgta ataatagaaa tagacaaaag gctgacgtta
     3121 agcgtcttta tattcatgac ttatccttaa gagtgtagta atttcctcca gatatcgcac
     3181 tggagaaaaa atcaggcatg gagctctttt cttaatgaaa taatttccct ggctaaatcc
     3241 tggcataata ttgcattcat caaatgatcc tgtgcatgta tcagaattaa attaacttct
     3301 attttgccac agccttcatc ctgactaatt aacgcggtct gaatcttatg cgcctgaaga
     3361 caggcagatt cagagctggc cattagttta gttgcgccct gccagtcttt ttgcccggca
     3421 caacgtatgg cttccatcgc atcagaacga gcctgtcccg cgttgatcag taactccata
     3481 acaagttctt catctgcaaa catcacaatc tccttcctgg gtaaatggaa ttatccgtta
     3541 gttgccatgt tattgacttt gttggcggcg ataacaaaag gtaaatagat cattatcgcc
     3601 acacctaaac ataaaaaacc aaccagtaat gccagccagt ttcccccggt tccgaggaaa
     3661 gcaattaaag gtccgggcgt agtccagggc acggcataag caaccggtgg aataatttcc
     3721 atcgaaacaa agaagtaacc aatggcacag ttaacaatgg gaaccataat aaacggcacc
     3781 atcatgatgg ggttaagtac aataggaagg ccgaacatta tcggttcatt gatattgaaa
     3841 ataccgggga taaatgacat ttttgccaga tcacggtatt ccgcacgctt agaagcgata
     3901 aaaatagcca acaataaccc caacgtcata cctgaaccgc cgcagttggc gaatgcatca
     3961 ttaatagaag tccaggtaat cggatatggc gcgccccagg tagtgccgtg ttgagcggcc
     4021 caggagagat tctccagatt agcctcagaa aaaacggttt ctcgaattgc cgacgttgta
     4081 ttaggtccgt ggatccccag cacccagagg aaattaccca cagcccccag gataattacc
     4141 gcaaagatat tcgttcccat atgtttgagc ggcgtctgga ttaatgtgta aatgagatca
     4201 tttaatcctg ccggggatat caatgtcagg caataattaa gtgcggaaaa gaacaccatg
     4261 acaaaaaata ttggcaataa aactttaaat gaacgcgcta ccgcaggagg tacagctgcc
     4321 ggcatcgtta tggtgatttt ggggtttcga gctaagcgac aaaatatttc actggtgatc
     4381 aatgcaacga taacagcaac aaacaacccc tgcgcgccga gatatttggt cgtcaggaat
     4441 gcctgcccat ctatcatctg gtatggggta taaacaataa aaaatgctcc tattgccgtt
     4501 aatccgcata aaagatcatc ttgctcatag ctaatcgcca cattcctggc gaccaggaaa
     4561 gcaatcataa ttgacatgat atttacagaa ccattcatta ccggagtaaa aatagcttgt
     4621 gctttttcaa ggttggggaa aatgctgttc agatgcagta atccggcaat aaaaccgtcg
     4681 ggcgataata tggcaaagtt aattaatata attaatgagc ttgccatgat aattggaaac
     4741 gtaaaaacga atccatctct caccgctgca acatgttttt gtgagtttaa cttgatagca
     4801 actggaacaa gaaaacgttc cattccacgt tcgaatgatg ccattaatcc cattttttta
     4861 tacctgttgt tctggatatt tatacatccc aataaagcat gtatgccata tccattttat
     4921 gatgtgtgaa aagttagttc aggtaatagt ttaataatat tagattatta ctcttgcagt
     4981 agtttcatcg caaattcaag aactgcctgt ccgttcattg tgccatagtc tttcatctcg
     5041 ataacggcta caggcatttt cccctgtgca acggcttcaa tctctggttt ctgaaagcga
     5101 acttgcggac caagtaaaac aacgtcaact tcgttatttt taattttccc tttagcctct
     5161 gcaatcgcca atgcggaaat attaacttca agtgaaatag cggtagcatg atcaatcata
     5221 cgtttaacca gcatactggt tgacatgccc gcagcgcaaa ctaacaatat ctttttcata
     5281 tttttcctta ctggtatata acagactaca ttgtgtaaat atcatatccc acagtagcta
     5341 cacgtcctgg tacattgatc acagtaatac agcaataaag ttgttcaaca acagggtgat
     5401 aattaaacga ctcggccata cgataaacta atcaggtctc aactccttgt tttcaatggc
     5461 tcttaaaagg tgcaaaagtg atttttcaaa agattgcccg cttacttaaa tccgagatca
     5521 atggcaattc atggcatgtt ggtgatttgt tgccgtcaga agcggaactc gctgttcgtt
     5581 ataatgtttc gcgtaatacc ctgcgtaagg cattatccct gctggaaggc gaaggaatta
     5641 ttcacagaaa gcatggttca ggaacataca ttcaaaaaaa gaattttgtt gcacacattg
     5701 atcacatgaa cagtttcagt gaaattgcac ataaaagcgg caaagaggca ggaagtcaga
     5761 ttatgaaatt tgaagtgcag gatgcttctc ctactattgc gactgagctg aatttagtga
     5821 ctggtgagca ggtttattac ataaagagac tgcgattcat tgaggataat gcagcacaat
     5881 tagaagaaac gtggatgtca gtggcacgtt ttcctgattt aaccgtatcg catatgcaaa
     5941 aatccaaatt ttcgtatatt gagaacgaat gcgggatcaa aatcattggc acctttgaaa
     6001 ctttctcccc gacttttcct accccagaaa tcgccagtat tttacggatc agcccacggg
     6061 atcccatact taaaattcag acccaggctg tggatagtaa ctctattccg ctggattatt
     6121 cgttacttta cagcaatatt ttcgagttcc aggtaaagta cttttttccg cgataa
//
