logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004517_00202

You are here: Home > Sequence: MGYG000004517_00202

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UBA5884;
CAZyme ID MGYG000004517_00202
CAZy Family GT2
CAZyme Description D-alanine--poly(phosphoribitol) ligase subunit 1
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2562 MGYG000004517_2|CGC1 290607.31 5.0636
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004517 2174745 MAG Israel Asia
Gene Location Start: 18897;  End: 26585  Strand: -

Full Sequence      Download help

MSENKDLMKK  ALLEIQRLKK  LLSEKTISNE  PTAIIGMACR  LPNGIKNTQD  FWQSLESGQD60
DIINVPENRW  NEYDKDELLK  NPYLQKAGFL  TEDIEEFDSR  LFKIPPKEAS  YIDPQQRMVL120
KVCWEVLENA  GYSPTELKGK  KIGVFCGVAQ  TDFLNENITR  NEHDIDVTGM  NVSFISGRIS180
YFFGFHGPAL  TIDTACSASA  SAINQAIKSL  RDNDCEMAIV  CGVNIMFSPE  TTKKLSALNI240
LSKTCELKSF  DKFANGTVRG  EGCCAILLKN  LSKATADNDS  IHCVIKGSYI  NHDGASTSLT300
APNGFAQEEL  LKSAWIKSGI  TPNDIDYIET  HGTGTALGDP  IEIKSISNVI  GSNRTEPLYI360
GSLKASIGHL  EPASGIASVI  KTALMLEHKK  IVKSINYNVP  SEYVNWNAIS  VKVAQNLMDW420
NKSNGKNRVA  GVSSFGLSGT  NVHIVLEEYN  KDTVTTQDMP  VYPFMFSAVT  ENALIEQMKK480
FLAYVETCKD  INLTNLSYTQ  NVTRAILEVR  TLIMAKNLDD  LKSLLNKAIS  GQMDSNIFTQ540
KGLSKKKIVF  TFTGQGSQYN  NMCKDFYKNP  YFKQAFDMCD  KYYNQLTGGS  LIDLVFSEKY600
DLSQTVYTQP  AIFSVEYSLA  YMYQKYGINP  SIVFGHSIGE  YVSACISGVF  SLEDAMKLVV660
TRGKAIQEKA  IFGKMMAIFT  DKECIKDIIK  PYNDVYISLS  NSQQQTVIAG  SEKSILEIDG720
ILNAKDIKHT  ILNTTRPFHT  PFMQDVSNDF  FEVAKTVTYS  KPKLNIISNV  TSKAETTLFT780
TADYWKQHIV  SEVRFLDSVL  NLGNLSEYLF  LEVGAMPILS  GLIGRISNGN  ADCIYSASKD840
TSADYKIAEI  LSSMFLYAIP  FNMRGYYKSF  NAQTCDIPNY  AFDTKKIPYI  HNLYEKVSYT900
TPVINEVEET  SNSSLELTDE  KISEYIVNLL  VYHLGIDKSD  IDENTNLLTL  GLSSLNGVKI960
VGDIKKHFNV  EINLNDFFNN  CTVSGWKELL  KGLTVSNSAD  KQVNQVTIDE  QHRYDMFDLN1020
DIQYAYWAGR  DNKNMLLSNN  ACCTYFELDM  PNLDIEKFKH  TLKCLEQRHD  MLRCRMNKDA1080
KQYIVKDEYS  TVVVYDYKSI  SDMQKHLDNI  RTTMSYEILP  IDKPMFKVAI  TKLDDINYRI1140
HFSIDFMIAD  AMSLYIFWKD  MGKLYNGEIL  EPLTITYKDY  LNYLNNSSDI  KAKHDVDKKY1200
WLSKIEDFPK  SPELPFKSSE  FVDNKHKFVR  RRKNINPEQW  KKFTQNCAKY  GLTPSSALFT1260
LYAEVLSAFG  GGSSFAIMMT  VFNRENINKD  VQKIIGDFTK  LALIDVHRKD  TTVSKNALDI1320
QSTIMNSISH  TEYSATEFVG  ELRKHFNEDR  MYTVVFTSAI  GIDDLNKDIS  DNDAIFLKSS1380
NSLISSTPQV  CIDHQIFYED  KDIVLSWDTL  DGVFMDNVVS  AMFDVYTSLV  DKAIENPEFF1440
NQTLIDLRPS  YQVEVHDKAN  MTTIEYTPVT  LLTGFNKNVL  ENPNNVAVVT  NGNSYTYRQL1500
DECSNKVANQ  LIQDGVTIGD  RVLVELPKSF  EQIYSIVGIL  KAGAVYVPIT  YKQPKNRTIN1560
IIEKSIPKAI  IGDNKFNDMG  ITYYSIIDFE  SSSSDGTNLP  TISASDGAYI  IYTSGSTGNP1620
KGVYIAHGSA  MNTIDDVTRK  YNITSNDSTI  AISSLSFDLS  VYDVFGMLSV  GGKIVIPTEQ1680
ERIDPKSQYQ  LVKHNDVTVW  NTVPAIMDLY  LDFLNKKSLT  SESIRKVILS  GDWIPLNLLD1740
KLYKALPNAE  LTSMGGATEA  SIWSNYFNVD  TLNPEWNSIP  YGYPLANQRF  YILDDFNRRC1800
PDYVSGKLHI  AGDGLAECYY  NEPSLTENAF  YVHKSIGERL  YNTGDYGKYI  SDGVIEFLGR1860
KDGQIKINGY  RIEIGEVISA  IKKCGIDSKA  IIMPVGNSRK  KIVAFLIQKD  TIDQELLKQE1920
LGKYLPKYFI  PDKIICIEDL  PVTANGKSDM  KKLYQIYEDI  KNSSKASNTS  NENVNPILKK1980
IREILDISEI  TEDDSFSAFG  VSSVDMIRLA  DELESIYGER  PSISDMISYK  SVSELINFFD2040
GKVANDNNNF  VQQQDNSQLD  DDDNFVYTTT  ELQENPLEKY  SNQGIELYLE  DGKLKFKASK2100
GKMTPQLKAE  LIANKQSLIE  YLTVKAEQEK  HDAEYFKENS  FLLTPLQKAY  LLGRSNYYEL2160
GGTSAHYYTE  IEWNNLDLVK  LEKSVNKVIR  YNDILQTVVF  SNGTQAVLEN  MPYYKVKVSK2220
VSNEEFLQVR  STWSEHKYEI  GKWPMFDIIV  SDLGNDNYIV  HFSFDCILLD  GWSANMMISE2280
IFEVYEGRTI  KKPQLSFKDY  VTNESIFLED  KEYHKKAIEY  WENSVQSLPN  APELKYKVDF2340
SQVETPHFKR  KRFVLDKNKT  HLLNEKIKKY  GLTASAVICT  AYMSVLSKYS  NTHDFTLNLT2400
LFNRLPLHND  VWNILGDFTN  ITLIPYFESK  NSTFMDSLTS  TKNYLVEAIE  HRTYNGLELL2460
KHFSDDNILK  AVMPVVFTSM  IFGNLESSSN  ENDIFDSVKE  VYSISQTPQV  SLDHQALVRN2520
NELVLIWDYV  SELFDAEIID  NMFNDYINFI  NTIANSDNWE  IL2562

Enzyme Prediction      help

No EC number prediction in MGYG000004517_00202.

CDD Domains      download full data without filtering help

Created with Snap12825638451264076889610241152128114091537166517931921204921772305243328887PksD30446PKS14831953A_NRPS_TlmIV_like14831953A_NRPS33448PKS_KS
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
COG3321 PksD 0.0 28 887 2 870
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism].
cd00833 PKS 2.69e-177 30 446 1 421
polyketide synthases (PKSs) polymerize simple fatty acids into a large variety of different products, called polyketides, by successive decarboxylating Claisen condensations. PKSs can be divided into 2 groups, modular type I PKSs consisting of one or more large multifunctional proteins and iterative type II PKSs, complexes of several monofunctional subunits.
cd12114 A_NRPS_TlmIV_like 1.26e-170 1483 1953 1 477
The adenylation domain of nonribosomal peptide synthetases (NRPS), including Streptoalloteichus tallysomycin biosynthesis genes. The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions. This family includes the TLM biosynthetic gene cluster from Streptoalloteichus that consists of nine NRPS genes; the N-terminal module of TlmVI (NRPS-5) and the starter module of BlmVI (NRPS-5) are comprised of the acyl CoA ligase (AL) and acyl carrier protein (ACP)-like domains, which are thought to be involved in the biosynthesis of the beta-aminoalaninamide moiety.
cd05930 A_NRPS 2.49e-134 1483 1953 1 444
The adenylation domain of nonribosomal peptide synthetases (NRPS). The adenylation (A) domain of NRPS recognizes a specific amino acid or hydroxy acid and activates it as an (amino) acyl adenylate by hydrolysis of ATP. The activated acyl moiety then forms a thioester bond to the enzyme-bound cofactor phosphopantetheine of a peptidyl carrier protein domain. NRPSs are large multifunctional enzymes which synthesize many therapeutically useful peptides in bacteria and fungi via a template-directed, nucleic acid independent nonribosomal mechanism. These natural products include antibiotics, immunosuppressants, plant and animal toxins, and enzyme inhibitors. NRPS has a distinct modular structure in which each module is responsible for the recognition, activation, and in some cases, modification of a single amino acid residue of the final peptide product. The modules can be subdivided into domains that catalyze specific biochemical reactions.
smart00825 PKS_KS 5.11e-128 33 448 2 298
Beta-ketoacyl synthase. The structure of beta-ketoacyl synthase is similar to that of the thiolase family and also chalcone synthase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.

CAZyme Hits      help

Created with Snap128256384512640768896102411521281140915371665179319212049217723052433332035BAZ00088.1|GT2332035BAZ75991.1|GT2331953BAY90071.1|GT2331953BAY30132.1|GT224982AFY93865.1|GT2
Hit ID E-Value Query Start Query End Hit Start Hit End
BAZ00088.1 9.80e-195 33 2035 1215 3288
BAZ75991.1 9.80e-195 33 2035 1215 3288
BAY90071.1 4.08e-193 33 1953 1214 3196
BAY30132.1 1.74e-191 33 1953 1217 3207
AFY93865.1 1.24e-112 24 982 953 1968

PDB Hits      download full data without filtering help

Created with Snap128256384512640768896102411521281140915371665179319212049217723052433208824MZ0_A101719537LY7_A101719537LY4_E258867VEE_A58207S6B_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
4MZ0_A 2.20e-181 20 882 29 929
ChainA, CurL [Moorena producens 3L],4MZ0_B Chain B, CurL [Moorena producens 3L]
7LY7_A 1.64e-157 1017 1953 11 932
ChainA, BmdB, Bacillamide NRPS [Thermoactinomyces vulgaris]
7LY4_E 1.68e-157 1017 1953 11 932
ChainE, BmdB, bacillamide NRPS [Thermoactinomyces vulgaris]
7VEE_A 5.75e-138 25 886 16 909
ChainA, Polyketide synthase [Streptomyces graminofaciens],7VEF_A Chain A, Polyketide synthase [Streptomyces graminofaciens]
7S6B_A 4.32e-133 5 820 32 861
ChainA, Polyketide synthase [Streptomyces lasalocidi],7S6B_B Chain B, Polyketide synthase [Streptomyces lasalocidi]

Swiss-Prot Hits      download full data without filtering help

Created with Snap1282563845126407688961024115212811409153716651793192120492177230524339302561sp|P48633|HMWP2_YERE828923sp|D4AU31|SWNK_ARTBC9341951sp|Q1B6A7|MBTB_MYCSS8886sp|B2HIL7|MSL7_MYCMM28882sp|E9F8M3|SWNK_METRA
Hit ID E-Value Query Start Query End Hit Start Hit End Description
P48633 1.64e-183 930 2561 28 1911
High-molecular-weight protein 2 OS=Yersinia enterocolitica serotype O:8 / biotype 1B (strain NCTC 13174 / 8081) OX=393305 GN=irp2 PE=3 SV=1
D4AU31 5.44e-133 28 923 623 1545
PKS-NRPS hybrid synthetase swnK OS=Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) OX=663331 GN=swnK PE=3 SV=1
Q1B6A7 9.76e-133 934 1951 19 1031
Phenyloxazoline synthase MbtB OS=Mycobacterium sp. (strain MCS) OX=164756 GN=mbtB PE=3 SV=1
B2HIL7 2.63e-131 8 886 19 912
Phenolphthiocerol synthesis polyketide synthase type I Pks15/1 OS=Mycobacterium marinum (strain ATCC BAA-535 / M) OX=216594 GN=pks15/1 PE=1 SV=1
E9F8M3 3.80e-131 28 882 615 1494
PKS-NRPS hybrid synthetase swnK OS=Metarhizium robertsii (strain ARSEF 23 / ATCC MYA-3075) OX=655844 GN=swnK PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000051 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004517_00202.