CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: CAJ76260.1

Basic Information

GenBank IDCAJ76260.1
FamilyGT14
Sequence Length876
UniProt IDQ2KT98(100,100)Download
Average pLDDT?90.09
CAZy50 ID19993
CAZy50 RepNo, CAD7082870.1
Structure Cluster-
EC Number(s)-
Substrates(s)-

Taxonomy

Tax ID7240
KingdomEukaryota
PhylumArthropoda
ClassInsecta
OrderDiptera
FamilyDrosophilidae
GenusDrosophila
SpeciesDrosophila simulans

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MEQSVSARWL  KRYRAFFLIL  LLIVAIQLFL  AYKSLDIVGG  GSGSGFDAAE  APASPPPPHS60
QARVQPPTRT  KLTAQQLGFQ  PECDILAREA  ISALQRAKTK  DCREHIAHIA  CAIQAGRFYA120
PQLRSSCPAG  NHTANVSLGC  FKDEKDRRLL  AGYYSSSKTS  NSPAKCVELC  LQSGYPYAGV180
QYGRECFCGY  DTPPRAAKLP  DSSCNTKCLG  NAKEICGGFY  AMNIYETGIA  KFTAQLAATT240
PPEETKRVRI  AFLLTLNGRA  LRQVHRLLKA  LYAPEHVYYI  HVDERQDYLY  RKLLELESKF300
PNIRLARKRF  STIWGGASLL  TMLLQCMEDL  LQSNWHWDFV  INLSESDFPV  KTLDKLVDFL360
SANPGRNFVK  GHGRETQKFI  QKQGLDKTFV  ECDTHMWRIG  DRKLPAGIQV  DGGSDWVALS420
RPFVAYVTHP  REDDELLQAL  LKLFRHTLLP  AESFFHTVLR  NTKHCTSYVD  NNLHVTNWKR480
KQGCKCQYKH  VVDWCGCSPN  DFKPEDWPRL  QATEQKSLFF  ARKFEPVINQ  AVLLQLEEWL540
YGPYTSEYAN  LHGYWQSLYH  HEDVHGAGDD  LARSIGDSVM  RLSARQAKLD  PLELIELTHY600
LHRDQYKGFL  VRYRARGSTG  KPLHLETRVR  PTQQGKLARN  ARFSKRLRNF  EVSTDFDQKE660
QIARNFGKLL  GPQSDLLLSY  TLQANADSGA  ASHSYNLTLL  WIDPLGRLQD  FNELHVEDST720
SDVINHSKTL  LKHPITPGIW  TAKLIGRNSI  YAQLKFLIAP  LAYYKGYPLA  KSSEAEALNA780
GLTVALPEDF  EMPVEWQQHL  QTDDEQFTMR  EESLAKGKML  GQELHSWIDG  LVGQFFQLRE840
SCVVEADSEV  SLPLCSDAPW  SSLAPDPKSD  VDALLK876

Predicted 3D structure by AlphaFold2 with pLDDT = 90.09 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Carbohydrate binding residues Predicted by CAPSIF

Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) : GT14(250-495)

MEQSVSARWL  KRYRAFFLIL  LLIVAIQLFL  AYKSLDIVGG  GSGSGFDAAE  APASPPPPHS60
QARVQPPTRT  KLTAQQLGFQ  PECDILAREA  ISALQRAKTK  DCREHIAHIA  CAIQAGRFYA120
PQLRSSCPAG  NHTANVSLGC  FKDEKDRRLL  AGYYSSSKTS  NSPAKCVELC  LQSGYPYAGV180
QYGRECFCGY  DTPPRAAKLP  DSSCNTKCLG  NAKEICGGFY  AMNIYETGIA  KFTAQLAATT240
PPEETKRVRI  AFLLTLNGRA  LRQVHRLLKA  LYAPEHVYYI  HVDERQDYLY  RKLLELESKF300
PNIRLARKRF  STIWGGASLL  TMLLQCMEDL  LQSNWHWDFV  INLSESDFPV  KTLDKLVDFL360
SANPGRNFVK  GHGRETQKFI  QKQGLDKTFV  ECDTHMWRIG  DRKLPAGIQV  DGGSDWVALS420
RPFVAYVTHP  REDDELLQAL  LKLFRHTLLP  AESFFHTVLR  NTKHCTSYVD  NNLHVTNWKR480
KQGCKCQYKH  VVDWCGCSPN  DFKPEDWPRL  QATEQKSLFF  ARKFEPVINQ  AVLLQLEEWL540
YGPYTSEYAN  LHGYWQSLYH  HEDVHGAGDD  LARSIGDSVM  RLSARQAKLD  PLELIELTHY600
LHRDQYKGFL  VRYRARGSTG  KPLHLETRVR  PTQQGKLARN  ARFSKRLRNF  EVSTDFDQKE660
QIARNFGKLL  GPQSDLLLSY  TLQANADSGA  ASHSYNLTLL  WIDPLGRLQD  FNELHVEDST720
SDVINHSKTL  LKHPITPGIW  TAKLIGRNSI  YAQLKFLIAP  LAYYKGYPLA  KSSEAEALNA780
GLTVALPEDF  EMPVEWQQHL  QTDDEQFTMR  EESLAKGKML  GQELHSWIDG  LVGQFFQLRE840
SCVVEADSEV  SLPLCSDAPW  SSLAPDPKSD  VDALLK876

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help