logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003949_00380

You are here: Home > Sequence: MGYG000003949_00380

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Eubacterium_F sp900539115
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Eubacterium_F; Eubacterium_F sp900539115
CAZyme ID MGYG000003949_00380
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
771 MGYG000003949_3|CGC1 83154.49 8.6109
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003949 2807569 MAG United Kingdom Europe
Gene Location Start: 6312;  End: 8627  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.-

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 203 442 1.1e-86 0.9873417721518988

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 2.00e-63 201 450 2 272
Cellulase (glycosyl hydrolase family 5).
sd00036 LRR_3 8.84e-13 657 745 63 139
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 1.01e-12 657 745 17 93
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 1.94e-11 657 745 14 89
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
COG2730 BglC 2.14e-11 178 420 33 330
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AAC06196.1 1.15e-103 170 483 15 338
CBK74991.1 1.06e-99 171 493 32 370
AGH41463.1 1.99e-99 170 483 222 545
AAC06197.1 4.07e-98 165 483 44 372
AGH40913.1 6.02e-98 165 483 63 391

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 4.55e-87 184 483 5 301
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
4XZB_A 6.10e-82 182 478 2 299
endo-glucanaseGsCelA P1 [Geobacillus sp. 70PC53]
3PZT_A 3.85e-79 184 478 29 320
Structureof the endo-1,4-beta-glucanase from Bacillus subtilis 168 with manganese(II) ion [Bacillus subtilis subsp. subtilis str. 168],3PZT_B Structure of the endo-1,4-beta-glucanase from Bacillus subtilis 168 with manganese(II) ion [Bacillus subtilis subsp. subtilis str. 168],3PZU_A P212121 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZU_B P212121 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_A C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_B C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_C C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_D C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168]
1H11_A 3.70e-78 183 483 3 301
2-DEOXY-2-FLURO-B-D-CELLOTRIOSYL/ENZYMEINTERMEDIATE COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION [Salipaludibacillus agaradhaerens],1H2J_A ENDOGLUCANASE CEL5A IN COMPLEX WITH UNHYDROLYSED AND COVALENTLY LINKED 2,4-DINITROPHENYL-2-DEOXY-2-FLUORO-CELLOBIOSIDE AT 1.15 A RESOLUTION [Salipaludibacillus agaradhaerens],1HF6_A ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHAERENS IN THE ORTHORHOMBIC CRYSTAL FORM IN COMPLEX WITH CELLOTRIOSE [Salipaludibacillus agaradhaerens],1OCQ_A COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION with cellobio-derived isofagomine [Salipaludibacillus agaradhaerens],1W3K_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellobio Derived-tetrahydrooxazine [Salipaludibacillus agaradhaerens],1W3L_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellotri Derived-Tetrahydrooxazine [Salipaludibacillus agaradhaerens],4A3H_A 2',4' Dinitrophenyl-2-Deoxy-2-Fluro-B-D-Cellobioside Complex Of The Endoglucanase Cel5a From Bacillus Agaradhaerens At 1.6 A Resolution [Salipaludibacillus agaradhaerens],5A3H_A 2-Deoxy-2-Fluro-B-D-CellobiosylENZYME INTERMEDIATE COMPLEX Of The Endoglucanase Cel5a From Bacillus Agaradhearans At 1.8 Angstroms Resolution [Salipaludibacillus agaradhaerens],6A3H_A 2-Deoxy-2-Fluro-B-D-CellotriosylENZYME INTERMEDIATE COMPLEX OF THE Endoglucanase Cel5a From Bacillus Agaradhearans At 1.6 Angstrom Resolution [Salipaludibacillus agaradhaerens],7A3H_A Native Endoglucanase Cel5a Catalytic Core Domain At 0.95 Angstroms Resolution [Salipaludibacillus agaradhaerens],8A3H_A Cellobiose-derived imidazole complex of the endoglucanase cel5A from Bacillus agaradhaerens at 0.97 A resolution [Salipaludibacillus agaradhaerens]
1E5J_A 3.94e-78 183 483 3 301
EndoglucanaseCel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Methyl-4ii-S-Alpha-Cellobiosyl-4ii-Thio Beta-Cellobioside [Salipaludibacillus agaradhaerens],1QHZ_A Native Tetragonal Structure Of The Endoglucanase Cel5a From Bacillus Agaradhaerens [Salipaludibacillus agaradhaerens],1QI0_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Cellobiose [Salipaludibacillus agaradhaerens],1QI2_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With 2',4'-Dinitrophenyl 2-Deoxy-2-Fluoro-B- D-Cellotrioside [Salipaludibacillus agaradhaerens],2V38_A Family 5 endoglucanase Cel5A from Bacillus agaradhaerens in complex with cellobio-derived noeuromycin [Salipaludibacillus agaradhaerens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P07983 2.00e-78 184 483 34 330
Endoglucanase OS=Bacillus subtilis OX=1423 GN=bglC PE=3 SV=2
Q59394 3.41e-77 184 490 33 340
Endoglucanase N OS=Pectobacterium atrosepticum OX=29471 GN=celN PE=3 SV=1
P06565 3.44e-77 188 483 34 327
Endoglucanase B OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celB PE=3 SV=1
Q47096 9.04e-77 184 490 33 340
Endoglucanase 5 OS=Pectobacterium carotovorum subsp. carotovorum OX=555 GN=celV PE=1 SV=1
O85465 1.01e-76 183 499 29 344
Endoglucanase 5A OS=Salipaludibacillus agaradhaerens OX=76935 GN=cel5A PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.002107 0.941566 0.055260 0.000400 0.000319 0.000300

TMHMM  Annotations      download full data without filtering help

start end
7 26