logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001777_00359

You are here: Home > Sequence: MGYG000001777_00359

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Eubacterium_F sp000434115
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Eubacterium_F; Eubacterium_F sp000434115
CAZyme ID MGYG000001777_00359
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
729 MGYG000001777_3|CGC2 79104.04 8.8999
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001777 2510144 MAG Denmark Europe
Gene Location Start: 41673;  End: 43862  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.-

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 205 444 5.3e-85 0.9873417721518988

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 2.03e-61 203 452 2 272
Cellulase (glycosyl hydrolase family 5).
sd00036 LRR_3 3.95e-13 615 703 17 93
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 8.14e-13 615 703 63 139
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 6.73e-12 615 703 14 89
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
sd00036 LRR_3 1.52e-11 632 703 2 70
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AAC06196.1 2.22e-103 172 485 15 338
AGH41463.1 9.90e-100 172 485 222 545
CBK74991.1 1.23e-97 173 495 32 370
AAC06197.1 8.38e-97 167 485 44 372
AGH40913.1 2.35e-96 167 485 63 391

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 3.69e-87 186 485 5 301
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
4XZB_A 6.02e-85 186 480 4 299
endo-glucanaseGsCelA P1 [Geobacillus sp. 70PC53]
1LF1_A 2.45e-80 190 485 8 301
CrystalStructure of Cel5 from Alkalophilic Bacillus sp. [Bacillus subtilis]
1H11_A 4.10e-80 185 485 3 301
2-DEOXY-2-FLURO-B-D-CELLOTRIOSYL/ENZYMEINTERMEDIATE COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION [Salipaludibacillus agaradhaerens],1H2J_A ENDOGLUCANASE CEL5A IN COMPLEX WITH UNHYDROLYSED AND COVALENTLY LINKED 2,4-DINITROPHENYL-2-DEOXY-2-FLUORO-CELLOBIOSIDE AT 1.15 A RESOLUTION [Salipaludibacillus agaradhaerens],1HF6_A ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHAERENS IN THE ORTHORHOMBIC CRYSTAL FORM IN COMPLEX WITH CELLOTRIOSE [Salipaludibacillus agaradhaerens],1OCQ_A COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION with cellobio-derived isofagomine [Salipaludibacillus agaradhaerens],1W3K_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellobio Derived-tetrahydrooxazine [Salipaludibacillus agaradhaerens],1W3L_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellotri Derived-Tetrahydrooxazine [Salipaludibacillus agaradhaerens],4A3H_A 2',4' Dinitrophenyl-2-Deoxy-2-Fluro-B-D-Cellobioside Complex Of The Endoglucanase Cel5a From Bacillus Agaradhaerens At 1.6 A Resolution [Salipaludibacillus agaradhaerens],5A3H_A 2-Deoxy-2-Fluro-B-D-CellobiosylENZYME INTERMEDIATE COMPLEX Of The Endoglucanase Cel5a From Bacillus Agaradhearans At 1.8 Angstroms Resolution [Salipaludibacillus agaradhaerens],6A3H_A 2-Deoxy-2-Fluro-B-D-CellotriosylENZYME INTERMEDIATE COMPLEX OF THE Endoglucanase Cel5a From Bacillus Agaradhearans At 1.6 Angstrom Resolution [Salipaludibacillus agaradhaerens],7A3H_A Native Endoglucanase Cel5a Catalytic Core Domain At 0.95 Angstroms Resolution [Salipaludibacillus agaradhaerens],8A3H_A Cellobiose-derived imidazole complex of the endoglucanase cel5A from Bacillus agaradhaerens at 0.97 A resolution [Salipaludibacillus agaradhaerens]
1E5J_A 4.37e-80 185 485 3 301
EndoglucanaseCel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Methyl-4ii-S-Alpha-Cellobiosyl-4ii-Thio Beta-Cellobioside [Salipaludibacillus agaradhaerens],1QHZ_A Native Tetragonal Structure Of The Endoglucanase Cel5a From Bacillus Agaradhaerens [Salipaludibacillus agaradhaerens],1QI0_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Cellobiose [Salipaludibacillus agaradhaerens],1QI2_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With 2',4'-Dinitrophenyl 2-Deoxy-2-Fluoro-B- D-Cellotrioside [Salipaludibacillus agaradhaerens],2V38_A Family 5 endoglucanase Cel5A from Bacillus agaradhaerens in complex with cellobio-derived noeuromycin [Salipaludibacillus agaradhaerens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P06565 5.35e-79 190 493 34 335
Endoglucanase B OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celB PE=3 SV=1
P07983 8.69e-79 186 485 34 330
Endoglucanase OS=Bacillus subtilis OX=1423 GN=bglC PE=3 SV=2
O85465 3.08e-78 185 493 29 335
Endoglucanase 5A OS=Salipaludibacillus agaradhaerens OX=76935 GN=cel5A PE=1 SV=1
P10475 9.26e-77 186 480 34 325
Endoglucanase OS=Bacillus subtilis (strain 168) OX=224308 GN=eglS PE=1 SV=1
Q59394 1.62e-75 186 492 33 340
Endoglucanase N OS=Pectobacterium atrosepticum OX=29471 GN=celN PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.002042 0.535659 0.461121 0.000652 0.000286 0.000227

TMHMM  Annotations      download full data without filtering help

start end
7 29