logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001748_03212

You are here: Home > Sequence: MGYG000001748_03212

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-56 sp900762665
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-56; CAG-56 sp900762665
CAZyme ID MGYG000001748_03212
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1874 210592.37 4.833
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001748 4218051 MAG Sweden Europe
Gene Location Start: 10914;  End: 16538  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001748_03212.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH95 435 1096 1.2e-47 0.8476454293628809
CBM32 132 269 3e-17 0.8790322580645161

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00754 F5_F8_type_C 9.78e-16 125 259 1 126
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam00754 F5_F8_type_C 6.15e-11 301 425 11 126
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
COG5492 YjdB 7.15e-11 1160 1300 191 320
Uncharacterized conserved protein YjdB, contains Ig-like domain [General function prediction only].
sd00036 LRR_3 3.69e-08 1760 1849 14 116
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam02368 Big_2 9.44e-08 1253 1303 18 68
Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QNK55922.1 0.0 110 1141 1 1037
QHW32096.1 1.85e-100 41 1141 45 1082
AWS40658.1 3.78e-73 343 1140 178 926
QXE33659.1 6.02e-70 334 1140 196 952
BCB76148.1 2.67e-65 334 1141 124 880

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P33747 7.95e-11 1126 1341 13 229
Uncharacterized protein CA_P0160 OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) OX=272562 GN=CA_P0160 PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000471 0.979780 0.018989 0.000295 0.000241 0.000191

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001748_03212.