Home · Families · GH · GH4

GH4 GH GH family — see CAZy reference

2,108
target proteins
2,018
CGCs
999 / 2,108
tree leaves
210
co-occurring families

Overview

ClassGH
Annotation sourcedbCAN consensus (HMM + DIAMOND, "recommended-result" column)
SSN sourceSubset of global_ssn.sqlite (DIAMOND e ≤ 1e-30)
Tree inputMUSCLE-super5 alignment of gh4_proteins.faa (subsampled to 999 sequences, seed 42)
Tree methodFastTreeMP −lg −nosupport

Phylogeny interactive · search MAG / protein

Sequence Similarity Network Cytoscape-openable session · 50.7 MB

Subset of the global SSN restricted to proteins from GH4 CGCs.

SSN
.cys
GH4_SSN.cys
Full GH4 SSN at e ≤ 1e-30. Node attributes: Leiden community, enzyme category, top annotation, global x/y. Edge attributes: bitscore, e-value.
Cytoscape session bundle · 50.7 MB
Download
To open in Cytoscape Desktop (free): File → Open and pick the .cys.

SCoNe co-occurrence community-level

Taxonomy sunburst GTDB drilldown

Host distribution CGCs per genome by group

GH4 host-group violin

Co-occurring subfamilies first 15 of 210 · .tsv

SubfamilyClassCGCs# genes# SSN clusters
GH4GH2,108
GH1GH128
GH42GH79
S1_8S140
GH3GH36
S1_6S136
GH170GH34
GH84GH34
GH77GH33
GH105GH32
GH38GH25
S1_2S125
GH13_31GH24
GH28GH24
S1_4S124

Downloads

Full SSN (Cytoscape)GH4_SSN.cys · 50.7 MB
Newickgh4_phylogeny.max999.nwk
MUSCLE alignmentgh4_phylogeny.aln.faa
Target proteins (FASTA)gh4_proteins.faa
All proteins in associated CGCssequences.faa
CGC listcgcs.txt
Co-occurring familiesranked_families.tsv