Home · Families · S1 · S1_4

S1_4 S1 S1 family — see CAZy reference

2,244
target proteins
2,217
CGCs
999 / 2,244
tree leaves
405
co-occurring families

Overview

ClassS1
Annotation sourcedbCAN consensus (HMM + DIAMOND, "recommended-result" column)
SSN sourceSubset of global_ssn.sqlite (DIAMOND e ≤ 1e-30)
Tree inputMUSCLE-super5 alignment of s1_4_proteins.faa (subsampled to 999 sequences, seed 42)
Tree methodFastTreeMP −lg −nosupport

Phylogeny interactive · search MAG / protein

Sequence Similarity Network Cytoscape-openable session · 20.6 MB

Subset of the global SSN restricted to proteins from S1_4 CGCs.

SSN
.cys
S1_4_SSN.cys
Full S1_4 SSN at e ≤ 1e-30. Node attributes: Leiden community, enzyme category, top annotation, global x/y. Edge attributes: bitscore, e-value.
Cytoscape session bundle · 20.6 MB
Download
To open in Cytoscape Desktop (free): File → Open and pick the .cys.

SCoNe co-occurrence community-level

Taxonomy sunburst GTDB drilldown

Host distribution CGCs per genome by group

S1_4 host-group violin

Co-occurring subfamilies first 15 of 405 · .tsv

SubfamilyClassCGCs# genes# SSN clusters
S1_4S12,244
GT2GT493
GT4GT130
GH3GH119
GH92GH109
S1_8S188
GH29GH87
GH43_26GH84
S1_7S180
GT53GT80
S1_16S175
AA3_2AA74
GH2_18GH70
CE20CE61
GH20GH61

Downloads

Full SSN (Cytoscape)S1_4_SSN.cys · 20.6 MB
Newicks1_4_phylogeny.max999.nwk
MUSCLE alignments1_4_phylogeny.aln.faa
Target proteins (FASTA)s1_4_proteins.faa
All proteins in associated CGCssequences.faa
CGC listcgcs.txt
Co-occurring familiesranked_families.tsv