Home · Families · S1 · S1_2

S1_2 S1 S1 family — see CAZy reference

3,066
target proteins
3,031
CGCs
999 / 3,066
tree leaves
385
co-occurring families

Overview

ClassS1
Annotation sourcedbCAN consensus (HMM + DIAMOND, "recommended-result" column)
SSN sourceSubset of global_ssn.sqlite (DIAMOND e ≤ 1e-30)
Tree inputMUSCLE-super5 alignment of s1_2_proteins.faa (subsampled to 999 sequences, seed 42)
Tree methodFastTreeMP −lg −nosupport

Phylogeny interactive · search MAG / protein

Sequence Similarity Network Cytoscape-openable session · 22.1 MB

Subset of the global SSN restricted to proteins from S1_2 CGCs.

SSN
.cys
S1_2_SSN.cys
Full S1_2 SSN at e ≤ 1e-30. Node attributes: Leiden community, enzyme category, top annotation, global x/y. Edge attributes: bitscore, e-value.
Cytoscape session bundle · 22.1 MB
Download
To open in Cytoscape Desktop (free): File → Open and pick the .cys.

SCoNe co-occurrence community-level

Taxonomy sunburst GTDB drilldown

Host distribution CGCs per genome by group

S1_2 host-group violin

Co-occurring subfamilies first 15 of 385 · .tsv

SubfamilyClassCGCs# genes# SSN clusters
S1_2S13,066
GT4GT445
GT2GT396
GT30GT189
AA3_2AA133
GH3GH111
GH23GH100
GT51GT80
GH25GH78
GH103GH78
S1_6S177
GH15GH67
GH77GH66
PL6PL62
PL17_2PL61

Downloads

Full SSN (Cytoscape)S1_2_SSN.cys · 22.1 MB
Newicks1_2_phylogeny.max999.nwk
MUSCLE alignments1_2_phylogeny.aln.faa
Target proteins (FASTA)s1_2_proteins.faa
All proteins in associated CGCssequences.faa
CGC listcgcs.txt
Co-occurring familiesranked_families.tsv