Home · Families · S1 · S1_20

S1_20 S1 S1 family — see CAZy reference

798
target proteins
770
CGCs
999 / 798
tree leaves
280
co-occurring families

Overview

ClassS1
Annotation sourcedbCAN consensus (HMM + DIAMOND, "recommended-result" column)
SSN sourceSubset of global_ssn.sqlite (DIAMOND e ≤ 1e-30)
Tree inputMUSCLE-super5 alignment of s1_20_proteins.faa (subsampled to 999 sequences, seed 42)
Tree methodFastTreeMP −lg −nosupport

Phylogeny interactive · search MAG / protein

Sequence Similarity Network Cytoscape-openable session · 6.0 MB

Subset of the global SSN restricted to proteins from S1_20 CGCs.

SSN
.cys
S1_20_SSN.cys
Full S1_20 SSN at e ≤ 1e-30. Node attributes: Leiden community, enzyme category, top annotation, global x/y. Edge attributes: bitscore, e-value.
Cytoscape session bundle · 6.0 MB
Download
To open in Cytoscape Desktop (free): File → Open and pick the .cys.

SCoNe co-occurrence community-level

Taxonomy sunburst GTDB drilldown

Host distribution CGCs per genome by group

S1_20 host-group violin

Co-occurring subfamilies first 15 of 280 · .tsv

SubfamilyClassCGCs# genes# SSN clusters
S1_20S1798
S1_16S1125
GT4GT111
GH29GH109
S1_8S179
GT2GT65
GH92GH64
GH20GH63
GH109GH58
S1_15S156
GH2_10GH55
GH3GH54
S1_7S153
GH33GH47
GH2_1GH41

Downloads

Full SSN (Cytoscape)S1_20_SSN.cys · 6.0 MB
Newicks1_20_phylogeny.max999.nwk
MUSCLE alignments1_20_phylogeny.aln.faa
Target proteins (FASTA)s1_20_proteins.faa
All proteins in associated CGCssequences.faa
CGC listcgcs.txt
Co-occurring familiesranked_families.tsv