↵
S1_20 S1 S1 family — see CAZy reference
798
target proteins
770
CGCs
999 / 798
tree leaves
280
co-occurring families
Overview
| Class | S1 |
|---|---|
| Annotation source | dbCAN consensus (HMM + DIAMOND, "recommended-result" column) |
| SSN source | Subset of global_ssn.sqlite (DIAMOND e ≤ 1e-30) |
| Tree input | MUSCLE-super5 alignment of s1_20_proteins.faa (subsampled to 999 sequences, seed 42) |
| Tree method | FastTreeMP −lg −nosupport |
Phylogeny
Sequence Similarity Network
Subset of the global SSN restricted to proteins from S1_20 CGCs.
SSN
.cys
Download
.cys
To open in Cytoscape Desktop (free):
File → Open and pick the .cys.SCoNe co-occurrence
Taxonomy sunburst
Host distribution
Co-occurring subfamilies
| Subfamily | Class | CGCs | # genes | # SSN clusters |
|---|---|---|---|---|
| S1_20 | S1 | 798 | — | — |
| S1_16 | S1 | 125 | — | — |
| GT4 | GT | 111 | — | — |
| GH29 | GH | 109 | — | — |
| S1_8 | S1 | 79 | — | — |
| GT2 | GT | 65 | — | — |
| GH92 | GH | 64 | — | — |
| GH20 | GH | 63 | — | — |
| GH109 | GH | 58 | — | — |
| S1_15 | S1 | 56 | — | — |
| GH2_10 | GH | 55 | — | — |
| GH3 | GH | 54 | — | — |
| S1_7 | S1 | 53 | — | — |
| GH33 | GH | 47 | — | — |
| GH2_1 | GH | 41 | — | — |
Downloads
| Full SSN (Cytoscape) | S1_20_SSN.cys · 6.0 MB |
|---|---|
| Newick | s1_20_phylogeny.max999.nwk |
| MUSCLE alignment | s1_20_phylogeny.aln.faa |
| Target proteins (FASTA) | s1_20_proteins.faa |
| All proteins in associated CGCs | sequences.faa |
| CGC list | cgcs.txt |
| Co-occurring families | ranked_families.tsv |