↵
S1_2 S1 S1 family — see CAZy reference
3,066
target proteins
3,031
CGCs
999 / 3,066
tree leaves
385
co-occurring families
Overview
| Class | S1 |
|---|---|
| Annotation source | dbCAN consensus (HMM + DIAMOND, "recommended-result" column) |
| SSN source | Subset of global_ssn.sqlite (DIAMOND e ≤ 1e-30) |
| Tree input | MUSCLE-super5 alignment of s1_2_proteins.faa (subsampled to 999 sequences, seed 42) |
| Tree method | FastTreeMP −lg −nosupport |
Phylogeny
Sequence Similarity Network
Subset of the global SSN restricted to proteins from S1_2 CGCs.
SSN
.cys
Download
.cys
To open in Cytoscape Desktop (free):
File → Open and pick the .cys.SCoNe co-occurrence
Taxonomy sunburst
Host distribution
Co-occurring subfamilies
| Subfamily | Class | CGCs | # genes | # SSN clusters |
|---|---|---|---|---|
| S1_2 | S1 | 3,066 | — | — |
| GT4 | GT | 445 | — | — |
| GT2 | GT | 396 | — | — |
| GT30 | GT | 189 | — | — |
| AA3_2 | AA | 133 | — | — |
| GH3 | GH | 111 | — | — |
| GH23 | GH | 100 | — | — |
| GT51 | GT | 80 | — | — |
| GH25 | GH | 78 | — | — |
| GH103 | GH | 78 | — | — |
| S1_6 | S1 | 77 | — | — |
| GH15 | GH | 67 | — | — |
| GH77 | GH | 66 | — | — |
| PL6 | PL | 62 | — | — |
| PL17_2 | PL | 61 | — | — |
Downloads
| Full SSN (Cytoscape) | S1_2_SSN.cys · 22.1 MB |
|---|---|
| Newick | s1_2_phylogeny.max999.nwk |
| MUSCLE alignment | s1_2_phylogeny.aln.faa |
| Target proteins (FASTA) | s1_2_proteins.faa |
| All proteins in associated CGCs | sequences.faa |
| CGC list | cgcs.txt |
| Co-occurring families | ranked_families.tsv |