Annotates sequences with Eggnog-mapper and hhblits against PDB70

See config/ for configuration information.

This workflow takes as input a set of protein sequences. It clusters them and then
functionally annotates the clusters’ representatives using Eggnog DB, and picks those without KO annotations
to
continue the process. These “hypothetical proteins” get aligned by hhblits against Uniclust30 and then against PDB70.

Testing

Decompress selected_seqs_by_size.tar.gz and use that path in the config file (already set).

To see the commands being executed (-p) without an actual execution of the workflow, use -n.
-r prints the “reason” for execution of each rule.

snakemake  --cores 16 -r -p -n

--cores N specify the max. number of cores used by the whole workflow, so if a

To finish reading, please visit source site