CAZyme Gene Cluster Visualization#

Introduction#

Hint

The CGC visualization module is currently in beta testing. We welcome feedback and suggestions for improvement.

After identifying CAZyme Gene Clusters (CGCs), visualizing their genomic distribution can provide valuable insights into their organization and potential functional relationships. The cgc_circle_plot module generates circular genome plots that display currently:

  • Position of CGCs around the genome

  • Distribution of CAZyme

These visualizations help researchers identify patterns in CGC distribution and understand their genomic context.

Basic Usage#

The basic command to generate CGC circular plots is:

run_dbcan cgc_circle_plot --output_dir <OUTPUT_DIRECTORY>

Examples for Different Genome Types#

Prokaryotic genome analysis#
run_dbcan cgc_circle_plot --output_dir output_EscheriaColiK12MG1655_faa
Eukaryotic genome analysis#
run_dbcan cgc_circle_plot --output_dir output_Xylona_heveae_TC161_faa

Parameters#

Parameter

Description

--output_dir

Directory containing CGC data and for output files

Output Files#

The visualization module generates several output files:

  • cgc_circos/ The output of the CGC circle plot, including the SVG files. It will be generated in the output directory.For each contig/scaffold in the genome, a separate SVG file is created, and there’s a whole genome overview SVG file.

Tip

The SVG format allows for editing in vector graphics programs like Inkscape` or Adobe Illustrator, which is useful for preparing publication-quality figures.