Nextflow Basics
Running the Workflows
Since TarGene uses Nextflow, all workflows can be run in the same way from the command line:
nextflow run https://github.com/TARGENE/targene-pipeline/ -r TARGENE_VERSION -entry WORKFLOW_NAME -profile P -resume
where:
TARGENE_VERSION
is the latest TarGene version, e.g.v0.11.1
WORKFLOW_NAME
is any of the TarGene workflowsP
is an optional Nextflow profile describing the computing platform (see Platform Configuration).
Additional Nextflow command line arguments can be found in the official documentation, for example important options are:
-resume
: Tells Nextflow to try to resume the pipeline if an error occurred during the execution (if you forgot to specify a parameter for instance)-with-trace
and-with-report
will generate additional report files.
Workflows Configurations
There are mainly two parts to configuring a workflow run. The first part describes the computing environment, it tells Nextflow how it should execute the various processes of the workflow. The second part provides the actual inputs to the TarGene workflows. When running the nextflow run
command, Nextflow will look for a nextflow.config
configuration file in your current directory. If you are new to Nextflow, you can use this file to setup both the platform and project configurations. As your project grows you may want to split them in distinct files. This is so that the platform configuration can be easily reused across many TarGene runs.
Platform Configuration
It is likely that you will run TarGene on a HPC platform, in particular the Executors and Singularity configurations are required. Since Nextflow is so widespread, it is probable that such a configuration file is already available from your HPC administrators. Since this configuration only describes de computing platform and not your project, it is often described as a Profile.
If you are using the University of Edinburgh Eddie cluster, you can simply use TarGene with the -profile eddie
option.
Project Configuration
These are the configuration details associated with your project, this is usually done in a nextflow.config
file living at the root of your project's directory. The configuration parameters are specific to each workflow and described in the following sections.