Run a quality control workflow on mzML files from mass spec based proteomics.
- A python 3 environment with (eg. anaconda)
- snakemake
- jinja2
- qproject (eg.
pip install git+https://github.com/qbicsoftware/qproject) - xtandem
- R
- openms 2.0 (there is a conda package that can be installed with
conda install -c aseyboldt openmsthat might or might not work)
##Usage Create the directory structure of the workdir (it should not exist):
qproject create -t path/to/workdir -w github:qbicsoftware/qcprot This will create the <workdir> and some directories inside and clone qcprot to <workdir>/src. Copy the mzML files to <workdir>/data and your fasta file(s) to <workdir>/ref.
Copy the ini files to etc:
cd <workdir>/src/inis <workdir>/etc and modify if necessary.
Execute the workflow with
cd <workdir>/src snakemake You can also adjust the jobscript and use qproject to execute the workflow:
qproject run -t <workdir> If qproject is used to execute qcprot some data about the run is stored in <workdir>/archive for reproducibility. This is still very much work in progress though' The output of snakemake will end up in <workdir>/logs/snake.err.