DeepVariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. DeepVariant relies on Nucleus, a library of Python and C++ code for reading and writing data in common genomics file formats (like SAM and VCF) designed for painless integration with the TensorFlow machine learning framework.

CPU 版本的 Singularity DeepVariant

CPU版安装

申请计算节点,然后制作 singularity 镜像

$ srun -p cpu -N 1 --exclusive --pty /bin/bash
$ singularity build deepvariant.simg docker://google/deepvariant

用SLURM脚本提交CPU版DeepVariant作业

使用 CPU 版本的 singluarity 镜像的 slurm.sh 如下:

#!/bin/bash

#SBATCH -J DeepVariant
#SBATCH -p small
#SBATCH -n 1
#SBATCH --ntasks-per-node=1
#SBATCH -o %j.out
#SBATCH -e %j.err

ulimit -s unlimited
ulimit -l unlimited

IMAGE_PATH=/安装路径/deepvariant.simg

singularity run $IMAGE_PATH /opt/deepvariant/bin/make_examples

并使用如下指令提交:

$ sbatch slurm.sh

交互式提交CPU版DeepVariant作业

srun -p cpu -N 1 --exclusive --pty /bin/bash
export IMAGE_PATH=/安装路径/deepvariant.simg
singularity run $IMAGE_PATH /opt/deepvariant/bin/make_examples

GPU 版本的 Singularity DeepVariant

GPU版安装

申请计算节点,然后制作 singularity 镜像

$ srun -p cpu -N 1 --exclusive --pty /bin/bash
$ singularity build deepvariant.gpu.simg docker://google/deepvariant:0.10.0-gpu

用SLURM脚本提交GPU版作业

使用GPU版本的 singluarity 镜像的 slurm.sh 如下:

#!/bin/bash

#SBATCH -J DeepVariant
#SBATCH -p dgx2
#SBATCH -N 1
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=6
#SBATCH --gres=gpu:1
#SBATCH --mem=MaxMemPerNode
#SBATCH -o %j.out
#SBATCH -e %j.err

ulimit -s unlimited
ulimit -l unlimited

IMAGE_PATH=/安装路径/deepvariant.gpu.simg

singularity run $IMAGE_PATH /opt/deepvariant/bin/make_examples

并使用如下指令提交:

$ sbatch slurm.sh

交互式提交GPU版deepvarant作业

srun --ntasks-per-node=1 -p dgx2 --gres=gpu:1 -N 1 --pty /bin/bash
export IMAGE_PATH=/安装路径/deepvariant.gpu.simg
singularity run $IMAGE_PATH /opt/deepvariant/bin/make_examples

参考资料


最后更新: 2024 年 11 月 22 日