Tutorial:Analysis of Smart-Seq3 data with kallisto-bustools
0
3
Entering edit mode
22 months ago
firestar ★ 1.6k

Based on @dsull's answer.

The smart-seq3 analysis is handled by the kallisto-bustools pipeline.

Create a conda environment if needed. Create the conda environment file and save it as env_kb.yml.

# conda environment for kallisto bustools pipeline

name: kb

channels:
 - bioconda
 - conda-forge

dependencies:
 - python=3.8
 - kallisto=0.48.0
 - bustools=0.42.0
 - pip
 - pip:
    - kb-python==0.27.3

Create the conda environment and activate it.

mamba env create -f env_kb.yml
conda activate kb

Kallisto bustools tools should be now available.

Create index:

kb ref \
  -i index.idx \
  -g t2g.txt \
  -f1 transcriptome.fa \
  /path/genome.fa \
  /path/genes.gtf \
  --kallisto kallisto \
  --bustools bustools

Quantify:

kb count \
  -i index.idx \
  -g t2g.txt \
  -t 20 \
  -m 100G \
  -o out/ \
  -x SMARTSEQ3 \
  --gene-names \
  --h5ad \
  --kallisto kallisto \
  --bustools bustools \
  -w whitelisted_barcodes.txt \
  /path/K45_9001_S1_I1_001.fastq.gz \
  /path/K45_9001_S1_I2_001.fastq.gz \
  /path/K45_9001_S1_R1_001.fastq.gz \
  /path/K45_9001_S1_R2_001.fastq.gz

The four fastq files must be specified in that same order: I1, I2, R1 and R2.

single-cell kallisto transcriptomics smart-seq3 pipeline • 1.4k views
ADD COMMENT

Login before adding your answer.

Traffic: 2899 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6