Only two fastqs for 10X scATAC-seq data deposited on GEO database.
1
0
Entering edit mode
4 months ago
1048686151 • 0

I am trying to replicate the upstream process of ATAC sequencing in an article (SRR26674024), but the uploaded raw data only contains two fastq files like SRR26674024_1.fastq.gz and SRR26674024_2.fastq.gz. I'm just wondering how to let the two files adapted to cellranger pipline. The first sequencing fragment of the two files shown as follows: @SRR26674024.1 A00869:659:H2M35DRX2:1:2101:29170:1000/3 NTACAAACTAAATGTGCCCTTGATGGAAAAGGAACCTTATGGAATGATGA + ?????????????????????????????????????????????????? and @SRR26674024.1 A00869:659:H2M35DRX2:1:2101:29170:1000/4 GCTGAGGCGGGAGAATCAGTTGAACCCAGGGGGCAGAGGTTGCAGTGAGC + ??????????????????????????????????????????????????

scATAC-seq cellranger single-cell 10X • 440 views
ADD COMMENT
2
Entering edit mode
4 months ago
GenoMax 148k

Unfortunately 10x data on SRA is bit of a hit and miss thing.

It is possible to get the four files using fastq-dump by doing

$ fastq-dump -F --split-files SRR26674024

This should give you

$ more SRR26674024*fastq
::::::::::::::
SRR26674024_1.fastq
::::::::::::::
@A00869:659:H2M35DRX2:1:2101:29170:1000
AAACTCAT
+A00869:659:H2M35DRX2:1:2101:29170:1000
FFFFFFFF
::::::::::::::
SRR26674024_2.fastq
::::::::::::::
@A00869:659:H2M35DRX2:1:2101:29170:1000
GGGGTGCCCGCGGTAA
+A00869:659:H2M35DRX2:1:2101:29170:1000
,FFFFFF,FFFFF:FF
::::::::::::::
SRR26674024_3.fastq
::::::::::::::
@A00869:659:H2M35DRX2:1:2101:29170:1000
NTACAAACTAAATGTGCCCTTGATGGAAAAGGAACCTTATGGAATGATGA
+A00869:659:H2M35DRX2:1:2101:29170:1000
#FFF:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFF,FFFFFF:F
::::::::::::::
SRR26674024_4.fastq
::::::::::::::
@A00869:659:H2M35DRX2:1:2101:29170:1000
GCTGAGGCGGGAGAATCAGTTGAACCCAGGGGGCAGAGGTTGCAGTGAGC
+A00869:659:H2M35DRX2:1:2101:29170:1000
FFFFFFFFFFF,FFFFFFFFFFF:FFFFFFFFFF,FFFFFFFFFFFFFFF

These files are available for download via cloud SRA (e.g. https://trace.ncbi.nlm.nih.gov/Traces/?view=run_browser&acc=SRR26674024&display=data-access under original data) but unfortunately that process would not be free since you will need access to a AWS/GCP cloud bucket and a paid account. NCBI does not charge for the data directly.

ADD COMMENT
0
Entering edit mode

That works, thanks a lot. Notice that another optional argument is --gzip compression.

ADD REPLY

Login before adding your answer.

Traffic: 1639 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6