Hi, I was doing RNAseq for CD31 selected human endothelial cells. Unfortunately, My cells didn't express CD31. I don't know why. The RNAseq method is SMART-Seq Stranded Kit (Takara).
I followed the general pipeline, Quality Control-Cutadapt-HISAT2-Stringtie. For the Stringtie, I used the reference file (hg19) to guide assembly.
For the gene abundance estimate of the Stringtie, the CD31(PECAM1) expression(FPKM) is 0.
Has anyone ever met this problem? What's the possible issue? Thanks.
Why did you use Stringtie after HISAT2? For the purpose meassuring gene expression it might be sufficient to apply the tool featureCounts on the BAM files from your alignment with HISAT2? This would give you the gene count matrix per sample.
Hi, thank you for your response. I just followed some papers that they had used Stringtie after HISAT2. I will try the FeatureCounts.
Yeah, that "tuxedo" pipeline hisat2-stringtie-ballgown was published in a high impact journal by a well-known group but was actually (at that time) a pipeline for transcript-level rather than gene-level analysis. It is (imho) overly complicated and mostly unnecessary for gene-level analysis. These days the lightweight quantifiers such as salmon and kallisto seem to be preferred by many users, which quantify reads directly against a transcriptome, and then aggregate transcript abundance estimation to the gene level, e.g. via tximport from Bioconductor. No need for stringtie assembly when working with well-annotated organisms unless you are specifically interested in novel transcripts.
I used the featurecounts, however, the geneid of the PECAM1(CD31) is 5175, right? I did search on NCBI. However, the aligned reads for this one is 0.
Is this single-cell RNA-seq? If yes, what platform (e.g. 10X Chromium)?
Thank you for your response.
Not sc RNAseq, but by followed SMART-Seq Stranded Kit (Takara).
Just to double-check, was alignment also done with hg19 as reference? As suggested below try to simply plug the bam file into
featureCounts
making sure that the genome version of the GTF is the same as your reference genome and then see whether CD31 comes out with some counts. I assume CD31 was confirmed (or even used as selection marker) via FACS?Yes, I used hg19, a gtf file. I am trying the featureCounts, I never used it before. I am not sure what method they used to confirm the CD31, I only know that they selected the endothelial cells based on CD31 marker.