Entering edit mode
8.2 years ago
Mike
★
1.9k
Hi All,
I have 3'-end RNAseq data and going to analyse. Could you please recommand specific protocol/pipeline or any precausion/suggesion for analyse 3'-end RNAseq data.
Thanks a lot.
Thanks WouterDeCoster,
yes, after lots of serach I also agree that " STAR, featureCounts/htseq and DESeq2/edgeR/limma-voom should be fine", but still Im not sure about trimming poly-A tail.
I think you definitely have to trim the poly A tail, and for that I use prinseq.
Actually for trimming I used Cutadapt
But when I used this trmmed fastq in STAR , I got following error,
STAR is working fine for un-trimmed reads
there is nothing left of that read...
Thanks,
so I should define minimum length of reads and discard trimmed reads that are shorter than minimum length.
That would prevent this issue, probably.
Thanks, its works,
Sorry I have one more question regarding featureCounts output.
Acutally I want to compare HTseq and featureCounts read count results, but featureCounts output result is in different format, in HTseq there is only two column gene name and corresponding count, but in case of featureCounts output there are seven column file, so how to compare count with HTseq.
Thanks
Based on the manual, these are the columns: ‘Geneid’, ‘Chr’, ‘Start’, ‘End’, ‘Strand’, ‘Length’ and read counts for each gene in each library. Comparing that to htseq count should be a trivial analysis.
Thanks, why header of 7th colum is name of input bam file in featureCounts ??
Is it read count or something else??
That's most likely as a surrogate for sample identifier and I assume that indeed contains read counts.
Thanks , highly appreciated for you help.
Is there any difference in read count normalization and DEA using DESeq2/edgeR in case of 3'-end seq data?
I am using DESeq2/edgeR for DEA and my codes:
How to compare between two samples CR & TR ? (each sample having 3 technical replicate)
It would be great help if you please check my code.
DESeq2 code:
And EdgeR code:
Is this OK?
Thanks