Dear all,
I want to filtered the Trinity outfile for RNAseq assemble. I know this is not the best choice, and there are some scripts online. but they are all not good to use in my data, Could you give me some suggestion on this?
my data like this: (Start with > for name)
TRINITY_DN66086_c0_g1_i1 len=1158 path=[1258:0-49 1259:50-66 1268:67-201 1262:202-225 1267:226-656 1257:657-1157] [-1, 1258, 1259, 1268, 1262, 1267, 1257, -2] CTAGCCTACGGGGCTTTATTGGGTCACTTTTTCCAATCGAGTTTGCGGAAAGGCCTCTTT CGGCGGAGCCCTTACGCCTCGTGAAACGAGATATTCAGGGCTCCCCGAAAGGGCACGGCT
TRINITY_DN66086_c0_g1_i2 len=1207 path=[1253:0-49 1254:50-115 1268:116-250 1262:251-274 1267:275-705 1257:706-1206] [-1, 1253, 1254, 1268, 1262, 1267, 1257, -2] CTAGCCTACGGGGCTTTATTGGGTCACTTTTTCCAATCGAGTTTGCGGAAGAGGCTTTTC CACCATTTGGCCCAGTCTAGCCGGGGCCACATGGCGGAAAGGCCTCTTTCGGCGGAGCCC TTACGCCTCGTGAAACGAGATATTCAGGGCTCCCCGAAAGGGCACGGCTGCTCTTCCTTG G
How can I get the longest one for the same gene cluster?
thanks
Dear All
I am using "get_longest_isoform_seq_per_trinity_gene.pl" script
Please help to solve this error. I also have installed the perl module Bio::Index::Fasta but the error is still there.
Looks like your
$PERL5LIB
variable is not correctly set. Something like:export PERL5LIB=/path_to/trinity/:${PERL5LIB}
would be needed.Or use conda to install the trinity, as it takes full care of dependencies and paths.
https://anaconda.org/bioconda/trinity
I have trinity installed by conda and trinity is working correctly for assembly