I use this command to extract a few particular fields of all hard and soft clipped reads from a SAM alignment (Illumina, paired-end):
awk -Ft '$6 ~ /H|S/{print $2,$4,$9}' input.sam> output.txt
However, I would like to extract only reads that have been clipped on the 5-prime end. Any suggestions?
TP
Never used pysam before, but when I tried I got this error:
Missing libraries?
I tried to uninstall it by running
pip uninstall pysam
but got this:You might want to reinstall pysam.
Already tried:
But I've figured out another way around my problem using
awk
(see below). Eventually, I'll have to figure out this pysam problem since I guess I'll need it eventually.You'll need to delete
./Documents/Tools/HTSeq-0.9.1/.eggs/pysam-0.11.2.2-py2.7-macosx-10.6-intel.egg
.