Entering edit mode
6.5 years ago
onkar
▴
10
I have a fasta/bed file in which there are some sequences which overlap with each other.
for eg:
1 57267 59067
1 63165 63758
1 63298 64137
1 67285 67596
here you can notice "1 63298 64137" is overlapping with "1 63165 63758". Now I want to obtain one file where minimum start position and maximun end position of these are reported instead of all these regions expected output:
1 57267 59067
1 63165 64137
1 67285 67596
this is just an example from that file. there are some locations where 4-5 sequence regions overlaps.
Kindly help
Thank you Nicolas for this suggestion. It worked
I found one more solution which was quite helpful.
Hope this will help others too.
As you tagged the question with the R tag I guessed it should be in R. But using bedools is also good