Hello. I have two databases (.fas) that I have to unify in only one. Some sequences are present in both DB, I want to keep one copy and delete the other. Instead of removing manually one by one, is there any way to select all duplicates and remove them? Thank you very much.
'.fas' = fasta file ? (= just a file, not a database)
Yes, I have two files
so this question has been asked a gazillon times on biostars: How To Remove The Same Sequences In The Fasta Files? , Remove Redundant Sequences Fasta , How To Remove The Identical Sequences In The Fasta Files? , Useful Bash Commands To Handle Fasta Files , Remove Redundant Sequences Fasta , ....