Hi! I have some fasta files with a couple of sequences each. They are like this:
>WP_012937914.1 NADH-dependent butanol dehydrogenase [Acidaminococcus fermentans]
MLGNFTYCNPTKLYFGKDALQNLPAELAKYGKKVVLVYGSGSIKKNGIYDAVTKILAEAGKDVAEIAGVMSNPTIHKLRE
GIAIARKHGADFILAVGGGSVIDYSKALSVSVNCDEDPWEKYFVRFEEPTCETIPVGTVLTMVGTGSEMNAGAVITYPEK
KLKMGKVFADEKIMPRFSILNPVYTMTLPKYQMVAGIYDIFNHICEQYFSGTDDNTSDYLAEGLMRNVVEASRTAVKNPQ
DYEARSNLMWDATWALNTLIAKGKSTDWMVHMLGQAAGGVTNATHGMTLAAVSLPYYRFILKDGLPKFVRFAKVVWDVRP
EGKTDEEIAKEGLEKMEDWMQQLGLVLHLKDLGATEDMLDDLVNGTLILTGGYRVLTKEEIREIFRRAM
How can I extract,from this files, only the sequences that have in that first line "NADH" ? I need that first line as well as the sequence into another fasta file.
I just tried this out, but although the python module installs just fine, I'm getting "command not found" when I try and run the script. Maybe it's getting copied to somewhere that isn't in my path (or not getting copied anywhere)?
OK, when I try it without the --user flag, it works, so I reckon it's that.
Yes, I should have mentioned that the
--user
flag places everything in$HOME/.local/bin
, and so that's where thefaidx
script would go, instead of/usr/local/bin
. Glad you've figured it out!