Hello I have a file like
ENTRY EC 1.1.1.1 Enzyme
NAME alcohol dehydrogenase;
CLASS Oxidoreductases;
SYSNAME alcohol:NAD+ oxidoreductase
REACTION (1) a primary alcohol + NAD+ = an aldehyde + NADH + H+ [RN:R00623];
ALL_REAC R00623 > R00754 R02124 R02878 R04805 R04880 R05233 R05234 R06917 R06927 R08281 R08306 R08557 R08558 R10783;
SUBSTRATE primary alcohol [CPD:C00226];
PRODUCT aldehyde [CPD:C00071];
ENTRY EC 1.1.1.157 Enzyme
NAME 3-hydroxybutyryl-CoA dehydrogenase;
CLASS Oxidoreductases;
SYSNAME (S)-3-hydroxybutanoyl-CoA:NADP+ oxidoreductase
REACTION (S)-3-hydroxybutanoyl-CoA + NADP+ = 3-acetoacetyl-CoA + NADPH + H+ [RN:R01976]
ALL_REAC R01976;
SUBSTRATE (S)-3-hydroxybutanoyl-CoA [CPD:C01144];
PRODUCT 3-acetoacetyl-CoA [CPD:C00332];
and i need to convert it to
ENTRY NAME CLASS SYSNAME REACTION ALL_REAC SUBSTRATE PRODUCT
and the corresponding values in rows. can anybody help me writing a script for this purpose.
output (tab separated):
This command gives correct output for first entry only. Can you please manipulate it to the entire file. I am not meticulous in awk.
input:
command:
output:
miller can be installed via ubuntu (till xenial-16.04)/mint (sonya- 18.2) repos. However, you would need latest version of Miller. Compile it from miller github.