Entering edit mode
4.4 years ago
anasjamshed
▴
140
I just downloaded data from tcga and GDC . I have two files , one is anotation.txt and other is .maf. I want to extract mutation from these files with the help of python script and analyze them in python, plz help me
MAF
is already a mutation file. What do you mean by extract mutations?Yes but my file contains sacttered data how can i analyze them these scattered data into python ? how can i load?
What does "scattered data" mean?
Also, do not add answers unless you're answering the top level post. If you're doing that to bump the post, that is bad etiquette and the post will be closed.
scattered means raw data which is not organized just like in excel
Do you have an MAF file or raw data? MAF files are processed files. What is the exact format of your data? Also, Excel is not a good tool in bioinformatics. Please give us the exact file name of this data file you're referring to.
i downloaded the file annotations.txt from tcga which contains data like this :
That's not the MAF file, it's the MANIFEST file with metadata.
so which one is MAF file I downloaded tar folder in which 3 files are present
Look at the description of MAF files online (on the GDC/NCI website) and compare the three files you have, that should help you pick the right file.
there are 3 files in tar folder one is manifest 2nd one is annotations and the third one is mad file which is built in ms access and does not open in my pc
No, it's not.
Do not use double-click to open. Use either the Windows Subsystem for Linux or use a Linux computer to read these files. They are plain text files and can be read either using linux commands (such as
head
,tail
,cat
) or notepad++ (which is a GUI application and could crash if it attempts to open a HUGE file).Please contact someone that knows linux or bioinformatics to help you with this task, we cannot hand-hold you through it.
Sir, I am also a bioinformatician but when I try to open this maf file this directly open into ms access and show errors
I apologize, I meant for you to check with someone that knows linux better than you do, not suggest that you're not one of us.
How are you opening it? Are you using linux commands or a point and click interface? If it's the latter, it's time to switch to linux commands.
now I successfully open it in notepad++. I have also Linux OS beside windows in my PC.
That's good progress. Remember to use linux as much as you can - it will only help.
salut, j'ai le même problème que vous , vous pouvez m'aider comment lire le fichier.maf sous notebook python?
A MAF file is a tab delimited file. You should be able to use
pandas
to read it into a data frame.