Entering edit mode
9.9 years ago
rus2dil
▴
20
I am new to bioinformatics and recently I started a research project to compare promoters of a particular gene (not decided yet). I have several raw reads of WGS deposited in ENA site. How should I use these raw reads to extract a particular region? For example, sequence of the rice LEAFY gene.
Thank you very much Ashutosh Pandey. I think using galaxy platform I can do this? Right?
Yes, upload FASTQ, align to ref genome. Then use SAM manipulation tools and a BED file to extract the region.
Thank you RamRS. Should this BED file only contain the sequence of interest? If so, how to create it from FASTQ or FATSA file. Because I have the FASTA file of the gene of interest downloaded from the reference genome.
BED is for co-ordinates of your gene of interest. Use UCSC genome browser and beware the index differences in tools (Check out this post for more details)