Entering edit mode
6.1 years ago
tinyskinn
•
0
I am working in a database project where we aim to store genomic data on a long term basis. My interest is with the data modeling (ENA) and the definition of what should be the (meta)data for different file formats like vcf, bam and plink data.
Thanks and looking forward to your kind response. Tony
A lot of this is already standardized. You can take a look at this website for inspiration. For dbSNP similar information can be found here.
Wait, what? You want to re-create ENA? (I sincerely hope I'm interpreting this wrongly)
ah, you want to implement a new database (for a 'new' data type) based on the 'philosophy' of ENA, makes more sense.
Thank you for the links. I am familiar with the first one, but my problem actually is that I am a software developer without bioinformatics background. I was hoping I could get someone who could lend me a hand.
dont put them in a database. Put them somewhere on the server and store paths to the files in the database.