I have data that sort of looks like this:
SAMPLE_ID,DATA_1,DATA_i,...,DATA_n
1, ...
And I want to be able to query it (using any sort of lambda) like so
select from table where DATA_5834 >= .1
Or..
select from table where ((DATA_x > 0.2) and (DATA_y == "canada"))
I could use SQL however I do not know ahead of time what the columns will be. I don't want to define them ahead of time as there can be between 1 and 10^5 columns.
What DB most appropriately fits this task?
MySQL is only useful for tables that have a defined number of columns, I think the best idea is to draw up a database schematic, where your database is in 3rd normal form and decide from there.
"however I do not know ahead of time what the columns will be". Exactly, this is a typical scenario where a relational DBMS is not very suitable. This is rather a case for something that can store large matrices or multidimensional arrays of data, e.g. HDF5
You cannot plan, if you dont know what your planning. So I think the best bet is to wait until he/she receives the dataset
@harpalls I have the datasets. There are many. It'd be a waste of time to write schema.