I want to store hierarchical, two-dimensional scientific data sets in a relational database (MySQL or SQLite). Each data set contains a table of numerical data with an arbitrary number of columns. In addition, each data set may have one or more children of the same type associated with a given row of the table. Each dataset typically has 1 to 100 columns and 1 to 1,000,000 rows. The database should be able to process many data sets (> 1000), and the read / write data should be fast enough.
What is the best database schema for storing such data? Is it wise to have a “main” table with the names, identifiers, and relationships of individual datasets and, in addition, one table per dataset that contains numerical values?
source
share