Aggregation

Aggregation splits data into subsets, computes summary statistics on each subset, and reports the results in a conveniently summarized form. The aggregate function is one of the most capable functions in the scidb package. The package overloads R’s standard...

Databases Unlock Big Data

Databases Unlock the Power of Medical Big Data Heard recently at a SciDB tech-talk: “Data in databases are used 100x more than data in files.” Ok. Not too controversial. “If we take ‘use’ (the number of papers published) to be a proxy for...

Innovation at the Intersection of Big Data and Life Sciences

There’s a critical need for a new generation of bioinformatics and clinical informatics software to scale up to handle the vast increase in data. Even so, the challenges ahead in managing, sharing, accessing, and analyzing data are exacerbated not just by the...

Why an ACID DBMS Beats “File Systems” Any Day

Do Not Go Gentle into that “Good Enough” Bill Kantor At one of our company lunches, my colleague Paul Brown commented (somewhat in a rage) about how foolish engineering choices sometimes can come back to bite you (or your users) hard. In addition to being a noted...