HUNTSVILLE — The enormous data demands of genomic research has demanded an affordable solution with radical improvements in manageability and scalability.
Quobyte, a deploy-anywhere software storage provider, announced Huntsville’s HudsonAlpha Institute for Biotechnology will use its technology to meet the exponentially growing data demands of its production IT system.
“If we had a new project come online and we needed another petabyte of storage, we could literally order a couple servers, put them online in a matter of minutes, and have that capacity available to the Quobyte file system,” said Richard Johnson, systems architect at HudsonAlpha. “If we were dealing with storage appliances, those abilities would be a lot more constrained.”
HudsonAlpha is dedicated to improving the human condition around the globe through discovery, education, and economic development in the genomic sciences.
With life sciences one of the world’s most data hungry industries, being able to deliver affordable storage performance and capacity to satiate that appetite is key to its success.
In genomics, data volumes are increasing at an incredible rate–Statista notes that from 2009-2014 the field created 1 petabyte of data, in the following five years that number exploded to 20 petabytes.
Traditional parallel file systems and data appliances are proving inadequate to the task of supporting the massive data requirements of these modern workloads.
“With my background in 3D protein structure alignment it has been a real pleasure working through the evaluation process with HudsonAlpha,” said Quobyte CEO Björn Kolbeck. “To keep pace with the possibilities of genomic research you have to remove the data clogs from the system — that’s why we founded Quobyte, to rethink data storage to enable it to handle these previously unimaginable volumes of data.”
A truly software-defined approach overcomes many of these limitations, the company said. For example, while individual drives can offer exceptional performance, they are throttled by NFS’ single-stream processing, and an appliance-based approach holds users hostage to the physical limitations of enclosures.
With Quobyte’s software-defined storage, HudsonAlpha said they were able to provide scale-out performance on any commodity, off-the-shelf platform.
HudsonAlpha said Quobyte is uniquely able to solve–the ability to provide direct API-accessed information. This provided a real-time view of data and storage usage helping ensure the correct level of provisioning and chargeback for each of the organizations it serves.
HudsonAlpha’s 4.5 petabyte production system now runs on a Quobyte clustered software-defined storage system. It is made up of 26 storage nodes each with 22 x 16TB hard disk drives, 2 x 8TB SSDs, and a 1.6TB NVMe drive, and connected to the network with dual 100GbE ports.