Backblaze logo

180TB of Good Vibrations – Storage Pod 3.0



We thought ten people would care; instead a million people read our Storage Pod 1.0 blog post where we open sourced the Backblaze Storage Pod design and introduced the world’s most cost-efficient way to store big data. The interest grew when we published our Petabytes on a Budget: Revealing More Secrets blog post that announced Storage Pod 2.0, which doubled the amount of storage and reduced the price. Since then several companies have built businesses selling Storage Pods inspired by Backblaze to hundreds of organizations around the world who are storing hundreds of petabytes of data on their own Storage Pods. Today we introduce Backblaze Storage Pod 3.0 which stores more data, costs less, is more reliable, and is easier to service.

What’s New?
For Storage Pod 3.0 we redesigned the chassis and upgraded many of the components. Most of the changes are aimed at improving the reliability and flexibility of the Storage Pod. The full parts list is in Appendix A at the end of this post.

Here are the highlights:

  • 180 terabytes of storage. With the availability of 4 TB hard drives a Storage Pod can now be configured to store 180 TB (45 x 4 TB). As a bonus the same chassis and components can be used with any capacity of 3.5” hard drives.
  • Anti-vibration drive bay assemblies. There are now 3 assemblies, one for each row of 15 drives. Each assembly is designed to lock down a row of drives in place. These assemblies replace the “drive bands” around each drive. This saves nearly an hour during Pod assembly and makes drive replacement easier as well.

    The key advantage of the drive bay assemblies is to reduce vibration. These assemblies not only keep the drives still, they also keep them firmly seated in the backplanes. Over the past several months we have tested different models of drives in the new drive bay assemblies and we have seen a dramatic improvement in overall system performance along with lower drive failure rates.

  • Upgraded motherboard. We now use the Supermicro MBD-X9SCL-F motherboard which replaces the previous model. The new motherboard adds a host of advanced processing features from Intel, as well as upgrades the PCIe slots to double throughput. Note that while our currently specified SATA cards do not take advantage of this increased throughput, it’s nice to know we can use it in the future.
  • More motherboard choices. We added standoffs to the chassis to provide better support for Micro ATX motherboards while still supporting the Standard ATX form factor. Specifically the new standoffs support the outer edge of the Micro ATX boards.
  • CPU. We upgraded the CPU to a 2nd generation Intel Core i3-2100 processor to replace the end of lifed (EOL) i3-540 model. This also gets us a little bump in clock speed (3.06 to 3.1 GHz), lower power usage (65 versus 73 watts), and more supported RAM (32GB up from 16GB).
  • Memory. We changed memory suppliers, so now the memory is certified by Supermicro.
  • Boot drive options. With Storage Pod 3.0, boot drives can now be 2.5” or 3.5” and we allow a second 2.5” drive to be attached for use in a redundant RAID1 boot volume. Boot-up drives can also be traditional HD or SSD drives. Backblaze switched to 2.5” boot drives because they are less expensive and are more reliable, but we didn’t want to eliminate support for 3.5” drives in case anyone needs a higher capacity 3.5” drive.
  • Backplanes. While we continue to use the same backplanes (Sil3726 chipset) as with Pod 2.0, there is another backplane based on the newer Silicon Image Sil3826 chipset that can be used. If you do use the Sil3826 based backplanes, you’ll notice that boot up can take a long time due to a large number of time-out/retry errors during the boot process. Eventually the boot up process will succeed. To fix this, you can use these instructions to update the backplane driver in your Linux kernel so you can use the Sil3826 based backplanes.
  • SATA cables. We replaced our SATA cable vendor with Nippon Labs. If you decide to use another vendor, look for cables that are SATA II or SATA III compliant and test them extensively. While any SATA II or SATA III cable should work we have found quality control problems with a number of vendors. In all cases it was the connectors, not the cables, which were defective.
  • Metal standoffs in the chassis. We replaced the plastic standoffs with metal components that can be manufactured as part of the chassis to reduce the cost and assembly time.
  • Improved airflow. The vent design was improved to increase airflow through the pod. We’ve never really had a problem with heat in the pods, and we’d like to keep it that way.
  • Chassis rivets. We replaced many of the screws with rivets. This simplifies the manufacturing process and saves times during the assembly process. Note, if you end up buying a case from Protocase (more on them later), they continue to use screws versus rivets. The two manufacturing processes produce the same basic chassis regardless.
  • Costs Less. For Storage Pod 2.0, the price for the components without drives was $1,984.00. For Pod 3.0 the price is $1,942.59, or $37.41 less – or about a 1.9% decrease in the cost (see Appendix A for a parts breakdown). The main component in the total cost will be the hard drives. The lingering effects from the Thailand drive crisis and consolidation in the drive industry have meant that even today hard drive prices are higher than they were when Pod 2.0 was introduced back in July of 2011 but here’s one way you can save a little on the price of hard drives.

Lessons Learned
Backblaze currently has over 450 Storage Pods deployed and manages nearly 50 petabytes of data. There are also many Storage Pods being used by organizations around the globe. Along the way we’ve learned a few things:

  • Firmware revisions matter. Watch out, manufacturers update hardware and upgrade firmware without changing their model number. These updates are intended to fix bugs but in the process new bugs can be introduced. Read the release notes carefully and downgrade the firmware to the version you’ve tested whenever possible. When updates are unavoidable, test them thoroughly before deploying them.
  • Let your vendors do the testing. Components like memory, PCIe cards and hard drives are modular and should all play well together. Unfortunately, the specs that allow for this interoperability (PCIe, SATA, DDR3, etc) are as complicated as are the components themselves. Because of this, use ‘certified’ components whenever possible. This will minimize problems and avoid finger pointing between vendors if problems do arise.
  • Don’t make random changes to our design. It might be tempting to try a build a pod out of the spare parts lying around your office but don’t do it. The components specified in the parts list in Appendix A are known to work well together. We believe in iteration and experimentation but don’t reinvent the wheel unless you have to.
  • There is more to power than just Watts. ATX power supplies deliver power at several voltages or ‘rails’ (12V, 5V, 3.3V, etc). Each vendor imposes unique limits on the amount of power you can draw off of each rail and unused power on one rail cannot be used on another. In particular, most high end power supplies are designed to deliver most of their power on the 12V rail because that is what high end gamer PCs use. Unfortunately, hard drives draw a lot of power off the 5V rail and can easily overwhelm a high wattage power supply. You will hit serious problems if power requirements for each component are not met so be careful if you don’t use the power supplies we recommend.
  • Keep it simple. For hard drives, SATA cards, SATA cables, backplanes, etc. you should use the same vendor, part number and/or model number as much as possible. By keeping things simple you reduce the number of variables that need to be considered if things go wrong. If you do mix components do it intentionally and for a good reason like comparing the performance of 3 different hard drives. And when you do this, make sure you’ve set things up in a way that allows you to draw clear conclusions. For example: with hard drives, if you want to compare the performance of 3 different models you should arrange them so that each RAID array is homogeneous. You should also take care to make sure each array is spread across all backplanes and SATA cards so that you don’t have IO hotspots which could taint your results.
  • Things change. Just when you get comfortable with a part, it will be discontinued or upgraded. We buy in quantity, we buy spares, and we have substitutes ready to go at any time. We also look for parts that have long-term support policies. We realize you may not be able to do these things, so be prepared when you upgrade one component that something may break.

You can download the 3D design files (24 MB zip) for the design drawings.

The Uses and Users of a Storage Pod
In general there are three types of storage:

  1. Transactional storage that provides real time or near real time access to data.
  2. Bulk storage which stores a large amount of data yet provides access to that data within seconds to minutes.
  3. Archive storage which keeps all of your data and often requires hours to days to access the data you need.

All are important. Our Backblaze Storage Pods were designed, built and implemented to economically hold large amounts of data yet provide access to that data in seconds to minutes, thus they are purpose-built for bulk storage. The economics of using a Backblaze Storage Pod for storage have driven many organizations and people to build and utilize Storage Pods inspired by Backblaze. This community has also built upon the Backblaze design and specifications to build Storage Pods for all types of storage needs. Here are a few examples:

Storage Pod Economics
When we announced Storage Pod 2.0 in July of 2011, the price dropped by 15% and we doubled the amount of storage versus Storage Pod 1.0. So with Storage Pod 3.0, this should be the section where we would say something like “as hard drive prices have dropped, the economics have continued to improve.” We would say that, except that the Thailand drive crisis which began in October of 2011 raised hard drive prices dramatically. That event, and perhaps the continuing consolidation of hard drive manufacturers, has meant that hard drives prices have yet to return to their July 2011 levels. For example, the hard drive we specified in the Storage Pod 2.0 blog post was listed at $120 (Hitachi 3TB 5400 HDSS5C3030ALA630). To purchase that same drive today you would pay $199 with very limited availability. To be fair there are other drives available that we use, but even the least expensive drive is $125 and that’s an external USB drive that must be ‘shucked’ before it can be used. With the increase in drive prices and decrease in availability, we have validated a variety of drives that work in Backblaze Storage Pods. Here is a list of the drives we have tested and currently use:

    4 TB drives

  • Hitachi – HDS5C4040ALE630 (Just starting to use these, but they look good.)
    3 TB drives

  • Hitachi- HDS5C3030ALA630 (low power)
  • Hitachi – HDS723030ALA640 (high power)
  • Western Digital – WDC WD30EFRX‑68AX9N0 (RED)
  • Seagate – ST3000DM001‑9YN166 (slightly higher failure rate)

Another thing we expected to happen was wide availability of 4 TB drives with the subsequent price drop. Only recently have we been able to get a reliable supply of 4 TB drives and prices are starting to drop. While we have built and deployed 180 TB Storage Pods (forty-five 4 TB drives) and we expect to move to exclusively building 180 TB Storage Pods in the near future, the cost per TB for 3 TB systems versus is 4 TB systems is nearly the same.

Storage Costs
The table below compares the total cost of different Backblaze Storage Pod configurations.

    Description Pod 2.0 Pod 3.0 Pod 3.0
    Drive size 3 TB 3 TB 4 TB
    Our cost per drive $120 $125 $195
    Number of drives 45 45 45
    Chassis cost $1,984.00 $1,942.59 $1,942.59
    Drive cost $5,400.00 $5,625.00 $8,775.00
    Total cost $7,384.00 $7,567.59 $10,717.59
    Total size in TB 135 135 180
    Cost per TB $54.70 $56.06 $59.54
    Cost per GB $0.0547 $0.0561 $0.0595

While it looks like the cost of a 4TB drive system is more expensive, when you factor in rack space, electricity, installation labor, etc. the long term cost for Backblaze leans towards using 4 TB drives. Our monthly cost for a full rack of Storage Pods with 3 TB drives is $0.63 per TB, while a full rack of Storage Pods with 4 TB drives is $0.47 per TB. When you factor all the costs together, it takes about 5 months for us to recover the extra cost encountered when building 4 TB based Storage Pods.

Building Your Own Storage Pod
Since Backblaze has given away the design, you can build your own Storage Pod. You’ll want to start with the parts list in Appendix A. From there feel free to change out some of the components, redesign the case, or completely change out the design. It’s up to you, but pay attention to the “lessons learned” noted earlier. To help you out, here’s a nice screen shot assembly walk-through from Protocase you can review and here’s a Storage Pod assembly overview (PDF-1.5MB) that you can download. You may also find some fellow Pod Builders by visiting the openstoragepod.org web site for information on their efforts to develop “affordable & energy-efficient high-capacity storage servers, built from commodity components”.

One thing to know is since we don’t sell the Storage Pods and don’t make money off any ancillary services, we don’t provide support or a warranty for people who build their own. To all of those builders who take up the challenge, we’d love to hear from you and welcome any insights you provide about the experience.

Buying a Storage Pod
If you are interested in buying a Storage Pod or having one built for you we recommend you talk to the folks at Protocase. They have built hundreds of Storage Pods. They can also help you with design changes for a reasonable fee. Also, their 45 Drives wiki site contains useful technical information about their Backblaze inspired Storage Pods.

As noted earlier Protocase continues to use screws versus rivets in the assembly process of their Storage Pods. From a functional point-of-view you will notice no difference in the finished product.

Another thing to know about Protocase is that they do not include hard drives with their Storage Pods. Protocase does test each Storage Pod they build with drives before they ship the unit to you, but it’s up to you to buy and install the hard drives. This is pretty easy to do, but be forewarned; sometimes buying 45 hard drives at Costco can be an issue.

Thanks for All the Fish
Thank you to everyone that has helped with ideas for how to improve the Storage Pods and to those who have helped spread the word on the reality of inexpensive big data storage. We welcome your continued feedback and support. Enjoy the design!

Appendix A – Price List:

The prices listed below are what we pay for the parts. To obtain these prices we do purchase then in quantity.

Item
Qty
Price
Total
4U Custom Case
Includes case, anti-vibration assemblies, power supply bracket, etc. (Available in quantities of 1 from Protocase for more money)
1
$382.35
$382.35
760 Watt Power Supply
Zippy PSM-5760V Power Supply
2
$269.00
$538.00
Dampener Kits
Vantec VDK-PSU Power Supply Vibration Dampener
2
$3.99
$7.98
Motherboard
Supermicro MBD-X9SCL-F (MicroATX)
1
$146.00
$146.00
Port Multiplier Backplanes
CFI-B53PM 5 Port Backplane (SiI3726 Chipset)
9
$40.00
$360.00
8GB DDR3 RAM
4GB Samsung BDDR3-1333 PC3-10600 (Certified by Supermicro)
2
$38.00
$76.00
CPU
Intel Core i3 processor i3-2100
1
$114.00
$114.00
Case Fan
Mechatronics G1238M(OR E)12B1-FSR 12V 3-Wire Fan
6
$12.62
$75.72
Port PCIe SATA II Card
Syba PCI Express SATA II 4-Port RAID Controller Card SY-PEX40008
3
$48.00
$144.00
SATA Cable
SATA cables RA-to-STR 3 ft locking from Nippon Labs
9
$1.50
$13.50
On/Off Switch
FrozenCPU ele-302 Bulgin Vandal Momentary LED Power Switch 12″ 2-pin
1
$24.95
$24.95
Boot Drive
WD SCORPIO BLUE WD1600BPVT 160GB 2.5″ Internal
1
$48.99
$48.99
Screw: 4-40 X 3/16 Phillips: Secure Boot drive
4
$0.02
$0.08

Standoff: Round 6-32 X 1/4 Dia X 5/16 Lng: Motherboard mount
1
$0.90
$0.90

Screw: 6-32 X 3/16 Phillips PAN SST: Motherboard standoff
1
$0.02
$0.02

Screw: 6-32 X 1/4 Phillips PAN ZPS: Motherboard standoff
9
$0.004
$0.04

Screw: 4-40 X 5/16 Phillips PAN ZPS: Backplanes
72
$0.02
$1.44

Crimp Terminal, 22-30 AWG Power (Tin): Fan Assembly
18
$0.02
$0.36

Fan Connection Housing, 1×3 with ramp and rib: Fan Assembly
6
$0.04
$0.24

Soft Fan Mount, AFM02B (1 flat end): Fan Assembly
12
$0.175
$2.10

Soft Fan Mount, AFM03B (2 tab ends): Fan Assembly
12
$0.175
$2.10

Foam Tape, 1″ x 50′ x 1/16 in black: Vibration lids
.5
$7.66
$3.83
 
TOTAL:  $1,942.59
 
 


Honoring Our Drive Farmers


Drive Farmer Drives

Back in November our Drive Farming Brigade was asked to farm hard drives for us over the Thanksgiving holiday. Hundreds of drives showed up and we dutifully sent them to our warehouse. They joined our drive stockpile and patiently waited for their chance to reach their hard drive destiny and be part of a Backblaze Storage Pod. Is there any greater honor for a hard drive than to be part of a Backblaze Storage Pod, backing up our customers’ data?  We think not!

Now it their turn for these farmed drives to be installed in Storage Pods. To honor our intrepid drive farmers, we decided to “tag” drives to their farmer. These photos, courtesy of our Operations Team, are from the first batch of farmed drives, 5 each from Olivier, Jeannine, David and the rest. Once installed, they will undergo rigorous testing to make sure they are ready for the task at hand – safely backing up photos, videos, documents and more for thousands of customers until such time as that data is needed.

So thank you drive farmers in your efforts to help Backblaze deliver truly unlimited online data backup. While everyone else is raising prices we continue to be just $5 a month and we’d like to keep it that way. Thanks again!

Drive Farmer Drives



Quartz Glass Backblaze Storage Pods?


Backblaze Storage Pod with Quartz Crystal

Hitachi announced this morning that they have invented “quartz glass storage”. With this innovation they are able to hold about 40MB of data per square inch on one piece of quartz glass. The Verge’s article doesn’t state the cost of the quartz glass, but the procedure to create such a piece of storage includes using a “high-precision laser to embed dots of binary code across a tiny piece of quartz glass”, and then “an optical microscope (paired with a computer capable of deciphering the imprint) can be used to recover the original data”, so it cannot be cheap!

To put that in to perspective, at 40MB per square inch, it would take 3,375,000 squares to house as much data as a single Backblaze Storage Pod. At 3,375,000 square inches, that many squares would cover almost ½ an acre or about 23,437.5 square feet of space (about 10x the size of Backblaze’s office).

One of the benefits of this is that the quartz glass is more or less fireproof, so it can withstand a lot of natural disaster scenarios and still be readable! The technology was just invented so we’re still a ways away from Backblaze Quartz Pods, but until that technology becomes readily available to the public, don’t forget to backup your current “old-school” non-quartz drives!



Backblaze datacenter grows 1000x.
Datacenter 2.0 needed; apply within.


Datacenter

40 petabytes of data takes up a lot of room. Those little ones and zeroes that make up your photos, videos, spreadsheets – your digital life – that we backup online for you are adding up. We started out nearly five years ago with 20 terabytes of data storage. Shortly after that, we signed the deal for our first quarter-cabinet in our current datacenter and we have steadily added more and more cabinets. Each full cabinet has 8-10 storage pods and when you do the math, we now have about 40 petabytes of storage. Growth is good.

So we are looking for another datacenter. We’ve created and issued an RFP (Request for Proposal) to get the process going. It’s a fairly short document, 8 pages, which outlines our requirements. And what do we need in a datacenter, here are a few things…

  • Space – Air conditioned, clean, raised floor for up to 225 cabinets – Hey, we provide unlimited data storage, so we need lots of space.
  • Network/Bandwidth – Multi-vendor 10Gb redundant symmetric connections – we don’t throttle backup speeds, so big pipes are mandatory.
  • Power – Each cabinet requires a 30 AMP 208Volt circuit, which powers 8-10 storage pods in each cabinet.
  • Physical security – Access controls, CCTV, good location, fences and barriers, manned patrols, etc.
  • Amenities – Space for Guido, parking, storage, loading dock and free back massages (just kidding, but it doesn’t hurt to ask).

Providers must provide documentation on their SSAE 16 or “SAS 70 type II” status. A LEED certified facility is desired. Of course we want a reputable provider with a proven history of providing stellar service.

Backblaze has grown from a commitment of just 1/4 cabinet to committing to up to 225 cabinets, a 1000x increase, and we are looking for a great partner to work with us.

We want to start using our new datacenter in December. It’s aggressive, but we believe it can be done. If you are interested in being our next datacenter provider you can download and view the RFP to learn how to submit your proposal, but act now as initial responses are due on Aug 20th.



Netflix, you flatter us.


Netflix Open Connect

Inspired by the Backblaze Storage Pod, Netflix builds their own Flixapod. Ok it’s not really called that. It’s called the “Open Connect Appliance Hardware”.

Netflix designed a 36-hard drive custom chassis filled with the most popular movies and deployed at every datacenter near their customers. Not only will this this reduce the load on Internet infrastructure and lower the cost for ISPs to provide the service, but it will also increase the speed and improve the user experience for end users. It’s a step that shows why Netflix is a leader in online video: it’s due not only to content, but also to continuous innovation.

When we open sourced our Storage Pod design in 2009, one of the most common reactions was, “Are you crazy?! Why would you give that away?” We are incredibly excited to see companies that need inexpensive storage not only building their own, but customizing it for their specific needs, and contributing their designs and learnings back to the community.

Derrick Harris penned a great analysis at GigaOm today, Why Netflix’s CDN should scare the storage industry. He’s right, of course. The open source software movement started from a desperate need by developers to have access to base components upon which they could build and innovate. That demand created a treasure trove of invaluable applications.

Open source hardware comes from the same goals and is starting to have the same results. Companies operating at “cloud scale” have a critical need for physical systems that can support the needs at efficient costs. With the Backblaze Storage Pods, Facebook Open Compute, and Netflix Open Connect, we may have reached a tipping point.

I love how Netflix puts their goal on their site:

We welcome commentary and improvements, which will be shared with the community with the goal of a faster, less expensive Internet for all.

Next time you stream some Top Gear on Netflix, it could be from one of these Flixapods.



Behind Backblaze – Scaling Big Storage


Many of you enjoyed the Part 1, Behind Backblaze video. Here’s Part 2, Behind Backblaze: Scaling Big Storage, which takes a look at our datacenter and the people behind it that make sure your data is safe and secure with Backblaze.

For those of you who think this kind of stuff is really cool, Backblaze has job openings in our Operations Group for a Director of Web Services, a Systems Administrator and a Datacenter Technician. All positions are in our San Mateo office.



1.6 Petabytes of storage in 1 day


Sometimes I am just amazed how much data we store for our customers – pictures, videos, documents, spreadsheets and more. It takes a lot of storage. Yesterday Kirk, Sean, new guy Dave and Guido, from our operations group, outdid themselves. They racked and powered up 12 new Storage Pods in one day; that is freakin’ awesome and it’s a new Backblaze record. Over the next couple of days the RAID arrays will be synched, the pods will be load tested and then they will be added to the hundreds of other working storage pods. As I write this 5 of these new pods have already come online and are busy backing up customer data.

How much data can you store on 12 storage pods? A little over 1.6 Petabytes. And how much is 1.6 Petabytes? A little over 800,000,000 photos (2 MB each). That’s a lot of memories safely tucked away.

So as you sit down at your computer and upload your photos, update a spreadsheet, or just edit your latest cat video, take a minute and thank Kirk, Sean, new guy Dave and the entire operations team for making sure there’s plenty of room for your data in our data center and don’t worry, there are more pods arriving next week…



The 1.35 petabyte Backblaze Salt Pod


Salt is great, but I prefer pepper on my drives
Researchers have discovered that a pinch of table salt can get more data onto spinning hard drive platters. A lot more: 6 TB in a single platter!

Considering the latest drives have up to 5 platters, that would be 30 TB in a single hard drive. Put 45 of those drives into a Backblaze Storage Pod and you would store 1.35 petabytes (!) in a single storage pod; more than 10 petabytes in a single rack.

Hello, Big Data. Meet your best friend, Salt.



Close