Click here to learn
about this Sponsor:
Home  |  News  |  Articles  |  Polls  |  Forum

Keywords: Match:
Linux powers low-cost petabyte-level storage
Jun. 22, 2005

Capricorn Technologies says it has completed delivery of more than a petabyte of storage to the Internet Archive, a non-profit organization based in San Francisco that creates periodic snapshots of the Internet. Capricorn's PetaBox products are based on Via mini-ITX boards running Debian or Fedora Linux, and deliver the lowest cost-per-GB and cost-of-ownership available, the company claims.

(Click for larger view of Capricorn PetaBox racks)

Capricorn started as a project within the Internet Archive (IA) to develop inexpensive storage devices based on Linux and commodity PC components. The project was spun out in June of 2004, resulting in the formation of Capricorn Technologies. The company has since supplied its PetaBox products to a number of universities, research centers, libraries, and national archives, both within the US and overseas, according to CEO C.R. Saikley. The IA remains Capricorn's largest customer, however, Saikley says.

The IA's PetaBox installation

The IA is an online digital library with very large collections of audio, video, texts, web sites, and software. For example, it claims to host footage of more than 20,000 live concerts, and snapshots of the Internet dating back to 1996, accessible through the well-known Wayback Machine, which currently hosts over 40 billion web pages.

The IA's PetaBox installation comprises about 16 racks housing 600 systems with 2,500 spinning drives, for a total capacity of roughly 1.5 petabytes. Despite its large size, the IA's PetaBox installation draws only about 50kW of power, Saikley says, and is maintained by one full- and one half-time person who spend a disproportionate amount of time working on older systems. "We've improved reliability considerably," Saikley claims.

The IA systems boot Debian or Fedora Linux from a central PXE boot server, and are remotely monitored using nagios. "The beauty of nagios is that it is so readily extensible," says Saikley. "If the register exists on the board, nagios can figure out how to read it. We typically provide hard disk temperatures, cpu temperatures, ping response, capacity utilization, that sort of thing."

The PetaBox can also be managed by Linux cluster management software, according to Saikley.

The PetaBox

Capricorn claims that its PetaBox storage devices provide the lowest ownership cost and cost-per-GB available. The company offers 40- and 64-terabyte models comprised of racks with 40 1U systems. The 1U systems are available in 1- and 1.6-terabyte models that are essentially the same but for hard-drive capacity. Both systems run Debian or Fedora Linux on Via mini-ITX motherboards.


The PetaBox is based on Via mini-ITX motherboards

Each 1U system includes a Via M-10000 mini-ITX board with a 1GHz Via C3 processor and 512MB of RAM, expandable to 1GB. Each includes four Hitachi ATA hard drives with 8MB caches and a claimed 8.5ms of typical latency.

Saikley says Capricorn did extensive testing to qualify hard drives for capacity, reliability, and cost, finally choosing Hitachi. "Although Hitachi does not offer an 'enterprise' or '24x7' SATA drive, our testing found their drives to be as reliable as anything out there, enterprise distinction or not," Saikley said.

The 1U PetaBox units (shown stacked in a rack, on the right) include all I/O on the front panel, reducing the need to access the back panel while maximizing its cooling capacity. Drives are housed in EZ-Latch bays that can be easily changed after the 1U unit is removed from the rack and had its cover removed. "We experimented with hot-swap, but found it caused as many problems as it solved. It actually induced failures, so we backed away. But you still have to make it easy to replace disks," Saikley said.

Similarly, Saikley says Capricorn tried then backed away from RAID (redundant arrays of inexpensive disks), instead opting to recommend JBOD (just a bunch of disks) configurations to most of its clients. "We had a painful experience with RAID 5, which does not scale well to petabyte-level storage," Saikley notes.

PetaBox options include a 16 x 2 LCD display and gigabit Ethernet (10/100 is standard). The PetaBox is configured by default to boot from a USB key, then from a PXE boot server, and finally from the local hard drive. However, boot order can easily be changed in the BIOS.

Each 1.6-terabyte 1U system draws 80 Watts of power (typical), or about 50 Watts per terabyte, according to Capricorn. Each measures 17.25 x 18 x 1.72 inches (43.8 x 45.7 x 4.4 cm), and weighs 18 lbs, 12 oz (8.5 kg).

According to Saikley, Capricorn is currently positioning itself for increased production levels, following recent improvements to its manufacturing process. "We have been constantly improving the efficiency and effectiveness of our manufacturing processes. By positioning ourselves for increased production levels, we are better able to pursue our relentless commitment to driving the cost of storage down."

Availability

The PetaBox is available now, priced at approximately $2/GB, in 40- and 64-terabyte capacities. Further details are on the company's website.



Related Stories:


(Click here for further information)


7 Advantages of D2D Backup
For decades, tape has been the backup medium of choice. But, now, disk-to-disk (D2D) backup is gaining in favor. Learn why you should make the move in this whitepaper.

4 Legal Reasons to Control Internet Access
The Internet is obviously a valuable resource for many organizations. However, many are exposed to legal liability concerns because they fail to control Internet access. Learn if you're safe in this white paper.

Rapidly Resolve J2EE Application Problems
Whether you are in the process of building J2EE applications or have J2EE applications already running in production, you must ensure that they deliver the expected ROI. Learn how in this white paper.

Load Testing 2.0 for Web 2.0
There are many unknowns in stress testing Web 2.0 applications. Find out how to test the performance of Web 2.0 in this white paper.

Build Better Games Online
For the game infrastructure providers, life is complex. Making money from games has become more complicated. Why? Find out in this white paper.

Building a Virtual Infrastructure from Servers to Storage
This white paper discusses the virtual storage solutions that reduce cost, increase storage utilization, and address the challenges of backing up and restoring Server environments.

Gaining Faster Wireless Connections with WiMAX
Welcome to what is quickly becoming the hyperconnected world where anything that would benefit from being connected to the network will be connected. Learn more in this white paper.

Is Your Desktop a Security Threat?
The new wave of sophisticated crimeware not only targets specific companies, but also targets desktops and laptops as backdoor entryways into those business’ operations and resources. Learn how to stay safe in this white paper.

Increasing SAN Reliability by 100 Percent
Storage area networks (SAN) are a strong part of storage plans. Learn how to increase your reliability and uptime by 100 percent in this case study.

 


Got a HOT tip?   please tell us!
Free weekly newsletter
Enter your email...
Click here for a profile of each sponsor:
PLATINUM SPONSORS
GOLD SPONSORS
(Become a sponsor)

ADVERTISEMENT
(Advertise here)

Check out the latest Linux powered...

mobile phones!

other cool
gadgets
HOWTOs: from DevShed & IBM DeveloperWorks:



BREAKING NEWS

• Open source awards open for nominations
• Atom-based ECX board runs Linux
• Tiny boards gain Linux cross-tools support
• Cross-platform tools vendor announces awards, earnings
• Little thin client runs Linux
• $7 Soc runs Linux
• Linux wins big in financial trading
• Linux-based remote access equipment adds monitoring
• Dual-core ARM SoC clocks to 1.2GHz
• Verizon Wireless, seven others join Linux phone org
• Android Developer Challenge announces first-round winners
• Low-end RAID controllers support Linux
• Wind River joins open source high-availability group
• Thin clients bulk up on software
• Embedded Linux Conference videos available


Most popular stories -- past 30 days:
• Ubuntu ported to ARM
• Linux still top embedded OS
• Linux gains new architecture support
• Linux 2.6.25 release bolsters ARM
• Linux-based diskless notebook costs under $300
• Low-cost MP3 player gains fancy Linux port
• Dutch UMPC runs Ubuntu Linux
• Mini-notebook boasts Linux, near-fullsize keyboard
• PC/104 module runs x86 Linux on 1.85 Watts
• Low-cost Linux-based NAS device supports RAID 1
• Free router distro gains wild WiFi features


Linux-Watch headlines:
• Verizon chooses Linux "platform of choice"
• Hats off to Fedora 9
• Running a small business on desktop Linux
• Sun launches OpenSolaris
• Via tiptoes toward openness
• Linux certification comes to Italy
• Installing Ubuntu Hardy Heron as a web hosting server
• Black Duck Buys Koders
• Open source conference co-locates with Ubuntu show
• Ubuntu 8.04 ready to challenge Windows


Also visit our sister site:


Sign up for LinuxDevices.com's...

news feed

Home  |  News  |  Articles  |  Polls  |  Forum  |  About  |  Contact
 
Use of this site is governed by our Terms of Service and Privacy Policy. Except where otherwise specified, the contents of this site are copyright © 1999-2008 Ziff Davis Enterprise Holdings Inc. All Rights Reserved. Reproduction in whole or in part in any form or medium without express written permission of Ziff Davis Enterprise is prohibited. Linux is a registered trademark of Linus Torvalds. All other marks are the property of their respective owners.