Category Archives: Big Data

Facebook – 2 petabytes in a rack

October 17, 2013Big Data16Pb, facebookPF

Each disk in the cold storage gear can hold 4 terabytes of data, and each 2U system contains two levels of 15 disks. In other words, each unit can handle 120 terabytes. A rack could hold 16 of these storage systems, allowing for 2 petabytes of cold storage in a rack.

Source: http://www.datacenterknowledge.com/archives/2013/10/16/first-look-facebooks-oregon-cold-storage-facility/

You can read more at how to create this servers on

Hacking Conventional Computing Infrastructure » Open Compute …

www.opencompute.org/‎

By releasing Open Compute Project technologies as open hardware, our goal is to develop servers and data centers following the model traditionally associated …

350 million photos per day – Facebook

October 17, 2013Big DataPF

Facebook stores more than 240 billion photos, with users uploading an additional 350 million new photos every single day. To house those photos, Facebook’s data center team deploys 7 petabytes of storage gear every month.

http://www.datacenterknowledge.com/archives/2013/01/18/facebook-builds-new-data-centers-for-cold-storage/

Hadoop, MapReduce videos

October 14, 2013Big DataPF

Some nice videos that I’v found on famous youtube…
just posting them to watch them later this week 🙂

Sandy Ryza, of Cloudera, gives you a quick run-down of the basics of MapReduce: A programming abstraction that allows for parallel processing of massive data sets without the worries of distributed systems or fault tolerance.

He goes over how it works, some of the applications it’s best suited for, and how it integrates with Hadoop and Java.

Introducing Apache Hadoop: The Modern Data Operating System

October 13, 2013Big DataPF

Hadoop, Pivotal HD and HAWQ, some stolen paragraphs

October 13, 2013Big DataPF

HAWQ is a native, mature and fast SQL Query Engine for Hadoop.

HAWQ enables existing SQL skillsets on Hadoop with benefits.

Parallel Query Optimizer

Dynamic Pipelining

Pivotal Extension Frameworks

Advanced Analytics Functions

Read the full article
http://www.gopivotal.com/pivotal-products/data/pivotal-hd#4

More stolen paragraphs (paragraphs, images and diagrams)!!

Continue reading Hadoop, Pivotal HD and HAWQ, some stolen paragraphs →

digitalwhores.net

yet, another geek blog

Category Archives: Big Data

Facebook – 2 petabytes in a rack

Hacking Conventional Computing Infrastructure » Open Compute …

350 million photos per day – Facebook

Hadoop, MapReduce videos

Introducing Apache Hadoop: The Modern Data Operating System

Hadoop, Pivotal HD and HAWQ, some stolen paragraphs

More stolen paragraphs (paragraphs, images and diagrams)!!

Continue reading Hadoop, Pivotal HD and HAWQ, some stolen paragraphs →