Programming Hive

Record Detail Back

XML

Programming Hive

From the early days of the Internet’s mainstream breakout, the major search engines and ecommerce companies wrestled with ever-growing quantities of data. More recently, social networking sites experienced the same problem. Today, many organizations realize that the data they gather is a valuable resource for understanding their customers, the performance of their business in the marketplace, and the effectiveness of their infrastructure. The Hadoop ecosystem emerged as a cost-effective way of working with such large data sets. It imposes a particular programming model, called Map Reduce, for breaking up computation tasks into units that can be distributed around a cluster of commodity, server class hardware, thereby providing cost-effective, horizontal scalability. Under- neath this computation model is a distributed file system called the Hadoop Distributed Filesystem (HDFS). Although the filesystem is “pluggable,” there are now several commercial and open source alternatives.

Statement of Responsibility

Author(s)

Edward Capriolo, Dean Wampler, and Jason Rutherglen - Personal Name

Edition

Call Number

ISBN/ISSN

978-1-449-31933-5

Subject(s)

Programming Hive

Classification

NONE

Series Title

GMD

Information Technology

Language

English

Publisher

Publishing Year

2012

Publishing Place

Collation

1-350

Specific Detail Info

File Attachment

LOADING LIST...

Availability

LOADING LIST...