Press "Enter" to skip to content

Posts tagged as “Hadoop”

Upcoming Mini-tutorial at BigDataCamp: How to Build a Hadoop Cluster from Scratch in 20 Minutes by CTO of Infochimps

Flip  Kromer (@mrflip), CTO of Infochimps will give a an overview and tutorial on using the latest version of Ironfan (which until today was called cluster_chef)  at the BigDataCamp unconference put on by  Dave Nielsen just before O’Reilly’s Strata Conference Feb 27 from 5:30pm to 10pm We’ve been using cluster_chef at Runa as the basis of our chef management for our entire production environment for the…

HBase/Hadoop on Mac OS X (Pseudo-Distributed)

I wanted to do some experimenting with various tools for doing Hadoop and HBase activities and didn’t want to have to bother making it work with our Cluster in the Cloud. I just wanted a simple experimental environment on my Macbook Pro running Snow Leopard Mac OS X. So I thought it was time to revisit installing Hadoop and HBase…

Experience installing Hbase 0.20.0 Cluster on Ubuntu 9.04 and EC2

NOTE (Sep 7 2009): Updated info on need to use Amazon Private DNS Names and clarified the need for the masters, slaves and regionservers files. Also updated to use HBase 0.20.0 Release Candidate 3 Introduction As someone who has “skipped” Java and wants to learn as little as possible about it, and as one who has not had much experience…

Want to work at a Startup with Cool Tech? (HBase, Clojure, Chef, Swarms, Javascript, Ruby & Rails)

Opportunity Knocks Runa.com, the startup where I am CTO, is looking for great developers to join our small agile team. We’re an early stage, pre-series-A startup (presently funded with strategic investments from two large corporations). Runa offers a SaaS to on-line merchant that allows them to offer dynamic product and consumer specific promotions embeded in their website. This will be…

Hadoop, HDFS and Hbase on Ubuntu & Macintosh Leopard

UPDATE: This has been replaced by a newer post Experience installing Hbase 0.20.0 Cluster on Ubuntu 9.04 and EC2 . I found that using the pre-built distributions of Hadoop and HBase much better than trying to build from source. I need more Java/Ant-fu to do the build from scratch. The HBase-0.20.0 Release Candidates are really great and seemingly easier to…

The Commoditization of Massive Data Analysis

Today’s article in O’Reilly’s Radar by Joseph Hellerstein, is a concise synopsis of the state-of-the-art large scale data analysis. It compares the Enterprise IT dominant Relational Database paradigm to the emerging (with a bullet!) MapReduce / Hadoop technologies. Professor Hellerstein, from UC Berkeley lives this stuff as a leading researcher on databases and distributed systems.  He is also an advisor…