Skip to content

hbaseinaction/twitbase

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HBase In Action

http://www.manning.com/dimidukkhurana

Usage

Code is managed by maven. Be sure to install maven2 on your platform.

To build a self-contained jar:

$ mvn assembly:assembly

TwitBase applications can be run using:

$ java -cp target/HBaseIA-1.0.0-SNAPSHOT-jar-with-dependencies.jar <app> [options...]

Utilities for interacting with TwitBase include:

  • HBaseIA.TwitBase.cli.InitTables :: create TwitBase tables
  • HBaseIA.TwitBase.cli.TwitsTool :: tool for managing Twits
  • HBaseIA.TwitBase.cli.UsersTool :: tool for managing Users
  • HBaseIA.TwitBase.cli.LoadUsers :: bulk-load random Users
  • HBaseIA.TwitBase.cli.LoadTwits :: bulk-load random Twits

The following MapReduce jobs can be launched the same way:

  • HBaseIA.TwitBase.mapreduce.TimeSpent :: run TimeSpent log processing MR job
  • HBaseIA.TwitBase.mapreduce.CountShakespeare :: run Shakespearean counter MR job
  • HBaseIA.TwitBase.mapreduce.HamletTagger :: run hamlet-tagging MR job

License

Copyright (C) 2012 Nick Dimiduk, Amandeep Khurana

Distributed under the Apache License, version 2.0, the same as HBase.

About

TwitBase is a running example used throughout HBase In Action

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published