Meeting: Nov 17th 2011

We will be meeting at the University Heights in room 209 @ 7:15pm.

  • Cluster Deployments, Pig and Python:
    • Chris Henry of Audience Science and the UW Math Dept will talk about automatically provisioning Hadoop clusters locally and in the cloud.
    • Chris Wilkes will present what is new in Pig 0.9.1, including Avro and Jython support
    • Sean Jensen-Grey will cover Python support for Map Reduce

Meeting: April 21st 2011

We will be meeting at the University Heights in room 110 @ 7:15pm.

  • Avro Serialization Special:
    • Avro Schema definition and the design of quality long term data structures.
    • Examples of reading and writing Avro files in Java, Python and C.
    • Writing Mapreduce jobs that operate over Avro files using the AvroMapper and AvroReducer.

Meeting: February 17th 2011

We will be meeting at the University Heights in in basement 7:15pm.

  • HBase with Google Ngram Data: Chris will talk about how he uses HBase with Google's Ngram Dataset.
  • Using Python with Hadoop: Overview of the different libraries available for writing MapReduce applications on Hadoop using Python.

Meeting: December 2nd 2010

We will be meeting at the University Heights in Room 110 @ 7:15pm.

  • Apache Mahout:Chris will give an introduction to Apache Mahout, a machine learning library that runs on Hadoop. In it he'll demonstrate how you can use K-Means clustering to group rows of data together and show how this can be done with minimal java programming. The take away will be that you can start using this library soon without much investment.
  • Streaming MapReduce:We will talk about HOP (Hadoop Online Processing) and Yahoo S4.

Meeting: October 21st 2010

We will be meeting at the University Heights in Room 110 @ 7:15pm.

  • Cascading Intro:Chris Wilkes will give a talk on Cascading, a different way of using Hadoop without having to write MapReduce programs. With this you can think in terms of data sources and pipes instead of the mechanics of mapping and reducing.
  • Hive:Sean Jensen-Grey gives a hands on introduction to Hive a SQL like query language for Hadoop. If you think in the relational algebra of SQL you will feel right at home with Hive.

Meeting: August 19th 2010

We will be meeting at the University Heights in Room 209 @ 7:15pm.

PLEASE NOTE THE ROOM CHANGE, ONE FLOOR UP, ROOM 209.

  • Cascalog Demo: Cascalog is an expressive query language for Hadoop that uses Clojure and Cascading. It fits in the same space as Pig and Hive. Chris Wilkes will give a demo and walk through of using Cascalog from a non Clojure programmer's perspective.
  • Disco DDFS Tagging: Disco is a MapReduce implementation in Python and Erlang. The DDFS (Disco Distributed File System) is based on sets of referential tags. Sean Jensen-Grey will go over some tagging strategies for managing file sets.

Meeting: July 15th 2010

We will be meeting at the University Heights in Room 211 @ 7:15pm.

PLEASE NOTE THE ROOM CHANGE, ONE FLOOR UP, ROOM 211.

  • Parallel Distributed Image Stacking and Mosaicing with Hadoop: Keith Wiley from the University of Washington explains with real world acumen how Hadoop can be used for processing astronomy data.
  • Intro to Disco: MapReduce in Python and Erlang Sean Jensen-Grey Will give an introduction to Disco a MapReduce implementation from Nokia Research.

Meeting: June 17th 2010

We will be meeting at the University Heights in Room 110 @ 7:15pm.

  • Hands on Pig UDF: Chris Wilkes will walk through code samples on how to create and use your own User Defined Functions with Pig.
  • Compute Bound map reduce: Sean Jensen-Grey will show some strategies for running compute bound tasks on Hadoop.

Announce List

We now have an announcement email list! To subscribe visit seattlehadoop-announce at google groups or send mail to seattlehadoop-announce+subscribe@googlegroups.com. Please signup to get late breaking information about the meetings. The list is private, low traffic and spam free.

Inaugural Meeting; May 20th 2010

We are happy to announce the first meeting of SeattleHadoop on May 20th at the University Heights Community Center at 7:15 pm in room 110.

 

Location

University Heights Community Center

5031 University Way NE
Seattle WA 98105
University Heights Elementary We are excited to be hosting the meeting at the University Heights Community Center between 50th and 52nd streets in the U-District. There is plenty of onsite and off street parking as well as street parking. map