Sam Ritchie archive about projects race results

About Me

I'm Sam Ritchie. I'm currently (as of August, 2014) living in Boulder and working on PaddleGuru. I've authored a number of prominent open source projects in Scala and Clojure. I'm currently a TechStars Mentor in the Boulder program!

A Short History

I left the US Sprint Kayaking Team on 2009, moved to New York and worked for a year at the TED Conference, working the phones of the Partnerships team by day and picking up what I could of Objective-C at night. I fell in love with coding that year and built a number of iOS games for ThinkFun, a brainteaser puzzle company started by my parents about a year before I was born. Rush Hour is the most well-known and lucrative of these, though Solitaire Chess or Chocolate Fix seem to have more loyal tribes.

I moved out of New York and south to Philadelphia in the summer of 2010 to code full-time and be a bit closer to the Schuykill river and a few old friends from my kayaking days. One of these old friends, Dan Hammer, knew about my coding forays and asked me help out with a large coding project called FORMA, or FORest Monitoring for Action. Something to do with satellites and NASA data, he said. Dan and his partner Robin Kraft were trying to develop a system that could classify various "pixels" of forest in the world's tropical areas according to their risk of deforestation. They had written a prototype in Python, running a MapReduce-ish algorithm across a bunch of EC2 servers, but the systems issues had begun to bog the project down.

I came on and, to Dan's horror, decided that the only way to save the project from endless systems nightmares was to rewrite everything in Clojure. Which we did. None of us knew Clojure at the time, but from the reading I'd done, Clojure seemed like the surest bet for developing low-maintenance MapReduce applications like the one we needed for FORMA.

I picked up Cascalog to write the MapReduce portions of the FORMA workflow – timeseries generation, feature extraction, data warehousing and the like. My experience with Clojure and Cascalog turned me into an open source fanatic. During that period, some of the highlights I produced were a Hadoop cluster deploy tool, a number of submodules for Cascalog and what may be the most pleasurable Hadoop testing suite out there. Somehow along the way, I became Cascalog's maintainer. (FORMA is now open source as well!)

In the summer of 2011, my work on Cascalog earned me an interview at BackType, one of the sexier startups around (and one of the only west coast startups I'd really followed from afar, out in the Philadelphia tech desert). BackType released and maintained Cascalog and had a large open source presence in the Clojure community. At the time, unbeknownst to me, BackType was being acquisition-courted by Twitter. I knew them only as a group of guys knew how to kick ass using the language I loved.

BackType offered me a job out west, which I deferred for three months to train for the Texas Water Safari, a 262 mile canoe race that deserves its own post. I won the race, but lost the standing offer when BackType was acquired by Twitter. (Glory has its price.) Ten interviews later, I received an offer from Twitter to go work with the former BackType team, which I quickly accepted, not wanting to make the same mistake again.

I left work as a senior software engineern in Twitter's Analytics and Infrastructure group in November of 2013. I had the pleasure of working with some amazing folks and open-sourcing a tremendous amount of code. Much of that code is related to the Summingbird project, an open-source platform for writing streaming aggregations that can run in batch mode on Hadoop, realtime mode on Storm, or in a combined batch/realtime mode in both execution environments. See the Projects page for more information about the specific projects I'm working on right now.

I'm currently working on a startup called PaddleGuru, living in Boulder, Colorado and spending as much time as I can in the mountains.

Fork me on GitHub