Skip to content

Instantly share code, notes, and snippets.

View richwhitjr's full-sized avatar

Richard Whitcomb richwhitjr

  • Osmo
  • Boston, MA
View GitHub Profile
@azymnis
azymnis / KMeansJob.scala
Created October 23, 2014 23:07
K-Means in scalding
import com.twitter.algebird.{Aggregator, Semigroup}
import com.twitter.scalding._
import scala.util.Random
/**
* This job is a tutorial of sorts for scalding's Execution[T] abstraction.
* It is a simple implementation of Lloyd's algorithm for k-means on 2D data.
*
* http://en.wikipedia.org/wiki/K-means_clustering