Dan Osipov | Abstract Content Factory

Snowed In

Posted January 11, 2016 by Dan Osipov & filed under Photography.

Spark on AWS EMR – The Missing Manual

Posted July 1, 2015 by Dan Osipov & filed under Big Data.

Apache Spark recently received top level support on Amazon Elastic MapReduce (EMR) cloud offering, joining applications such as Hadoop, Hive, Pig, HBase, Presto, and Impala. This is exciting for me, because most of my workloads run on EMR, and utilizing Spark required either standing up manual EC2 clusters, or using EMR bootstrap, which was very… Read more »

Handling Avro records in Scalding

Posted May 27, 2015 by Dan Osipov & filed under Big Data, Programming.

In this post I’ll try to cover how to write and read Avro records in Scalding pipelines. To begin, a reminder that Avro is a serialization format, and Scalding is a scala API on top of Hadoop. If you’re not using Scalding, this post is probably not too interesting for you. Let’s begin by defining… Read more »

Scala Days 2015 Recap

Posted March 19, 2015 by Dan Osipov & filed under Programming.

I was very fortunate to attend Scala Days 2015 in San Francisco this week. It was incredible to talk to the people behind the language, and to get a glimpse of a wide ecosystem. I wanted to post a short recap of some of my thoughts and feelings after the conference. Martin Odersky delivered the… Read more »

Camouflage

Posted March 9, 2015 by Dan Osipov & filed under Photography.

Caterpilar masking for tree bark

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Posts By: Dan Osipov