hadoop | Abstract Content Factory

Handling Avro records in Scalding

Posted May 27, 2015 by Dan Osipov & filed under Big Data, Programming.

In this post I’ll try to cover how to write and read Avro records in Scalding pipelines. To begin, a reminder that Avro is a serialization format, and Scalding is a scala API on top of Hadoop. If you’re not using Scalding, this post is probably not too interesting for you. Let’s begin by defining… Read more »

DataPhilly: Apache Pig

Posted June 5, 2013 by Dan Osipov & filed under Programming.

I’m giving a presentation on Apache Pig at DataPhilly. For those who want to access the slides later, or those who were not able to make it in person, below is the presentation I’m using to keep my talk on track.

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Posts Tagged: hadoop

Handling Avro records in Scalding

DataPhilly: Apache Pig

Consulting

Recent Posts

Posts Tagged: hadoop

Consulting

Recent Posts

Tags