Kostiantyn's blog

My tech blog. Java development and other

четвер, 24 липня 2014 р.

Hadoop 2.2 Distributed Cache and Map Join

›
It's very common to use Distributed Cache for Map joins - it gives a possibility to implement extremely fast join of huge dataset with a...
четвер, 3 липня 2014 р.

Runing Spark Unit Test on Windows 7

›
It's common situation in enterprises when developers are working on Windows platform. When you are working with Hadoop, it sounds as a f...
25 коментарів:
четвер, 24 квітня 2014 р.

Hue Notifier for Hadoop goes wild

›
Several months ago I developed Chrome browser plugin for my own needs. As a Hadoop engineer I faced with one problem everyday. I run a lot o...
1 коментар:
пʼятниця, 18 квітня 2014 р.

Building BuilData ETL with Hive and Oozie

›
Perhaps, Hive is the most successful component of today's Hadoop infrastructure. It provides simple and efficient way of creating Hadoop...
1 коментар:
вівторок, 1 квітня 2014 р.

Spark on HDP2

›
There is my first experience with Apache Spark, running it on Hadoop . I faced in several issues during running my piece of code. To be ho...
1 коментар:
четвер, 20 березня 2014 р.

XQuery on Hadoop

›
Java is mother language for the most of Hadoop engineers. In recent years, Python became popular, R is used by data scientist on Hadoop. Pig...
середа, 19 лютого 2014 р.

How to write good unit test for Hadoop MapReduce?

›
Without a doubt, there is avery common situation when UnitTest (or IntegrationTest) is required to test functionality of MapReduce job. This...
‹
›
Головна сторінка
Переглянути веб-версію
На платформі Blogger.