Kostiantyn's blog

My tech blog. Java development and other

пʼятниця, 30 січня 2015 р.

Demystify BloomFilter on Hadoop

›
I believe most of you have seen BloomFilter class. But how to correctly use it? Accordint to Wikipedia, " A  Bloom filter  is a space...
пʼятниця, 23 січня 2015 р.

Composite join with MapReduce

›
As everyone knows, map-side join is the most effective techniques to join datasets on Hadoop. However, at the same time it gives a possibili...

Kafka web console with Docker

›
My first Docker file aims to run Kafka Web Console (application for monitoring Apache Kafka): FROM ubuntu:trusty RUN apt-get update; ...
вівторок, 11 листопада 2014 р.

Spark and Location Sensitive Hashing, part 2

›
This is a second part of topic about Locality Sensitive Hashing, and here is example of creating working example using Apache Spark. Let...
3 коментарі:
пʼятниця, 7 листопада 2014 р.

Spark and Location Sensitive Hashing, part 1

›
Location Sensitive Hashing is the name of special algorithm designed to address complexity of BigData processing. Let's consider t...
пʼятниця, 10 жовтня 2014 р.

Tuning the MapReduce job

›
java.lang.OutOfMemoryError: GC overhead limit exceeded  that's what I got yesterday while running my new shining MapReduce job. Ou...
четвер, 9 жовтня 2014 р.

Unit test for Hive query

›
Sometimes the soul wants something really extraordinaly... for example, to write a unit test for Hive query :) Let's how it is possibl...
‹
›
Головна сторінка
Переглянути веб-версію
На платформі Blogger.