Kostiantyn's blog
My tech blog. Java development and other
пʼятниця, 30 січня 2015 р.
Demystify BloomFilter on Hadoop
›
I believe most of you have seen BloomFilter class. But how to correctly use it? Accordint to Wikipedia, " A Bloom filter is a space...
пʼятниця, 23 січня 2015 р.
Composite join with MapReduce
›
As everyone knows, map-side join is the most effective techniques to join datasets on Hadoop. However, at the same time it gives a possibili...
Kafka web console with Docker
›
My first Docker file aims to run Kafka Web Console (application for monitoring Apache Kafka): FROM ubuntu:trusty RUN apt-get update; ...
вівторок, 11 листопада 2014 р.
Spark and Location Sensitive Hashing, part 2
›
This is a second part of topic about Locality Sensitive Hashing, and here is example of creating working example using Apache Spark. Let...
3 коментарі:
пʼятниця, 7 листопада 2014 р.
Spark and Location Sensitive Hashing, part 1
›
Location Sensitive Hashing is the name of special algorithm designed to address complexity of BigData processing. Let's consider t...
пʼятниця, 10 жовтня 2014 р.
Tuning the MapReduce job
›
java.lang.OutOfMemoryError: GC overhead limit exceeded that's what I got yesterday while running my new shining MapReduce job. Ou...
четвер, 9 жовтня 2014 р.
Unit test for Hive query
›
Sometimes the soul wants something really extraordinaly... for example, to write a unit test for Hive query :) Let's how it is possibl...
‹
›
Головна сторінка
Переглянути веб-версію