Harish Mallipeddi RSS

Avid Pythonista with a secret love for Erlang.

harish.mallipeddi at gmail

 Photos

 LinkedIn

 Twitter

 Projects

Older posts

Nov
20th
Fri
permalink

Riak’s dets backend - too many files open error

I was playing with Riak yesterday and ran into this error on my Macbook running Snow Leopard:


{
 {badmatch,{error,{{badmatch,{error,
 {file_error,"./dets-store/1392993748081016843912887106182707253109560705024",emfile}}},
                    [   {riak_vnode,init,1},
                        {gen_server2,init_it,6},
                        {proc_lib,init_p_do_apply,3}    ]}}
 },
 [{riak_vnode_master,get_vnode,2},
  {riak_vnode_master,handle_cast,2},
  {gen_server,handle_msg,5},
  {proc_lib,init_p_do_apply,3}]
}

When using the dets storage backend, Riak seems to create/open a dets database file for each vnode (partition) in your ring. When there’s only one node in your cluster, I’m guessing all the vnodes/partitions are owned by this node which results in a whole bunch of files being opened (under the dets-store folder you configured). In Snow Leopard, I’d to do ulimit -n 8192 to increase the limit on the no. of fds a process could take. You probably won’t notice this normally - I increased the partition size in the config file and hence ran into this problem.

Comments
Nov
3rd
Tue
permalink Comments
Oct
30th
Fri
permalink Comments
Oct
22nd
Thu
permalink Comments
Oct
20th
Tue
permalink

Speaking at FOSS.MY 2009

I’ll be speaking about Hadoop at FOSS.MY 2009 in KL. They’ve an interesting bunch of speakers this year including Brain Aker, David Axmark, RMS. This will be my first time to FOSS.MY!

If you’re attending, do drop by my talk on Sunday afternoon :)

Comments
permalink

Log-analysis software

There were a couple of interesting articles on DBMS2 on log analysis software vendors, Splunk and SenSage. I’m amazed that both companies have around 150+ employees and are profitable. There’s also a good summary of who their main customers are and how they’re using their products.

Comments
Sep
15th
Tue
permalink Comments
Sep
14th
Mon
permalink Comments
permalink Comments
Aug
31st
Mon
permalink Comments
Aug
30th
Sun
permalink Comments
Aug
27th
Thu
permalink Comments
Aug
25th
Tue
permalink Comments
Aug
19th
Wed
permalink

LZO compression for Hadoop-0.20+

Starting from Hadoop-0.20 onwards, any code related to LZO compression has been removed from the Hadoop source tree. This is because the LZO code is licensed under GPL and hence incompatible with Hadoop’s Apache license. One more thing you should know is LZO compression is only supported via a native library (AFAIK there’s no pure Java implementation of it). LzoCodec and LzopCodec are almost the same (LzopCodec is compatible with the output from the lzop unix utility).

Here are the steps to get LzopCodec working with Hadoop-0.20 (see the gist embed below). I’m assuming you’ve already downloaded and installed the Hadoop-0.20 release tarball. We’ll be adding the compiled library to Hadoop-0.20’s lib/ folder. Repackage it into a tarball and push it to your cluster using whatever magic you use and you should have LZO compression working.

Comments
Aug
5th
Wed
permalink Comments