Return to Answer

added 169 characters in body

edited Aug 19, 2013 at 16:38

2.2k
5
28
33

I tried to use threading with python, there were issues with the Global Interpreter Lock. Ported code to use the multiprocessing module and it worked with 2x speedup, internally hadoop assigns as many mappers as there are cores in the cluster, hence multiprocessing is not the way to go if you need speed up. Multithreading if performed right might give some speedup

Source Link

answered Aug 15, 2013 at 0:01

viper

2.2k
5
28
33

I tried to use threading with python, there were issues with the Global Interpreter Lock. Ported code to use the multiprocessing module and it worked with 2x speedup.

Collectives™ on Stack Overflow

Return to Answer