Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

9
  • This sounds like a good solution. I will try to implement it. Commented Aug 5, 2015 at 9:34
  • if tars are few and very large, the small number of mappers will be a performance bottleneck Commented Aug 5, 2015 at 9:40
  • The question states "I have a large number of compressed tar files" Commented Aug 5, 2015 at 9:42
  • that's what i wonder about, what is "large" in OPs understanding Commented Aug 5, 2015 at 9:43
  • probably the better idea would be using one job to decompress jars and second job to process unpacked files Commented Aug 5, 2015 at 9:45