I was tasked with making a program that uploads a .csv to a NoSQL cluster. We were doing 2-5 gigThe files, and it was taking 4 are larger (typically 2-8 hours. Well when we hit 17 gig files that was just to slow17GB).
When I remade my My program I realized workingworks in batch made is much much faster. I could getmode and can process a 17 gig17GB file done in 6 hours. So then
I decided to make a consumer-producer multithreading structure. This caused it to be just as slow as beforesignificantly slower. Although my program is fast and working great, I want to know why the producer-consumer construct was slower than a batch produce, batch consume method.
As compairedcompared to
The bottom half are infinite loops, but I just left it like that for a speed test.