Linked Questions
15 questions linked to/from Solving embarassingly parallel problems using Python multiprocessing
28 votes
14 answers
32k views
Search for string allowing for one mismatch in any location of the string
I am working with DNA sequences of length 25 (see examples below). I have a list of 230,000 and need to look for each sequence in the entire genome (toxoplasma gondii parasite). I am not sure how ...
23 votes
6 answers
34k views
Parallel processing of a large .csv file in Python
I'm processing large CSV files (on the order of several GBs with 10M lines) using a Python script. The files have different row lengths, and cannot be loaded fully into memory for analysis. Each ...
7 votes
3 answers
6k views
"Can't pickle <type '_csv.reader'>" error when using multiprocessing on Windows
I'm writing a multiprocessing program to handle a large .CSV file in parallel, using Windows. I found this excellent example for a similar problem. When running it under Windows, I receive an error ...
2 votes
1 answer
4k views
Python multiprocessing EOF error on csv files
I am trying to implement a multiprocessing approach for reading and comparing two csv files. To get me started, I started with the code example from embarassingly parallel problems, which sums ...
4 votes
3 answers
2k views
spring boot API - document processing and executing python script on documents in parallel
Scenario: In my application, there are 3 processes which are copying documents on a shared drive in their respective folders. As soon as any document is copied on shared drive (by any process), ...
0 votes
2 answers
842 views
Loop through collision detection with multiple processes
I'm using python to write an ideal gas simulator, and right now the collision detection is the most intensive part of the program. At the moment though, I'm only using one of my 8 cores. (I'm using ...
0 votes
1 answer
1k views
Multiprocessing with progress
I've never worked with multiprocessing before so bear with me if I'm asking a basic question. This answer provided a very nice processing class that I adapted to my needs and it works very well. I'm ...
1 vote
1 answer
1k views
python itertools product slow is the write speed to a output file a bottleneck
I have a simple python function performing itertools product function. As seen below. def cart(n, seq): import itertools b = 8 while b < n: n = n - 1 for p in itertools....
1 vote
1 answer
1k views
Multiple instances of subprocess.Popen
Hi I have just started programming in python and I am trying to use subprocess.Popen to run multiple instances of a program that i compile using "make". But before i do a "make", I have to do some ...
0 votes
1 answer
907 views
Q: How to write a function output in a .CSV file with multi-threading / multiprocessing ? (Using a String array as input)
I am coding a little web scraper where I would like to implement multiprocessing / multi-threading. I have written my function webScraper() which receives a String with a website URL as input, ...
1 vote
1 answer
247 views
Is there a cost to calling python multiprocessing .join() method
My question is inspired by a comment on the solving embarassingly parallel problem with multiprocessing post. I am asking about the general case where python multiprocessing is used to (1) read data ...
0 votes
1 answer
427 views
python asynchronous data recorder
I have a simulation running inside a class "Simulation", and a class "DataRecorder" in charge of saving data on disk (after several manipulations). Here is a simplified mockup: class DataRecorder(...
0 votes
2 answers
126 views
Python threading or multiprocessing for my 'tool'
I have created a script that : Imports a list of IP's from .txt ( around 5K ) Connects to a REST API and performs a query based on the IP ( web logs for each IP) Data is returned from the API and ...
1 vote
1 answer
127 views
Simple parallel scientific programming
I quite often write simple optimization routines that look something like this: def createinstance(n): while(True): #create some instance called instance yield instance loopno = ...
2 votes
0 answers
112 views
Embarassingly parallel task not fast enough
I have seen other related questions (like this one) but none of them actually answers my questions, so here it goes: I have an obviously embarassingly parallel task to perform, my own rolled version ...