Sorting a vector using threads

Question

Are the vectors defined in the C++ STL re-entrant or thread safe? Can I use two threads and work(in this case sort) on two halfs of a vector without using a mutex? For example:

int size = (Data_List.size())/2; Thread A() { ................ // Do we need to lock Data_list with a mutex sort(Data_List.begin(),Data_List.begin()+size,cmp); } Thread B() { ....// Do we need to lock Data_list with a mutex sort(Data_List.begin()+(size+1),Data_List.end(),cmp); }

My Question is do we need to lock the access of Data_List using a mutex?

Note: the cmp function is a regular int comparision function.

I wonder whether this really buys you anything, when you finish sorting each half, the list is still not sorted. You still would need to sort/merge the two halves together — John Knoeller
– John Knoeller, Commented Mar 3, 2010 at 2:59
@John: I am implementing a sample sort using Threads and then later use recursive doubling to combine the halves and sort it together ... I was wondering if I can just use the offsets and work on the same data structure without the overhead of creating extra copies and working on the copies separately... — tomkaith13
– tomkaith13, Commented Mar 3, 2010 at 3:05
Ok, I concur with all of the other answers - It should be fine as long as you aren't changing the number of elements in the vector during the sort. — John Knoeller
– John Knoeller, Commented Mar 3, 2010 at 3:17
@John - divide and conquer is one of the principles behind parallelization. Of course it will buy something, provided the dataset is big enough. — Tom
– Tom, Commented Mar 3, 2010 at 3:19
@Tom: If the sort is saturating memory bandwidth (and a simple int compare suggests it will be), then moving every element twice is going to make things worse rather than better. For sorts that spill out of memory, it can see how this would help, but I'm skeptical that it will help if the whole data set fits in a vector. Even so, he seems to know what he's doing, so I'll shut up now ;) — John Knoeller
– John Knoeller, Commented Mar 3, 2010 at 3:28

tvanfosson · Accepted Answer · 2010-03-03 02:51:35Z

6

As long as the threads are working on different regions of memory and your comparison function only works with that memory and local variables, you should be ok. Essentially, you are "locking" each half of the table by dividing the work between the threads and only letting the thread work on its half of the data.

answered Mar 3, 2010 at 2:51

tvanfosson

534k102 gold badges703 silver badges801 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Steve Jessop Over a year ago

"only letting the thread work on its half of the data". Unless your architecture does RMW writes, and your partition of the vector straddles a word boundary. Then the two threads are accessing the same memory.

tvanfosson Over a year ago

@Steve Jessop -- I'm assuming that the compiler would pad any user defined types to word boundaries, but for types that are smaller than the word size of the machine, you certainly do need to be more careful.

Steve Jessop · Accepted Answer · 2010-03-03 12:36:19Z

Nearly, but not quite. This code is in general not thread-safe, even assuming the implementation makes reasonable guarantees about the thread-safety of vector.

Suppose Data_List::value_type is a type for which your architecture does not provide atomic access. For example on x86, byte-writes and aligned word- and dword-writes are atomic, but short-writes are not, and writes of user-defined types aren't unless they happen to be a good size and alignment. If your UDT has size 3, you could have a problem.

To perform a non-atomic short write, an implementation might do two byte writes. Or it might load the word, modify two of the bytes, and write the word back. On architectures where a byte write is expensive, the latter is quite plausible.

Now, suppose your implementation does the latter. Suppose further that your division of the vector happens to leave the first half of a word in one portion, and the second half of the word in the other. Suppose finally that the two different threads both try to modify that word simultaneously. This can go wrong - they might both read the same value, both modify it, but then one write back will occur before the other, so one of the modifications will be overwritten and lost.

Atomic byte-access by default isn't universal, and I doubt any implementation does atomic bit-access by default. So a vector<bool> is a possible problem even if other vector types are safe: your division of the vector could go down the middle of a byte. Not that you'd generally sort bools this way, but there are other operations where you might try to divide vectors, or your code might be generic.

You can usually expect word-access to be atomic (and in C or C++, int access). But it's not guaranteed by the language standard, hence sig_atomic_t. You say your cmp is an int comparison function, which suggests that maybe your vector contains ints. But you can compare shorts perfectly well using an int comparison function, so I don't know.

What you actually need to do is check your implementation/platform very carefully, and see what it says about safely accessing memory from multiple threads. It's probably fine for a vector of int.

@Steve: thnx for that .. Really appreciate it!!! ... I decide to copy each offset in separate buckets and sort them individually. I was kinda comparing this problem with the Strtok ( the need for strtok_r). So I was wondering if the iterator will be corrupted by two threads if we dont push in a static variable.. After implementing , it doesn seem to be corrupted ... so i am good .. as for the sorting part... i need to test it !!

Tronic · Accepted Answer · 2010-03-03 02:59:10Z

You can use the parallel mode of GCC (if you use GCC) for threaded versions of various algorithms. http://gcc.gnu.org/onlinedocs/libstdc++/manual/parallel_mode.html

Chris H · Accepted Answer · 2010-03-03 03:08:12Z

technically the standard says that these classes aren't thread safe, so if you're setting the data from things like the [] operator then technically that's multiple writes on one object. On the other hand, it's safe to operate on separate parts of a c array...so there seems to be a conflict here. If you want to be squeaky clean take the address of the first element and treat it as a c-array. A lot of answers here say it's fine as long as you're on separate parts of the array, and although that is probably true on many implementations

->I think it's important to note that an implementation is free not make this safe.

Tom · Accepted Answer · 2010-03-03 02:50:49Z

1

You shouldnt have any problems if the threads work on disjoint parts of the vector.

answered Mar 3, 2010 at 2:50

Tom

45.4k30 gold badges142 silver badges171 bronze badges

Collectives™ on Stack Overflow

Sorting a vector using threads

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

2 Comments

1 Comment

Comments

Comments

Comments

Related