MPI_Gatherv

Let's say I have two vectors a and b and I am going to get the result vector c[i]=a[i]*b[i]. Each part of a and b are constructed in different processors. For each part I'm going to use scaterv to distribute the elements of vector to all processors and then use gatherv to collect each local vector in the current processor. Then I'm going to use MPI_Allreduce to sum up all vectors in all processors and now all processors have the final result. (all vector in different processors and result vector are in the same size).
I don't know why when I run the code it stucks. The problem starts when I include myrank in gatherv command line.

int myrank, num_procs;
	MPI_Comm_size(MPI_COMM_WORLD, &num_procs);
	MPI_Comm_rank(MPI_COMM_WORLD, &myrank);

	MPI_Gatherv(local_res.data(), local_n, MPI_DOUBLE, res.data(), scounts.data(), displs.data(), MPI_DOUBLE, myrank ,MPI_COMM_WORLD);

MPI_Allreduce(res.data(), result.data(), res.size(), MPI_DOUBLE, MPI_SUM, MPI_COMM_WORLD);

Last edited on

lastchance (6980)

use gatherv to collect each local vector in the current processor.

No, collect the scattered data back onto root, not myrank in the MPI_Gatherv call. It's a master-slave relationship (are we allowed to say that any more?). All processors need to issue this call, but most of them will be sending chunks of data, not receiving it.

If all you are going to do is sum the data then it is questionable if you need an MPI_Gatherv statement. You don't need to send it back to root for that.

Last edited on

Cplusc (457)

Thanks for your answer. I read in the MPI Guidelines that it can't be other processors than root. I understand what you are saying but I'm looking for a way to do something like this
The vector a and b are as follows,
a={2,3,5,7,4,2,5,6,7}
b={3,5,7,2,7,3,5,4,8}
These two vector are created in part in different processes.

Processor 0:
a={ 2,3,1,2,0,2,3,1,4}
b={ 2,4,5,0,7,0,2,1,0}

Processor 1:
a={0,0,4,5,4,0,2,5,3}
b={1,1,2,2,0,3,3,3,5}

Now in each processor I need to do multiplication operations. I want something that distributes the data for each processor between all processors and does a[i]*b[i] and then collect into a vector in the current processor then sum them all up in all processors.
How can I do something like this? If I collect them in root processor how can I sum them?

Last edited on

lastchance (6980)

You are already summing them with the
MPI_Allreduce( ... MPI_SUM ...);
Each processor will do its own sum The "reduce part" will combine those sums. The "All" part will then distribute the total sum to all processors.

Cplusc (457)

thanks for your answer.

Topic archived. No new replies allowed.