Please sign in or register

Zheng Meyer-Zhao

I work as a software engineer for HPC applications at ASTRON in The Netherlands. I spend most of my time developing software, giving trainings on HPC related topics, and developing training materials.

Location ASTRON, Dwingeloo, The Netherlands

Activity

Zheng Meyer-Zhao replied to Stephen Allison

Remote Memory Access (RMA) routines

25 NOV 2020

MPI manages system memory that is used for buffering messages and for storing internal representations of various MPI objects such as groups, communicators, datatypes, etc. This memory is not directly accessible to the user, and objects stored there are opaque: their size and shape is not visible to the user. opaque objects are accessed via handles, which...
Zheng Meyer-Zhao replied to Johan Bergsma

Questions about quiz 4 on the summary of one-sided communication

18 NOV 2020

Thank you for reporting the bugs! The options of the question and answers to these options have been updated/corrected.
Zheng Meyer-Zhao replied to Stephen Allison

Sequence of one-sided communication

18 NOV 2020

An epoch in the sense of one-sided communication is the time between two consecutive synchronization calls. Such a period is usually used for RMA calls to a remote window (i.e., in the role of being an origin process) and/or local load and stores to the local window (i.e., in the role of being a target process)
Zheng Meyer-Zhao made a comment

Sequence of one-sided communication

18 NOV 2020

An epoch in the sense of one-sided communication is the time between two consecutive synchronization calls. Such a period is usually used for RMA calls to a remote window (i.e., in the role of being an origin process) and/or local load and stores to the local window (i.e., in the role of being a target process)
Zheng Meyer-Zhao replied to Stephen Allison

Which MPI-routines do I need?

11 NOV 2020

It is the same as used in other collective MPI routines. The communicator can be e.g. MPI_COMM_WORLD.
Zheng Meyer-Zhao replied to Hugues Digonnet

One-sided: functional opportunities – an example

04 NOV 2020

No, we don't have any benchmark results on this. And it depends on the quality of your MPI library.
Zheng Meyer-Zhao replied to Georg Geiser

Lock/Unlock

29 APR 2020

@GeorgGeiser Could you elaborate what you mean by "the calling process"? Do you mean the process that calls MPI_Win_lock?
Zheng Meyer-Zhao replied to dimitri lecas

MPI_Win_fence and Fortran-specific features with one-sided communication

28 APR 2020

The problem may occurs with all buffer arguments of nonblocking
MPI routines, i.e., Independent on whether Array or variable,
or whether MPI_Put/Get/Accumulate buffers or direct loads and stores to the 1-sided window by the target process within a local load/store epoch, or whether in other nonblocking routines like MPI_Isend or MPI_Irecv.
Zheng Meyer-Zhao replied to Mariela Gonzalez-Flores

About PRACE

28 APR 2020

Sorry, I don't have the answer. You may contact PRACE https://prace-ri.eu/contact-us/ for this question.
Zheng Meyer-Zhao replied to Georg Geiser

Lock/Unlock

28 APR 2020

No. When a window is locked by process A, other processes' lock can only lock this window when the lock of process A is released. So MPI_Win_lock of other processes will automatically be triggered once MPI_Win_unlock of process A has returned.
Zheng Meyer-Zhao replied to Outi Vilhelmiina Kontkanen

Post / Start / Complete / Wait

28 APR 2020

You can find the digital version on the website of MPI-forum.org at https://www.mpi-forum.org/docs/mpi-3.1/mpi31-report.pdf
Zheng Meyer-Zhao replied to Marko Mišić

Summary of week 1

24 APR 2020

Yes, that's indeed the problem, the extra MPI_Win_fence is not needed. The question is to select the "most correct and accurate way".
Zheng Meyer-Zhao replied to Dominik Loroch

RMA routines Put, Get and Accumulate

24 APR 2020

On origin and target side (as with message passing on sender and receiver side), the combination of sendcount*sendtype and really used recvcount*recvtype
must reflect the same sequence of basic datatypes (the recvcount in the argument list may be larger the really used count).
This means, for example, you may (e.g., with MPI_Put) send 10 doubles located very...
Zheng Meyer-Zhao replied to Ronald Cohen

Window creation & allocation

24 APR 2020

Unless you are working with legacy code which uses mpif.h, for new applications please use mpi_f08, as most mpif.h implementations do not include compile-time argument checking, therefore, many bugs in MPI applications remain undetected at compile-time (e.g. missing ierror as last argument in most Fortran bindings).
Zheng Meyer-Zhao replied to Ronald Cohen

Window creation & allocation

24 APR 2020

C_F_POINTER(cptr_buf, buf, (/max_length/) ) assigns the target of the C pointer cptr_buf to the Fortran pointer buf and specifies its shape. 'buf' will be used in the rest of the application.
Zheng Meyer-Zhao replied to Giang Nam Nguyen

Discussion of race conditions with Put and Get

23 APR 2020

The example described here perfectly illustrated race condition. A block of RMA operations is not atomic, that's why synchronisations are needed.
Zheng Meyer-Zhao replied to Jay S

How does your laptop use multiple CPU-cores?

06 SEP 2017

Parallel computing is done at the programming level. It is the software developer's responsibility to make sure that the program runs in parallel correctly. With modern processors, vectorization is also possible. To do this, users can compile programs with compilation options that enables vectorization.
Zheng Meyer-Zhao replied to Michael Hughes

Graphics Processors

01 SEP 2017

There are more and more libraries available nowadays, which allow you to write a few lines of code to offload the computation to GPUs without having to write CUDA code yourself. Therefore, there are more and more GPU users, but not that many CUDA developers.
Zheng Meyer-Zhao replied to Michael Hughes

Week 1 summary

29 AUG 2017

Hi Michael, I agree with you. However, this is a Future Learn policy that we cannot do much about :(.
Zheng Meyer-Zhao replied to Stc Mr

How to calculate the world's yearly income?

28 AUG 2017

Hi John, the concept of parallel programming will be explained in Week 3. However, there is no coding exercises in this course.
Zheng Meyer-Zhao replied to Dao Toan

Understanding Supercomputing - Processors

15 MAR 2017

In case of hyper-threading, two threads are running on one CPU-core. The instructions that need to be carried out from the two threads will be in the pipeline for the CPU-core to execute.
Zheng Meyer-Zhao replied to Doug Karo

Who needs a multicore laptop?

14 MAR 2017

When multiple cores are trying to access the memory intensively at the same time, the memory becomes the bottleneck, so everything is slowing down. A single program can be assigned to run on more cores if it is programmed to do so.
Zheng Meyer-Zhao replied to Doug Karo

Shared Memory Architecture

14 MAR 2017

When using distributed memory architecture, you will need to write the software yourself to let it know how to split the tasks to run on different machines, i.e. each machine has its own memory, but there are multiple machines. More about this will be explained in Week 3.
Zheng Meyer-Zhao replied to Doug Karo

Computer Basics

14 MAR 2017

The difference between the two architectures are explained later in the course.
Zheng Meyer-Zhao made a comment

Introduce yourselves

05 JUL 2016

Hi everyone, I am Zheng, an HPC consultant. I am interested in the learning process of robots.

Harnessing AI in Marketing and Communication

Introduction to Cyber Security

The Online Educator: People and Pedagogy

How to Succeed at: Interviews

Harnessing AI in Marketing and Communication

Introduction to Cyber Security

The Online Educator: People and Pedagogy

How to Succeed at: Interviews

Zheng Meyer-Zhao

Activity

About FutureLearn

Using FutureLearn

Need some help?

Popular Subjects

Developing Skills

Small Print