Jussi Enkovaara

Senior application scientist at the Finnish national supercomputing center CSC - IT Center for Science.

Location Finland

Activity

Jussi Enkovaara replied to Lisa Landuyt

Vectorised operations

15 OCT 2020

Hi @LisaLanduyt how did you measure the timings? Generally, larger the array, more benefit there should be from vectorization.
Jussi Enkovaara replied to Ihar Suvorau

Different ways to use compiled code with Python

28 MAY 2020

As pointed out by @IharSuvorau optimization often makes readibility worse, so remember to document any optimizations in the code!
Jussi Enkovaara replied to Camille Clouard

Collective communication: many to one

26 MAY 2020

Hi @CamilleClouard note that in the example above `data` is a ten element array, while `rank` is a scalar value. When you gather, the root contains the values from all the other ranks, and the non-root ranks do not get anything, i.e. `n` is none. If you gather the `data` variable, the `n` in root should contain four element list (with 4 MPI tasks) where each...
Jussi Enkovaara replied to Camille Clouard

Collective communication: many to one

25 MAY 2020

Hi @CamilleClouard lower case and upper case routines work differently as lower case always return data, while for upper case routines one needs to provide the "return" array as argument. In the above example with `n = comm.gather(rank, root=0)`, in rank 0 `n` will be list containing all the ranks, whereas in other ranks `n` will be none.
Jussi Enkovaara replied to Patrick Kürschner

Communicators

25 MAY 2020

Creating a communicator is collective operation with some cost (in very large parallel scale the cost maybe surprisingly large).
Jussi Enkovaara replied to Dominik Loroch

Communication modes

25 MAY 2020

Hi @DominikLoroch semantics of isend is that the send buffer can be reused only after the communication is completed (i.e. wait or test has been called), i.e. whether internal buffering is used depends on the MPI implementation and the message size. We should probably emphasize that also in the article.

You can try a simple test:
----
from mpi4py import...
Jussi Enkovaara replied to Keichi Takahashi

Hands-on: Non-blocking communication

25 MAY 2020

Isendrecv would be just the same as Isend followed by Irecv
Jussi Enkovaara replied to Vikas Kushwaha

Course summary

20 MAY 2020

Hi @VikasKushwaha , FutureLearn policy is to provide certificates only for fee (that's not for us to decide), but please provide feedback directly to FutureLearn via the Contact link in the bottom of the page.
Jussi Enkovaara replied to Ronald Cohen

Parallel programming with Python

20 MAY 2020

Hi @RonaldCohen , yes, it is possible to have OpenMP threading in C or Fortran code that is called from Python. One can also have MPI calls both in C/Fortran and in Python within the same program, one just needs to pass the communicator to C/Fortran function.
Jussi Enkovaara replied to Maurice Karrenbrock

Welcome to week 4

20 MAY 2020

Hi @MauriceKarrenbrock you are correct, OpenMP cannot be used in Python code (you can still have a Python module written in C utilizing OpenMP). Generally, the global interpreter lock in CPython makes it difficult to efficiently parallelize pure Python code with threads.
Jussi Enkovaara replied to Juan F Rivero

Arithmetics and elementary functions

20 MAY 2020

If you need multidimensional arrays in "simple" calculator, Numpy might also be convenient (even if you do not care about the performance)
Jussi Enkovaara replied to Jari Perttunen

Parallel programming concepts

20 MAY 2020

The equation shows maximum speed up with "infite" number of CPU cores. If whole problem (100 %) is parallelizable, then the maximum speed up is infite :-)
Jussi Enkovaara replied to Camille Clouard

Parallel programming concepts

20 MAY 2020

If you use local single core as reference, then yes.
Jussi Enkovaara made a comment

Hands-on: Message exchange

20 MAY 2020

The semantics of send and recv are that their completion *might* depend on other processes, i.e. send might return only if the corresponding recv has been called.

In practice, for small messages MPI libraries typically perform some internal buffering, so that send returns "immediately" while for large messages corresponding recv has to be called.
Jussi Enkovaara replied to Kalle Prorok

Fast communication of large arrays

20 MAY 2020

No, in this case in rank 0 you want to both send to 1 and receive from 1 (and similarly for rank 1)
Jussi Enkovaara replied to Jari Perttunen

Interfacing C code with Cython

15 MAY 2020

Actually, as cimport and import perform completely different things (import cython definitions vs. import python modules), one can actually cimport and import as the same name (you can also just 'import numpy' and 'cimport numpy'). This is the practice that most cython tutorials actually use, but personally I prefer to use different names.
Jussi Enkovaara replied to Sarah H

Avoiding function call overheads

15 MAY 2020

Even though `cdef` functions can be called only from within the same cython module (the .pyx) file, they can return either Python values or pure C values. Thus, `cdef int add(...)` returns a pure C integer, while with `cdef add` the return variable is converted from pure C value to Python object which adds some overhead.

Generally, when the type of...
Jussi Enkovaara replied to Ignace Pelckmans

Where to add types?

15 MAY 2020

Yes, it should be mandel.pyx , I have no idea why I used .cyt in the video... :-) (`cython` command does do actually care about the file extensions, but `cythonize` function in setup.py requires '.py' or 'pyx' extension)
Jussi Enkovaara replied to Sarah H

Using static typing

15 MAY 2020

Yes, in this case the function call overhead is much larger than the actual computation so you do not see much difference.
Jussi Enkovaara replied to Gianni Procida

Creating Cython modules

15 MAY 2020

Hi @GianniProcida as the first rule of optimization you should first make sure that the pure Python module works correctly. The Python debugger can be useful in this: `python3 -m pdb myfile.py`.

If it seems that cythonization introduces bugs, you can try to follow the steps in the above link pointed by @IainS
Jussi Enkovaara replied to Noora H

Hands-on: utilizing Fortran code

13 MAY 2020

Hi @NooraH and @PatrickKürschner I realized that Fortran compiler is missing from the virtual image, sorry for that!

You can install it: sudo apt install gfortran

After that also f2py3 should work.
Jussi Enkovaara replied to Yannick Gansemans

Interfacing C code with CFFI

13 MAY 2020

Thanks @YannickGansemans this is fixed now. Dangers of not doing edits in real code...
Jussi Enkovaara replied to Ihar Suvorau

Hands-on: Optimising heat equation solver

13 MAY 2020

Yes, thats right, you cannot call 'cdef' functions from Python (i.e. from heat_main.py) but only within the cython module. Thus, evolve can be made ´cdef´ but other functions would need to be `cpdef` (if one wants to cythonize them).
Jussi Enkovaara replied to Laura Vuorinen

Using NumPy with Cython

13 MAY 2020

@LauraVuorinen I have still a bit more limited experiences about Numba, but it looks really promising. At least in simple cases one seems to get same performance as with Cython (or C/C++ or Fortran) with tiny effort, similar to what @giordanozanoli wrote.

We would definitely like to include also Numba in this course, unfortunately we haven't yet had enough...
Jussi Enkovaara replied to Ihar Suvorau

Hands-on: Using C-functions

13 MAY 2020

@IharSuvorau lesson here is that good non-optimized algorithm beats optimized bad algorithm 10000000 - 0 :-) Premature optimization is ...
Jussi Enkovaara replied to Fraser Kennedy

Hands-on: Static typing in a simple extension

13 MAY 2020

@FraserKennedy this is expected behaviour, Cython is more strict about implicit type conversions.
Jussi Enkovaara replied to Juan F Rivero

Arithmetics and elementary functions

13 MAY 2020

Hi Rafael, just to point out that your measuring also implicit creation of NumPy array from lst, of course with math example there is also the overhead from list comprehension. Generally you are still right that math is typically more efficient for scalars even if you neglect the array creation:

In [2]: a = 0.37 ...
Jussi Enkovaara replied to Dmytro Kryvokhyzha

Hands-on: Array creation

13 MAY 2020

Hi Christof,
sys.getsize(a) of returns the size of the whole object 'a', which in case of NumPy array includes all the metadata (array shape etc.) in addition to actual data. In this case this metadata takes up 96 bytes:

In [2]: a = np.zeros(1, dtype='S1')
In [3]: sys.getsizeof(a) ...
Jussi Enkovaara replied to Ingrid Strandberg

Creating and accessing NumPy arrays

13 MAY 2020

Hi, as Germain pointed out NumPy array has a reference only to single "data buffer" in memory (there cannot be hierarchy of references), and thus there is really no concept of shallow copy with NumPY arrays.

If one uses "shallow" `copy.copy` from Python stdlib (even though the arrays own copy-method is the recommended one) with NumPy arrays, one still gets...
Jussi Enkovaara replied to Camille Clouard

Creating and accessing NumPy arrays

07 MAY 2020

One can also assign multiple values at the same time:
a[[4, 6]] = [-1, -2]

Generally, one can index NumPy arrays with integer lists / arrays and boolean mask arrays:
In [1]: a = np.arange(10)
In [2]: m = a < 5
In [3]: a[m] = -1 ...
Jussi Enkovaara replied to Tom Couch

Hands-on: Array creation

05 MAY 2020

The link describes the array API, which is in principle a bit different thing than the array constructor "array". But, I admit it is a bit confusing. Furthermore, as one can provide as dtype also 'c8' and 'c16' meaning single or double precision floating point numbers... NumPy does not fully comply with Zen of Python :-) ("There should be one-- and preferably...
Jussi Enkovaara replied to Lassi Lehto

Hands-on: Array creation

05 MAY 2020

Hi Lassi, floating points can be a bit peculiar as many simple decimal numbers cannot be presented exactly, but with double precision numbers only up to ~16 digits. Here, the culprit is 0.2:
In [1]: format(0.2, '.18f')
Out[1]: '0.200000000000000011'
Jussi Enkovaara replied to Tom Couch

Hands-on: Array creation

05 MAY 2020

Hi Tom, could you point out where NumPy documentation says that 'c' is a complex number?

Frankly, I think that 'c' for character array is for some historical background compatibility, and according to NumPy documentation is not recommended. However, it is still the easiest way to create such an array from a string.

In addition to 'fromiter', one use also...
Jussi Enkovaara replied to Outi Vilhelmiina Kontkanen

Creating and accessing NumPy arrays

04 MAY 2020

You can create NumPy arrays both from tuples and lists, i.e.
numpy.array([1, 2, 3, 4]) and numpy.array((1, 2, 3, 4)) create exactly the same NumPy array. Also, the input data (either from list or tuple) is always copied, if you have something like
myarray = numpy.array(mylist)
modifying myarray does not change mylist.
Jussi Enkovaara replied to giancarlo marra

Week 1 summary

30 APR 2020

Hi Giancarlo, that is a bug indeed, thanks for spotting it! We will fix it shortly.
Jussi Enkovaara replied to Ihar Suvorau

Pros and cons of various performance analysis approaches

30 APR 2020

Hi Ihar, you can investigate it yourself by running the heat equation with and without cProfile :-) . Generally speaking, there is some overhead from cProfile, but I think it depends also on how many time() calls you need. Internally, cProfile is impelemented in C so its timing routines are in principle more efficient, but on the other hand it is performing a...
Jussi Enkovaara replied to giordano zanoli

Hands-on: Performance analysis of heat equation solver

30 APR 2020

Hi, I assume you are using Windows? This same input file is used it in quite few places in exercises, and in Linux/Mac this is conveniently dealed with symbolic links (when modifying the file the changes are seen everywhere without need to copy). Unfortunately symbolic links do not work in Windows so you need to manually copy it.
Jussi Enkovaara replied to giancarlo marra

Hands-on: Performance analysis of heat equation solver

30 APR 2020

Hi Giancarlo, nice thing with older machine is that you real feel the difference when optimising the code later on :-) (not that more powerful machines would not see any benefits.)
Jussi Enkovaara replied to Michele Pellegrino

Introducing heat equation

30 APR 2020

There was indeed a typo in the formula, thanks for spotting it!
Jussi Enkovaara replied to Alexandre Fournier

Introducing heat equation

30 APR 2020

Hi Alexander, you are right that boundary condition is also needed.
Jussi Enkovaara replied to Marko Mišić

Measuring small code snippets with timeit

30 APR 2020

Hi Marko, see demos/performance/matmul/test_matmul.py (or directly in github at: https://github.com/csc-training/hpc-python/blob/master/demos/performance/matmul/test_matmul.py
Jussi Enkovaara replied to Catherine Shevchenko

Using applications own timers

30 APR 2020

Measuring performance incurs always some overhead, but just "import time" should be negligible in most real situations. Note that Python caches imports, i.e. when a module is imported multiple times from multiple places in the program, e.g. the disk is read only once and in subsequent imports everything is readily in memory.

The process_time() function has...
Jussi Enkovaara replied to SUAT TANIR

Why are Python programs slow?

30 APR 2020

The choice between compiled vs. interpreted is generally speaking compromise between programmer efficiency and computer efficience. Nowadays active area of development is just-in-time compilation which tries to get best of both worlds.
Jussi Enkovaara replied to Hal Euphrates

Why are Python programs slow?

30 APR 2020

Hi, the dynamic typing and data structure are quite generic areas and the issues Dominik mentioned are directly related to both of them.
Jussi Enkovaara replied to Cristina Russo

Setting up the programming environment

30 APR 2020

Hi Cristina, just out of curiosity which browser/operating system you are using?
Jussi Enkovaara replied to Boris Vandemoortele

Setting up the programming environment

30 APR 2020

The pythonuser has admin right, so you can authenticate with the same 'hpc1python' password.
Jussi Enkovaara replied to Eva Kontogianni

Setting up the programming environment

30 APR 2020

Hi Eva, can you provide a bit more details about your problem? What operating system you are using and what is the actual error message?
Jussi Enkovaara replied to Eike Cramer

Setting up the programming environment

30 APR 2020

Hi Eike, if the tests pass, you should be fine for the rest of the course.
Jussi Enkovaara replied to Sameer Hassan

Setting up the programming environment

30 APR 2020

Hi Sameer, installing MPI can sometimes be a bit tricky in Mac and Windows environment which is one of the reasons we are supporting only Linux. You can try the brew install as suggested by Qing above or just use the virtual machine.
Jussi Enkovaara replied to Ashwin Vishnu Mohanan

Setting up the programming environment

30 APR 2020

Hi Ashwin, thanks for spotting this bug! We will fix it shortly
Jussi Enkovaara replied to Outi Vilhelmiina Kontkanen

Hands-on: Performance analysis of heat equation solver

27 APR 2020

Hi Outi, you are right that for-loops can be quite inefficient in Python (as said in the assignment, this is an inefficient implementation :-) ). During the course we will look for some better ways.
Jussi Enkovaara replied to Keichi Takahashi

Hands-on: Performance analysis of heat equation solver

27 APR 2020

Hi, you are right the correct module is `heat_main.py`, sorry for the typo!
Jussi Enkovaara replied to Santiago Helbig

Different ways to use compiled code with Python

30 SEP 2019

In numerical calculations Fortran compiler can usually optimize the code easier. However, in most cases C code can be made equally fast but you might need to add extra guidance for the compiler (pragmas, restrict qualifiers for pointers etc.)
Jussi Enkovaara replied to Hamidreza Ardeshiri

Setting up the programming environment

17 SEP 2019

Hi @DariaS , using the provided image should not be too much work if you already have VirtualBox running as you can skip the step 2 in the instructions (in principle the .ova image should work also with other virtualization systems such as VMware but we have not tested them).

Downloading the image takes of course some time (depending on your bandwidth),...
Jussi Enkovaara replied to Bill Roberts

Hands-on: Performance analysis of heat equation solver

11 SEP 2019

Hi @BillRoberts , the file path in bottle.dat is due to use of symbolic links in the Linux side, which work a bit differently in Windows. You will probably encounter the same issue also later in the course.
Jussi Enkovaara replied to Micah Young

Hands-on: Performance analysis of heat equation solver

11 SEP 2019

Hi @MicahYoung , I guess you are using Windows? We use symbolic links for bottle.dat which work a bit differently in Windows, so you will probably encounter the same issue also later in the course.
Jussi Enkovaara replied to Pauliina Mäkinen

Hands-on: Performance analysis of heat equation solver

11 SEP 2019

Hi @PauliinaMäkinen in principle, virtual machine should have only a small effect on the execution time, but if your system is busy otherwise (e.g. some system update is running on background) it can make the simulation within the virtual machine run slower. Also, large variation in the execution time suggests that system is busy also with something...
Jussi Enkovaara made a comment

Hands-on: Performance analysis of heat equation solver

10 SEP 2019

Sorry for the deprecated `plt.hold` in this exercise. Provided virtual machine works (although with warning), if you are using your own matplotlib installation which gives error you can try to replace `plt.hold(False)` with `plt.gca().clear()' (or issue git pull, material has been fixed in github)
Jussi Enkovaara replied to Alice Redermeier

Setting up the programming environment

09 SEP 2019

Hi Alice, can you be a bit more specific, what kind of trouble you are having with the password?
Jussi Enkovaara replied to Bill Roberts

Setting up the programming environment

09 SEP 2019

Hi Bill, in principle everything we discuss can be done also in Windows provided you have the necessary packages installed.
Jussi Enkovaara replied to Hamidreza Ardeshiri

Setting up the programming environment

09 SEP 2019

Hi @HamidrezaArdeshiri what does "apt list --installed | grep numpy" print out? Are you testing the installation with python or python3 ? The installation instructions are for Python 3, so you probably need to use python3 (although practically everything should work also with Python 2 if you install the correct version).

Harnessing AI in Marketing and Communication

Samuel Johnson’s Rasselas: An Introduction

The Online Educator: People and Pedagogy

How to Succeed at: Interviews

Harnessing AI in Marketing and Communication

Samuel Johnson’s Rasselas: An Introduction

The Online Educator: People and Pedagogy

How to Succeed at: Interviews

Jussi Enkovaara

Activity

About FutureLearn

Using FutureLearn

Need some help?

Popular Subjects

Developing Skills

Small Print