Speed-up `build` by avoiding a expensive operation #3395

abravalheri · 2022-06-20T19:45:41Z

I decided to investigate a little bit the execution time for setuptools' PEP 517 backend after reading some criticism about its speed (it seems that it is much slower than the alternatives) on Python discourse and in other places. I don't really think the difference is relevant, but it would be nice if we can tackle the negative publicity.

So I hacked together this dirty profiling script to see which parts of setuptools take longer to run.

The first time I run it, I saw the following:

(The bright pink shows the function I was selecting to see, in this case exec).

It seems that we are currently paying the price of running exec in most PEP 517 hooks twice... This was very curious, so I came back to a line in the build_meta that I never really understood:

setuptools/setuptools/build_meta.py

Line 174 in 9a60016

exec(compile(code, __file__, 'exec'), locals())

In this line, we are compiling the setup script before executing it, which is curious. I don't know why this is necessary, since exec should also be able to compile the code...

The other part that catch my attention in the profile was dist._install_dependencies (bright pink again):

which seems to be one of the main contributors for the execution time.
When I was looking at the code for this function, I saw that we are trying to find out missing dependencies for entry-points that have extras (which is not very common nowadays since pip simply ignore extras in entry-points).

I decided to quickly hack these two code paths, and I was positively surprised by the huge impact:

(It is clear to me that the precise timings will vary from computer to computer and also depend on which programs the user has open in their machine... but this little experiment seems to be showing a general tendency).

This PR contains these 2 simple changes.
(I don't have a lot of experience with Python profiling, so I might also have implemented something wrong or underestimated something).

Summary of changes

build_meta: Avoid running compile before exec
- I honestly don't know if this change has another impact that I am not able to foresee...
dist: Avoid trying to install dependencies for entry-points that don't have extras.

Closes

Pull Request Checklist

Changes have tests
News fragment added in changelog.d/.
(See documentation for details)

abravalheri · 2022-06-20T20:08:30Z

setuptools/build_meta.py

 code = f.read().replace(r'\r\n', r'\n')

- exec(compile(code, __file__, 'exec'), locals())
+ exec(code, locals())


The only difference I can think here is that if the setup script has a syntax error, compile would fail so no instruction gets executed. I don't know how exec works, but it could be the case it gets to execute something before failing... I don't think this is relevant though because the PEP 517 hooks are supposed to be executed in isolated subcommands, so it is fine for the hook to crash.

However, I might be failing to see other problems...

Alternatively we could also try.

try: exec(code, locals()) except Exception: exec(compile(code, __file__, 'exec'), locals())

It's my understanding that if there is a syntax error, nothing will ever be executed, so no concerns here.

An important difference is that — when an exception gets raised — the stacktrace is now broken. The second argument to compile tells python where to find the source code when displaying the stacktrace. That's exactly what #3577 wanted to improve.

Calling compile before exec doesn't seem to have any noticeable effect on performance:

$ python -m timeit -s 'code = "1 + 1 #" + "A"*10000' 'exec(code)' 20000 loops, best of 5: 10.7 usec per loop $ python -m timeit -s 'code = "1 + 1 #" + "A"*10000' 'exec(compile(code, __file__, "exec"))' 20000 loops, best of 5: 10.7 usec per loop

I don't see how that could make any difference. To execute code the code needs to get compiled before. It doesn't matter if compile gets executed explicitly.

abravalheri · 2022-06-20T20:30:49Z

@jaraco, is there any chance you could have a look on this?
At first this seems like an easy gain, but I don't know what was the original motivation for the code to be the way it is, so maybe I am failing to see a side-effect...

The `exec` function in Python should be able to execute code directly. Using `compile` and then `exec` seem to cause an overhead.

abravalheri · 2022-07-04T12:27:38Z

Part of the improvements in this PR was superseded by #3421 so I rebased the proposed changes.

Now the PR only targets the compile + exec issue.

KOLANICH · 2022-09-02T11:32:31Z

compile calls Py_CompileStringObject

https://github.com/python/cpython/blob/bbcf42449e13c0b62f145cd49d12674ef3d5bf64/Python/bltinmodule.c#L738-L840

exec calls PyRun_StringFlags for the case of strings and PyEval_EvalCodeEx for the case of code.

https://github.com/python/cpython/blob/bbcf42449e13c0b62f145cd49d12674ef3d5bf64/Python/bltinmodule.c#L996-L1111

PyRunStringFlags calls _PyParser_ASTFromString to parse AST and then run_mod, which through the chain of funcs calls PyEval_EvalCode, which looks much simpler than PyEval_EvalCodeEx.

https://github.com/python/cpython/blob/8a0d9a6bb77a72cd8b9ece01b7c1163fff28029a/Python/pythonrun.c#L1587-L1607

run_mod calls _PyAST_Compile (with the optimization level -1, which is the default for compile too, which means retrieving it from _Py_GetConfig()->optimization_level) and then run_eval_code_obj

So the optim8zation flags should be the same.

https://github.com/python/cpython/blob/8a0d9a6bb77a72cd8b9ece01b7c1163fff28029a/Python/pythonrun.c#L1720-L1737

and Py_CompileStringObject calls _PyParserASTFromString and then _PyAST_Compile.

abravalheri marked this pull request as ready for review June 20, 2022 20:03

abravalheri commented Jun 20, 2022

View reviewed changes

abravalheri requested a review from jaraco June 20, 2022 20:29

abravalheri mentioned this pull request Jun 27, 2022

[BUG] performance after switching to nspektr / importlib_metadata as cratered #3418

Closed

benoit-pierre mentioned this pull request Jun 28, 2022

Drop support for installing dependencies when resolving distutils commands #3421

Merged

2 tasks

abravalheri added 3 commits July 4, 2022 13:21

build_meta: execute code directly

199deb1

The `exec` function in Python should be able to execute code directly. Using `compile` and then `exec` seem to cause an overhead.

Add news fragment

e1c6701

Remove outdated part in news fragment

241b529

abravalheri force-pushed the some-optimisation branch from 1abc6db to 241b529 Compare July 4, 2022 12:27

abravalheri changed the title ~~Speed-up build by avoiding 2 expensive operations~~ Speed-up build by avoiding a expensive operation Jul 4, 2022

jaraco approved these changes Jul 4, 2022

View reviewed changes

jaraco merged commit 5619b39 into pypa:main Jul 4, 2022

abravalheri deleted the some-optimisation branch July 4, 2022 13:43

abravalheri mentioned this pull request Sep 2, 2022

Implemented right file name in the stack frame of setup.py being executed. #3577

Closed

2 tasks

abravalheri mentioned this pull request Apr 28, 2025

[BUG] Error in build_meta.py:run_setup: exec(code, locals()) results in ValueError: not enough values to unpack (expected 2, got 1) #4964

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Speed-up `build` by avoiding a expensive operation #3395

Speed-up `build` by avoiding a expensive operation #3395

Uh oh!

abravalheri commented Jun 20, 2022 •

edited

Loading

abravalheri Jun 20, 2022

jaraco Jul 4, 2022

Joshix-1 May 31, 2025

abravalheri commented Jun 20, 2022

abravalheri commented Jul 4, 2022

KOLANICH commented Sep 2, 2022

Labels

4 participants

Uh oh!

Speed-up build by avoiding a expensive operation #3395

Speed-up build by avoiding a expensive operation #3395

Uh oh!

Conversation

abravalheri commented Jun 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary of changes

Pull Request Checklist

abravalheri Jun 20, 2022

Choose a reason for hiding this comment

jaraco Jul 4, 2022

Choose a reason for hiding this comment

Joshix-1 May 31, 2025

Choose a reason for hiding this comment

abravalheri commented Jun 20, 2022

abravalheri commented Jul 4, 2022

KOLANICH commented Sep 2, 2022

Labels

4 participants

Speed-up `build` by avoiding a expensive operation #3395

Speed-up `build` by avoiding a expensive operation #3395

abravalheri commented Jun 20, 2022 •

edited

Loading