[BUG] Isolation of tests in pet #882

pfebrer · 2025-10-31T16:01:59Z

The tests for PET were all using and some of them modifying global variables (DEFAULT_HYPERS and MODEL_HYPERS) which is of course very bad. I realised because running the tests of a single file was passing but running the tests of all files was failing (in a branch).

I just took the approach that some of the tests were already taking, which is to deepcopy the hypers before modifying them.

📚 Documentation preview 📚: https://metatrain--882.org.readthedocs.build/en/882/

pfebrer · 2025-10-31T16:13:42Z

Now that the tests are isolated, they fail 😆 If someone knows how to fix them please go ahead.

HaoZeke · 2025-11-01T16:07:24Z

I did a little bit of digging into this... FWIW there are only two tests which need to be run in strict order, the rest can be run independently:

tox -e pet-tests -- src/metatrain/pet/tests/test_{finetuning,regression}.py -vvv -s

So test_finetuing must be run before test_regression otherwise test_regression will fail. I couldn't figure out what exactly finetuning is mutating to cause this mess. FWIW test_finetuning alone does work independently...

So maybe we need a more targeted fix than scattering deep copies, because that's a major performance hit.

pfebrer · 2025-11-01T20:07:59Z

Is deepcopying a very shallow dict really a "major performance hit"? I don't see how.

I think the fact that one unit test depends on having ran some other tests on some other file in a given order is much more worrying...

HaoZeke · 2025-11-02T14:43:32Z

Is deepcopying a very shallow dict really a "major performance hit"? I don't see how.

I think the fact that one unit test depends on having ran some other tests on some other file in a given order is much more worrying...

Oh definitely the order dependence is much worse. I just meant the fix should probably be targeted at the two tests I suggested

Edit : can't remember what OTOH but we should use a test order randomizer

pfebrer · 2025-11-04T10:33:48Z

@abmazitov can you try to fix the PET tests? (while keeping them isolated) I don't know if the PET tests were designed to be ran in a given sequence (in which case we would need to rethink how we define that order) or it was just by pure chance that the tests were passing

abmazitov · 2025-11-04T10:39:34Z

What do you mean by "keeping them isolated"?

pfebrer · 2025-11-04T10:44:01Z

That the fact that a test passes does not depend on having ran some other test previously, unless this order is specified somehow. The isolation is already done, we just needed to do deep copies of the DEFAULT_HYPERS so that each test does not modify this global variable. Now isolating them has shown that some tests were passing just by chance or because there is an implicit order that was assumed.

pfebrer · 2025-11-04T10:48:17Z

For full context I was modifying a functionality that is tested by a unit test. After the modification, this test continues to pass. But if I run the full suite of tests it doesn't because there is leakage from one test to another (through this DEFAULT_HYPERS).

pfebrer requested a review from abmazitov as a code owner October 31, 2025 16:02

Isolation of tests in pet

060e9ae

pfebrer force-pushed the fix_test_notfrozen branch from 6c5f766 to 060e9ae Compare October 31, 2025 16:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Isolation of tests in pet #882

[BUG] Isolation of tests in pet #882

Uh oh!

pfebrer commented Oct 31, 2025 •

edited

Loading

pfebrer commented Oct 31, 2025

HaoZeke commented Nov 1, 2025 •

edited

Loading

pfebrer commented Nov 1, 2025

HaoZeke commented Nov 2, 2025 •

edited

Loading

pfebrer commented Nov 4, 2025

abmazitov commented Nov 4, 2025

pfebrer commented Nov 4, 2025 •

edited

Loading

pfebrer commented Nov 4, 2025

Labels

4 participants

[BUG] Isolation of tests in pet #882

Are you sure you want to change the base?

[BUG] Isolation of tests in pet #882

Uh oh!

Conversation

pfebrer commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pfebrer commented Oct 31, 2025

HaoZeke commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pfebrer commented Nov 1, 2025

HaoZeke commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pfebrer commented Nov 4, 2025

abmazitov commented Nov 4, 2025

pfebrer commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pfebrer commented Nov 4, 2025

Labels

4 participants

pfebrer commented Oct 31, 2025 •

edited

Loading

HaoZeke commented Nov 1, 2025 •

edited

Loading

HaoZeke commented Nov 2, 2025 •

edited

Loading

pfebrer commented Nov 4, 2025 •

edited

Loading