Force OPf_PARENS on "if/elsif/unless" optree branches #23850

richardleach · 2025-10-15T01:05:02Z

Previously, only else {} branches would have the OPf_PARAMS flag set.

Perl_op_scope uses this flag to determine whether its optree argument (o)
should be wrapped in an ENTER/LEAVE pair or only get a SCOPE OP, which
is typically optimized away (nulled out) before runtime.

This has at least two consequences visible for Perl users:

Differing lifetimes for things depending upon whether they occur in an
if block or an else block. This could cause bugs that cannot be
understood from Perl source code alone.

For example, consider a Foo class that has a DESTROY sub. In the
following code, $object2 goes out of scope at the completion of the
else {} block and the DESTROY sub fires. In contrast, $object1
does NOT go out of scope at the completion of the if {} block -
because there is no scope - and the DESTROY sub won't fire until
some later time.

    if ($_) {
        my $object1 = Foo->new();
    } else {
        my $object2 = Foo->new();
    }

The NEXTSTATE OP immediately following a SCOPE OP is typically
nulled out before runtime, but the first NEXTSTATE after an
ENTER OP is not.

NEXTSTATE OPs update the interpreter with the line number associated
with the currently executing statement. (PL_curcop.) The interpreter
outputs this in warnings or fatal error messages. Not having the first
NEXTSTATE present in if blocks means that error messages triggered
by the first line of code will typically report an incorrect line
number.

This PR addresses the above two concerns, but with the downside
that if/elsif/unless blocks now have the same OP overhead as
else blocks. (The ENTER, first NEXTSTATE, and LEAVE OPs.)

This set of changes requires a perldelta entry, and I need help writing it.

Previously, only `else {}` branches would have the OPf_PARAMS flag set. `Perl_op_scope` uses this flag to determine whether its optree argument (`o`) should be wrapped in an `ENTER/LEAVE` pair or only get a `SCOPE` OP, which is typically optimized away (nulled out) before runtime. This has at least two consequences visible for Perl users: 1. Differing lifetimes for things depending upon whether they occur in an `if` block or an `else` block. This could cause bugs that cannot be understood from Perl source code alone. For example, consider a `Foo` class that has a `DESTROY` sub. In the following code, `$object2` goes out of scope at the completion of the `else {}` block and the `DESTROY` sub fires. In contrast, `$object1` does NOT go out of scope at the completion of the `if {}` block - because _there is no scope_ - and the `DESTROY` sub won't fire until some later time. ``` if ($_) { my $object1 = Foo->new(); } else { my $object2 = Foo->new(); } ``` 2. The `NEXTSTATE` OP immediately following a `SCOPE` OP is typically nulled out before runtime, but the first `NEXTSTATE` after an `ENTER` OP is not. `NEXTSTATE` OPs update the interpreter with the line number associated with the currently executing statement. (`PL_curcop`.) The interpreter outputs this in warnings or fatal error messages. Not having the first `NEXTSTATE` present in `if` blocks means that error messages triggered by the first line of code will typically report an incorrect line number. This commit addresses the above two concerns, but with the downside that `if`/`elsif`/`unless` blocks now have the same OP overhead as `else` blocks. (The `ENTER`, first `NEXTSTATE`, and `LEAVE` OPs.)

richardleach · 2025-10-15T08:49:47Z

Line number problems could also be fixed by not nulling out NEXTSTATE kids of SCOPE OPs, which would also fix additional "wrong line number" issues - such as ##8216 - but doing that alone and not this PR wouldn't fix the discrepancy in DESTROY behaviour, which I still think is a bug that should be fixed.

Something that we could do if this PR is merged and we fix the above is to teach Perl_op_scope to check whether the optree argument really needs an ENTER/LEAVE pair and to emit a SCOPE if not. Strategies might include:

Perl_op_scope scans a certain number of OPs in the optree (limited to prevent a noticeable slowdown in compilation) to see if there's anything that warrants ENTER/LEAVE.
Toggle an OP flag when adding an OP that needs an ENTER/LEAVE to a LINESEQ. Perl_op_scope might then just be able to read that flag and make its decision based on that. (This sounds like a nicer approach, but I have no idea how plausible it is!)

richardleach · 2025-10-15T09:30:45Z

teach Perl_op_scope to check whether the optree argument really needs an ENTER/LEAVE pair and to emit a SCOPE if not

It might be easier than I suggested above. 😆 Will look into that sometime this month.

richardleach · 2025-10-15T12:02:29Z

Hmmm, a better fix for the if/else scoping discrepancy might be:

diff --git a/perly.y b/perly.y
index 53d4279b98..64eb3a63b3 100644
--- a/perly.y
+++ b/perly.y
@@ -965,7 +965,6 @@ else
        :       empty
        |       KW_ELSE mblock
                        {
-                         ($mblock)->op_flags |= OPf_PARENS;
                          $$ = op_scope($mblock);
                        }
        |       KW_ELSIF PERLY_PAREN_OPEN mexpr PERLY_PAREN_CLOSE mblock else[else.recurse]
diff --git a/toke.c b/toke.c
index 3a3b1aa9e5..31c45a262b 100644
--- a/toke.c
+++ b/toke.c
@@ -8445,6 +8445,7 @@ yyl_word_or_keyword(pTHX_ char *s, STRLEN len, I32 key, I32 orig_keyword, struct
     case KEY_our:
     case KEY_my:
     case KEY_state:
+        PL_hints |= HINT_BLOCK_SCOPE;
         return yyl_my(aTHX_ s, key);
 
     case KEY_next:

That does nothing for the line number warnings though - actually means lines from the start of else blocks are more likely to be reported incorrectly! (That is arguably a distinct problem though.)

tonycoz · 2025-10-15T23:20:40Z

"Force OPf_PARAMS..."

in subject, commit message.

but you force OPf_PARENS (OPf_PARAMS isn't a thing)

richardleach · 2025-10-15T23:27:56Z

"Force OPf_PARAMS..."

in subject, commit message.

but you force OPf_PARENS (OPf_PARAMS isn't a thing)

D'oh. Too much thinking about parameters recently. Thanks for spotting it. I'm going to see if the alternative route works without breaking B, before continuing with this PR.

tonycoz · 2025-10-15T23:43:26Z

I like fixes, but I am worried it will silently change lifetimes and break downstream code.

Though it should really only depend on destruction side effects, code that depends on the object really needs its own reference will be prevent any early destruction.

richardleach · 2025-10-15T23:50:00Z

I like fixes, but I am worried it will silently change lifetimes and break downstream code.

Yeah, that's definitely a potential downside.

Any such code is already brittle; any minor refactoring that moves logic out of an if and into an else block, or vice-versa, will see a change to time-of-destruction. I wonder if any such code has already been made safe / had guardrails added as a result of the developer encountering this bug.

richardleach added 3 commits October 15, 2025 00:56

Test fixes (to squash)

6418a72

New tests (to squash)

064e1a0

This was linked to issues Oct 15, 2025

Inconsistent block scoping #22204

Open

caller's line number off by one on first line inside if with variable #23175

Open

Wrong line number for FILEHANDLE reported by Xref (Xref.pm V1.01) #7947

Open

This was linked to issues Oct 15, 2025

wrong line number in error message #12573

Open

caller(0) returns wrong line number when called from single-stmt else{...} #16872

Open

richardleach marked this pull request as draft October 15, 2025 11:56

richardleach changed the title ~~Force OPf_PARAMS on "if/elsif/unless" optree branches~~ Force OPf_PARENS on "if/elsif/unless" optree branches Oct 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Force OPf_PARENS on "if/elsif/unless" optree branches #23850

Force OPf_PARENS on "if/elsif/unless" optree branches #23850

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

tonycoz commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

tonycoz commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Force OPf_PARENS on "if/elsif/unless" optree branches #23850

Are you sure you want to change the base?

Force OPf_PARENS on "if/elsif/unless" optree branches #23850

Conversation

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

tonycoz commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

tonycoz commented Oct 15, 2025

Uh oh!

richardleach commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants