-
Notifications
You must be signed in to change notification settings - Fork 133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
InlineMultistateSDFG
Destroys SSA Invariant
#1959
Comments
philip-paul-mueller
added a commit
to philip-paul-mueller/gt4py
that referenced
this issue
Mar 4, 2025
The DaCe issue is described [here](spcl/dace#1959), essentially running `InlineMultistateSDFG` destroys the SSA properties, i.e. that in any control path there is only one write to a transient. An example for this can be found in `TestFusedSolveNonhydroStencil39To40` an ICON4Py test. It was noticed that `InlineSDFG`, which only handles nested SDFGs with a single state is not affected (because it does not create additional states). The fix, which has to be revisited later, is to run the `InlineSDFG` transformation. Note that we still allow and perform multi state inlining, because we need it. Furthermore, another help is once we have the chain remover for `concat_where`.
philip-paul-mueller
added a commit
to GridTools/gt4py
that referenced
this issue
Mar 10, 2025
This PR adds the `CopyChainRemover` transformation. The introduction of `concat_where` introduced a new pattern, which is essentially a chain of copies. As an example imagine the case that a domain is split into 3 subdomain. The result on the first subdomain is stored in `T1`, the one of the second in `T2` and the one for the third domain in `T3`. `T1` and `T2` are then copied into `T4`, finally `T4` together with `T3` are then copied into `T5`. This transformation will remove `T1`, `T2`, `T3` and `T4` thus the results will be written into `T5` directly. There are some limitation, if we have the pattern `(A1) -> (A2)`, then we eliminate `A1` only if: - `A1` is fully read; this is to avoid some nasty adjustments of map bounds. - There can only be one connection between `A1` and `A2`. The transformation was added twice to the simplify pass, which allows us to mitigate DaCe [issue#1959](spcl/dace#1959). --------- Co-authored-by: edopao <edoardo16@gmail.com>
philip-paul-mueller
added a commit
to philip-paul-mueller/gt4py
that referenced
this issue
Mar 19, 2025
This [issue](spcl/dace#1959) leaded to some problem, especially in ICON4Py stencil `39_to_40`. However, through an [update](GridTools#1915) in GT4Py this leads to now to other problems (`ConstantPropagation` involving AcccessNodes) in `19_to_20`. The underlying issue in DaCe was solved in [PR#1980](spcl/dace#1980) and this PR now deactivate the workaround. Only merge if the DaCe dependency in GT4Py was updated to include that fix.
github-merge-queue bot
pushed a commit
that referenced
this issue
Mar 21, 2025
This PR addresses [issue#1959](#1959), the issue was about that the inlining transformation destroys the SSA invariant we maintain inside GT4Py. This invariant states that in every code path there is exactly one AccessNode with that is used to write to a data container. The PR solves this issue by modifying how the nested SDFG is isolated, before this was done in two steps and is now performed in a single step. The main idea is that the number of writes is preserved.
philip-paul-mueller
added a commit
to GridTools/gt4py
that referenced
this issue
Mar 24, 2025
This [issue](spcl/dace#1959) leaded to some problem, especially in ICON4Py stencil `39_to_40`. However, through an [update](#1915) in GT4Py this leads to now to other problems (`ConstantPropagation` involving AcccessNodes) in `19_to_20`. The underlying issue in DaCe was solved in [PR#1980](spcl/dace#1980) and this PR now deactivate the workaround. It also updates the version of DaCe.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
This is not a bug per se, as the resulting SDFGs are correct, however, the
InlineMultistateSDFG
transformation destroys the SSA invariant of SDFGs, which is a less than optimal behaviour.Consider the following SDFG:
The first nested SDFG writes into
T1
which is then written intoT3[0:a]
, the second nested SDFG writes its output intoT2
which is then copied intoT3[a:N]
.T3
is then finally written into the global memoryG
.As you can see al transients are written to exactly once, for certain reason (I am looking at you FORTRAN) we can not enforce this on global data, but this is not the issue here.
After we run
InlineMultistateSDFG
we get the following SDFG:As you can see now the writing of the output, i.e.
{(T1), (T2)} -> (T3) -> (G)
is now split and put into different states.Thus the transients are no longer SSA.
Since SSA is a very nice and useful property I would think/expect/hope that the transformation would maintain this invariant.
Thus from this perspective it is a bug.
The text was updated successfully, but these errors were encountered: