A function similar to ForAllThunksMutable, but capable of wrapping or replacing nested thunks with new ones. In case the thunk has some specific requirements (e.g. assumes the directly nested thunk is a `SequentialThunk`), the implementation needs to ensure the assumptions still hold. The transform function must be infallible, but it may be a no-op and return the original `unique_ptr<Thunk>`.
The use for this is in buffer checksumming: to cover all thunks, we need to recursively insert pre-/post-execution checksum thunks around execution of nested thunks. Unfortunately, ForAllThunksMutable doesn't allow mutating the thunk's *container*, which would create the need to switch() over thunk kind, special-case any thunk that may contain nested thunks, and expose the internals to the caller to reimplement the recursion.
PiperOrigin-RevId: 826014413
Imported from GitHub PR https://github.com/openxla/xla/pull/33188📝 Summary of Changes
Fix tests when executing on rbe
🎯 Justification
Fix hermetic dependencies so right libs are loaded.
Libs are put as a data dependency as many of the rocm libs may have their own
RPATH=$ORIGIN/..., they will try to load the libs from the place where they are.
This leads to a situation that one lib located in _solid_data dir will try to find its
dependencies in the same dir. However dependency might be located in the data
directory. Then either test can't load the lib or tries to load one from the system libs.
🚀 Kind of Contribution
Please remove what does not apply: 🐛 Bug Fix
📊 Benchmark (for Performance Improvements)
Build fix
🧪 Unit Tests:
Build fix
🧪 Execution Tests:
Build fix
Copybara import of the project:
--
43913bfbacc042c7a81e692cfa28e0a0733c3989 by Alexandros Theodoridis <atheodor@amd.com>:
Add minimal hip runtime deps to kernel_headers
--
ffb62f536608b00445fda4fce0a3323c5bf6fe9d by Alexandros Theodoridis <atheodor@amd.com>:
Fix hermetic build invalid local rpath
Merging this change closes#33188
PiperOrigin-RevId: 826014075
This change allows broadcast and reshape operations on the scale operands of `scaled-dot` to be fused into the Triton kernel. It generalizes the operand fusion logic to handle all four operands of `scaled-dot` and adds support for `BroadcastOp` and `ExpandDimsOp` in the Triton MLIR conversion. A new test case is added to verify this fusion.
PiperOrigin-RevId: 826012089
Currently it is assumed that GetTupleElement is never the root of a
computation. That assumption is not necessarily true, e.g. during autotuning of
Cublas Gemm calls we can have a GetTupleElement op as root.
PiperOrigin-RevId: 825932301
* Extend alternate memory chunk reservations for aliased uses.
* Add pinned allocations in alternate memory for aliased uses.
* Mark all aliased allocations as colocated.
Pin all values aliased with the prefetched source value to default memory.
PiperOrigin-RevId: 825801181
It is pure mechanical move cl.
The goal is to use the pass for all the buffer debug checks. We have checksum and nan_counter kernels at the moment.
PiperOrigin-RevId: 825602375
Update the production code to not set the `*_allocated_bytes` fields
when using address/memory/thread sanitizer, since those sanitizers don't
support `mallinfo()` -- `mallinfo()` may return invalid results when sanitizers
are enabled .
Update test to not expect the `*_allocated_bytes` fields to be valid when using
address/memory/thread sanitizer.
Also update docs in `memory_info.h` to make it clearer that even if `isSupported()`
returns true, it is possible that only _some_ of the fields are supported.
Also document that `memory_info.h` supports Windows; this was already clear from
other parts of the header file that described the behaviour on Windows.
PiperOrigin-RevId: 825505016
Imported from GitHub PR https://github.com/openxla/xla/pull/33085📝 Summary of Changes
Fix too strict spawn strategy for rbe builds
🎯 Justification
remote only execution is not possible for all the tests
🚀 Kind of Contribution
Please remove what does not apply: 🐛 Bug Fix
📊 Benchmark (for Performance Improvements)
Not relevant
🧪 Unit Tests:
Not relevant
🧪 Execution Tests:
Not relevant
Copybara import of the project:
--
df73e6e006c47d5ada1e14ced8f2ae94c0df7dd8 by Alexandros Theodoridis <atheodor@amd.com>:
Fix too strict default spanw strategy for rbe builds
Merging this change closes#33085
PiperOrigin-RevId: 825463234
Encoding extra metadata about an debug log entry within its ID limits how much
information we can pass. To remove the limitation without the need to pass
extra data between host and device, introduce a metadata store that provides a
opaque ID -> metadata mapping.
Follow up patches will make checksum/NaN tracing use
BufferDebugLogEntryMetadataStore shared between all thunks that operate on
BufferDebugLog:
- BuffersChecksumThunks put the metadata into the store and use the returned
entry_ids to identify the checksums from BufferDebugLog,
- xla_gpu_buffer_debug_log_dump reads the BufferDebugLog and uses the store to
resolve the entry_ids into the metadata.
PiperOrigin-RevId: 825462635
Imported from GitHub PR https://github.com/openxla/xla/pull/33141
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 4.30.9 to 4.31.0.
<details>
<summary>Release notes</summary>
<p><em>Sourced from <a href="https://github.com/github/codeql-action/releases">github/codeql-action's releases</a>.</em></p>
<blockquote>
<h2>v4.31.0</h2>
<h1>CodeQL Action Changelog</h1>
<p>See the <a href="https://github.com/github/codeql-action/releases">releases page</a> for the relevant changes to the CodeQL CLI and language packs.</p>
<h2>4.31.0 - 24 Oct 2025</h2>
<ul>
<li>Bump minimum CodeQL bundle version to 2.17.6. <a href="https://redirect.github.com/github/codeql-action/pull/3223">#3223</a></li>
<li>When SARIF files are uploaded by the <code>analyze</code> or <code>upload-sarif</code> actions, the CodeQL Action automatically performs post-processing steps to prepare the data for the upload. Previously, these post-processing steps were only performed before an upload took place. We are now changing this so that the post-processing steps will always be performed, even when the SARIF files are not uploaded. This does not change anything for the <code>upload-sarif</code> action. For <code>analyze</code>, this may affect Advanced Setup for CodeQL users who specify a value other than <code>always</code> for the <code>upload</code> input. <a href="https://redirect.github.com/github/codeql-action/pull/3222">#3222</a></li>
</ul>
<p>See the full <a href="https://github.com/github/codeql-action/blob/v4.31.0/CHANGELOG.md">CHANGELOG.md</a> for more information.</p>
</blockquote>
</details>
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a href="https://github.com/github/codeql-action/blob/main/CHANGELOG.md">github/codeql-action's changelog</a>.</em></p>
<blockquote>
<h1>CodeQL Action Changelog</h1>
<p>See the <a href="https://github.com/github/codeql-action/releases">releases page</a> for the relevant changes to the CodeQL CLI and language packs.</p>
<h2>[UNRELEASED]</h2>
<p>No user facing changes.</p>
<h2>4.31.0 - 24 Oct 2025</h2>
<ul>
<li>Bump minimum CodeQL bundle version to 2.17.6. <a href="https://redirect.github.com/github/codeql-action/pull/3223">#3223</a></li>
<li>When SARIF files are uploaded by the <code>analyze</code> or <code>upload-sarif</code> actions, the CodeQL Action automatically performs post-processing steps to prepare the data for the upload. Previously, these post-processing steps were only performed before an upload took place. We are now changing this so that the post-processing steps will always be performed, even when the SARIF files are not uploaded. This does not change anything for the <code>upload-sarif</code> action. For <code>analyze</code>, this may affect Advanced Setup for CodeQL users who specify a value other than <code>always</code> for the <code>upload</code> input. <a href="https://redirect.github.com/github/codeql-action/pull/3222">#3222</a></li>
</ul>
<h2>4.30.9 - 17 Oct 2025</h2>
<ul>
<li>Update default CodeQL bundle version to 2.23.3. <a href="https://redirect.github.com/github/codeql-action/pull/3205">#3205</a></li>
<li>Experimental: A new <code>setup-codeql</code> action has been added which is similar to <code>init</code>, except it only installs the CodeQL CLI and does not initialize a database. Do not use this in production as it is part of an internal experiment and subject to change at any time. <a href="https://redirect.github.com/github/codeql-action/pull/3204">#3204</a></li>
</ul>
<h2>4.30.8 - 10 Oct 2025</h2>
<p>No user facing changes.</p>
<h2>4.30.7 - 06 Oct 2025</h2>
<ul>
<li>[v4+ only] The CodeQL Action now runs on Node.js v24. <a href="https://redirect.github.com/github/codeql-action/pull/3169">#3169</a></li>
</ul>
<h2>3.30.6 - 02 Oct 2025</h2>
<ul>
<li>Update default CodeQL bundle version to 2.23.2. <a href="https://redirect.github.com/github/codeql-action/pull/3168">#3168</a></li>
</ul>
<h2>3.30.5 - 26 Sep 2025</h2>
<ul>
<li>We fixed a bug that was introduced in <code>3.30.4</code> with <code>upload-sarif</code> which resulted in files without a <code>.sarif</code> extension not getting uploaded. <a href="https://redirect.github.com/github/codeql-action/pull/3160">#3160</a></li>
</ul>
<h2>3.30.4 - 25 Sep 2025</h2>
<ul>
<li>We have improved the CodeQL Action's ability to validate that the workflow it is used in does not use different versions of the CodeQL Action for different workflow steps. Mixing different versions of the CodeQL Action in the same workflow is unsupported and can lead to unpredictable results. A warning will now be emitted from the <code>codeql-action/init</code> step if different versions of the CodeQL Action are detected in the workflow file. Additionally, an error will now be thrown by the other CodeQL Action steps if they load a configuration file that was generated by a different version of the <code>codeql-action/init</code> step. <a href="https://redirect.github.com/github/codeql-action/pull/3099">#3099</a> and <a href="https://redirect.github.com/github/codeql-action/pull/3100">#3100</a></li>
<li>We added support for reducing the size of dependency caches for Java analyses, which will reduce cache usage and speed up workflows. This will be enabled automatically at a later time. <a href="https://redirect.github.com/github/codeql-action/pull/3107">#3107</a></li>
<li>You can now run the latest CodeQL nightly bundle by passing <code>tools: nightly</code> to the <code>init</code> action. In general, the nightly bundle is unstable and we only recommend running it when directed by GitHub staff. <a href="https://redirect.github.com/github/codeql-action/pull/3130">#3130</a></li>
<li>Update default CodeQL bundle version to 2.23.1. <a href="https://redirect.github.com/github/codeql-action/pull/3118">#3118</a></li>
</ul>
<h2>3.30.3 - 10 Sep 2025</h2>
<p>No user facing changes.</p>
<h2>3.30.2 - 09 Sep 2025</h2>
<ul>
<li>Fixed a bug which could cause language autodetection to fail. <a href="https://redirect.github.com/github/codeql-action/pull/3084">#3084</a></li>
<li>Experimental: The <code>quality-queries</code> input that was added in <code>3.29.2</code> as part of an internal experiment is now deprecated and will be removed in an upcoming version of the CodeQL Action. It has been superseded by a new <code>analysis-kinds</code> input, which is part of the same internal experiment. Do not use this in production as it is subject to change at any time. <a href="https://redirect.github.com/github/codeql-action/pull/3064">#3064</a></li>
</ul>
<!-- raw HTML omitted -->
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a href="4e94bd11f7"><code>4e94bd1</code></a> Merge pull request <a href="https://redirect.github.com/github/codeql-action/issues/3235">#3235</a> from github/update-v4.31.0-1d36546c1</li>
<li><a href="8f11182164"><code>8f11182</code></a> Update changelog for v4.31.0</li>
<li><a href="1d36546c14"><code>1d36546</code></a> Merge pull request <a href="https://redirect.github.com/github/codeql-action/issues/3234">#3234</a> from github/mbg/changelog/post-processing</li>
<li><a href="08ada26e6a"><code>08ada26</code></a> Add changelog entry for post-processing change</li>
<li><a href="b843cbeed0"><code>b843cbe</code></a> Merge pull request <a href="https://redirect.github.com/github/codeql-action/issues/3233">#3233</a> from github/mbg/getOptionalEnvVar</li>
<li><a href="1ecd563919"><code>1ecd563</code></a> Use <code>getOptionalEnvVar</code> in <code>writePostProcessedFiles</code></li>
<li><a href="e576807920"><code>e576807</code></a> Merge pull request <a href="https://redirect.github.com/github/codeql-action/issues/3223">#3223</a> from github/henrymercer/bump-minimum</li>
<li><a href="ad35676669"><code>ad35676</code></a> Add <code>getOptionalEnvVar</code> function</li>
<li><a href="d75645b13f"><code>d75645b</code></a> Merge pull request <a href="https://redirect.github.com/github/codeql-action/issues/3222">#3222</a> from github/mbg/upload-lib/post-process</li>
<li><a href="710606cc35"><code>710606c</code></a> Check that <code>outputPath</code> is non-empty</li>
<li>Additional commits viewable in <a href="16140ae1a1...4e94bd11f7">compare view</a></li>
</ul>
</details>
<br />
[](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`.
[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)
---
<details>
<summary>Dependabot commands and options</summary>
<br />
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
</details>
Copybara import of the project:
--
cbe7908eed34d441708d7360f23dad04e5b48ee1 by dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>:
Bump github/codeql-action from 4.30.9 to 4.31.0
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 4.30.9 to 4.31.0.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](16140ae1a1...4e94bd11f7)
---
updated-dependencies:
- dependency-name: github/codeql-action
dependency-version: 4.31.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Merging this change closes#33141
PiperOrigin-RevId: 825449574
Imported from GitHub PR https://github.com/openxla/xla/pull/33205📝 Summary of Changes
The simplification was unintentionally disabled in
2accf052cb.
🎯 Justification
Bug fix.
🚀 Kind of Contribution
🐛 Bug Fix, ⚡️ Performance Improvement
📊 Benchmark (for Performance Improvements)
No
🧪 Unit Tests:
Yes.
🧪 Execution Tests:
No.
Copybara import of the project:
--
2fb682c10ff49212044dd995ba97aa329e52bb71 by Ilia Sergachev <isergachev@nvidia.com>:
[GPU] Fix reduce-precision simplification.
Was unintentionally disabled in
2accf052cb.
Merging this change closes#33205
PiperOrigin-RevId: 825449067