The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
PyTorch_XLA 2.7 release source code.tar.gz	2025-04-22	7.8 MB	0
PyTorch_XLA 2.7 release source code.zip	2025-04-22	8.4 MB	1
README.md	2025-04-22	3.3 kB	1
Totals: 3 Items		16.2 MB	2

Highlights

Easier training on Cloud TPUs with TorchPrime
A new Pallas-based kernel for ragged paged attention, enabling further optimizations on vLLM TPU (#8791)
Usability improvements
Experimental JAX interoperability with JAX operations (#8781, #8789, #8830, #8878)
re-enabled GPU CI build [#8593]

Stable Features

Gated Recurrent Unit (GRU) implemented with scan (#8777)
Introduce apply_xla_patch_to_nn_linear to improve einsum performance (#8793)
Support splitting physical axis in SPMD mesh (#8698)
Enable default buffer donation for step barriers (#8721, #8982)

Better profiling control: the start and the end of the profiling session can be controlled by the new profiler API (#8743)
API to query number of cached compilation graphs (#8822)
Enhancement on host-to-device transfer (#8849)

fix a bug in tensor.flatten (#8680)
cummax: fix 0-sized dimension reduction. (#8653)
Fix dk/dv autograd error on TPU flash attention (#8685)
Fix a bug in flash attention where kv_seq_len should divide block_k_major. (#8671)
[scan] Make sure inputs into fn are not device_data IR nodes(#8769)

Deprecate torch.export and instead, use torchax to export graph to StableHLO for full dynamism support
Remove torch_xla.core.xla_model.xrt_world_size, replace with torch_xla.runtime.world_size
Remove torch_xla.core.xla_model.get_ordinal, replace with torch_xla.runtime.global_ordinal
Remove torch_xla.core.xla_model.parse_xla_device, replace with _utils.parse_xla_device
Remove torch_xla.experimental.compile, replace with torch_xla.compile

Source: README.md, updated 2025-04-22