Download Latest Version arm64-v8a-android-1.13.4.tgz (675.6 MB)
Email in envelope

Get an email when there's a new version of MegEngine

Home / v1.13.4
Name Modified Size InfoDownloads / Week
Parent folder
arm64-v8a-android-1.13.4.tgz 2024-04-30 675.6 MB
arm64-v8a-linux-1.13.4.tgz 2024-04-30 684.8 MB
armeabi-v7a-hardfp-linux-1.13.4.tgz 2024-04-30 588.3 MB
armeabi-v7a-softfp-linux-1.13.4.tgz 2024-04-30 588.2 MB
armeabi-v7a-android-1.13.4.tgz 2024-04-30 590.1 MB
MegEngine v1.13.4 source code.tar.gz 2024-04-11 8.0 MB
MegEngine v1.13.4 source code.zip 2024-04-11 11.4 MB
README.md 2024-04-11 1.9 kB
Totals: 8 Items   3.1 GB 4

MegBrain

Bug fixes

通用组件

  • 修复 dump 开启 CD4 + FP16 时 clip 阶段图优化异常, MIN op 相关 bug 导致 dump 出错的问题
  • 修复 megengine tensor 类型为 bool 时 index 操作未能正确定位地址的问题

XLA

  • 修复多机训练时,device 设置错误的问题

CUDA

  • 修复由于缺少一个 void ** 的强制转换而引发无法通过编译的问题。

New Features

Python API

  • 添加 FillPoly 算子
  • 增加 erf 接口

CUDA

  • 增加对 Hopper 系列 GPU 的支持

通用组件

  • 修复在 io16xc32 模式下 reduce 算子无法执行的问题

XLA

  • XLA 增加对 FP16 数据类型的支持
  • 新增支持 xla 打包的脚本,自 v8.20.3(包含)及以后可以用以下方式安装 xla: megbrain[xla]==8.20.3+cu111

Dataloader

  • Dataloader 支持 cuda 数据转换

Bug fixes

Common components

  • Fixed the issue where the clip stage diagram optimization was abnormal when CD4 + FP16 was turned on for dump, and MIN op related bugs caused dump errors.
  • Fix the problem that the index operation fails to correctly locate the address when the megengine tensor type is bool.

XLA

  • Fixed the problem of incorrect device settings during multi-machine training.

CUDA

  • Fixed the problem of failing to compile due to the lack of a void ** cast.

New Features

Python API

  • Add FillPoly operator.
  • Add erf interface.

CUDA

  • Add support for Hopper series GPUs.

Common components

  • Fix the problem that reduce operator cannot be executed in io16xc32 mode.

XLA

  • XLA adds support for FP16 data type.
  • Added scripts that support xla packaging. From v8.20.3 (included) and later, xla can be installed in the following way: megbrain[xla]==8.20.3+cu111.

Dataloader

  • Dataloader supports cuda data conversion.
Source: README.md, updated 2024-04-11