| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| arm64-v8a-android-1.13.4.tgz | 2024-04-30 | 675.6 MB | |
| arm64-v8a-linux-1.13.4.tgz | 2024-04-30 | 684.8 MB | |
| armeabi-v7a-hardfp-linux-1.13.4.tgz | 2024-04-30 | 588.3 MB | |
| armeabi-v7a-softfp-linux-1.13.4.tgz | 2024-04-30 | 588.2 MB | |
| armeabi-v7a-android-1.13.4.tgz | 2024-04-30 | 590.1 MB | |
| MegEngine v1.13.4 source code.tar.gz | 2024-04-11 | 8.0 MB | |
| MegEngine v1.13.4 source code.zip | 2024-04-11 | 11.4 MB | |
| README.md | 2024-04-11 | 1.9 kB | |
| Totals: 8 Items | 3.1 GB | 4 | |
MegBrain
Bug fixes
通用组件
- 修复 dump 开启 CD4 + FP16 时 clip 阶段图优化异常, MIN op 相关 bug 导致 dump 出错的问题
- 修复 megengine tensor 类型为 bool 时 index 操作未能正确定位地址的问题
XLA
- 修复多机训练时,device 设置错误的问题
CUDA
- 修复由于缺少一个 void ** 的强制转换而引发无法通过编译的问题。
New Features
Python API
- 添加 FillPoly 算子
- 增加 erf 接口
CUDA
- 增加对 Hopper 系列 GPU 的支持
通用组件
- 修复在 io16xc32 模式下 reduce 算子无法执行的问题
XLA
- XLA 增加对 FP16 数据类型的支持
- 新增支持 xla 打包的脚本,自 v8.20.3(包含)及以后可以用以下方式安装 xla: megbrain[xla]==8.20.3+cu111
Dataloader
- Dataloader 支持 cuda 数据转换
Bug fixes
Common components
- Fixed the issue where the clip stage diagram optimization was abnormal when CD4 + FP16 was turned on for dump, and MIN op related bugs caused dump errors.
- Fix the problem that the index operation fails to correctly locate the address when the megengine tensor type is bool.
XLA
- Fixed the problem of incorrect device settings during multi-machine training.
CUDA
- Fixed the problem of failing to compile due to the lack of a void ** cast.
New Features
Python API
- Add FillPoly operator.
- Add erf interface.
CUDA
- Add support for Hopper series GPUs.
Common components
- Fix the problem that reduce operator cannot be executed in io16xc32 mode.
XLA
- XLA adds support for FP16 data type.
- Added scripts that support xla packaging. From v8.20.3 (included) and later, xla can be installed in the following way: megbrain[xla]==8.20.3+cu111.
Dataloader
- Dataloader supports cuda data conversion.