MegEngine Files

Easy-to-use deep learning framework with 3 key features

This is an exact mirror of the MegEngine project, hosted at https://github.com/MegEngine/MegEngine. SourceForge is not affiliated with MegEngine. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
arm64-v8a-android-1.13.4.tgz	2024-04-30	675.6 MB	3
arm64-v8a-linux-1.13.4.tgz	2024-04-30	684.8 MB	1
armeabi-v7a-hardfp-linux-1.13.4.tgz	2024-04-30	588.3 MB	0
armeabi-v7a-softfp-linux-1.13.4.tgz	2024-04-30	588.2 MB	0
armeabi-v7a-android-1.13.4.tgz	2024-04-30	590.1 MB	0
MegEngine v1.13.4 source code.tar.gz	2024-04-11	8.0 MB	0
MegEngine v1.13.4 source code.zip	2024-04-11	11.4 MB	0
README.md	2024-04-11	1.9 kB	0
Totals: 8 Items		3.1 GB	4

MegBrain

Bug fixes

通用组件

修复 dump 开启 CD4 + FP16 时 clip 阶段图优化异常， MIN op 相关 bug 导致 dump 出错的问题
修复 megengine tensor 类型为 bool 时 index 操作未能正确定位地址的问题

XLA

修复多机训练时，device 设置错误的问题

CUDA

修复由于缺少一个 void ** 的强制转换而引发无法通过编译的问题。

New Features

Python API

添加 FillPoly 算子
增加 erf 接口

CUDA

增加对 Hopper 系列 GPU 的支持

通用组件

修复在 io16xc32 模式下 reduce 算子无法执行的问题

XLA

XLA 增加对 FP16 数据类型的支持
新增支持 xla 打包的脚本，自 v8.20.3（包含）及以后可以用以下方式安装 xla： megbrain[xla]==8.20.3+cu111

Dataloader

Dataloader 支持 cuda 数据转换

Bug fixes

Common components

Fixed the issue where the clip stage diagram optimization was abnormal when CD4 + FP16 was turned on for dump, and MIN op related bugs caused dump errors.
Fix the problem that the index operation fails to correctly locate the address when the megengine tensor type is bool.

XLA

Fixed the problem of incorrect device settings during multi-machine training.

CUDA

Fixed the problem of failing to compile due to the lack of a void ** cast.

New Features

Python API

Add FillPoly operator.
Add erf interface.

CUDA

Add support for Hopper series GPUs.

Common components

Fix the problem that reduce operator cannot be executed in io16xc32 mode.

XLA

XLA adds support for FP16 data type.
Added scripts that support xla packaging. From v8.20.3 (included) and later, xla can be installed in the following way: megbrain[xla]==8.20.3+cu111.

Dataloader

Dataloader supports cuda data conversion.

Source: README.md, updated 2024-04-11

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Cloud tools for web scraping and data extraction Icon

Cloud tools for web scraping and data extraction

Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.

Explore 10,000+ tools

Recommended Projects

ONNX Runtime
ONNX Runtime: cross-platform, high performance ML inferencing
DeepSpeed
Deep learning optimization library: makes distributed training easy
TensorRT
C++ library for high performance inference on NVIDIA GPUs
Stable Diffusion Version 2
High-Resolution Image Synthesis with Latent Diffusion Models
TorchServe
Serve, optimize and scale PyTorch models in production