Curated Transformers Files

PyTorch library of curated Transformer models and their components

This is an exact mirror of the Curated Transformers project, hosted at https://github.com/explosion/curated-transformers. SourceForge is not affiliated with Curated Transformers. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2024-04-16	972 Bytes	0
v2.0.0 (Superposition) source code.tar.gz	2024-04-16	373.6 kB	0
v2.0.0 (Superposition) source code.zip	2024-04-16	489.0 kB	1
Totals: 3 Items		863.6 kB	1

✨ New features and improvements

Register models using catalogue to support external models in Auto{Decoder,Encoder,CausalLM} (#351, [#352]).
Add support for loading parameters in-place (#370).
Support for ELECTRA models (#358).
Add support for write/upload operations with HFHubRepository (#354).
Add support for converting Curated Transformer configs to HF-compatible configs (#333).

🔴 Bug fixes

Support PyTorch 2.2 (#360).

⚠️ Backwards incompatibilities

Support for TorchScript tracing is removed (#361).
The qkv_split argument is now mandatory for AttentionHeads, AttentionHeads.uniform, AttentionHeads.multi_query, and AttentionHeads.key_value_broadcast (#374).
All FromHFHub mixins are renamed to FromHF (#374).
FromHF.convert_hf_state_dict is removed in favor of FromHF.state_dict_from_hf (#374).

👥 Contributors

@danieldk, @honnibal, @ines, @KennethEnevoldsen, @shadeMe

Source: README.md, updated 2024-04-16

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Run applications fast and securely in a fully managed environment Icon

Run applications fast and securely in a fully managed environment

Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.

Try for free

Recommended Projects

Transformer Engine
A library for accelerating Transformer models on NVIDIA GPUs
Transformers-Interpret
Model explainability that works seamlessly with Hugging Face
Diffusers
State-of-the-art diffusion models for image and audio generation
Oumi
Everything you need to build state-of-the-art foundation models
OpenLLM
Operating LLMs in production