Download Latest Version v2.0.1_ Fix Python 3.12.3 compatibility source code.tar.gz (373.5 kB)
Email in envelope

Get an email when there's a new version of Curated Transformers

Home / v2.0.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2024-04-16 972 Bytes
v2.0.0 (Superposition) source code.tar.gz 2024-04-16 373.6 kB
v2.0.0 (Superposition) source code.zip 2024-04-16 489.0 kB
Totals: 3 Items   863.6 kB 1

✨ New features and improvements

  • Register models using catalogue to support external models in Auto{Decoder,Encoder,CausalLM} (#351, [#352]).
  • Add support for loading parameters in-place (#370).
  • Support for ELECTRA models (#358).
  • Add support for write/upload operations with HFHubRepository (#354).
  • Add support for converting Curated Transformer configs to HF-compatible configs (#333).

🔴 Bug fixes

  • Support PyTorch 2.2 (#360).

⚠️ Backwards incompatibilities

  • Support for TorchScript tracing is removed (#361).
  • The qkv_split argument is now mandatory for AttentionHeads, AttentionHeads.uniform, AttentionHeads.multi_query, and AttentionHeads.key_value_broadcast (#374).
  • All FromHFHub mixins are renamed to FromHF (#374).
  • FromHF.convert_hf_state_dict is removed in favor of FromHF.state_dict_from_hf (#374).

👥 Contributors

@danieldk, @honnibal, @ines, @KennethEnevoldsen, @shadeMe

Source: README.md, updated 2024-04-16