| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2024-04-16 | 972 Bytes | |
| v2.0.0 (Superposition) source code.tar.gz | 2024-04-16 | 373.6 kB | |
| v2.0.0 (Superposition) source code.zip | 2024-04-16 | 489.0 kB | |
| Totals: 3 Items | 863.6 kB | 1 | |
✨ New features and improvements
- Register models using
catalogueto support external models inAuto{Decoder,Encoder,CausalLM}(#351, [#352]). - Add support for loading parameters in-place (#370).
- Support for ELECTRA models (#358).
- Add support for write/upload operations with
HFHubRepository(#354). - Add support for converting Curated Transformer configs to HF-compatible configs (#333).
🔴 Bug fixes
- Support PyTorch 2.2 (#360).
⚠️ Backwards incompatibilities
- Support for TorchScript tracing is removed (#361).
- The
qkv_splitargument is now mandatory forAttentionHeads,AttentionHeads.uniform,AttentionHeads.multi_query, andAttentionHeads.key_value_broadcast(#374). - All
FromHFHubmixins are renamed toFromHF(#374). FromHF.convert_hf_state_dictis removed in favor ofFromHF.state_dict_from_hf(#374).
👥 Contributors
@danieldk, @honnibal, @ines, @KennethEnevoldsen, @shadeMe