Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
py_data_juicer-1.3.2-py3-none-any.whl | 2025-04-27 | 485.2 kB | |
README.md | 2025-04-25 | 1.0 kB | |
Release v1.3.2_ Enhancements on usability _ two OPs_ some bugs fixes source code.tar.gz | 2025-04-25 | 32.2 MB | |
Release v1.3.2_ Enhancements on usability _ two OPs_ some bugs fixes source code.zip | 2025-04-25 | 32.7 MB | |
Totals: 4 Items | 65.4 MB | 0 |
What's Changed
- Human OP enhancements, in https://github.com/modelscope/data-juicer/pull/642 https://github.com/modelscope/data-juicer/pull/645
- update label-studio version
- make service script more robust
- add documentation
- optimizing fields mapping
- OP efficiency optimization of
document_minhash_deduplicator
, in https://github.com/modelscope/data-juicer/pull/639 - set temp_parser.usage to argparse.SUPPRESS, skip too much help log in https://github.com/modelscope/data-juicer/pull/643
- fix date typo by in https://github.com/modelscope/data-juicer/pull/648
- Fix docker building failure in https://github.com/modelscope/data-juicer/pull/650
- Fix StreamToLoguru compatibility issue with torch._dynamo in https://github.com/modelscope/data-juicer/pull/651
- add init file for annotation module, fix dj-process command error in https://github.com/modelscope/data-juicer/pull/652
New Contributor
- @cmgzn made their first contribution in https://github.com/modelscope/data-juicer/pull/651