| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| grobid-trainer-0.8.1.jar | 2024-09-14 | 15.1 MB | |
| grobid-service-0.8.1.jar | 2024-09-14 | 3.2 MB | |
| grobid-core-0.8.1.jar | 2024-09-14 | 15.8 MB | |
| 0.8.1 source code.tar.gz | 2024-09-14 | 406.5 MB | |
| 0.8.1 source code.zip | 2024-09-14 | 411.1 MB | |
| README.md | 2024-09-14 | 1.5 kB | |
| Totals: 6 Items | 851.8 MB | 0 | |
Added
- Identified URLs are now added in the TEI output [#1099]
- Added DL models for patent processing [#1082]
- Copyrights owner and licenses identification models [#1078]
- Add research infrastructure recognition for funding processing [#1085]
- Add paragraphs coordinates in the TEI output [#1068]
- Specify configuration file with DL models enabled for the full docker image [#1117]
- Support for biblio-glutton 0.3 [#1086]
Changed
- Update affiliation process [#1069]
- Improved the recognition of URLs using (when available) PDF annotations, such as clickable links
- Updated TEI schema [#1084]
- Review patent process [#1082]
- Add Kotlin language to support development and testing [#1096]
Fixed
- Avoid splitting URLs between sentences [#1097]
- Add missing sentence segmentation in funding and acknowledgement [#1106]
- Docker image was optimized to reduce the needed space [#1088]
- Fixed OOBE when processing large quantities of notes [#1075]
- Corrected
<title>coordinate attribute name [#1070] - Fix missing coordinates in paragraph continuation [#1076]
- Fixed JSON log output
- Fixed notes identification [#1124]
- Fixed extraneous semicolon in the training data [#1133]
- Reduced security vulnerabilities in the dependencies [#1136] [#1137]
New Contributors
- @tanaynayak made their first contribution in https://github.com/kermitt2/grobid/pull/1133
- @vipulg13 made their first contribution in https://github.com/kermitt2/grobid/pull/1137