Download Latest Version grobid-core-0.8.1.jar (15.8 MB)
Email in envelope

Get an email when there's a new version of GROBID

Home / 0.8.1
Name Modified Size InfoDownloads / Week
Parent folder
grobid-trainer-0.8.1.jar 2024-09-14 15.1 MB
grobid-service-0.8.1.jar 2024-09-14 3.2 MB
grobid-core-0.8.1.jar 2024-09-14 15.8 MB
0.8.1 source code.tar.gz 2024-09-14 406.5 MB
0.8.1 source code.zip 2024-09-14 411.1 MB
README.md 2024-09-14 1.5 kB
Totals: 6 Items   851.8 MB 0

Added

  • Identified URLs are now added in the TEI output [#1099]
  • Added DL models for patent processing [#1082]
  • Copyrights owner and licenses identification models [#1078]
  • Add research infrastructure recognition for funding processing [#1085]
  • Add paragraphs coordinates in the TEI output [#1068]
  • Specify configuration file with DL models enabled for the full docker image [#1117]
  • Support for biblio-glutton 0.3 [#1086]

Changed

  • Update affiliation process [#1069]
  • Improved the recognition of URLs using (when available) PDF annotations, such as clickable links
  • Updated TEI schema [#1084]
  • Review patent process [#1082]
  • Add Kotlin language to support development and testing [#1096]

Fixed

  • Avoid splitting URLs between sentences [#1097]
  • Add missing sentence segmentation in funding and acknowledgement [#1106]
  • Docker image was optimized to reduce the needed space [#1088]
  • Fixed OOBE when processing large quantities of notes [#1075]
  • Corrected <title> coordinate attribute name [#1070]
  • Fix missing coordinates in paragraph continuation [#1076]
  • Fixed JSON log output
  • Fixed notes identification [#1124]
  • Fixed extraneous semicolon in the training data [#1133]
  • Reduced security vulnerabilities in the dependencies [#1136] [#1137]

New Contributors

Source: README.md, updated 2024-09-14