jesus_zen_drod

  • 🏀 Supporter
2

Experience

Dribs

  • consider implementing a consistent syntax to be used for all future BGEs (structured, ie. XML- or JSON-based)
2 years ago

Conclusions relative to HTML parser:

  • works reasonably well so far, there may be some occasional data loss
  • too many inconsistencies in the text data to reliably re-structure the data ; human post-processing of the script output is a must.
2 years ago

splitting references to extract their Art./Abs./lit. components.

2 years ago