jesus_zen_drod

Personal log

  • consider implementing a consistent syntax to be used for all future BGEs (structured, ie. XML- or JSON-based)
1 year ago

Conclusions relative to HTML parser: - works reasonably well so far, there may be some occasional data loss - too many inconsistencies in the text data to reliably re-structure the data ; human post-processing of the script output is a must.

1 year ago

splitting references to extract their Art./Abs./lit. components.

1 year ago