I have several questions about the new note_nlp table
- offset field is described as an integer in the google document (https://docs.google.com/document/d/1ykYVJTQ5MuI7eh_Nk7xzt44EzNjVs71nq2LIsC_RlOg/edit#) However, is is described and implemented as a character(250) every where else. It makes sense to be an “integer”.
- Offset means the index of the extracted part ? Then why not replace it with “begin”, “end” that are represents begining index and end index of the extracted part? Tools such UIMA should works on such information. offset is not sufficient.
- lexical_variant field: it is only 250 character long.
- What about Notes Sections ? Are they supposed to be stored in the note_nlp table or in the note table ? FHIR share the Notes as Composition resource. And it stores only sections(https://www.hl7.org/fhir/valueset-doc-section-codes.html). But splitting notes into section is a NLP task, that is why I ask the question