...
Mark regions to extract from Lab Reports. (Using Opensource Label Studio)
From the regions, extract the text (OCR of printed text) using Tesseract models.
Use NLP libraries like MedCat and Spacy for extraction of "meaning" from text (like identifying patient ID, name or clinical term).
Receive a JSON representation of original Lab report, with appropriate data elements extracted and identified.
Tip |
---|
Video recording of the PoC showcase: https://talk.openmrs.org/t/bahmni-pat-call-on-20-apr-2022-wednesday/36454/7 |
Reference materials
Digital Scanned Documents for Bahmni - Initial Proposal by KCDH: (Presentation Link)
IIT/KCDH: https://rnd.iitb.ac.in/research-glimpse/adaptive-framework-end-end-corrections-indic-ocr
Sample Lab Reports
...