Page Comparison

Objective

Provide OCR abilities in Bahmni

Due date

Key outcomes

Phase1. OutcomeA: Ability to scan Covid RT-PCR test results into Bahmni via OCR.

Status

Status

colour	Yellow
title	in-Analysis

Status

colour	Yellow
title	Prototype

Collaborators

KCDH/IIT + Thoughtworks

Slack

#bahmni-ocr

Code Repo

https://github.com/venkatapathy/ocr-editor

Issue List

https://github.com/venkatapathy/ocr-editor/issues

Table of Contents

Problem Statement

Provide OCR abilities in Bahmni

...

Code: https://github.com/document-analysis-tools/ocr-ner-extractor

Mark regions to extract from Lab Reports. (Using Opensource Label Studio)
From the regions, extract the text (OCR of printed text) using Tesseract models.
Use NLP libraries like MedCat and Spacy for extraction of "meaning" from text (like identifying patient ID, name or clinical term).
Receive a JSON representation of original Lab report, with appropriate data elements extracted and identified.

View file

name	sample_covid_report.pdf

View file

name	PDFReportServlet - 139900.pdf

View file

name	Sample Report.pdf

View file

name	ReportPrint.pdf

Image RemovedImage Added