Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/17803
Title: A SCALABLE, LOW-COST FRAMEWORK FOR MULTILINGUAL INTELLIGENT DOCUMENT PROCESSING FOR CONTINUITY OF CARE
Authors: Kale, Apoorwa
Khandelwal, Yash
Pandhare, Vibhor
Ghosh, Atreyee
Pathak, Nidhi
Lad, Bhupesh Kumar
Issue Date: 2025
Publisher: Institution of Engineering and Technology
Citation: Kale, A., Khandelwal, Y., Pandhare, V., Ghosh, A., Pathak, N., Saitya, B. S., & Lad, B. K. (2025). A SCALABLE, LOW-COST FRAMEWORK FOR MULTILINGUAL INTELLIGENT DOCUMENT PROCESSING FOR CONTINUITY OF CARE. IET Conference Proceedings, 2025(28), 161–166. https://doi.org/10.1049/icp.2025.3682
Abstract: Paper-based prescriptions and reports constitute a major part of the medical health records across the globe. Accordingly, paper-based manual data recording is a common practice in the Community Health Centers (CHCs) in India. These records result in poor handling, fragmented information, inefficient data retrieval, sharing, and storage of clinical data. To address this gap, we present Intelligent Document Processing Application (IDPA), a low-cost, scalable data digitization pipeline combining Optical Character Recognition (OCR) and Vision-Language Models (VLMs) for converting bilingual (Hindi-English), handwritten, and numerical medical records into structured digital formats. IDPA comprises a two-stage pipeline, where Stage 1 employs table cell segmentation using OpenCV and Stage 2 uses OCR extraction with PaliGemma VLM. As a proof-of-concept, the application was tested using a dataset of 150 patient records of the Indian population, which exhibited prevalent data input issues including overwritten texts, obscured columns, and the application of whiteners. PaliGemma, refined using over 650 labelled table cell images, attained 74% accuracy and a 13% Character Error Rate (CER), outperforming other open-source VLM models in extracting the medical records. The extracted data is organized into structured dataframes, served through a FastAPI endpoint, and accessible through a Progressive Web App (PWA). The interface supports secure user authentication via Clerk API and enables real-time image upload, editable tabular outputs, and data export in CSV/PDF formats. Together, these digital tools offer an affordable, user-centric approach to improve healthcare data management in low-resource settings. They hold strong potential for integration with national health systems, improvement of continuity of care, enablement of longitudinal monitoring, and expansion into predictive analytics for clinical decision support. © This is an open access article published by the IET under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/)
URI: https://dx.doi.org/10.1049/icp.2025.3682
https://dspace.iiti.ac.in:8080/jspui/handle/123456789/17803
ISBN: 9781807050351
9781807050207
9781837247257
9781837249916
9781807050375
9781837245277
9781837247295
9781837247264
9781837247325
9781839537776
Type of Material: Conference Paper
Appears in Collections:Department of Mechanical Engineering
IITI DRISHTI CPS Foundation
Mehta Family School of Biosciences and Biomedical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: