Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/17803
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKale, Apoorwaen_US
dc.contributor.authorKhandelwal, Yashen_US
dc.contributor.authorPandhare, Vibhoren_US
dc.contributor.authorGhosh, Atreyeeen_US
dc.contributor.authorPathak, Nidhien_US
dc.contributor.authorLad, Bhupesh Kumaren_US
dc.date.accessioned2026-02-10T15:50:11Z-
dc.date.available2026-02-10T15:50:11Z-
dc.date.issued2025-
dc.identifier.citationKale, A., Khandelwal, Y., Pandhare, V., Ghosh, A., Pathak, N., Saitya, B. S., & Lad, B. K. (2025). A SCALABLE, LOW-COST FRAMEWORK FOR MULTILINGUAL INTELLIGENT DOCUMENT PROCESSING FOR CONTINUITY OF CARE. IET Conference Proceedings, 2025(28), 161–166. https://doi.org/10.1049/icp.2025.3682en_US
dc.identifier.isbn9781807050351-
dc.identifier.isbn9781807050207-
dc.identifier.isbn9781837247257-
dc.identifier.isbn9781837249916-
dc.identifier.isbn9781807050375-
dc.identifier.isbn9781837245277-
dc.identifier.isbn9781837247295-
dc.identifier.isbn9781837247264-
dc.identifier.isbn9781837247325-
dc.identifier.isbn9781839537776-
dc.identifier.otherEID(2-s2.0-105027169575)-
dc.identifier.urihttps://dx.doi.org/10.1049/icp.2025.3682-
dc.identifier.urihttps://dspace.iiti.ac.in:8080/jspui/handle/123456789/17803-
dc.description.abstractPaper-based prescriptions and reports constitute a major part of the medical health records across the globe. Accordingly, paper-based manual data recording is a common practice in the Community Health Centers (CHCs) in India. These records result in poor handling, fragmented information, inefficient data retrieval, sharing, and storage of clinical data. To address this gap, we present Intelligent Document Processing Application (IDPA), a low-cost, scalable data digitization pipeline combining Optical Character Recognition (OCR) and Vision-Language Models (VLMs) for converting bilingual (Hindi-English), handwritten, and numerical medical records into structured digital formats. IDPA comprises a two-stage pipeline, where Stage 1 employs table cell segmentation using OpenCV and Stage 2 uses OCR extraction with PaliGemma VLM. As a proof-of-concept, the application was tested using a dataset of 150 patient records of the Indian population, which exhibited prevalent data input issues including overwritten texts, obscured columns, and the application of whiteners. PaliGemma, refined using over 650 labelled table cell images, attained 74% accuracy and a 13% Character Error Rate (CER), outperforming other open-source VLM models in extracting the medical records. The extracted data is organized into structured dataframes, served through a FastAPI endpoint, and accessible through a Progressive Web App (PWA). The interface supports secure user authentication via Clerk API and enables real-time image upload, editable tabular outputs, and data export in CSV/PDF formats. Together, these digital tools offer an affordable, user-centric approach to improve healthcare data management in low-resource settings. They hold strong potential for integration with national health systems, improvement of continuity of care, enablement of longitudinal monitoring, and expansion into predictive analytics for clinical decision support. © This is an open access article published by the IET under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/)en_US
dc.language.isoenen_US
dc.publisherInstitution of Engineering and Technologyen_US
dc.sourceIET Conference Proceedingsen_US
dc.titleA SCALABLE, LOW-COST FRAMEWORK FOR MULTILINGUAL INTELLIGENT DOCUMENT PROCESSING FOR CONTINUITY OF CAREen_US
dc.typeConference Paperen_US
Appears in Collections:Department of Mechanical Engineering
IITI DRISHTI CPS Foundation
Mehta Family School of Biosciences and Biomedical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: