Increasing the accuracy of handwriting text recognition in medical prescriptions with generative artificial intelligence

Authors

DOI:

https://doi.org/10.15587/2706-5448.2023.284998

Keywords:

handwriting recognition, , generative artificial intelligence, recognition algorithms, deep neural networks

Abstract

The object of the research is a system for recognizing handwritten text in medical prescriptions. The peculiarities of handwriting, the variety of calligraphy styles, as well as the specificity of medical prescriptions, create many problems and challenges for recognition algorithms, causing errors and reducing recognition accuracy.

The work presents a new system with additional components of post-processing the recognition results to increase the accuracy of the final results. An algorithm for combining words into lines and blocks is proposed, which makes it possible to group words while preserving contextual connections between them. Also, a generative neural network with a large language model is used to analyze the recognition result and correct possible errors. The results of the testing show an improvement in recognition accuracy by 0.13 %. Successful cases of generative artificial intelligence usage are analyzed, as well as examples of the results deterioration, that are related to grammatical errors in the initial input data.

The obtained results show the use of generative artificial intelligence as an additional step for processing the recognition results really can improve the accuracy of text recognition systems. The results of the study can be used for further experiments to improve recognition results in other tasks related to text recognition and in related fields.

Author Biographies

Oleg Yakovchuk, National Technical University of Ukraine «Igor Sikorsky Kyiv Polytechnic Institute»

Assistant, Postgraduate Student

Department of System Design

Maksym Vasin, National Technical University of Ukraine «Igor Sikorsky Kyiv Polytechnic Institute»

Department of System Design

References

  1. Baniulyte, G., Rogerson, N., Bowden, J. (2023). Evolution – removing paper and digitising the hospital. Health and Technology, 13 (2), 263–271. doi: https://doi.org/10.1007/s12553-023-00740-8
  2. Dhar, D., Garain, A., Singh, P. K., Sarkar, R. (2020). HP_DocPres: a method for classifying printed and handwritten texts in doctor’s prescription. Multimedia Tools and Applications, 80 (7), 9779–9812. doi: https://doi.org/10.1007/s11042-020-10151-w
  3. Hucka, M. (2022). Caltechlibrary/handprint: Release 1.5.6 (v1.5.6). CaltechDATA. doi: https://doi.org/10.22002/D1.20059
  4. Schmidt, R. (2019). Recurrent Neural Networks (RNNs): A gentle Introduction and Overview. doi: https://doi.org/10.48550/arXiv.1912.05911
  5. Graves, A., Fernández, S., Gomez, F., Schmidhuber, J. (2006). Connectionist temporal classification. Proceedings of the 23rd International Conference on Machine Learning – ICML ’06. doi: https://doi.org/10.1145/1143844.1143891
  6. Dhar, D., Garain, A., Singh, P. K., Sarkar, R. (2020). HP_DocPres: a method for classifying printed and handwritten texts in doctor’s prescription. Multimedia Tools and Applications, 80 (7), 9779–9812. doi: https://doi.org/10.1007/s11042-020-10151-w
  7. Yakovchuk, O., Cherneha, A., Zhelezniakov, D., Zaytsev, V. (2020). Methods for Lines and Matrices Segmentation in RNN-based Online Handwriting Mathematical Expression Recognition Systems. 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP). doi: https://doi.org/10.1109/dsmp47368.2020.9204273
  8. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I. (2019) Language Models are Unsupervised Multitask Learners. Available at: https://www.semanticscholar.org/paper/Language-Models-are-Unsupervised-Multitask-Learners-Radford-Wu/9405cc0d6169988371b2755e573cc28650d14dfe
  9. Child, R., Gray, S., Radford, A., Sutskever, I. (2019). Generating Long Sequences with Sparse Transformers. doi: https://doi.org/10.48550/arXiv.1904.10509
  10. Vaswani, A., Shazeer, N., Parmar, N. (2017). Attention Is All You Need. doi: https://doi.org/10.48550/arXiv.1706.03762
  11. Brown, B., Mann, B., Ryder, N., Subbiah, M. (2020). Language Models are Few-Shot Learners. Available at: https://arxiv.org/pdf/2005.14165.pdf
Increasing the accuracy of handwriting text recognition in medical prescriptions with generative artificial intelligence

Downloads

Published

2023-08-28

How to Cite

Yakovchuk, O., & Vasin, M. (2023). Increasing the accuracy of handwriting text recognition in medical prescriptions with generative artificial intelligence. Technology Audit and Production Reserves, 4(2(72), 18–21. https://doi.org/10.15587/2706-5448.2023.284998

Issue

Section

Information Technologies