IAD Index of Academic Documents
  • Home Page
  • About
    • About Izmir Academy Association
    • About IAD Index
    • IAD Team
    • IAD Logos and Links
    • Policies
    • Contact
  • Submit A Journal
  • Submit A Conference
  • Submit Paper/Book
    • Submit a Preprint
    • Submit a Book
  • Contact
  • Journal of Old Turkic Studies
  • Cilt: 9 Sayı: 2
  • Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs

Old Uyghur OCR: The First Work-in-Progress via Reproducing Fine-tuning of VLMs

Authors : Mehmet Oguz Derin, Erdem Uçar
Pages : 313-318
Doi:10.35236/jots.1762980
View : 75 | Download : 60
Publication Date : 2025-08-18
Article Type : Research Paper
Abstract :Old Uyghur manuscripts remain largely inaccessible due to the absence of optical character recognition (OCR) systems aligned with modern scholarly practices. To address this, we present a work‑in‑progress OCR approach using fine-tuning of vision–language models (VLMs) for Old Uyghur transliteration. Specifically, LLaMA‑3.2‑11B‑Vision was fine‑tuned with 4‑bit Unsloth quantization and LoRA on 525 manually annotated pages from Radloff and Malov’s edition of Altun Yaruk Sudur, following Unicode‑based transliteration guidelines. On a held‑out test set of 16 pages from one bölük (part) of the fifth tägzinč, the approach yielded a character error rate (CER) of 5.46% and normalized edit distance (NED) of 0.286. These results demonstrate feasibility; however, limitations are evident owing to binarized scans, a single woodblock style, small training data, and lack of positional supervision. Consequently, findings provide a baseline and a reproducible path toward robust Old Uyghur OCR.
Keywords : OCR, Vision–Language Models, Old Uyghur, Unicode‑based transliteration, Low‑resource, Reproducibility

ORIGINAL ARTICLE URL

* There may have been changes in the journal, article,conference, book, preprint etc. informations. Therefore, it would be appropriate to follow the information on the official page of the source. The information here is shared for informational purposes. IAD is not responsible for incorrect or missing information.


Index of Academic Documents
İzmir Academy Association
CopyRight © 2023-2026