IAD Index of Academic Documents
  • Home Page
  • About
    • About Izmir Academy Association
    • About IAD Index
    • IAD Team
    • IAD Logos and Links
    • Policies
    • Contact
  • Submit A Journal
  • Submit A Conference
  • Submit Paper/Book
    • Submit a Preprint
    • Submit a Book
  • Contact
  • Turkish Journal of Electrical Engineering and Computer Science
  • Volume:26 Issue:3
  • Implementing universal dependency, morphology, and multiword expression annotation standards for Tur...

Implementing universal dependency, morphology, and multiword expression annotation standards for Turkish language processing

Authors : Umut SULUBACAK, Gülşen ERYİĞİT
Pages : 1662-1672
View : 16 | Download : 12
Publication Date : 0000-00-00
Article Type : Research Paper
Abstract :Released only a year ago as the outputs of a research project insert ignore into journalissuearticles values(``Parsing Web 2.0 Sentences``, supported in part by a TÜBİTAK 1001 grant insert ignore into journalissuearticles values(No. 112E276); and a part of the ICT COST Action PARSEME insert ignore into journalissuearticles values(IC1207););, IMST and IWT are currently the most comprehensive Turkish dependency treebanks in the literature. This article introduces the final states of our treebanks, as well as a newly integrated hierarchical categorization of the multiheaded dependencies and their organization in an exclusive deep dependency layer in the treebanks. It also presents the adaptation of recent studies on standardizing multiword expression and named entity annotation schemes for the Turkish language and integration of benchmark annotations into the dependency layers of our treebanks and the mapping of the treebanks to the latest Universal Dependencies insert ignore into journalissuearticles values(v2.0); standard, ensuring further compliance with rising universal annotation trends. In addition to significantly boosting the universal recognition of Turkish treebanks, our recent efforts have shown an improvement in their syntactic parsing performance insert ignore into journalissuearticles values(up to 77.8{\%}/82.8{\%} LAS and 84.0{\%}/87.9{\%} UAS for IMST/IWT, respectively);. The final states of the treebanks are expected to be more suited to different natural language processing tasks, such as named entity recognition, multiword expression detection, transfer-based machine translation, semantic parsing, and semantic role labeling.
Keywords : Turkish, treebanks, natural language processing, dependency parsing, deep dependencies, multiword expressions, universal dependencies

ORIGINAL ARTICLE URL
VIEW PAPER (PDF)

* There may have been changes in the journal, article,conference, book, preprint etc. informations. Therefore, it would be appropriate to follow the information on the official page of the source. The information here is shared for informational purposes. IAD is not responsible for incorrect or missing information.


Index of Academic Documents
İzmir Academy Association
CopyRight © 2023-2025