Zawilinski: a library for studying grammar in Wiktionary
Track: PostersAuthors: Zachary Kurmas (Grand Valley State University, USA)
Abstract
We present Zawilinski, a Java library that supports the extraction and analysis of grammatical data in Wiktionary. Zawilinski can efficiently (1) filter Wiktionary for content pertaining to a specified language, and (2) extract a word’s inflections from its Wiktionary entry. We have thus far used Zawilinski to (1) measure the correctness of the inflections for a subset of the Polish words in the English Wiktionary and to (2) show that this grammatical data is very stable. (Only 131 out of 4748 Polish words have had their inflection data corrected.) We also explain Zawilinski’s key features and discuss how it can be used to simplify the development of additional grammar-based analyses.
Sidebar
wikisym: R...
- wikisym: Remember to tell all your graduate student friends and collaborators to apply for the #WikiSym 2012 doctoral symposium! http://t.co/nKOHjdk3
- wikisym: Hey you Wikipedia-studying PhD students, apply to the #WikiSym 2012 doctoral consortium with @blurky! http://t.co/nKOHjdk3
- wikisym: Don't forget, you have 7 weeks to prepare technology demos for everyone at @ArsElectronica to see! http://t.co/nKOHjdk3 #WikiSym
- wikisym: Glad that all you authors submitted. There are some outstanding paper submissions from authors across the globe! #WikiSym 2012