This weekend, I decided to run the automated parser with what little analysis I have – that is productive noun morphology and about a third of the productive adjective morphology – hopefully I’ll have adjectives done today and I can move on to either to pronouns or the huge task of verbs.
Anyway, since I didn’t have much analysis to start with, I wasn’t if anything would happen when it ran and at the time I didn’t see and change, so I assumed: 1) the parser is designed to only work with the International phonetic alphabet or 2) because I have so little analysis done, there wasn’t anything for it to parse.
Well both were wrong. When I pulled up the program this morning, I was going along analyzing adjectives and I saw this:
Now the blue boxes are simply words/forms that are identical to ones I’ve already parsed – that is, since I’ve done the Accusative Masculine Singular form of δίκαιος, it makes the suggestion that these should be parsed the same – that in of itself is convenient since if I have a word analyzed, it would suggest an analysis for every other occurrence of that word – which basically means that every instances of the article, when I’ve gotten to it, will be taken care of instantly. Anyway, because of the masculine/neuter neutralization, the parser is, of course, wrong here. But that’s not what’s exciting. Well, its still exciting up to this point, I wasn’t sure if I could make it work at all.
What’s exciting is the orangish-pink box (salmon?). That color marks the analysis of automated parser. When I had ran it, I had *δικαί in my lexicon as an alternate form of *δίκαι. And I also had the morpheme –ου “gen.m.sg” in my lexicon. But never had I actually analyzed or parsed the form. The program did itself -granted it did it wrong since it should actually be neuter, not masculine in gender. But even still, it did and that means that I can make it work, though it will take a good amount of effort more before its ready.
In my excitement, I decided to put the parser to the text more directly. I completed entering the analysis for the *δίκαι type of adjective and then added the root *κόκκιν to my lexicon with its alternate stress form (*κοκκίν). The difference between adjectives *δίκαι and *κόκκιν adjectives is in the feminine (hence, below its labeled Productive Ib instead of Productive II). So if the parser works, then I should be able to parse all the masculine and neuter forms. Here’s the result:
It got everything right except for the neutralization between the vocative and nominative cases, which is easy to fix. That’s simply beautiful. I have since updated the lexicon so that now, the nominative is the primary sense for the morpheme –οι.
The final goal, I think (read: I hope), for my presentation at BibleTech:2009 in terms of parsing will be to complete as much morphological analysis as possible and build a small lexicon of the most common words in Ephesians and demonstrate the parser. How much can I get done before the end of March and write the paper? Well, we’ll find out…but its encouraging that it actually works! Especially after the hours upon hours of work I’ve put into this particular database over the past three months and into the program over the past year. I think I’ve restarted from scratch four times out of sheer frustration. I’m finally making headway.
Looking ahead, what worries me the most is dealing with contract verbs…