zaterdag 29 augustus 2009

Dutch Text Interpretation Aid 4

Today I added support for nouns and other lexical categories in the Dutch Text Interpretation Aid software tool. I downloaded lists of adjectives, adverbs, conjunctions, nouns, prepositions, and pronouns available at www.muiswerk.nl. Then I used some software routines to retrieve dictionary information for the lemmas, if available on Wiktionary. As you can see in the following screenshot a lot of words are now being recognised. To futher expand the lexicon, I will develop functionality to add lemmas (with conjugation information for verbs), and to manage the lemmatization rules within the software tool.


Geen opmerkingen:

Een reactie posten