
zaterdag 29 augustus 2009
Dutch Text Interpretation Aid 4
Today I added support for nouns and other lexical categories in the Dutch Text Interpretation Aid software tool. I downloaded lists of adjectives, adverbs, conjunctions, nouns, prepositions, and pronouns available at www.muiswerk.nl. Then I used some software routines to retrieve dictionary information for the lemmas, if available on Wiktionary. As you can see in the following screenshot a lot of words are now being recognised. To futher expand the lexicon, I will develop functionality to add lemmas (with conjugation information for verbs), and to manage the lemmatization rules within the software tool.


Labels:
Dutch,
Software,
Text interpretation
Abonneren op:
Reacties posten (Atom)
Geen opmerkingen:
Een reactie posten