Today I wrote a small software tool that retrieves dictionary information from Wiktionary. This technique is often called screen scraping since it involves scraping information from the screen (Internet browser).
Most of the verb lemmas recognized by the Dutch verb lemmatizer are already described on Wiktionary. This dictionary information could thus be added to the electronic dictionary used in the Dutch Text Interpretation Aid software tool. Dictionary entries for verb lemmas that currently do not exist, I will add manualy later on.
Next, I will add dictionary entries for nouns and other lexical categories.
woensdag 19 augustus 2009
Abonneren op:
Reacties posten (Atom)
Geen opmerkingen:
Een reactie posten