Bjarke Walling in profile looking relaxed with closed eyes.

2017

First look at Universal Dependencies — part 2

In part 1 I introduced you to Universal Dependencies and touched on some of the concepts about treebanks. We are now going to take a deeper look at the German treebank. Read full article…

First look at Universal Dependencies — part 1

This and the next post will dive into downloading a treebank, how to parse it, and look at some basic statistics. A treebank is a structured list of words in connection to a text corpus annotated with word type and other essential information. The word type is called a POS tag. Read full article…

Fooling around with German declension

The first posts will be about acquiring some data sets to play around with German language in general. I'm currently learning about German declension, i.e. how to form articles, nouns, and adjectives according to grammatical gender, case, and singular/plural. I want to dig a bit deeper into this topic and look at it from different perspectives using statistics, NLP, and more. Read full article…